Information is not a first-class citizen in corporate information systems. Worse, it is neglected. Then why do we still call them information systems? We don’t. We call them applications. And applications, quite appropriately, are built or purchased with an application-centric mindset. Consequently, data is broken into diverse fragments, tightly coupled with applications, and expensive to reassemble into coherent information entities. Software engineering and procurement practices work in sync with market forces to maintain this trend.
How does this happen? And what can be done about it?
This essay is about the corporate management of data. If an essay about corporate data management had been written a decade ago and mentioned Artificial Intelligence, that would have been a weird essay. It’s the opposite today. A weird essay is a data management essay that’s not about AI. And this will be such a weird essay. I think that AI, more concretely LLMs, came too early, before we managed to solve problems with data quality and interoperability.
Speaking of AI and weird, a recent paper1Atari, M., Xue, M. J., Park, P. S., Blasi, D., & Henrich, J. (2023). Which Humans? OSF. https://doi.org/10.31234/osf.io/5b26t reported that all LLMs have WEIRD bias. They are trained on data coming exclusively from countries that are W.E.I.R.D: Western, Educated, Industrialized, Rich, and Democratic. For LLMs, WEIRD is the norm.
In large organizations, weird is the norm as well, and here is one manifestation:
Information systems are not about information.
Now, the question is, of course,
How could that be?
- 1Atari, M., Xue, M. J., Park, P. S., Blasi, D., & Henrich, J. (2023). Which Humans? OSF. https://doi.org/10.31234/osf.io/5b26t