The digital world is much more material than we think. It seems important to me to take the measure of it in a concrete way, it is what can allow to build good strategies of organization of its data: to better manage them, not to lose them in the future and to remain the owner. The stakes are economic, cultural, human, democratic, ecologic, etc.
Digital technology seems to us to be something immaterial, because of the Internet and wireless connections present almost everywhere, which allow us to access our data (social networks, documents, music, video, photos, address books, agendas, etc.) wherever we are, without any apparent need for a concrete medium.
Nevertheless, when the connection is bad, we become aware that it is a remote link, that our data is in another place to which we must connect. But even in these situations, as the smartphone also has an internal memory and a certain amount of data is duplicated in the device (cached as we say in computer jargon), then synchronized in the “cloud” at the next connection (i.e. copied to the remote computer servers of our service providers, Google, Apple or Facebook for example), we live in a form of almost permanent connection, which plays with the constraints of reality.
Thanks to these great tools, we are evolving in a kind of “magical world of data”, which goes beyond the limits of space and time. I have nothing against this fluidity of use, which is extremely practical, but it risks making us lose our footing when it comes to making the right decisions about the security and durability of our data, both personal and professional.
Data loss is far more common than we think. The example of the fire of a data center of the French web host OVH in Strasbourg in March 2021, which caused the irremediable loss of a huge amount of data (administrative, personal, media, etc.) concerning more than 400,000 websites, has the merit of bringing us back to reality. And this is all the more true that, legally, it is not OVH that has been made responsible for the loss of data, but the owners of these data, who had not put in place backup strategies (which is their responsibility, not OVH’s).
However, some of OVH’s customers who were victims of this fire were able to recover their data almost immediately, thanks to their anticipation, and overcome the disaster with little or no damage. But others, hosted on the same servers, lost their data forever.
How to protect yourself? Let’s start with a point of understanding the materiality of the digital world, the strategies will almost follow by themselves by common sense.
The data we manipulate on a daily basis (our texts, images, sounds, posts on social networks, etc.), what are they, materially speaking? First of all, they are of several natures (to be more precise, they have several types of uses):
Data, in terms of its nature, is therefore text (for text and computer code) as well as number sequences (for images and sounds). To be able to be treated in an automated way by these machines that we call computers, stored, transmitted, transformed thanks to the computer code, these data are all transformed into “binary code”. The “binary code”, they are the “0 and the 1”, the “bits” (Binary Digit), which are the only elements that can be processed by the current computers. But how do we transform texts (text and computer code) and number sequences (images and sounds) into these billions of “0s and 1s”?
The first traces of the use of a binary codification to make calculations or information storage more reliable date back to 750 BC in China. In history until today, many mathematicians and philosophers have used binary for various types of operations (mathematics and logic mainly). This manuscript by Leibnitz from 1703, for example, already describes exactly how contemporary computers do their calculations:
Leibniz shows the correspondence between the binary code and our decimal code. In the same way that our decimal code (which consists of 10 digits) allows us to represent numbers beyond 9, the binary code (which consists of only 2 digits) also allows us to represent numbers beyond 2. We can therefore count in binary, in the same way that we count in decimal: write the numbers, add them, subtract them, etc.
The bytes
First of all, instead of considering the “0 and 1” in isolation, it was decided in 1956 to use them in groups of 8, the “bytes”. A byte, which is thus made up of 8 bits, makes it possible to count from 0 to 255.
| Decimal | Binary |
|---|---|
| 0 | 00000000 |
| 1 | 00000001 |
| 2 | 00000010 |
| 3 | 00000011 |
| 4 | 00000100 |
| … | … |
| 253 | 11111101 |
| 254 | 11111110 |
| 255 | 11111111 |
Okay... we can store numbers from 0 to 255 in bytes, but what’s the point?
Text and numbers
For text, the “ASCII code” (American Standard Code for Information Interchange) was created in 1963, which maps an alphanumeric character to each byte:
This ASCII code is the unique standard used internationally today, which means that strings of 0s and 1s processed by computers (sent, received, manipulated) are immediately converted into alphanumeric characters. These received texts always contain the necessary information to identify their nature. This identification is formatted by common protocols: address of the sender and the recipient, type of file (simple text, computer code, image or sound for example), size of the file, etc., which are always placed in the same places so that the interpretation is exact.
For images and sounds, texts describe them in the header, and the following bytes are used as numbers and no longer as texts. For example, have you noticed that in image processing software (such as Photoshop), when you choose a color, it consists of a combination of the 3 primary colors Red, Green and Blue, each of these three colors having a value between 0 and 255? This makes 16 million possible colors. Each pixel of an image uses 3 bytes to store its color.
I hope you have understood how texts, images and sounds can be transformed into billions of 0’s and 1’s in order to be stored, processed and transmitted. But, concretely, what is the materiality of the 0s and 1s?
So, there is nothing magical about the digital, it is not in the clouds! It is always well and truly stored materially somewhere.
If we don’t take care of its storage place and its durability by ourselves, there is no “God Almighty” who will do it for us. We are using new technologies, which bring us new kinds of responsibilities. And we need to build new ways of taking those responsibilities. If we don’t, we will simply lose most of our data, in the short, medium and long term. This may be a deliberate choice in a personal capacity. But in a professional capacity, it is a mistake.
Even if the big digital industrialists make our lives and uses easier by making our experiences with digital very “fluid”, we must not forget that their job is only to sell us services in the present. This ease of access should not make us forget that our digital data is indeed material and stored somewhere, by someone who does not necessarily have the altruism to ensure the guarantee and durability of our data for us. This service provider can go bankrupt, suffer a fire, a computer attack, etc. Of course, this is very rare, and this is what deceives us all the more, giving us the impression that our data is present in an eternal space, that it will be accessible forever.
In reality, our data is stored on media (nowadays hard disks) in very large data centers. It is a bit like a book in a public library, which does not really belong to you, even if you can borrow it. Unlike a book that you have in your library at home, which belongs to you fully and for which you are also fully responsible (hygrometry, fire safety, etc.).
Today, there is no reliable or durable medium for storing digital data. Hard disks have an average lifespan of 7 years and are very fragile, USB keys have a limited number of access cycles, etc. Thus, the responsible data backup goes through two temporalities:
In conclusion, care for your digital data and its media like you care for your precious books, because your digital data is not less material than your books. They have the advantage of being able to be duplicated without loss, but they have the disadvantage of not having the solidity of a book, which you only need to store on a shelf to preserve over time.
Digital data needs renewed attention, and to be duplicated in two places. These are new kinds of responsibilities that fall to us, linked to the evolution of technology. If we do not take it in hand in a simple and material way, it means that we subcontract this responsibility to others, that is to say that we choose to be irresponsible with regard to our heritage. Do we not maintain our house? It’s the same for digital data.
It is possible to benefit from the convenience of “cloud” services, while being fully owner of its data and responsible for their backup: it is enough to acquire a NAS (Network Attached Storage).
It is a hard disk with special functions, permanently switched on in our office. It is connected to the Internet box. It is permanently accessible from a distance with a computer or telephone, and offers shared office documents (cf. Google docs), file sharing, agendas, contacts, music, video, etc. NAS offers about the same services (and often better) than commercial cloud services. The difference is that it is done in control and responsibility of its data. For their basic use, NAS can be configured by non-IT specialists. There are also other responsible cloud solutions.
Yes, managing your data requires attention and work: that’s what we call responsibility. This is the real world of today’s adults. This responsibility is quite accessible, even to non-specialists, it is made of common sense, from a lucid awareness of the materiality of digital data.
In the XXIst Century, most of the human productions are made with digital tools and circulate in digital form: written, photo, sound, video, multimedia...
What is heritage? It is the access to human productions of the past and present (cultural, artistic, industrial, built, financial...). Heritage has a cultural, political, economic and historical value. Without heritage, societies have no history. Without the Eiffel Tower, without the Sacré Cœur, without the Louvre Museum and other elements of architectural heritage, Paris would not have a tourist economy, for example.
The heritage that we will be able to produce from contemporary digital productions will strongly contribute to our future wealth, in every sense of the word. But how can we identify, build up and enhance our digital heritage? Methodological, technical and strategic elements.