Click here to Skip to main content
15,892,927 members

Welcome to the Lounge

   

For discussing anything related to a software developer's life but is not for programming questions. Got a programming question?

The Lounge is rated Safe For Work. If you're about to post something inappropriate for a shared office environment, then don't post it. No ads, no abuse, and no programming questions. Trolling, (political, climate, religious or whatever) will result in your account being removed.

 
GeneralRe: Computer archeology. Pin
Amarnath S19-Aug-18 3:00
professionalAmarnath S19-Aug-18 3:00 
GeneralRe: Computer archeology. Pin
Richard Andrew x6419-Aug-18 6:17
professionalRichard Andrew x6419-Aug-18 6:17 
GeneralRe: Computer archeology. Pin
User 1106097919-Aug-18 9:17
User 1106097919-Aug-18 9:17 
GeneralRe: Computer archeology. Pin
OriginalGriff19-Aug-18 10:38
mveOriginalGriff19-Aug-18 10:38 
GeneralRe: Computer archeology. Pin
User 1106097919-Aug-18 10:51
User 1106097919-Aug-18 10:51 
GeneralRe: Computer archeology. Pin
OriginalGriff19-Aug-18 11:09
mveOriginalGriff19-Aug-18 11:09 
GeneralRe: Computer archeology. Pin
CoolTeddyBear20-Aug-18 9:25
CoolTeddyBear20-Aug-18 9:25 
GeneralRe: Computer archeology. Pin
kalberts20-Aug-18 2:01
kalberts20-Aug-18 2:01 
I participated in a few international library projects a few years ago, and encountered two quite different attitudes: One group said, "We must standardize on digital formats so that we all can read everybody's document files fifty and hundred years from now", and the other one saying: "Let each library use the formats of their choice! Fifty or hundred years from now, chances are then much higher that there will be readers for at least one of the formats that were used by all the libraries preserving the document".

I am very much in favor of standands, preferably with as few competing alternative standards as possible. Yet I see the arguments for not standardizing but preserving data in a multitude of formats. Too many times I have selected The Ultimate Format for my private files, deciding to convert all my text documents to The Ultimate Text Format, photos to The Ultimate Image Format and so on. It never works. I never get around to convert all my files to The Ultimate before it is no longer The Ultimate. I have even lost access to files that I did convert at some time to The Ultimate, but failed to convert to the next Ultimate while I still had the hardware to do so.

Long time preservation of files requires not just a single format, but a whole stack of standards. It won't help you that the file is PDF/A if you stored it on an 8" floppy. Even if you dig up an 8" floppy unit somewhere, the sectoring may be different: I have a huge pile of floppies with 2048 byte sectors (IBM and most others used 128 byte). So you write a driver that can read the huge sectors, making a disk image file on a modern PC. But the file system is neither MS-DOS, NTFS nor any Unix-family file system: Disk sectors are organized in an "unknown" way. I happen to know it - I know all the details of the Sintran file system, but not very many people do, nowadays. So I can write an extractor to select, in the right order, those sectors making up, say, a PDF/A file.

This is a simple case, when you know the format, which is well defined. There may be more layers. Once I was consulted by a company who needed to retrieve some information in unknown format from some floppies - I could extract the files, but they looked pure gibberish, not readable text. I noticed that some characters were more frequent than others, which gave away one secret: The text was not ASCII characters, but EBCDIC (i.e. IBM's old character code - which existed in umpteen variants, analogous to DOS "code pages"). What I got out was a lot of small text fragments that did not make up a coherent text, and then some seeemingly binary information. Yet, knowing that the text was IBM format, we could start searching for which types of IBM machines had been used by the company in former times, and found one that stored the data using a fairly simple DBMS - so simple that it was easier to read the format specs for the DBMS and write a small program to interpret the data structures, than to get hold of an anitique IBM machine to run it. But if I hadn't had access to the specs for the data structures, I would have been lost.

This is more like the typical case. I am sure that if the information had been available in eight or ten different formats, at least one of them would have been more easily accessible.
GeneralRe: Computer archeology. Pin
kalberts20-Aug-18 2:49
kalberts20-Aug-18 2:49 
GeneralRe: Computer archeology. Pin
Daniel Pfeffer20-Aug-18 4:04
professionalDaniel Pfeffer20-Aug-18 4:04 
GeneralRe: Computer archeology. Pin
Richard Deeming20-Aug-18 8:10
mveRichard Deeming20-Aug-18 8:10 
GeneralRe: Computer archeology. Pin
OriginalGriff20-Aug-18 8:20
mveOriginalGriff20-Aug-18 8:20 
JokeLost at home... Pin
Kornfeld Eliyahu Peter18-Aug-18 22:36
professionalKornfeld Eliyahu Peter18-Aug-18 22:36 
GeneralRe: Lost at home... Pin
Eddy Vluggen18-Aug-18 23:04
professionalEddy Vluggen18-Aug-18 23:04 
GeneralRe: Lost at home... Pin
Kornfeld Eliyahu Peter18-Aug-18 23:19
professionalKornfeld Eliyahu Peter18-Aug-18 23:19 
GeneralRe: Lost at home... Pin
Eddy Vluggen18-Aug-18 23:24
professionalEddy Vluggen18-Aug-18 23:24 
GeneralRe: Lost at home... Pin
Kornfeld Eliyahu Peter18-Aug-18 23:36
professionalKornfeld Eliyahu Peter18-Aug-18 23:36 
GeneralRe: Lost at home... Pin
Sander Rossel19-Aug-18 0:06
professionalSander Rossel19-Aug-18 0:06 
GeneralRe: Lost at home... Pin
Kornfeld Eliyahu Peter19-Aug-18 0:15
professionalKornfeld Eliyahu Peter19-Aug-18 0:15 
GeneralMaunder makes it into the Oxford English Dictionary ! Pin
BillWoodruff18-Aug-18 18:34
professionalBillWoodruff18-Aug-18 18:34 
GeneralRe: Maunder makes it into the Oxford English Dictionary ! Pin
OriginalGriff18-Aug-18 20:14
mveOriginalGriff18-Aug-18 20:14 
GeneralRe: Maunder makes it into the Oxford English Dictionary ! Pin
BillWoodruff19-Aug-18 7:07
professionalBillWoodruff19-Aug-18 7:07 
GeneralYou need to explicitly give us permission to email you (It's a CASL thing...) Pin
AnotherKen18-Aug-18 17:42
professionalAnotherKen18-Aug-18 17:42 
GeneralRe: You need to explicitly give us permission to email you (It's a CASL thing...) Pin
OriginalGriff18-Aug-18 19:25
mveOriginalGriff18-Aug-18 19:25 
GeneralRe: You need to explicitly give us permission to email you (It's a CASL thing...) Pin
Richard Deeming20-Aug-18 8:12
mveRichard Deeming20-Aug-18 8:12 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.