1 The following table lists the data sets we have collected for testing purposes.
3 ^ File name ^ Format ^ Encoding ^ Source system ^ Description ^
4 | FSL.marc | MARC21 | UTF8 | Aleph | Armenian and Cyrillic scripts, collected from the Fundamental Science Library in Yerevan, Armenia |
5 | oss.marc | MARC21 | MARC8 | Unicorn GL3.1 | |
6 | lul_fre_100.marc | MARC21 | MARC8 | Unicorn GL3.1 | 100 records, French, pre-1923 |
7 | lul_fre_500.marc | MARC21 | MARC8 | Unicorn GL3.1 | 500 records, French, pre-1923 |
8 | jazz_1k.marc | MARC21 | MARC8 | Unicorn GL3.1 | 1000 records |
9 | music_5k.marc | MARC21 | MARC8 | Unicorn GL3.1 | 5000 records |
10 | hebrew.marc | MARC21 | MARC8 | III | Hebrew scripts, 25 records |