The following table lists the data sets we have collected for testing purposes. ^ File name ^ Format ^ Encoding ^ Source system ^ Description ^ | FSL.marc | MARC21 | UTF8 | Aleph | Armenian and Cyrillic scripts, collected from the Fundamental Science Library in Yerevan, Armenia |