U bent hier
Voortbestaan
Digitising Haalpulaar Islamic Manuscripts (EAP1245 Project)
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
The Argoknot project: JSON song data
I’ve got a new project which I ought to blog about somewhere, and it’s related to file formats, so it’s going here.
There have been projects to archive information about filk songs. They’ve tended toward wikis such as the Filk Discography Wiki, which contains information about filk recordings. Many filk albums have gone out of publication and might otherwise be forgotten, and the wiki keeps them in the cultural memory. Wikis are fine, and they’re easy to participate in with little technical knowledge. They’re also fragile; if the hosting for a wiki goes away, it might find a new home, but it might disappear if no one takes prompt action.
Structured information has advantages. It’s easy for anyone with a little file storage to keep a copy and give it to others. People can create their own repositories, perhaps of songs which they have published. It’s easy to search them and extract information, e.g., all the songs by an author. This isn’t to say that we should abandon wikis, but having structured information as well strengthens the effort. With a little work, it can be fed to wikis.
This is why I’ve created the Argoknot project. It’s a Python-based project to process song data in JSON format. As of this post, it can do one thing: convert CSV files to JSON. I’m planning to add the ability to convert XML files that use the MODS schema. There is a pile of such files in the MASSFILC Filk Book Index.
One of the project’s aims is to create a JSON nomenclature for the filk community. That will let other projects work with the same JSON files to create websites, import into wikis, or do lots of other things.
What I’m doing here is just a start, and it won’t get far without the participation of others. I encourage others in the filk community to join the effort, whether working directly on Argoknot, offering suggestions on how to organize the data, or creating other coding projects.
New online - April 2022
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Digitising Arabic Manuscripts in Mattool, North Kerala (EAP1390)
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
How broadcast FM can wreck your receiving system
Today I came upon some news weird enough to justify a post on this long-dormant blog. Ars Technica reports that it “began on January 30 and afflicted Mazdas from model years 2014 to 2017 when the cars were tuned to the local NPR station, KUOW 94.9. At some point during the day’s broadcast, a signal from KUOW caused the Mazdas’ infotainment systems to crash—the screens died and the radios were stuck on 94.9 FM.”
That shouldn’t be possible, right? A broadcast FM signal is just frequency-modulated audio. It might deafen you or damage the speakers, but it shouldn’t make the receiver stop working! Well, actually, it isn’t just audio. Broadcasters can optionally use the Radio Broadcast Data System (RBDS), which supports encoded digital data. It uses a 57 kHz subcarrier, well above the limits of human hearing. The data is encoded at 1187.5 bits per second, a strange-sounding number that yields 48 cycles of the subcarrier for every bit. Error correction codes bring the effective data rate down to 730 bits per second.
RDBS carries several types of information, including station identification, clock time, song titles or other text, traffic alerts, and images. The last one is where the problem lay.
As Ars Technica explains it, certain Mazda infotainment systems in model years 2014 to 2017 identified image files by their extension. If it saw .jpg or .png in the file name, it knew what to do with it. But KUOW broadcast some otherwise valid images with no extension in their name. That’s a problem, but it shouldn’t destroy receiving equipment. The bug was on Mazda’s side; the receiver had no idea what to do with a file lacking an extension. Ignoring the file would have been reasonable. Instead, according to a comment,
The very simplified version is that the head unit loaded this image into its cache and tried to decode it, hit the software bug, locked up and watchdog rebooted. The first thing it does when it reboots is tries to decode the image, and locks up and reboots, forever.
That’s doubly bad. First it crashes when it gets an unrecognized file; second, when it reboots, it reprocesses the bad data that made it crash. Forever.
You should be able to fix the problem with a cold reboot, right? An authorized service center should be able to do that. Apparently not. Ars Technica says:
From there, the infotainment systems became trapped in a rebooting loop, never successfully completing the task. When afflicted owners took their cars to be checked at local Mazda dealers, they were told that the “connectivity master unit” was dead and needed to be replaced.
The price of a replacement unit is $1,500. Fortunately, Mazda is offering free replacements to those afflicted by this situation.
It sounds like something out of a science fiction story. A pirate broadcaster could take advantage of this bug to send out a malicious signal, wrecking the receivers of drivers who go station-surfing. When everything contains a computer, everything is potentially vulnerable to sloppy coding.
New online - December 2021
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
East African Life-Writing and Colonial History: New Perspectives from EAP Tanzanian Church Records
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Updated Equipment List for Round 17
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
EAP Round 17 – deadline extension
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Application Portal now open
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Webinars for Applicants
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Call for applications now open
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
New online - August 2021
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
The Backstory to Digitising the Barbados Gazette
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
JHOVE Tips for Developers
I got a request for my ebook, JHOVE Tips for Developers. It’s no longer for sale on Smashwords, since I haven’t updated it since 2012, but if anyone wants it, you can download JHOVE Tips for Developers from this site.
New online - July 2021
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
EAP blog subscription ending - Please follow us on Twitter for updates
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
Help trace the stories of enslaved people in the Caribbean using colonial newspapers
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
New online - June 2021
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)
New online - April/May 2021
(From the Endangered Archives Blog: Lynda Barraclough on histories in peril)