U bent hier

Internet Archive

Abonneren op feed Internet Archive
A blog from the team at archive.org
Bijgewerkt: 2 uur 17 min geleden

TV News Record: Coverage of sexual harassment charges

1 december 2017 - 5:00pm

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman.

This week we dive into research resources, fact-checks, and backgrounders on the sexual harassment charges sweeping the worlds of media, politics, and entertainment.

Past TV news appearances by O’Reilly, Franken, Halperin & more preserved, searchable

The names of well-known and influential men accused of sexual harassment continue to pile up: Louis C.K.; Rep. John Conyers, D., Mich.; Sen. Al Franken, D., Minn.; Mark Halperin of ABC; Garrison Keillor of Prairie Home Companion fame; NBC’s Matt Lauer, Alabama GOP Senate candidate Roy Moore; Fox’s Bill O’Reilly; Charlie Rose of CBS and PBS; Leon Wieseltier, formerly of The New Republic; Harvey Weinstein, and of course the president himself, Donald Trump.

Some of these men made their living on TV and all of them have many TV appearances preserved in the TV News Archive. These include, for example, clips making the rounds in this week’s stories about Lauer’s television history: this one where NBC staff did a fake joke story about Lauer himself being harassed by a colleague; and this interview with the actress Anne Hathaway after a photographer took a photo up her skirt and a tabloid printed it.

We’ve also got this September 2017  interview Lauer did of Bill O’Reilly, in which he asks the former Fox News host: “Have you done some soul searching? Have you done some self-reflection and have you looked at the way you treated women that you think now or think about differently now than you did at the time?” O’Reilly answers, “My conscience is clear.”


TV news use of term “sexual harassment” peaked in 2011

Mentions of term “sexual harassment” since 2009 on cableTV news shows, source: Television Explorer search of TV News Archive caption data

Cable TV news programs mentions of the term “sexual harassment” are picking up, but not yet at the level they were back in November 2011, according to a search of TV News Archive caption data via Television Explorer. What was big news then? Accusations of sexual harassment against then presidential candidate Herman Cain, who the following month dropped out of the 2012 race. Cain was back on TV this week, in an interview on the current claims of sexual harassment against powerful men, on Fox News’ The Ingraham Angle.

Cain says: “Now, what’s different about my situation, and let’s just say Roy Moore’s situation, is that they came after me with repeatedly attacks and accusations, but no confirmation. They now believe that if they throw more and more and more mud on the wall, that eventually people are going to believe it. But that has backfired because, as you know, the latest poll shows Roy Moore is now back in the lead in Alabama, and the people in Alabama are going to have to decide.”


Fact-checks and backgrounders on sexual harassment charges

Our fact-checking partners have produced numerous fact-checks and background pieces on sexual harassment charges and statements.

“How politicians react to such charges often appears to reflect who is being accused. Democrats are quick to jump on allegations about Republicans — and vice versa. But the bets start to get hedged when someone in the same party falls under scrutiny,” writes Meg Kelly for The Washington Post‘s Fact Checker.

For example, here’s House Minority Leader Nancy Pelosi, D. Calif., on “Meet the Press” in 1998, defending then-President Bill Clinton. At the time, Pelosi said, referring to the investigation by Ken Starr into allegations against Clinton: “The women of America are just like other Americans, in that they value fairness, they value privacy, and do not want to see a person with uncontrolled power, uncontrolled time, uncontrolled – unlimited money investigating the president of the United States.”

And here’s President Donald Trump on the White House south lawn, defending GOP Alabama Senate candidate Roy Moore: “Roy Moore denies this. That’s all I can say. And by the way, he totally denies it.”

Kelly also wrote a round-up of sexual harassment charges against the president himself: “During the second presidential debate, Anderson Cooper asked then-candidate Trump point blank whether he had “actually kiss[ed] women without consent or grope[d] women without consent?” Trump asserted that “nobody has more respect for women” and Cooper pushed him, asking, “Have you ever done those things?” Trump denied that he had, responding: “No, I have not.”…But it’s not as simple as that. Many of the women have produced witnesses who say they heard about these incidents when they happened — long before Trump’s political aspirations were known. Three have produced at least two witnesses.”

In another piece, Glenn Kessler, editor of The Washington Post’s Fact Checker, writes  in a round up of corroborators, “Such contemporaneous accounts are essential to establishing the credibility of the allegation because they reduce the chances that a person is making up a story for political purposes. In the case of sexual allegations, such accounts can help bolster the credibility of the “she said” side of the equation.” One of the statements of denial he quotes is this one from White House Press Secretary Sarah Huckabee Sanders, who says when asked if all of the accusers are lying: “Yeah, we’ve been clear on that from the beginning, and the president’s spoken on it.”


Here’s FactCheck.org’s Eugene Kiely on Sen. Tim Kaine, D., Va., and his claim that the Clinton campaign can’t give back contributions from Harry Weinstein: “Asked if the Clinton-Kaine campaign will return contributions it received from movie mogul Harvey Weinstein, Sen. Tim Kaine repeatedly said the campaign is over. That’s true, but it doesn’t mean the campaign can’t refund donations.” FactCheck.org deemed this claim “misleading.”

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

Why I Work for the Internet Archive

28 november 2017 - 11:37am

Wendy Hanamura, Director of Partnerships

This year something magical happened.

Our film curator, Rick Prelinger, noticed a film for sale on eBay. The description said “taken at a Japanese Internment Camp,” so Rick bought it, suspecting it might be of historical significance. In October, when he digitized the 16mm reel and showed it to me, I couldn’t believe it.  

On the screen was a home movie shot in 1944 at the WWII camp in Jerome, Arkansas where 8,500 Japanese Americans were incarcerated. This American concentration camp was once the fifth largest town in Arkansas. Rick thinks the film was shot by a camp administrator and hidden away for the last 73 years.

There are only a handful of movies ever shot inside the camps—I know, because my mother and grandparents were locked up in a similar camp for three and a half years.

Sab Masada at the Internet Archive, October 2017

What a miracle, then, that the Internet Archive found this film and preserved it while there are still people who can bear witness to what we see on screen. One of them, Sab Masada, was 12 years old when a truck came to haul his family away from their farm in Fresno. At our annual event, Sab remembered:

We were shipped to Jerome, Arkansas. It turned extremely cold, the beginning of November. In fact, we had some snow. The camp was still being completed so our barracks had no heat and my father caught pneumonia. 21 days after we arrived, he died in a makeshift barrack hospital…

This film will tell America that these concentration camps we were in—it wasn’t a myth! They were real. So it’s a historical record: proof of what really happened to 120,000 Americans and legal residents.

The best part for me?  That this film will live on at archive.org, accessible to the public, forever, for free.  Filmmakers can download it. Scholars can study it.  Teachers can weave it into their lessons.

Educator, Andi Wong, teaching the lessons of Japanese American incarceration at Rooftop School in San Francisco.

Here’s our promise to you: the Internet Archive will keep updating these files every time a major, new format emerges. We will preserve them for the long term, against fire, neglect and all types of more human disaster. We will cherish your stories as if they were our own.  

I’m part of a small staff of 150, running a site the whole world depends on. I’ve worked at huge media corporations where the only things that matter are the ratings, because ratings = profits. At the end of the day, I couldn’t stomach it. I wanted to do more. I wanted to work at a place aligned with my values: creating a world where everyone has equal access to knowledge, because knowledge equals power. It’s society’s great leveler—at least that’s how it worked for my family. 

This is why I give to the Internet Archive:  because I believe every story deserves to be saved for the future.   

When you give, you’re helping make sure the whole world can access the books, concerts, radio shows, web pages and yes, the home movies, that tell our human story.

So please, support our mission by donating today. Right now, a very generous supporter will match your donation 3-to-1, so you can triple your impact.

Think of it as an investment in our children, and their children, so that one day in the future, they might understand our joys and learn from our mistakes.

Wendy Hanamura, Director of Partnerships, Internet Archive

5,000 78rpm sides for the Great 78 Project are now posted

27 november 2017 - 6:47am

From the David Chomowicz and Esther Ready Collection.
Click to listen.

This month’s transfers of 5,000 78rpm sides for the Great 78 Project are now posted.

Many are Latin American music from the David Chomowicz and Esther Ready Collection.

Others are square dance music, with and without the calls, from the Larry Edelman Collection. (Thank you David Chomowicz, Esther Ready, and Larry Edelman for the donations.)

We are still working on some of the display issues with this month’s materials, so some changes are yet to come.

From the Larry Edelman Collection.
Click to listen.

Unfortunately we have only found dates for about 1/2 of this month’s batch using our automatic techniques of looking through 78disography.com, 45worlds, discogs, DAHR, and full text searching of Cashbox Magazine.  There are currently over 2,000 songs with missing dates.

If you like internet sleuthing, or leveraging our scanned discographies or your discographies and would like to join in on finding dates and reviews, please jump in. We have a slack channel of those doing this.

Congratulations to B George’s group, George Blood’s group, and the collections group at the Internet Archive for another large batch of largely disappeared 78’s.

Net Neutrality & Competition: The Final Days of Internet Freedom

21 november 2017 - 7:15pm

Join Public Knowledge to Discuss…

Net neutrality is on the chopping block. Public Knowledge has spent nearly ten years fighting for an open internet, but we expect that at the Federal Communications Commission’s December 14th meeting a majority of Commissioners will vote to eliminate our strong net neutrality rules. The current FCC has made dominant industry interests a priority, putting startup and consumer interests at risk. As one of the world’s largest centers for businesses and individuals financing, creating, and building the technology and inputs to the digital economy, Silicon Valley will be directly impacted by these new policies. At the same time, AT&T is seeking to merge with Time Warner, and Sinclair Broadcasting with the Tribune Company — potentially undermining video and broadband competition as net neutrality rules disappear. So, Public Knowledge is coming to California to discuss these important political shifts with engaged individuals, and to build new connections with individuals who want to learn more about standing up for an open internet.

The event will kick off with a reception from 6-7pm and will include a discussion from 7-8pm. We hope that you will join us and stick around afterwards for food and drinks until 9pm.

Please be sure to register, spread the word via Twitter, and keep an eye out for additional information coming soon.

GET TICKETS HERE

Tuesday December 5th, 2017
6:00 pm Reception
7:00 pm Program
8:00 pm Reception

Internet Archive
300 Funston Ave.
San Francisco, CA 94118

Wondering what to do with your Cryptocurrency Windfall? Donations accepted here!

17 november 2017 - 12:23am

This week, as you’ve watched your Bitcoin Cash and Bitcoin rise and fall and rise again, perhaps you’ve been wondering:  how can I put my cryptocurrencies to good use?  Should I buy a new car or yacht?  Plow it into Amazon stock?  Well, at least some of you have turned to the Internet Archive—a place where you can donate your cryptocurrencies directly to help ensure that the Web is free, secure and backed up for all time.

At the Internet Archive, we are big fans of the cryptocurrency movement and have been trying to do our part to test and support alternative means of commerce. We’ve been accepting Bitcoin donations since 2012, and starting this week, we are now accepting donations of Bitcoin Cash and Zcash.

This week it all started when UKcryptocurrency tweeted us asking the Internet Archive to start accepting Bitcoin Cash. We love a good challenge and got that link up within hours.

UKcurrency tweet

Here’s how you can donate in cryptocurrency:


Bitcoin is an experimental, cryptographically secure, semi-anonymous method of transferring value between parties. Introduced in 2008, it has been successfully used as a token system between thousands of people. The Internet Archive has proudly experimented with bitcoin including paying some employees with it and encouraging local businesses to experiment as well.
Internet Archive Bitcoin Address: 1Archive1n2C579dMsAu3iC6tWzuQJz8dN


When Bitcoin was first created, developers and miners questioned whether the cryptocurrency could scale properly. To ensure its future, on August 1, 2017, developers and miners initiated what’s known as a ‘hard fork’ and created a new currency called Bitcoin Cash. For more information, visit: https://www.bitcoincash.org/
Internet Archive Bitcoin Cash Address: 12PRZjrLo5yqnHMmUCtPUse4kCyuneby3S


Zcash is the first-of-its-kind cryptocurrency that offers both privacy and selective transparency for all transactions. Zcash gives you the option of using a transparent addresses (what the Internet Archive uses, listed below) or shielded addresses which keep sender, receiver, and amounts private. For more information on Zcash visit https://z.cash/, and for details on the differences between transparent and shielded transactions visit: https://z.cash/support/security/privacy-security-recommendations.html
Internet Archive Zcash Address: t1W6JqMECmbqmDGZ9uLSXWQZ6EgxFkfuty8

We are a non-profit organization with a huge mission: to give everyone access to all knowledge—the books, web pages, audio, television and software of our shared humanity. Forever. For Free. But to build this digital library of the future, we need your help. If you’re feeling flush from a cryptocurrency-windfall, please consider giving to the Internet Archive today.

More info on why cryptocurrencies matter to us: http://blog.archive.org/2017/09/02/why-bitcoin-is-on-the-internet-archives-balance-sheet/

TV News Record: Whoops, they said it again (on taxes)

16 november 2017 - 7:54pm

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman

This week, we demonstrate GOP and Democrat talking points on taxes; display a case of mistaken facial identity; and present fact checks on the GOP tax proposal.

Whoops, they said it again

Was that House Minority Leader Nancy Pelosi, D., Calif., who said “tax cuts for the rich”? Or was that Senate Minority Leader Chuck Schumer, D., N.Y.? Wait: they both said it. Often.

Meanwhile, House Speaker Paul Ryan, R., Wis., keeps talking about tax reform being a “once in a generation opportunity,” and, coincidence!, so does Sen. Majority Leader Mitch McConnell, R., Ky. It’s a recurring theme.

These types of repeated phrases, often vetted via communication staff, are known as “talking points,” and it’s the way politicians, lobbyists, and other denizens of the nation’s capital sell policy. The TV News Archive is working toward the goal of applying artificial intelligence (AI) to our free, online library of TV news to help ferret out talking points so we can better understand how political messages are crafted and disseminated.

For now, we don’t have an automated way to identify such repeated phrases from the thousands of hours of television news coverage. However, searching within our curated archives of top political leaders can provide a quick way to check for a phrase you think you’re hearing often. Visit archive.org/tv to find our Trump archive, executive branch archive, and congressional archives, click into an archive, then search for the phrase within that archive.

Sample search results in the congressional archive

Funny, you look familiar

Wait, is this former President George W. Bush trying out a new look?

No, it’s not. This is Bob Massi, a legal analyst for Fox Business News and host of “Bob Massi is the Property Man.”  In a test run of new faces for our Face-o-Matic facial detection tool, Massi’s uncanny resemblance (minus the hair) to the former president earned him a “false positive” – the algorithm identified this appearance as Bush incorrectly.

This doesn’t get us too worried, as we still include human testers and editors in our secret sauce: we’ll retrain our algorithm to disregard photos of Massi in the TV news stream. It does point toward why we want to be very careful, particularly with facial recognition, where a private individual may be tracked inadvertently or a public official misrepresented. Our concern about developing ethical practices with facial recognition is why, for the present, we are restricting our face-finding to elected officials. We invite discussion with the greater community about ethical practices in applying AI to the TV News Archive at tvnews@archive.org.

In our current Face-o-Matic set we track the faces of President Donald Trump and the four congressional leaders in their TV news appearances. After receiving feedback from journalists and researchers, our next set will include living ex-presidents and recent major presidential party nominees: Jimmy Carter, Bill Clinton, George H.W. Bush, George W. Bush,  Barack Obama, Hillary Clinton, John McCain, and Mitt Romney. Stay tuned, while we fine tune our model.

Fact-check: everyone will get a tax cut (false)

In an interview on November 7, on Fox News’s new “The Ingraham Angle,” House Speaker Paul Ryan, R., Wis., says: “Everyone enjoys a tax cut all across the board.”

Pulling in information from the Tax Policy Center and a tax model created by the American Enterprise Institute, The Washington Post’s Fact Checker Glenn Kessler counters Ryan’s claim: “In the case of married families with children — whom Republicans are assiduously wooing as beneficiaries of their plan — about 40 percent are estimated to receive tax hikes by 2027, even if the provisions are retained.”

Ryan changed his language, according to Kessler, following an inquiry on November 8 from the Fact Checker. Now he is saying, “the average taxpayer in all income levels gets a tax cut.”

Fact-check: tax bill not being scored by CBO as is tradition (false)

In an interview on November 12 on CNN’s “State of the Union,” Senate Minority Whip Dick Durbin, D., Ill., claimed that the GOP tax plan is “not being scored by the Congressional Budget Office, as it is traditionally. It’s because it doesn’t add up.”

“Under the most obvious interpretation of that statement, Durbin is incorrect. The nonpartisan analysis for tax bills is actually a task handled by the Joint Committee on Taxation, and the committee has been actively analyzing the Republican tax bills,” reported Louis Jacobson of PolitiFact.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

Hewlett Foundation Commissions New Work by DJ Spooky & Internet Archive

15 november 2017 - 6:58pm

We are honored to announce that the Internet Archive and artists Paul D. Miller (aka DJ Spooky) and Greg Niemeyer have been awarded one of the first Hewlett 50 Art Commissions to support the creation of “Sonic Web”—an acoustic portrait of the Internet.  Sampling from the millions of hours of audio preserved in the Internet Archive, these experimental composers and artists will collaborate to create an 11-movement multimedia production for a string quartet, vocalist and original electronic instruments about the origins of the Internet and what needs to happen to keep it accessible, neutral, and free.

“Art is always a reflection of the changing dynamics of any society. Leonardo Da Vinci once said ‘Learning never exhausts the mind,'” explained DJ Spooky. “I think that we have so many things to learn from these kinds of interdisciplinary projects, and the William and Flora Hewlett Foundation is collaborating with Artists to show how these initiatives can affect the entire spectrum of the creative economy.”

The Internet Archive team is among the first 10 recipients of the Hewlett 50 Arts Commissions, an $8 million commissioning initiative that is the largest of its kind in the United States.  These $150,000 grants support Bay Area nonprofits working with world-class artists on major new music compositions spanning myriad genres including chamber, electronic, jazz, opera, and hip hop. These commissions honor the Hewlett Foundation’s 50th anniversary, commemorating decades of leadership in the Bay Area arts world.

“The Hewlett 50 Arts Commissions are a symbol of the foundation’s longstanding commitment to performing arts in the Bay Area,” said Larry Kramer, president of the Hewlett Foundation. “We believe the awards will fund the creation of new musical works of lasting significance that are as dynamic and diverse as the Bay Area communities where they will premiere.”

New media artist and UC Berkeley arts practice associate professor, Greg Niemeyer, presenting his new work, “Memory Palace,” at the Internet Archive in 2016.

“Sonic Web” is conceived to push boundaries in both music and technology.  New media artist, Greg Niemeyer, will build an original Sonic Web Instrument —a large touchscreen with a software tool to draw network diagrams. It will enable DJ Spooky to build and take apart simple networks using sampled sounds from the Internet Archive, further layered by a vocalist and string quartet.

“Sonic Web will dig into the big crate of the Internet Archive and remix internet history in a new, networked way,” says Greg Niemeyer.  “We will break out of linear musical structures towards a more networked and connected sound.”

 

The artists will also take these tools on the road, partnering with Berkeley Center for New Media, Stanford Live, Youth Radio, and Bay Area high schools for music and technology workshops and a service learning course at UC Berkeley.

The work will premiere at the Internet Archive Great Room during the summer of 2018.  We will also provide  free global access to a downloadable Sonic Web album with music videos and the livestream of the premiere  at archive.org.

NOTE: DJ Spooky, Niemeyer and the Internet Archive collaborated in 2016 to create “Memory Palace,” a new multimedia work performed at our own 20th anniversary celebration. For a taste of what’s to come, watch this.

 

 

TV News Record: With indictment, chyrons & captions get a graphic workout

3 november 2017 - 12:20am

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman

Fox News downplayed Mueller indictment, according to NYT editorial chyron analysis

In the most intensive use the Internet Archive’s Third Eye data to date, The New York Times editorial page analyzed chyron data to show how Fox News downplayed this week’s news of the indictment of former Trump campaign manager and other legal developments. The graphic-heavy opinion piece was featured at the top of the online homepage [most/much] of the day on Wednesday, Nov. 1:

Though it is far from the only possible way to evaluate news coverage, the chyron has become something of a touchstone for media analysts, being both the most obvious visual example of spin or distraction and the most shareable. Any negative coverage of the president usually prompts a flurry of tweets cataloguing the differences among networks in their chyron text. While CNN, MSNBC and the BBC are typically in alignment, Monday morning was a particularly stark example of how Fox News pushes its own version of reality.

 Read The New York Times opinion piece, and dig into the data yourself.

Captions yield insights on Mueller investigation, shooting coverage

Fox News actively tried to “plant doubt in viewers’ minds” as Mueller brought charges against former Trump campaign officials, according to an analysis of a week’s worth of closed captions by Alvin Chang of Vox News. Chang used Television Explorer, fueled by TV News Archive data, to crunch the numbers behind charts such as the one below.

And The Trace, an independent, nonprofit news organization that focuses on gun violence, used TV News Archive caption data via Television Explorer to show how TV news coverage of mass shootings declines quickly.

How fast did TV news move on from the Vegas shooting and other major events? Pretty fast, @TVNewsArchive data shows https://t.co/cBeyU8enR1 pic.twitter.com/OghOyXQzZJ

— Daniel Nass (@dnlnss) October 27, 2017

Face-o-Matic captures congressional leaders reactions on indictments

In the 24 hours following news breaking about the indictments, our Face-o-Matic data feed captured cable news networks’ editorial choices on how much face-time to allot to congressional leaders’ reactions. The answer: not much.

All together the four congressional leaders’ faces were shown for a total of 2.5 minutes on indictment-related reporting on screen by CNN, Fox News, and MSNBC. Ryan got the lion’s share of the attention. Much of this was devoted to airings of his photo in connection with his official statement,“[N]othing is going to derail what we are doing in Congress, because we are working on solving people’s problems.”

The image of Senate Majority Leader Mitch McConnell, K., Ky., was not featured by any network. House Minority Leader Nancy Pelosi, D., Calif., got attention only from Fox News, which featured her photo with discussion of her statement, in which she said despite the news, “we still need an outside fully independent investigation.”

Fact-check:Papadopoulos had a limited role in Trump campaign (had seat at table/not the whole story)

One of the most parsed statements this week was White House press secretary Sarah Huckabee Sanders’ claim that George Papadopolous, who pleaded guilty to lying to the FBI, had an “extremely limited” role in the campaign. “It was a volunteer position,” she said. “And again, no acitvity was ever done in an official capacity on behalf of the campaign.”

“Determining how important Papadopoulos was on the Trump team is open to interpretation, so we won’t put this argument to the Truth-O-Meter,” wrote Louis Jacobson, reporting for PolitiFact. Jacobson, however, laid out the known facts. For example, in March 2016, then presidential candidate Donald Trump tweeted out a photo of himself and advisors sitting at a table, saying it was a “national security meeting.” Papadopoulos is seen at the table sitting near future Attorney General Jeff Sessions. However, Jacobson also writes,“There is some evidence to support the argument that Papadopoulos was freelancing by pushing the Russia connection.”

Reviwing Sanders’ claim, as well as a Trump tweet along similar lines, Robert Farley and Eugene Kiely took a similar tack for FactCheck.org, concluding that Papadopoulos had a “seat at the table” in the campaign, but it was beyond licking envelopes and posting lawn signs:  “What we do know is that during this time — from late March to mid-August — Papadopoulos was in regular contact with senior Trump campaign officials and attended a national security meeting with Trump. We will let readers decide if this constitutes a ‘low-level volunteer.'”

Embed TV News Archive clips on web annotations

Now you can embed TV News Archive news clips when commenting and annotating the web, thanks to a new integration from Hypothes.is. From the Hypothesis.is blog:

This integration makes it easy for journalists, fact-checkers, educators, scholars and anyone that wants to relate specific text in a webpage, PDF, or EPUB to a particular snippet of video news coverage. All you need to do to use it is copy the URL of a TV News Archive video page, paste it into the Hypothesis annotation editor and save your annotation. You can adjust the start and end of the video to include any exact snippet. The video will then automatically be available to view in your annotation alongside the annotated text.

See a live example of the integration in this annotation with an embedded news video of Senator Charles Schumer at a news conference over a post that checks the facts in one of his statements.

“This integration means that one of the world’s most valuable resources — the news that the Internet Archive captures across the world everyday — will be able to be brought into close context with pages and documents across the web,” said Hypothesis CEO Dan Whaley. “For instance, a video of a politician making an actual statement next to an excerpt that claims the opposite, or a video of a newsworthy event next to a deeper analysis of it.”

Please take Hypothes.is for a spin and let us know what you think: tvnews@archive.org.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

Internet Archive to host conference about saving online news

29 oktober 2017 - 10:25pm

The Internet Archive will host the “Dodging the Memory Hole” (DTMH) forum Nov 15 and 16th.  This will be the fifth in the series of outreach efforts over the past four years. Presented by the Donald W. Reynolds Journalism Institute, with support from the Institute of Museum and Library Services, the conference will address issues related to archiving and access to online news.

We are happy to be able to present a range of people, and projects, involved in a wide cross-section of activities related to news archiving, representing local, national and world-wide efforts.  As a bonus, our special guest speaker, Daniel Ellsberg, will highlight the value of the First Amendment and the need to make sure the public has free access to accurate information in the digital age.

News has been called the “first rough draft of history.”  Some think the risk to this history is at an all time high.  The possibility exists that large portions of our cultural record, as captured by journalists and others, will be lost forever if no action is taken to provide long-term solutions for access.  The loss of digital records is happening at an unprecedented pace – faster than the loss of comparable print and analog resources.  Access and preservation are two sides of the same coin in this regard.

The Internet Archive has become increasingly important as a means of collecting and preserving online news content.  As if the challenges of capturing more traditional news sources such as newspapers and television stations aren’t enough, the rise of social media as major distribution channels has made it even more difficult to address the complex set of issues involved.  Since many of the challenges end up being technical in nature, bringing Internet Archive staff together with the DTMH community offers the chance to identify problems and approach solutions to some of the stumbling blocks we’ve encountered at this point in the journey.

Journalists, memory institutions, technologists, historians, political scientists and anyone with an interest in having long-term access to a trustworthy and accurate record of life in the digital age will find this gathering of interest.  I urge anyone interested in this urgent and important issue to come join us at the Internet Archive on Nov. 15-16.  We have a limited number of seats available.  Registration is required, but it is free. If you want register in time to allow us to order food for you, please register by Monday, Oct. 30.  Final cutoff for registrations is Nov. 5.  I hope to see you there!

For more information, and to register, click here.

TV News Record: Third Eye goes to Trump press conference

19 oktober 2017 - 6:53pm

A weekly round up on what’s happening and what we’re seeing at the TV News Archive by Katie Dahl and Nancy Watzman. Additional research by Robin Chin.

All three major U.S. cable news networks covered President Donald Trump’s impromptu press conference with Sen. Mitch McConnell, R., Ky., on Monday, October 16, but there were notable differences in their editorial choices for chyrons – the captions that appear in real-time on the bottom third of the screen – throughout the broadcast. We used the TV News Archive’s new Third Eye chryon extraction data tool to demonstrate these differences, similar to how The Washington Post examined FBI director James B.  Comey’s hearing in June 2017.

The beauty of the Third Eye tool is you can do this too, any time there is breaking news or other widely covered live events, like yesterday’s Senate judiciary committee hearing where AG Jeff Sessions testified (7:31am-9:46am PT) or the October 5 White House briefing about Puerto Rico (11:20am-11:48am PT). Third Eye data – which includes chyrons from BBC News, CNN, Fox News, and MSNBC – is available for data download, via API, in both raw and filtered formats. (Get into the weeds over on the Third Eye collection page.) Please take Third Eye for a spin, and let us know if you have questions: tvnews@archive.org or @tvnewsarchive.

For example, at 11:03 PT, Trump began answering a question about pharmaceutical companies “making money.” MSNBC chooses a chyron that characterizes Trump’s statements as a claim, whereas Fox News displays Trump’s assertion that Obamacare is a disaster. CNN goes with a chyron saying that Trump is “very happy” to end Obamacare subsidies.  In the following minute, 11:04, Fox News chooses other bold statements from Trump: “I do not need pharma money” and “I want tax reform this year.” CNN’s chyron instead says Trump “would like to seee” tax reform, a less bold statement.

(Note: these are representative chryons from the minute period and did not necessarily display for the full 60-second period.)

Later in the press conference, the discussion turns to natural disasters before then focusing on the proposed wall on the border with Mexico. Again, Fox News features Trump making bold, simple assertions: “we are getting high marks for our hurricane response,” and “PR was in bad shape before the storm hit.” MSNBC instead uses the word “claims”: “Trump claims Puerto Rico now has more generators than any place in the world.”

Watch the Trump-McConnell press conference in context on C-Span.

Fact-check: Sen. McCaskill not present for bill to weaken DEA (four Pinocchios)

The day following The Washington Post-60 Minutes report on legislation passed by Congress and signed by President Barack Obama to weaken the authority of the Drug Enforcement Agency, Sen. Claire McCaskill, D., Mo., called for repeal of the law. In an interview, she also said, “Now, I did not go along with this. I wasn’t here at the time. I was actually out getting breast cancer treatment. I don’t know that I would have objected. I like to believe I would have, but the bottom line is, once the DEA [Drug Enforcement Administration] kind of, the upper levels at the DEA obviously said it was okay, that’s what gave it the green light.”

But “despite her claim that she ‘wasn’t here at the time,’ McCaskill was clearly back at the Senate, participating in votes and hearings,” according to The Washington Post‘s Fact Checker’s Glenn Kessler. “McCaskill’s staff acknowledged the error, saying that they had forgotten she had come back at that time. ‘It was sloppy on our part, and we take responsibility,’ a spokesman said.”

Fact-check: Pressure from Trump led to stepped up NATO members’ defense spending (half true)

In an interview on October 15, Secretary of State Rex Tillerson said, “The president early on called upon NATO member countries to step up their contributions — step up their commitment to NATO, modernize their own forces… He’s been very clear, and as a result of that countries have stepped up contributions toward their own defense.”

PolitiFact reporter Allison Graves found that “25 NATO allies plan to increase spending in real terms in 2017.” And “according to NATO, over the last 3 years, European allies and Canada spent almost $46 billion more on defense, meaning increases in spending have occurred before Trump’s presidency. Experts said it’s possible that Trump’s pressure has contributed to the continuation of the upward trend, but Tillerson’s explanation glazes over the other factors that have led to increases, including the conflict in the Ukraine in 2014.”

Follow us @tvnewsarchive, and subscribe to our weekly newsletter here.

Fifth Annual Aaron Swartz Day & International Hackathon

19 oktober 2017 - 3:51am
Aaron Swartz Weekend

by Lisa Rein, Cofounder and Coordinator, Aaron Swartz Day

In memory of Aaron Swartz, whose social, technical, and political insights still touch us daily, Lisa Rein, in partnership with the Internet Archive, will be hosting a weekend of events on Saturday, November 4th and Sunday, November 5th. Friends, collaborators, and hackers can participate in a two-day Hackathon and Aaron Swartz Day Evening Reception.

Schedule of events held at the Internet Archive:

Saturday, November 4th, from 10 am – 6 pm and Sunday, November 5th, from 11am – 5pm — Participate in the hackathon, which will focus on SecureDrop, the whistleblower submission system originally created by Aaron just before he passed away, and other projects inspired by Aaron’s work.

Saturday night, November 4th, from 6:00pm – 9:30pm — Celebrate and remember Aaron, and also the grand tradition of working hard to make the world a better place, at the Aaron Swartz Day Evening Celebration:
Reception: 6:00pm – 7:00pm – Come mingle with the speakers and enjoy nectar, wine & tasty nibbles.

Migrate your way upstairs: 7:10-7:30pm – Finish your nibbles and wine at the reception, exchange contact info, and make your way upstairs to grab a seat to watch the speakers, which will begin promptly at 7:30 pm – a half hour earlier than usual, because we have so many amazing speakers this year.

Speakers 7:30-9:30 pm (Break 8:15-8:30pm)

  • Chelsea Manning (Network Security Expert, Former Intelligence Analyst)
  • Lisa Rein (Chelsea Manning’s Archivist, Co-founder Creative Commons, Co-founder Aaron Swartz Day)
  • Daniel Rigmaiden (Transparency Advocate)
  • Barrett Brown (Journalist, Activist, Founder of the Pursuance Project) (via SKYPE)
  • Jason Leopold (Senior Investigative Reporter, Buzzfeed News)
  • Jennifer Helsby (Lead Developer, SecureDrop, Freedom of the Press Foundation)
  • Cindy Cohn (Executive Director, Electronic Frontier Foundation)
  • Gabriella Coleman (Hacker Anthropologist, Author, Researcher, Educator)
  • Caroline Sinders (Designer/Researcher, Wikimedia Foundation, Creative Dissent Fellow, YBCA)
  • Brewster Kahle (Co-founder and Digital Librarian, Internet Archive, Co-founder Aaron Swartz Day)
  • Steve Phillips (Project Manager, Pursuance)
  • Mek Karpeles (Citizen of the World, Internet Archive)
  • Brenton Cheng (Senior Engineer, Open Library, Internet Archive)
GET TICKETS HERE

Saturday, November 4th, 2017
10:00 am Hackathon
6:00 pm Reception
7:30 pm Program

Sunday, November 5th, 2017
11:00 am Hackathon

Internet Archive
300 Funston Ave.
San Francisco, CA 94118
For more information, contact:
lisa@lisarein.com
https://www.aaronswartzday.org

The 20th Century Time Machine

13 oktober 2017 - 4:24am

by Nancy Watzman & Katie Dahl

Jason Scott

With the turn of a dial, some flashing lights, and the requisite puff of fog, emcees Tracey Jaquith, TV architect, and Jason Scott, free range archivist, cranked up the Internet Archive 20th Century Time Machine on stage before a packed house at the Internet Archive’s annual party on October 11.

Eureka! The cardboard contraption worked! The year was 1912, and out stepped Alexis Rossi, director of Media and Access, her hat adorned with a 78 rpm record.

1912

D’Anna Alexander (center) with her mother (right) and grandmother (left).

“Close your eyes and listen,” Rossi asked the audience. And then, out of the speakers floated the scratchy sounds of Billy Murray singing “Low Bridge, Everybody Down” written by Thomas S. Allen. From 1898 to the 1950s, some three million recordings of about three minutes each were made on 78rpm discs. But these discs are now brittle, the music stored on them precious. The Internet Archive is working with partners on the Great 78 Project to store these recordings digitally, so that we and future generations can enjoy them and reflect on our music history. New collections include the Tina Argumedo and Lucrecia Hug 78rpm Collection of dance music collected in Argentina in the mid-1930s.

1927

Next to emerge from the Time Machine was David Leonard, president of the Boston Public Library, which was the first free, municipal library founded in the United States. The mission was and remains bold: make knowledge available to everyone. Knowledge shouldn’t be hidden behind paywalls, restricted to the wealthy but rather should operate under the principle of open access as public good, he explained. Leonard announced that the Boston Public Library would join the Internet Archive’s Great 78 Project, by authorizing the transfer of 200,000 individual 78s to digitize for the 78rpm collection, “a collection that otherwise would remain in storage unavailable to anyone.”

David Leonard and Brewster Kahle

Brewster Kahle, founder and digital librarian of the Internet Archive, then came through the time machine to present the Internet Archive’s Internet Archive Hero Award to Leonard. “I am inspired every time I go through the doors,” said Kahle of the library, noting that the Boston Public Library was the first to digitize not just a presidential library, of John Quincy Adams, but also modern books.  Leonard was presented with a tablet imprinted with the Boston Public Library homepage.

1942

Kahle then set the Time Machine to 1942 to explain another new Internet Archive initiative: liberating books published between 1923 to 1941. Working with Elizabeth Townsend Gard, a copyright scholar at Tulane University, the Internet Archive is liberating these books under a little known, and perhaps never used, provision of US copyright law, Section 108h, which allows libraries to scan and make available materials published 1923 to 1941 if they are not being actively sold. The name of the new collection: the Sony Bono Memorial Collection, named for the now deceased congressman and former representative who led the passage of the Copyright Term Extension Act of 1998, which had the effect of locking up most books from the public domain back to 1923.

One of these books includes “Your Life,” a tome written by Kahle’s grandfather, Douglas E. Lurton, a “guide to a desirable living.” “I have one copy of this book and two sons. According to the law, I can’t make one copy and give it to the other son. But now it’s available,” Kahle explained.

1944

Sab Masada

The Time Machine cranked to 1944, out came Rick Prelinger, Internet Archive board president, archivist, and filmmaker. Prelinger introduced a new addition to the Internet Archive’s film collection: long-forgotten footage of an Arkansas Japanese internment camp from 1944.  As the film played on the screen, Prelinger welcomed Sab Masada, 87, who lived at this very camp as a 12 year old.

Masada talked about his experience at the camp and why it is important for people today to remember it, “Since the election I’ve heard echoes of what I heard in 1942. Using fear of terrorism to target the Muslims and people south of the border.”

1972

Next to speak was Wendy Hanamura, the director of partnerships. Hanamura explained how as a sixth grader she discovered a book at the library, Executive Order 9066, published in 1972, which told the tale of Japanese internment camps during World War II.

“Before I was an internet archivist, I was a daughter and granddaughter of American citizens who were locked up behind barbed wires,” said Hanamura. That one book – now out of print – helped her understand what had happened to her family.

Inspired by making it to the semi-final round of the MacArthur 100&Change initiative with a proposal that provides libraries and learners with free digital access to four million books, the Internet Archive is forging ahead with plans despite not winning the $100 million grant. Among the books the Internet Archive is making available: Executive Order 9066.

1985

The year display turned to 1985, Jason Scott reappeared on stage, explaining his role as a software curator. New this year to the Internet Archive are collections of early Apple software, he explained, with browser emulation allowing the user to experience just what it was like to fire up a Macintosh computer back in its hay day. This includes a collection of the then wildly popular “HyperCards,” a programmatic tool that enabled users to create programs that linked materials in creative ways, before the rise of the world wide web.

2017

After Vinay Goelthis tour through the 20th century, the Time Machine was set to present day, 2017. Mark Graham, director of the Wayback Machine and Vinay Goel, senior data engineer, stepped on stage. Back in 1996, when the Wayback Machine began archiving websites on the still new world wide web, the entire thing amounted to 2.2 terabytes of data. Now the Wayback Machine contains 20 petabytes. Graham explained how the Wayback Machine is preserving tweets, government websites, and other materials that could otherwise vanish. One example: this report from The Rachel Maddow Show, which aired on December 16, 2016, about Michael Flynn, then slated to become national security advisor. Flynn deleted a tweet he had made linking to a falsified story about Hillary Clinton, but the Internet Archive saved it through the Wayback Machine.

Goel took the microphone to announce new improvements to Wayback Machine 2.0 search. Now it’s possible to search for keywords, such as “climate change,” and find not just web pages from a particular time period mentioning these words, but also different format types — such as images, pdfs, or yes, even an old Internet Archive favorite, gifs from the now-defunct GeoCities–including snow globes!

Thanks to all who came out to celebrate with the Internet Archive staff and volunteers, or watched online. Please join our efforts to provide Universal Access to All Knowledge, whatever century it is from.

 

Syncing Catalogs with thousands of Libraries in 120 Countries through OCLC

12 oktober 2017 - 6:37pm

We are pleased to announce that the Internet Archive and OCLC have agreed to synchronize the metadata describing our digital books with OCLC’s WorldCat. WorldCat is a union catalog that itemizes the collections of thousands of libraries in more than 120 countries that participate in the OCLC global cooperative.

What does this mean for readers?
When the synchronization work is complete, library patrons will be able to discover the Internet Archive’s collection of 2.5 million digitized monographs through the libraries around the world that use OCLC’s bibliographic services. Readers searching for a particular volume will know that a digital version of the book exists in our collection. With just one click, readers will be taken to archive.org to examine and possibly borrow the digital version of that book. In turn, readers who find a digital book at archive.org will be able, with one click, to discover the nearest library where they can borrow the hard copy.

There are additional benefits: in the process of the synchronization, OCLC databases will be enriched with records describing books that may not yet be represented in WorldCat.

“This work strengthens the Archive’s connection to the library community around the world. It advances our goal of universal access by making our collections much more widely discoverable. It will benefit library users around the globe by giving them the opportunity to borrow digital books that might not otherwise be available to them,” said Brewster Kahle, Founder and Digital Librarian of the Internet Archive. “We’re glad to partner with OCLC to make this possible and look forward to other opportunities this synchronization will present.”

“OCLC is always looking for opportunities to work with partners who share goals and objectives that can benefit libraries and library users,” said Chip Nilges, OCLC Vice President, Business Development. “We’re excited to be working with Internet Archive, and to make this valuable content discoverable through WorldCat. This partnership will add value to WorldCat, expand the collections of member libraries, and extend the reach of Internet Archive content to library users everywhere.”

We believe this partnership will be a win-win-win for libraries and for learners around the globe.

Better discovery, richer metadata, more books borrowed and read.

Boston Public Library’s Sound Archives Coming to the Internet Archive for Preservation & Public Access

11 oktober 2017 - 8:06pm

Today, the Boston Public Library announced the transfer of significant holdings from its Sound Archives Collection to the Internet Archive, which will digitize, preserve and make these recordings accessible to the public. The Boston Public Library (BPL) sound collection includes hundreds of thousands of audio recordings in a variety of historical formats, including wax cylinders, 78 rpms, and LPs. The recordings span many genres, including classical, pop, rock, jazz, and opera – from 78s produced in the early 1900s to LPs from the 1980s. These recordings have never been circulated and were in storage for several decades, uncataloged and inaccessible to the public. By collaborating with the Internet Archive, Boston Public Libraries audio collection can be heard by new audiences of scholars, researchers and music lovers worldwide.

Some of the thousands of 20th century recordings in the Boston Public Library’s Sound Archives Collection.

“Through this innovative collaboration, the Internet Archive will bring significant portions of these sound archives online and to life in a way that we couldn’t do alone, and we are thrilled to have this historic collection curated and cared for by our longtime partners for all to enjoy going forward,” said David Leonard, President of the Boston Public Library.

78 rpm recordings from the Boston Public Library Sound Archive Collection

Listening to the 78 rpm recording of “Please Pass the Biscuits, Pappy,” by W. Lee O’Daniel and his Hillbilly Boys from the BPL Sound Archive, what do you hear? Internet Archive Founder, Brewster Kahle, hears part of a soundscape of America in 1938.  That’s why he believes Boston Public Library’s transfer is so significant.

Boston Public Library is once again leading in providing public access to their holdings. Their Sound Archive Collection includes hillbilly music, early brass bands and accordion recordings from the turn of the last century, offering an authentic audio portrait of how America sounded a century ago.” says Brewster Kahle, Internet Archive’s Digital Librarian. “Every time I walk through Boston Public Library’s doors, I’m inspired to read what is carved above it: ‘Free to All.’”

The 78 rpm records from the BPL’s Sound Archives Collection fit into the Internet Archive’s larger initiative called The Great 78 Project. This community effort seeks to digitize all the 78 rpm records ever produced, supporting their  preservationresearch and discovery. From about 1898 to the 1950s, an estimated 3 million sides were published on 78 rpm discs. While commercially viable recordings will have been restored or remastered onto LP’s or CD, there is significant research value in the remaining artifacts which include often rare 78rpm recordings.

“The simple fact of the matter is most audiovisual recordings will be lost,” says George Blood, an internationally renowned expert on audio preservation. “These 78s are disappearing right and left. It is important that we do a good job preserving what we can get to, because there won’t be a second chance.”

George Blood LP’s 4-arm turntable used for 78 digitization.

The Internet Archive is working with George Blood LP, and the IA’s Music Curator, Bob George of the Archive of Contemporary Music  to discover, transfer, digitize, catalog and preserve these often fragile discs.  This team has already digitized more than 35,000 sides.  The BPL collection joins more than 20 collections  already transferred to the Internet Archive for physical and digital preservation and access. Curated by many volunteer collectors, these collections will be preserved for future generations.

The Internet Archive began working with the Boston Public Library in 2007, and our scanning center is housed at its Central Library in Copley Square.  There, as a digital-partner-in-residence, the Internet Archive is scanning bound materials for Boston Public Library, including the John Adams Library, one of the BPL’s Collections of Distinction.

To honor Boston Public Library’s long legacy and pioneering role in making its valuable holdings available to an ever wider public online, we will be awarding the 2017 Internet Archive Hero Award to David Leonard, the President of BPL, at a public celebration tonight at the Internet Archive headquarters in San Francisco.

 

 

Books from 1923 to 1941 Now Liberated!

10 oktober 2017 - 9:18pm

The Internet Archive is now leveraging a little known, and maybe never used, provision of US copyright law, Section 108h, which allows libraries to scan and make available materials published 1923 to 1941 if they are not being actively sold. Elizabeth Townsend Gard, a copyright scholar at Tulane University calls this “Library Public Domain.”  She and her students helped bring the first scanned books of this era available online in a collection named for the author of the bill making this necessary: The Sonny Bono Memorial Collection. Thousands more books will be added in the near future as we automate. We hope this will encourage libraries that have been reticent to scan beyond 1923 to start mass scanning their books and other works, at least up to 1942.

While good news, it is too bad it is necessary to use this provision.

Trend of Maximum U.S. General Copyright Term by Tom W Bell

If the Founding Fathers had their way, almost all works from the 20th century would be public domain by now (14-year copyright term, renewable once if you took extra actions).

Some corporations saw adding works to the public domain to be a problem, and when Sonny Bono got elected to the House of Representatives, representing part of Los Angeles, he helped push through a law extending copyright’s duration another 20 years to keep things locked-up back to 1923.  This has been called the Mickey Mouse Protection Act due to one of the motivators behind the law, but it was also a result of Europe extending copyright terms an additional twenty years first. If not for this law, works from 1923 and beyond would have been in the public domain decades ago.

Lawrence Lessig

Lawrence Lessig

Creative Commons founder, Larry Lessig fought the new law in court as unreasonable, unneeded, and ridiculous.  In support of Lessig’s fight, the Internet Archive made an Internet bookmobile to celebrate what could be done with the public domain. We drove the bookmobile across the country to the Supreme Court to make books during the hearing of the case. Alas, we lost.

Internet Archive Bookmobile in front of
Carnegie Library in Pittsburgh: “Free to the People”

But there is an exemption from this extension of copyright, but only for libraries and only for works that are not actively for sale — we can scan them and make them available. Professor Townsend Gard had two legal interns work with the Internet Archive last summer to find how we can automate finding appropriate scanned books that could be liberated, and hand-vetted the first books for the collection. Professor Townsend Gard has just released an in-depth paper giving libraries guidance as to how to implement Section 108(h) based on her work with the Archive and other libraries. Together, we have called them “Last Twenty” Collections, as libraries and archives can copy and distribute to the general public qualified works in the last twenty years of their copyright.  

Today we announce the “Sonny Bono Memorial Collection” containing the first books to be liberated. Anyone can download, read, and enjoy these works that have been long out of print. We will add another 10,000 books and other works in the near future. “Working with the Internet Archive has allowed us to do the work to make this part of the law usable,” reflected Professor Townsend Gard. “Hopefully, this will be the first of many “Last Twenty” Collections around the country.”

Now it is the chance for libraries and citizens who have been reticent to scan works beyond 1923, to push forward to 1941, and the Internet Archive will host them. “I’ve always said that the silver lining of the unfortunate Eldred v. Ashcroft decision was the response from people to do something, to actively begin to limit the power of the copyright monopoly through action that promoted open access and CC licensing,” says Carrie Russell, Director of ALA’s Program of Public Access to Information. “As a result, the academy and the general public has rediscovered the value of the public domain. The Last Twenty project joins the Internet Archive, the HathiTrust copyright review project, and the Creative Commons in amassing our public domain to further new scholarship, creativity, and learning.”

We thank and congratulate Team Durationator and Professor Townsend Gard for all the hard work that went into making this new collection possible. Professor Townsend Gard, along with her husband, Dr. Ron Gard, have started a company, Limited Times, to assist libraries, archives, and museums implementing Section 108(h), “Last Twenty” collections, and other aspects of the copyright law.

Prof. Elizabeth
Townsend Gard

Tomi Aina
Law Student

Stan Sater
Law Student

 

 

 

 

 

 

Hundreds of thousands of books can now be liberated. Let’s bring the 20th century to 21st-century citizens. Everyone, rev your cameras!

Internet Archive’s Annual Bash this Wednesday! — Get your tickets now before we run out!

10 oktober 2017 - 12:32am

Limited tickets left for 20th Century Time Machine — the Internet Archive’s Annual Bash – happening this Wednesday at the Internet Archive from 5pm-9:30pm. In case you missed it, here’s our original announcement.

Tickets start at $15 here.

Once tickets sell out, you’ll have the opportunity to join the waitlist. We’ll release tickets as spaces free up and let you know via email.

We’d love to celebrate with you!

History is happening, and we’re not just watching

10 oktober 2017 - 12:02am
  1. Which recent hurricane got the least amount of attention from TV news broadcasters?
    1. Irma
    2. Maria
    3. Harvey
  2. Thomas Jefferson said, “Government that governs least governs best.”
    1. True
    2. False
  3. Mitch McConnell shows up most on which cable TV news channel?
    1. CNN
    2. Fox News
    3. MSNBC

Answers at end of post.

The Internet Archive’s TV News Archive, our constantly growing online, free library of TV news broadcasts, contains 1.4 million shows, some dating back to 2009, searchable by closed captioning. History is happening, and we preserve how broadcast news filters it to us, the audience, whether it’s through CNN’s Jake Tapper, Fox’s Bill O’Reilly, MSNBC’s Rachel Maddow or others. This archive becomes a rich resource for journalists, academics, and the general public to explore the biases embedded in news coverage and to hold public officials accountable.

Last October we wrote how the Internet Archive’s TV News Archive was “hacking the election,” then 13 days away. In the year since, we’ve been applying our experience using machine learning to track political ads and TV news coverage in the 2016 elections to experiment with new collaborations and tools to create more ways to analyze the news.

Helping fact-checkers

Since we launched our Trump Archive in January 2017, and followed in August with the four congressional leaders, Democrat and Republican, as well as key executive branch figures, we’ve collected some 4,534 hours of curated programming and more than 1,300 fact-checks of material on subjects ranging from immigration to the environment to elections.

 

The 1,340 fact-checks–and counting–represent a subset of the work of partners FactCheck.orgPolitiFact and The Washington Post’s Fact Checker, as we link only to fact-checks that correspond to statements that appear on TV news. Most of the fact-checks–524–come from PolitiFact; 492 are by FactCheck.org, and 324 from The Washington Post’s Fact Checker.

We’re also proud to be part of the Duke Reporter’s Lab’s new Tech & Check collaborative, where we’re working with journalists and computer scientists to develop ways to automate parts of the fact-checking process.  For example, we’re creating processes to help identify important factual claims within TV news broadcasts to help guide fact-checkers where to concentrate their efforts. The initiative received $1.2 million from the John S. and James L. Knight Foundation, the Facebook Journalism Project and the Craig Newmark Foundation.

See the TrumpUS Congress, and executive branch archives and collected fact-checks.

TV News Kitchen

We’re collaborating with data scientists, private companies and nonprofit organizations, journalists, and others to cook up new experiments available in our TV News Kitchen, providing new ways to analyze TV news content and understand ourselves.

Dan Schultz, our senior creative technologist, worked with the start-up Matroid to develop Face-o-Matic, which tracks faces of selected high level elected officials on major TV cable news channels: CNN, Fox News, MSNBC, and BBC News. The underlying data are available for download here. Unlike caption-based searches, Face-o-Matic uses facial recognition algorithms to recognize individuals on TV news screens. It is sensitive enough to catch this tiny, dark image of House Minority Leader Nancy Pelosi, D., Calif., within a graphic, and this quick flash of Senate Minority Leader Chuck Schumer, D., N.Y., and Senate Majority Leader Mitch McConnell, R., Ky.

The work of TV Architect Tracey Jaquith, our Third Eye project scans the lower thirds of TV screens, using OCR, or optical character recognition, to turn these fleeting missives into downloadable data ripe for analysis. Launched in September 2017, Third Eye tracks BBC News, CNN, Fox News, and MSNBC, and collected more than four million chyrons captured in just over two weeks, and counting.

Download Third Eye data. API and TSV options available.

Follow Third Eye on Twitter.

Vox news reporter Alvin Chang used the Third Eye chyron data to report how Fox News paid less attention to Hurricane Maria’s destruction in Puerto Rico than it did to Hurricanes Irma and Harvey, which battered Florida and Texas. Chang’s work followed a similar piece by Dhrumil Mehta for FiveThirtyEight, which used Television Explorer, a tool developed by data scientist Kalev Leetaru to search and visualize closed captioning on the TV News Archive.

 

FiveThirtyEight used TV News Archive captions to create this look at how cable networks covered recent hurricanes.

CNN’s Brian Stelter followed up with a similar analysis on “Reliable Sources” October 1.

We’re also working with academics who are using our tools to unlock new insights. For example, Schultz and Jaquith are working with Bryce Dietrich at the University of Iowa to apply the Duplitron, the audiofingerprinting tool that fueled our political ad airing data, to analyze floor speeches of members of Congress. The study identifies which floor speeches were aired on cable news programs and explores the reasons why those particular clips were selected for airing. A draft of the paper was presented in the 2017 Polinfomatics Workshop in Seattle and will begin review for publication in the coming months.

What’s next? Our plans include making more than a million hours of TV news available to researchers from both private and public institutions via a digital public library branch of the Internet Archive’s TV News Archive. These branches would be housed in computing environments, where networked computers provide the processing power needed to analyze large amounts of data. Researchers will be able to conduct their own experiments using machine learning to extract metadata from TV news. Such metadata could include, for example, speaker identification–a way to identify not just when a speaker appears on a screen, but when she or he is talking. Metadata generated through these experiments would then be used to enrich the TV News Archive, so that any member of the public could do increasingly sophisticated searches.

Going global

We live in an interdependent world, but we often lack understanding about how other cultures perceive us. Collecting global TV could open a new window for journalists and researchers seeking to understand how political and policy messages are reported and spread across the globe. The same tools we’ve developed to track political ads, faces, chyrons, and captions can help us put news coverage from around the globe into perspective.

We’re beginning work to expand our TV collection to include more channels from around the globe. We’ve added the BBC and recently began collecting Deutsche Welle from Germany and the English-language Al Jazeera. We’re talking to potential partners and developing strategy about where it’s important to collect TV and how we can do so efficiently.

History is happening, but we’re not just watching. We’re collecting, making it accessible, and working with others to find new ways to understand it. Stay tuned. Email us at tvnews@archive.org. Follow us @tvnewsarchive, and subscribe to our weekly newsletter here.

Answer Key

  1. b. (See: “The Media Really Has Neglected Puerto Rico,” FiveThirtyEight.
  2. b. False. (See: Vice President Mike Pence statement and linked PolitiFact fact-check.)
  3. c. MSNBC. (See: Face-O-Matic blog post.)

Members of the TV News Archive team: Roger Macdonald, director; Robin Chin, Katie Dahl, Tracey Jaquith, Dan Schultz, and Nancy Watzman.

TV News Record: 1,340 fact checks collected and counting

5 oktober 2017 - 2:07pm

A weekly round up on what’s happening and what we’re seeing at the TV News Archive by Katie Dahl and Nancy Watzman. Additional research by Robin Chin.

In an era when social media algorithms skew what people see online, the Internet Archive TV News Archive’s collections of on-the-record statements by top political figures serves  as a powerful model for how preservation can provide a deep resource for who really said what, when, and where.

Since we launched our Trump Archive in January 2017, and followed in August with the four congressional leaders, Democrat and Republican, as well as key executive branch figures, we’ve collected some 4,534 hours of curated programming and more than 1,300 fact-checks of material on subjects ranging from immigration to the environment to elections.

The 1,340 fact-checks–and counting–represent a subset of the work of partners FactCheck.org, PolitiFact and The Washington Post’s Fact Checker, as we link only to fact-checks that correspond to statements that appear on TV news. Most of the fact-checks–524–come from PolitiFact; 492 are by FactCheck.org, and 324 from The Washington Post’s Fact Checker.

As a library, we’re dedicated to providing a record – sometimes literally, as in the case of 78s! – that can help researchers, journalists, and the public find trustworthy sources for our collective history. These clip collections, along with fact-checks, now largely hand-curated, provide a quick way to find public statements made by elected officials.

See the Trump, US Congress, and executive branch archives and collected fact-checks.

The big picture

Given his position at the helm of the government, it is not surprising that Trump garners most of the fact-checking attention.  Three out of four, or 1008 of the fact-checks, focus on Trump’s statements. Another 192 relate to the four congressional leaders: Senate Majority Leader Mitch McConnell, R., Ky.; Senate Minority Leader Chuck Schumer, D., N.Y.; House Speaker Paul Ryan, R., Wis.; and House Minority Leader Nancy Pelosi, D., Calif. We’ve also logged 140 fact-checks related to key administration figures such as Sean Spicer, Jeff Sessions, and Mike Pence.

pie chart

The topics

The topics covered by fact-checkers run the gamut of national and global policy issues, history, and everything in between. For example, the debate on tax reform is grounded with fact-checks of the historical and global context posited by the president. Fact-checkers have also examined his aides’ claims on the impact of the current reform proposal on the wealthy and on the deficit. They’ve also followed the claims made by House Speaker Paul Ryan, R., Wis., the leading GOP policy voice on tax reform.

Another large set of fact-checks cover health care, going back as far as this claim made in 2010 by Pelosi about job creation under healthcare reform (PolitiFact rated it “Half True.”) The most recent example is the Graham-Cassidy bill that aimed to repeal much of Obamacare. One of the most sharply contested debates about that legislation was whether or not it would require coverage of people with pre-existing conditions. Fact-checkers parsed the he-said he-said debate as it unfolded on TV news, for example examining dueling claims by Schumer and Trump.

Browse or download  fact-checked TV clips by topic

The old stuff

The collection of Trump fact checks include a few dating back to 2011, long before his successful presidential campaign. Here he is at the CPAC conference that year claiming no one remembered now-former President Barack Obama from school, part of his campaign to question Obama’s citizenship. (PolitiFact rated: “Pants on Fire!”) And here he is with what FactCheck.org called a “100 percent wrong” claim about the Egyptian people voting to overturn a treaty with Israel.

This fact-check of McConnell dates back to 2009, when PolitiFact rated “false” his claim of how much federal spending occurred under Obama’s watch: “In just one month, the Democrats have spent more than President Bush spent in seven years on the war in Iraq, the war in Afghanistan and Hurricane Katrina combined.”

Meanwhile, this 2010 statement by Schumer, rated “mostly false” by PolitiFact, asserted that the U.S. Supreme Court “decided to overrule the 100-year-old ban on corporate expenditures.” The ban on giving directly to candidates is still in place; however,  corporations are free to spend unlimited funds on elections providing they do so separate from a candidate’s official campaign.

The repetition

Twenty-four million people will be forced off their health insurance, young farmers have to sell the farm to pay estate tax, NATO members owe the United States money, millions of women turn to Planned Parenthood for mammograms, and sanctuary cities lead to higher crime. These are all examples of claims found to be inaccurate or misleading, but that continued or continue to be repeated by public officials.

The unexpected

Whether you lean one political direction or another, there are always surprises from the fact-checkers that can keep all our assumptions in check. For example, if you’re opposed to building a wall on the southern border to keep people from crossing into the U.S., you might guess Trump’s claim that people use catapults to toss drugs over current walls is an exaggeration. In fact, that statement was rated “mostly true” by PolitiFact. Or if you’re conservative, you might be surprised to learn an often repeated quote ascribed to Thomas Jefferson, in this case by Vice President Mike Pence, is in fact falsely attributed to him.

How to find

If you’re looking for the most recent TV news statements with fact-checks, you can see the latest offerings on the TV Archive’s homepage by scrolling down.

screen grab of place on tv homepageYou can review whole speeches, scanning for just the fact-checked claims by looking for the fact-check icon  on a program timeline. For example, starting in the Trump Archive, you can choose a speech or interview and see if and how many of the statements were checked by reporters.

screen grab of timeline w icons

You can also find the fact-checks in the growing table, also available to download, which includes details on the official making the claim, the topic(s) covered, the url for the corresponding TV news clip, and the link to the fact-checking article.

image of fact-checks table

To receive the TV News Archive’s email newsletter, subscribe here.

###

Wayback Machine Playback… now with Timestamps!

5 oktober 2017 - 6:16am

The Wayback Machine has an exciting new feature: it can list the dates and times, the Timestamps, of all page elements compared to the date and time of the base URL of a page.  This means that users can see, for instance, that an image displayed on a page was captured X days before the URL of the page or Y hours after it.  Timestamps are available via the “About this capture” link on the right side of the Wayback Toolbar.  Here is an example:

The Timestamps list includes the URLs and date and time difference compared to the current page for the following page elements: images, scripts, CSS and frames. Elements are presented in a descending order. If your cursor over a list element on the page, it will be highlighted and if you click on it you will be shown a playback of just that element.

Under the hood

Web pages are usually a composition of multiple elements such as images, scripts and CSS. The Wayback Machine tries to archive and playback web pages in the best possible manner, including all their original elements.  Each web page element has its own URL and Timestamp, indicating the exact date and time it was archived. Page elements may have similar Timestamps but they could also vary significantly for various reasons which depend on the web crawling process. By using the new Timestamps feature, users can easily learn the archive date and time for each element of a page.

Why this is important

The Wayback Machine is increasingly used in critical procedures such as legal evidence or political debate material.  It is important that what is presented is clear and transparent, even in the light of a web that was not designed to be archived. One of the ways a web archive could be confusing is via anachronisms, displaying content from different dates and times than the user expects. For example, when a archived page is played back, it could include some images from the current web, making it look like the image came from the past when it did not. We implemented Timestamps to provide users with more context about, and in turn hopefully greater confidence in, what they are seeing.

TV News Record: Wayback Machine saves deleted prez tweets

29 september 2017 - 4:17pm

A weekly round up on what’s happening and what we’re seeing at the TV News Archive by Katie Dahl and Nancy Watzman. Additional research by Robin Chin.

In this week’s TV News Archive roundup, we explain how presidential tweets are forever, show how different TV cable news networks summarized NFL protests via Third Eye chyron data, and present FiveThirtyEight’s analysis of hurricane coverage (hint: Puerto Rico got less attention.)

Wayback Machine preserved deleted prez tweets; PolitiFact fact-checks legality of prez tweet deletions (murky)

The Internet Archive’s Wayback Machine has preserved President Donald Trump’s deleted tweets praising failed GOP Alabama U.S. Senate candidate Luther Strange following his defeat by Roy Moore on September 26. So does the Pulitzer Prize-winning investigative journalism site ProPublica, through its Politwoops project.

Kudos @propublica saving @realDonaldTrump deleted tweets. Also @internetarchive on Waybackhttps://t.co/FMkJNZ4xNShttps://t.co/xAPRTzCCb0 pic.twitter.com/zXkHzDvkLP

— TV News Archive (@TVNewsArchive) September 27, 2017

The story of Trump’s deleted tweets about Strange was reported far and wide, including this segment on MSNBC’s “Deadline Whitehouse” that aired on September 27.

In a fact-check on the legality of a president deleting tweets, linked in the TV News Archive clip above, John Kruzel, reports for PolitiFact that the law is murky but still being fleshed out:

Experts were split over how much enforcement power courts have in the arena of presidential record-keeping, though most seemed to agree the president has the upper hand.

“One of the problems with the Presidential Records Act is that it does not have a lot of teeth,” said Douglas Cox, a professor at the City University of New York School of Law. “The courts have held that the president has wide and almost unreviewable discretion to interpret the Presidential Records Act.”

That said, many of the experts we spoke to are closely monitoring how the court responds to the litigation around Trump administration record-keeping.

He also provides background on that litigation, a lawsuit brought by Citizens for Responsibility and Ethics in Washington. The case is broadly about requirements for preserving presidential records, and a previous set of deleted presidential tweets is a part of it.

Fact Check: NFL attendance and ratings are way down because people love their country (Mostly false)

Speaking of Trump’s tweets, the president ignited an explosion of coverage with an early morning tweet on Sunday, Sept. 24, ahead of a long day of football games: “NFL attendance and ratings are WAY DOWN. Boring games yes, but many stay away because they love our country.”

Manuela Tobias of PolitiFact rated this claim as “mostly false,” reporting, “Ratings were down 8 percent in 2016, but experts said the drop was modest and in line with general ratings for the sports industry. The NFL remains the most watched televised sports event in the United States.” “As for political motivation, there’s little evidence to suggest people are boycotting the NFL. Most of the professional sports franchises are dealing with declines in popularity.”

How did different cable TV news networks cover the NFL protests?

We first used the Television Explorer tool to see where there was a spike in the use of the word “NFL” near the word “Trump.” It looked like Sunday showed the most use of these words. After a  closer look, we saw MSNBC, Fox News, and CNN all showed highest mentions of these terms around 2 pm Pacific.

Spike at 2 pm (PST) for CNN, MSNBC, and CNN

Then we downloaded data from the new Third Eye project, which turns TV News chyrons into data, filtering for that date and hour. We were able to see how the three cable news networks were summarizing the news at that particular point in time.

At about 2:02, CNN broadcast this chyron: “NFL teams kneel, link arms in defiance of Trump.”

Screen grab of chyron caught by Third Eye from 2:02 pm 9/24/17 on CNN

Fox News chose the following, also seen below tweeted from one of the Third Eye twitter bots: “Some NFL owners criticize Trump’s statements on player protests, link arms with players”

FOXNEWS 2:02pm SOME NFL OWNERS CRITICIZE TRUMP’S STATEMENTS ‘. . ON PLAYER PROTESTS, LINK ARMS WITH PLAYERS i
FOX N EWS ALERT

— The Third Eye (@tvThirdEyeF) September 24, 2017

Meanwhile, MSNBC chose a different message:  “Taking a knee: NFL teams send a message.”

Screen grab of chyron caught by Third Eye from 2:02 pm 9/24/17 on MSNBC

About eight minutes later, all three cable channels were still reporting on the NFL protests:

Puerto Rico’s hurricane Maria got less media attention than hurricanes Harvey & Irma

Writing for FiveThirtyEight.com, Dhrumil Mehta demonstrated that both online news sites and TV news broadcasters paid less attention to Puerto Rico’s hurricane Marie than to hurricanes Harvey and Irma, which hit mainland U.S. primarily in Texas and Florida. Mehta used TV News Archive data via Television Explorer, as well as data from Media Cloud on online news coverage, to help make his case:

While Puerto Rico suffers after Hurricane Maria, much of the U.S. media (FiveThirtyEight not excepted) has been occupied with other things: a health care bill that failed to pass, a primary election in Alabama, and a spat between the president and sports players, just to name a few. Last Sunday alone, after President Trump’s tweets about the NFL, the phrase “national anthem” was said in more sentences on TV news than “Puerto Rico” and “Hurricane Maria” combined.

To receive the TV News Archive’s email newsletter, subscribe here.

 

 

Pagina's