Skip to Content

Extra! Extra! app may be scraping news museum's feed of front pages

TUAW came across the US$3.99 app Extra! Extra! when developer Finbarr Brady solicited a review. Extra! Extra! bills itself as an app that will supply you with the daily front pages of more than 800 newspapers from around the globe.

This sounded suspiciously similar to one of the features on the Newseum's website. The Newseum, located in Washington, D.C., is a museum that documents news media history. Each morning, more than 800 newspapers from around the world send their front pages as high-quality PDFs to the museum's gallery, which features these pages for educational purposes.

In Extra! Extra!, you can select a newspaper either from a list or a map view, and the app downloads the PDF to your iPhone, iPod touch or iPad. You can then either mark it as a favorite, e-mail it to someone, or visit the paper's site -- though several of the links I checked were incorrect.

I scanned the list of papers provided by Extra! Extra! and found that it closely mirrored the Newseum's list. The FAQ section on the app site claims that it is up to the individual newspaper to decide whether or not it is included with the Extra! Extra! app.

Full disclosure: I'm a designer with The Patriot-News just outside of Harrisburg, PA. Part of my duties when I, or my co-workers, design the front page is to send a high-quality PDF to the Newseum. I checked with our executive editor this morning, who confirmed that we only send those PDFs to Newseum and not any other organization. The Patriot-News is one of the newspapers made available through Extra! Extra!

Read on for more...

When I first downloaded the app and checked The Patriot-News feed, it was still showing the front page from the previous day, which mirrored the Newseum entry -- which usually isn't updated until around 8:30 a.m. When I redownloaded it a couple hours later through Extra! Extra!, it reflected the updated front page as shown through Newseum.

Brady says the newspaper fronts he downloads are publicly accessible. "Basically I am using freely available PDF's from the newspaper websites themselves," he said in an e-mail. "These are public URL's such as http://www.independent.ie/independent.ie/editorial/todaysPaper/todayspaper.pdf."

A visit to the Independent's website found no public link to their front page, but links to their paid e-edition. However, a Google search does confirm the link Brady provided.

However, this isn't the case for a good number of the papers, and it's most likely that Brady is scraping the Newseum feed. The app claimed that today's issue of the Corpus Christi Caller-Times wasn't available. That's because it hadn't been uploaded to the Newseum for the day.

A check of the Newseum revealed no paper from Corpus Christi for the day. The Caller-Times is one of the newspapers that makes its front-page PDFs accessible from its site and today's issue is available there. If Brady was using the newspaper's public feed, the PDF would be available in the app; if he's scraping the Newseum site, as we suspect, the missing page would mirror the status there.

Another newspaper, Referans from Turkey, no longer exists after merging with another newspaper. But, it's still part of the Newseum feed and therefore is listed in Extra! Extra!

Newseum makes it clear that it has a special arrangement with newspaper companies to display these front pages. Anyone seeking permission to use a front page must contact the newspaper directly, and U.S. copyright laws apply to both the Newseum and the US-based papers it includes. Extra! Extra!'s developer is based out of Ireland, but the app is being sold in the U.S. app store.

"I store the URLs on the server side in an XML file, so if any paper contacts me and wants to be removed, I can take them out right away," Brady said. "So far, no papers have wanted to do this, as I guess my app is driving more traffic to their website which in turn is good for them. I know from talking to the Irish papers, they love the app and are happy to get more exposure this way."

That's one possibility. Another, more likely scenario is that Extra! Extra! hasn't been noticed by people in a position to know whether the front pages are being used with permission or not.

I reached out to a few designers who work at papers included on Extra! Extra!, and they were pretty shocked by the app. Some papers, like the Express in Washintgon D.C., do make their entire paper downloadable as a single PDF document, but not the front page by itself. The PDF encryption makes it difficult to separate those pages.

"We have asked [Extra! Extra!] not to use the content from our site without each newspaper's specific permission. We have told numerous other sites the same thing. We cannot stop him from doing it without impacting the performance of our site," said Paul Sparrow, senior vice-president of broadcasting with Newseum in an e-mail to Charles Apple, who runs a visual journalism blog for the American Copy Editors Society. Charles was kind enough to contact Newseum on my behalf to get their take on this.

A growing number of newspapers do have their front pages available for download -- however, those pages are still under the copyright of their respective owners, and when an app like Extra! Extra! appears in the store, it calls into question whether Apple is effectively policing copyright violators. Just because the front pages are out there for download -- whether or not it is through the Newseum or the paper's own site -- does not mean that someone had the right to cull these front pages and make a profit off of them.

Edit (6:30 p.m. ET): Commenter Kevin was gracious enough to source out the app's XML file, [Ed. note: the link is now broken, so we've pasted an image below with proof] which revealed that all of the files are coming from the Newseum. Thanks, Kevin!





Categories

iPhone App Store iPad

TUAW came across the US$3.99 app Extra! Extra! when developer Finbarr Brady solicited a review. Extra! Extra! bills itself as an app that...
 

Add a Comment

*0 / 3000 Character Maximum Comment Moderation Enabled. Your comment will appear after it is cleared by an editor.

43 Comments

Filter by:
Pat

I am still amazed at media companies publishing free content, then admonishing those who use it.

The excellent ExtraExtra app collates free information. The value of the app lies in this collation process, not in the *freely available* news content.

October 30 2010 at 9:04 PM Report abuse rate up rate down Reply
Harbiter

Just one of the few breaking the law on the net, LOL.

I've met Finbarr Brady, and he's a top, top guy. Pity he's getting lambasted in this court of public opinion.

Ok so its not technically legal what the app is doing, but the Newseum people seem to like the app, they know about it, and have allowed it..

October 30 2010 at 4:04 PM Report abuse rate up rate down Reply
tvko

Camp Hill, PA! Woo!

October 28 2010 at 6:42 PM Report abuse rate up rate down Reply
Joe

He certainly put the work into designing an interface for Newseum's feeds, but Newseum has policies with regard to the use of them which he appears to have violated. Furthermore, it appears that he gives Newseum no credit, and indeed makes misleading statements that imply that all of the content in the app comes from "freely available URLs." Freely available does not been free. He does not have express permission to utilize this content. In the US, that's a copyright violation. Newseum made the effort to create a specialized partnership with newspapers for their front pages, so really, aside from the code of the app itself, Newseum is the one who did all the work.

October 28 2010 at 10:08 AM Report abuse rate up rate down Reply
Biba

I don't see the problem. Maybe it is illegal, but I dont think it's immoral. The information is available for free anyway.

October 28 2010 at 10:04 AM Report abuse rate up rate down Reply
David Newhouse

Megan's editor in Harrisburg here. When Brady writes "Basically I am using freely available PDF's from the newspaper websites themselves," that is clearly a lie as far as The Patriot-News is concerned because we don't publish a pdf of our front page on our web site. Absolutely the only place it is available online is on Newseum. Caught red-handed!

October 28 2010 at 8:38 AM Report abuse rate up rate down Reply
Liam

A point: the app in question provides an improved user experience for accessing the individual PDF documents linked through from the publicly available XML data. The developer, Brody, has likely expended a not insignificant amount of time & effort in producing such an interface. He is _not_ reselling on the underlying content, but the improved user experience in browsing this content. Effectively, he has written a highly specialised browser to access public data from the internet, and to present it in an easily digestible format, no different to an RSS reader. The fact that Brody is charging for his browser is irrelevant. He is not making a profit from the content, but funding his development of the presentation of that content.

For the record, I'm also an Irish app developer. However, I've never actually met this Brody character in person, but he seems like a rather nice chap.

October 28 2010 at 4:41 AM Report abuse rate up rate down Reply
Steven Levin

So by the "this must be illegal" argument, pulling those same URL's with a web browser must be illegal as well?

I'm thinking... not. And that's all this app is doing, at its heart.

October 28 2010 at 1:39 AM Report abuse rate up rate down Reply
1 reply to Steven Levin's comment
Joe

The app is charging money for someone else's work without their permission. Do you plan on pulling these URLs up on your browser, than charging someone for the right to view them after you've done so?

October 28 2010 at 10:03 AM Report abuse rate up rate down Reply
Carl

Should be simple enough for either Newseum or someone else to report this to Apple and the app would be taken off immediately for copyright issues no?

October 28 2010 at 12:05 AM Report abuse rate up rate down Reply
Dermdaly

Eh..
I've just looked through the xml file - that URL is in it.

October 27 2010 at 9:08 PM Report abuse rate up rate down Reply
3 replies to Dermdaly's comment
Buy an ad here

Tweets

© 2012 AOL Inc. All Rights Reserved.