Rome in a Day Project: 3D reconstruction from Flickr-found photos


A group of technology researchers in Washington state are attempting to construct a three-dimensional model of Rome from photographs found on Flickr -- in one day.

Entering the search term Rome on Flickr returns more than two million photographs. This collection represents an increasingly complete photographic record of the city, capturing every popular site, facade, interior, fountain, sculpture, painting, cafe, and so forth. It also offers us an unprecedented opportunity to richly capture, explore and study the three dimensional shape of the city.

In this project, we consider the problem of reconstructing entire cities from images harvested from the web. Our aim is to build a parallel distributed system that downloads all the images associated with a city, say Rome, from Flickr.com. After downloading, it matches these images to find common points and uses this information to compute the three dimensional structure of the city and the pose of the cameras that captured these images. All this to be done in a day.

Rome in a Day (via Dean Putney)

Discussion

Report this comment

Very cool idea and the preliminary results are already impressive. Although they might get even more images if they search for "St. Peter's Basilica" instead of "St. Peter's Cathedral."

Report this comment
#2 posted by Anonymous, August 4, 2009 9:38 AM

Well that didn't show anything. What photographs?

Report this comment

@ 2

I think all those up-ended pyramids are the locations of all the cameras that took all the tiny photos being combined together.

http://www.digitalartform.com/archives/2004/11/nodal_point_pan_2.html

Report this comment

would be cooler if they didn't restrict themselves to just one day

Report this comment

If those camera views were all frames from a single moving camera then this would be what programs like Boujou and Syntheyes and PF Track do.

They all generate an object/environment point cloud as part of their camera tracking process.

Report this comment

I am somewhat surprised no one has mentioned Microsoft's Photosynth: http://photosynth.net/Search.aspx?keyword=rome&sortby=Best%20Synth

BoingBoing previously covered this technology in a demonstration at TED (complete with Rome flickr photos): http://www.boingboing.net/2008/06/26/top-10-ted-talks.html

Report this comment

The automated aspect of it is related to this -

http://www.graficaobscura.com/merge/index.html

Report this comment

Interesting ... but more interesting (and practical) to me is the work of another group, "Finding Paths through the World's Photos":

http://phototour.cs.washington.edu/findingpaths/

creating a spacio- view (that could also be temporal) using multi-person exposures of 3D objects.

The possibilities for virtual tourism are wide open ... most of us can't haul ourselves everywhere we'd like to. Mob annotations would be great for education. The addition of ambient sound would be even better ... if more people would only make recordings of spaces!!

Report this comment

The innovation here isn't what's being done it's HOW it's being done:

"The key contributions of our work is a new, parallel distributed matching system that can match massive collections of images very quickly "

I assume the presentation layer is similar to Photosynth (they don't say, but it looks exactly like it). As such, this might fill in the gab that the demonstrations of Photosynth several years ago glossed over: that it took yonks to knit the pretty pictures together.


Report this comment

PS: Anyone who wants to see more of the pretty bit, as opposed to the data-crunching part that this research is about, should follow the links provided by #7.

Report this comment
#12 posted by hbl, August 4, 2009 5:04 PM

@7 - The first thing I thought of when I saw the title was Photosynth. I haven't heard about it for a while, but the other software demo'd at the TED that year, Seadragon, cropped up on iPhone. That stuff was slick. I guess it's just waiting for the killer app to come along and make it useful.

For instance, PDF viewers are complete dogmeat. If I could point Seadragon at a directory structure full of PDFs and images, docs, whatever and flip through them and zoom in the same, that'd would be the killer app. Why isn't it baked into Windows 7? All this innovation just flapping in the wind.

Leave a comment

Name:
Anonymous