{"id":286366,"date":"2011-08-24T16:49:34","date_gmt":"2011-08-24T16:49:34","guid":{"rendered":"http:\/\/www.wdev.rochester.edu\/College\/translation\/threepercent-dev\/2011\/08\/24\/discovering-books-booklamp-vs-goodreads\/"},"modified":"2018-04-16T16:17:01","modified_gmt":"2018-04-16T16:17:01","slug":"discovering-books-booklamp-vs-goodreads","status":"publish","type":"post","link":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/2011\/08\/24\/discovering-books-booklamp-vs-goodreads\/","title":{"rendered":"Discovering Books: Booklamp vs. GoodReads"},"content":{"rendered":"<p>Today&#8217;s <a href=\"http:\/\/publishingperspectives.com\/2011\/08\/is-booklamps-book-genome-project-the-future-of-discovery\/\">Publishing Perspectives feature<\/a> is all about <a href=\"http:\/\/booklamp.org\/\">Booklamp.org,<\/a> a new book discovery site that&#8217;s being referred to either as &#8220;Pandora for Books&#8221; or the &#8220;Book Genome Project.&#8221; <\/p>\n<p>Here&#8217;s a brief description:<\/p>\n<blockquote>\n<p>\u201cOur program breaks a book up into 100 scenes and measures the \u2018DNA\u2019 of each scene, looking for 132 different thematic ingredients, and another 2,000 variables.\u201d A reader can go to the BookLamp site, which was launched in beta last week, and do a keyword search for titles that meet the criteria similar to a title they plug into the site. Pundits have dubbed it the \u201cPandora for Books,\u201d though Stanton prefers the term \u201cBook Genome Project.\u201d<\/p>\n<\/blockquote>\n<blockquote>\n<p>\u201cSay you\u2019re looking for a novel like the The Da Vinci Code. We have found that it contains 18.6% Religion and Religious Institutions, 9.4% Police &amp; Murder Investigation, 8.2% Art and Art Galleries, and 6.7% Secret Societies &amp; Communities, and other elements \u2014 we\u2019ll pull out a book with similar elements, provided it is in our database,\u201d says Stanton. [. . .]<\/p>\n<\/blockquote>\n<blockquote>\n<p>But can a computer really accurately assess the content of a book? Stanton thinks so. \u201cOur original models are based on focus groups,\u201d he says. \u201cWe would give them a highly dense scene and a low density scene, for example, and ask them to assess them, which gave us a basis for training the models. Then we looked at books that might exceed the models and tweaked the formulas. In this way, our algorithms are trained like a human being.\u201d<\/p>\n<\/blockquote>\n<blockquote>\n<p>BookLamp quantifies such elements as density, pacing, description, dialogue and motion, in addition to numerous nuanced micro-categories, such as \u201cpistols\/rifles\/weapons\u201d or \u201cexplicit depictions of intimacy\u201d or \u201coffice environments.\u201d<\/p>\n<\/blockquote>\n<p>I&#8217;m totally a sucker for this sort of shit . . . I think it&#8217;s great that people are finally thinking about how readers find books; I think it&#8217;s maybe detrimental to only read books that fit your fiction prejudices. (Of course, what example does the founder of Booklamp use in the interview? The fucking <em>Da Vinci Code.<\/em> Dear god, please make it stop.) <\/p>\n<p>Knowledgeable recommendations used to be the function of booksellers, but since we, as a culture, seem not to need them (or bookstores) at all anymore, there are a number of book sites popping up to fill this void. <\/p>\n<p>I&#8217;m sure this is over-simplifying, but there seems to be two major approaches to automated recommendations: the &#8220;similar user&#8221; approach, which is what Last.fm and GoodReads use and is based on the idea that if you like A, B, &amp; C, and a lot of people who also like A, B, &amp; C, also like Q, then you&#8217;ll probably like Q as well; and the &#8220;similar component&#8221; approach, which is what&#8217;s in play with Booklamp and Pandora and uses top down analysis to recommend books\/music with components similar to books\/music you like. <\/p>\n<p>Personally, I prefer the first approach, and have never really gotten Pandora, nor do I see how my favorite books can be accurately quantified (<em>The Sound and the Fury<\/em> is 25% suicidal tendencies and 25% narrated by a mentally challenged character? Or <em>The Crying of Lot 49<\/em> is 48% paranoia and 88% too cool for school?)<\/p>\n<p>Now that two of the three recommendation sites are at least in their beta phases&#8212;Bookish is still in the works&#8212;it seems sort of worthwhile to check and see what these sites recommend . . . It&#8217;s one thing to talk about the theory and drool over hot catchphrases like &#8220;discovery&#8221; and &#8220;genome,&#8221; but another to find out that no matter what you put in, you&#8217;re told that you should read <em>Twilight.<\/em><\/p>\n<p>I&#8217;m not sure I can convince you of this, but I&#8217;m doing this test live . . . I haven&#8217;t looked up any of these books yet, so I have no idea what I&#8217;ll find. (It&#8217;s like live blogging! Which I believe is now called &#8220;tweeting.&#8221; Anyway.)<\/p>\n<p>So, first up, the book I have tattooed on my arm&#8212;<i>The Crying of Lot 49.<\/i><\/p>\n<blockquote>\n<p>Booklamp: No results. Well that&#8217;s unfortunate . . . Skipped right over the paragraph in the <em>Publishing Perspectives<\/em> article about how they currently only have 20,000 Random House books in their database . . . <\/p>\n<\/blockquote>\n<blockquote>\n<p>GoodReads: <em>The Recognitions<\/em> by William Gaddis, <em>The Sot-Weed Factor<\/em> by John Barth, <em>Dog Soldiers<\/em> by Robert Stone, <em>Falconer<\/em> by John Cheever, <em>Call It Sleep<\/em> by Henry Roth.<\/p>\n<\/blockquote>\n<p>OK, not so sure about the last couple, but the GoodReads recommendations are <em>fine.<\/em> Gaddis rocks, but I love <em>JR<\/em> more than <em>The Recognitions<\/em>, and <em>Sot-Weed<\/em> isn&#8217;t even close to my favorite Barth book. Still, not bad. Predictable, but not bad.<\/p>\n<p>Since Booklamp is so limited in scope, I&#8217;m using their &#8220;Author Browse&#8221; function to find a good example . . . And oh, look, they have a listing for <em>The Sound and the Fury<\/em>! My snotty prediction about what the make up would be was pretty crap . . . Instead of confused narrators and philosophical issues about time, the five most prominent &#8220;StoryDNA&#8221; elements according to Booklamp are: Financial Matters, Family Connections, Domestic Environments, Automobiles &amp; Vehicles, and Nature\/Forests\/Trees. The fuck?? Seriously? This does not bode well . . . <\/p>\n<blockquote>\n<p>Booklamp: <em>Sanctuary<\/em> by William Faulkner, <em>Moon Women<\/em> by Pamela Duncan, <em>Leo and the Lesser Lion<\/em> by Sandra Forrester, <em>Good-bye Marianne<\/em> by Irene Watts, and <em>Telling Lies to Alice<\/em> by Laura Wilson<\/p>\n<\/blockquote>\n<blockquote>\n<p>GoodReads: <em>Appointment in Samarra<\/em> by John O&#8217;Hara, <em>Loving<\/em> by Henry Green, <em>The Death of the Heart<\/em> by Elizabeth Bowen, <em>Under the Net<\/em> by Iris Murdoch, and <em>An American Tragedy<\/em> by Theodore Dreiser<\/p>\n<\/blockquote>\n<p>Although they aren&#8217;t the first authors that come to mind when I think of William Faulkner (I was expecting Flannery O&#8217;Connor), I do love me some Henry Green and Elizabeth Bowen. And to be completely frank, I have no fucking idea what any of the Booklamp recommendations are. Irene Watts? <em>Leo and the Lesser Lion<\/em>?? Maybe I&#8217;m just ignorant and missing out on great fiction . . . <\/p>\n<blockquote>\n<p><em>Leo and Lesser Lion<\/em>. Listed by the publisher as Juvenile Fiction: A heartwarming family story set during the Depression that reads like a classic.Everyone&#8217;s been down on their luck since the Depression hit. But as long as Mary Bayliss Pettigrew has her beloved older brother, Leo, to pull pranks with, even the hardest times can be fun. Then one day, there\u2019s a terrible accident, and when Bayliss wakes up afterward, she must face the heartbreaking prospect of life without Leo. And that\u2019s when her parents break the news: they\u2019re going to be fostering two homeless little girls, and Bayliss can\u2019t bear the thought of anyone taking Leo\u2019s place. But opening her heart to these weary travelers might just be the key to rebuilding her grieving family.<\/p>\n<\/blockquote>\n<p>Oh, my. Booklamp fail. &#8220;Juvenile Fiction&#8221;?? I&#8217;ll bet $1million that there&#8217;s not a single stylistically interesting about this novel. And I&#8217;ll also bet that there&#8217;s no possible way I&#8217;d read this book and be like, &#8220;wow, I&#8217;ve never read a book <em>more<\/em> like <em>Sound and the Fury<\/em> than Sandra Forrester&#8217;s little gem!&#8221; Fuck. And no. <\/p>\n<p>So far Booklamp is coming in a distant second . . . And exposing all the flaws of this top-down recommendation approach. This system really doesn&#8217;t seem to account for writing style, which, in my opinion, is maybe the most important feature of any work of fiction. Sure, it&#8217;s got &#8220;automobiles&#8221; and I do like to read about people who move from one point to another, but if the writing is shitty, there no number of &#8220;automobile\/vehicle&#8221; scenes that will save a novel. And how do you identify the &#8220;genome&#8221; for exciting writing? (Not to bang a dead drum, but this is why booksellers and librarians and actual readers are so goddamn important.)<\/p>\n<p>Let&#8217;s try one more, this time with feeling: <em>Independent People<\/em> by Halldor Laxness.<\/p>\n<blockquote>\n<p>Booklamp: the first four are all Laxness books, which seems a bit of a cheat, so I&#8217;ll skip those . . . <em>The Writer and the World<\/em> by V.S. Naipaul, <em>Dead Souls<\/em> by Nikolai Gogol, <em>Agnes Grey<\/em> by Anne Bronte, <em>A Country Doctor<\/em> by Sarah Orne Jewett, and <em>Walt Whitman&#8217;s Secret<\/em> by George Fetherling<\/p>\n<\/blockquote>\n<blockquote>\n<p>GoodReads: <em>Njal&#8217;s Saga<\/em> by Anonymous, <em>Angels of the Universe<\/em> by Einar Mar Gudmundsson, <em>The Pets<\/em> by Bragi Olafsson, <em>The Blue Fox<\/em> by Sjon, and <em>Growth of the Soil<\/em> by Knut Hamsun.<\/p>\n<\/blockquote>\n<p>Interesting to end with this book . . . Booklamp&#8217;s recommendations are a lot less country-specific compared to GoodReads. But on the whole, I think I like the GoodReads recommendations better, especially since they include one of our books. <\/p>\n<p>There&#8217;s no conclusion to this post except that I find this all very interesting in theory, and flawed in execution. I&#8217;m sure both of these sites (and Bookish) will get better and better as time goes on and data accumulates, and will play a larger role in how books find an audience as booksellers continue to decrease in number . . . <\/p>\n<div class=\"ad_banner\">\n<a href=\"http:\/\/www.readthisnext.org\/\"><img decoding=\"async\" src=\"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/images\/762.jpg\"  \/><\/a>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Today&#8217;s Publishing Perspectives feature is all about Booklamp.org, a new book discovery site that&#8217;s being referred to either as &#8220;Pandora for Books&#8221; or the &#8220;Book Genome Project.&#8221; Here&#8217;s a brief description: \u201cOur program breaks a book up into 100 scenes and measures the \u2018DNA\u2019 of each scene, looking for 132 different thematic ingredients, and another [&hellip;]<\/p>\n","protected":false},"author":292,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[67486],"tags":[41926,40356,41916,6826,24256,3826],"class_list":["post-286366","post","type-post","status-publish","format-standard","hentry","category-articles","tag-book-discovery","tag-bookish","tag-booklamp","tag-goodreads","tag-publishing-perspectives","tag-rant"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/posts\/286366","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/users\/292"}],"replies":[{"embeddable":true,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/comments?post=286366"}],"version-history":[{"count":1,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/posts\/286366\/revisions"}],"predecessor-version":[{"id":343186,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/posts\/286366\/revisions\/343186"}],"wp:attachment":[{"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/media?parent=286366"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/categories?post=286366"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.rochester.edu\/College\/translation\/threepercent\/wp-json\/wp\/v2\/tags?post=286366"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}