As a long time book fiend and collector of all things in paper the cascade of Amazon’s latest features intrigued me enough to dig into it and explore what it was all about- I’ll start with the whole SIPs thing, but it doesn’t end there as Amazon goes for the grand slam on social media optimization…
From Amazon:
Amazon.com’s Statistically Improbable Phrases, or “SIPs”, are the most distinctive phrases in the text of books in the Search Inside!™ program. To identify SIPs, our computers scan the text of all books in the Search Inside! program. If they find a phrase that occurs a large number of times in a particular book relative to all Search Inside! books, that phrase is a SIP in that book.
SIPs are not necessarily improbable within a particular book, but they are improbable relative to all books in Search Inside!. For example, most SIPs for a book on taxes are tax related. But because we display SIPs in order of their improbability score, the first SIPs will be on tax topics that this book mentions more often than other tax books. For works of fiction, SIPs tend to be distinctive word combinations that often hint at important plot elements.
Click on a SIP to view a list of books in which the phrase occurs. You can also view a list of references to the phrase in each book
Contrast that with the Wikipedi SIP entry just for fun:
Sips (Structural Insulating Panels) are a composite material used in building. They consist of a sandwich of two layers of structural board with an insulating layer of foam in between. The board is usually OSB (Oriented Strand Board) and the foam either polystyrene foam or polyurethane foam.
If you click on a SIP, you see all other works that also contain those phrases. Remember they are not just restricted to the book itself, but to other related books- ranked by their improbability score. This is not to be confused with the Adam’s Infinite Improbability Drive- we are not approaching normalcy–not even close. Let’s take a leap for fans…and neophytes:
For non Adam’s readers The Infinite Improbability Drive is a fictional faster-than-light drive in Douglas Adams’ The Hitchhiker’s Guide to the Galaxy series of books (get the ultimate guide with the books) which I highly recommend- you can watch the movie, but it really would be cheating, and you would miss great lines like :
And then, one Thursday, nearly two thousand years after one man had been nailed to a tree for saying how great it would be to be nice to people for a change, one girl sitting on her own in a small cafe in Rickmansworth suddenly realized what it was that had been going wrong all this time, and she finally knew how the world could be made a good and happy place. This time it was right, it would work, and no one would have to get nailed to anything.
I don’t want to spoil the plot so let’s get back to the drive (hijacked by Zaphod)- the most prominent usage of the drive is in the starship Heart of Gold. It is based on a particular perception of quantum theory: a subatomic particle is most likely to be in a particular place, such as near the nucleus of an atom, but there is a fantastically small probability of it being found a long way away, for example close to a distant star. Thus, a body could travel from place to place without passing through the intervening space (or hyperspace), if you had sufficient control of probability.
So you can note how it could work as a semi-spoiler in fiction (perhaps not in Adams’ books), especially since Amazon is also integrated in CAPS or capitalized phrases that occur frequently in the text. I note I am not sure what kind of testing they are running e.g. bivariate or multivariate, etc. But I assume that is what is going on. On to CAPs
Capitalized Phrases (CAPs): Capitalized Phrases, or “CAPs”, are people, places, events, or important topics mentioned frequently in a book. Along with our Statistically Improbable Phrases, Capitalized Phrases give you a quick glimpse into a book’s contents.
Click on a Capitalized Phrase to view a list of books in which the phrase occurs. You can also view a list of references to the Capitalized Phrase in each book.
For an example, assuming it hasn’t changed check out Get Back in the Box: Innovation from the Inside Out, by Douglas Rushkoff. (I also highly recommend reading Media Virus! by Rushkoff too…which you can pick up cheap in the after market. Don’t get turned off by the negative reviews. The text was written in the mid-90’s and remarkably visionary- to put it into context BoingBoing was just a zine back then- and after reviewing refreshing myself from the history depths I found an interesting aside on stats among aggregators.)
Amazon is also providing a concordance based on “heatmapped tag clouds” on some of their books. This is an interesting use of a concordance because, themes and keywords visually pop-out as a result. I see a number of applications for clouds as meta-data becomes more and more in demand. Meta-data- it will be everything in a couple of years- start collecting yours now.
They are also using readability and complexity analysis and sometimes citations- citations simply being other books a particular book cites. To step back here is an example on the Text Stats.
The Text Stats feature calculates a variety of statistics for each book in the Search Inside!™ program. The Complexity calculations indicate the complexity of the words and sentence structure in the text of a book. A word is considered “complex” if it has three or more syllables.
Neat. Reminds me of work we did on the Spware / Adware EULA Analyzer.
I will add that for some complex might qualify as anything over two syllables…
Are we done yet? No. Plogs. Amazon now offers Plogs for folks. Like your rifle your plog is your own. It does follow along systems theory providing a feedback loop within a feedback loop although one has to wonder if Amazon’s system could be getting too complex?
Your Amazon.com Plog is a personalized web log that appears on your customer home page. Every person’s Plog is different (hence the name) and just like a blog, your Plog is sorted in reverse chronological order. Each post also gives you the opportunity to provide feedback to the sender as to whether you liked the post or not. This feedback loop means your Plog becomes even more relevant and interesting over time. Your Plog will appear if you are logged into our web site and is visible only to you.
Think personalized blog- although most blogs are personal anyway. The play is pretty obvious- authors comment and it gives readersa a great opportunity to learn more about their author of choice and Amazon becomes a more interactive place to visit and manage. Either way Amazon has really innovated this time. Truly grasping SOM- social media optimization - as it applies to books and putting it to practical use.
Did I mention SOM- social media optimization? Yes I did, some will call it a “buzz word”, but having put it in practice I know it works. It won’t become a buzzword until some calls it “Hyper-Social Media Optimization”.
If it is SOM and all Web 2.0′ish Porter where are the damn tags?
Forgive me…after some exploring I did indeed turn up tags, or bookmarklets, so Amazon didn’t forget the tags…and I will load up on a few at the end just for finesse or overkill depending on your viewpoint- and eventually get a tag cloud generator installed.
Sample tags for Rushkoff’s text:
business (2)
rushkoff (2)
books - business (1)
business web (1)
innovation (1)
open source (1)
opensource (1)
to_read (1)
Some Tags for this Post:
Technorati Tags:
SIPs, Amazon, Rushkoff, business, web2.0, socialbookmarks, tags, tag clouds, tagging, som, social media optimization, wayneporter.com,
Furl Tags:
SIPs, Amazon, Rushkoff, business, web2.0, socialbookmarks, tags, tag clouds, tagging, som, social media optimization, wayneporter.com, innovation, marketing, amazon.com, AMZN
Del.icio.us Tags:
SIPs, Amazon, Rushkoff, business, Douglas Adams, reviews, web2.0, socialbookmarks, tags, tag clouds, tagging, som, social media optimization, wayneporter.com, innovation, marketing, amazon.com, AMZN
Blogging E Commerce Language & Sound Net Lifestyle Online Education Reading & Literature Recreation Social Bookmarking Popularity: 5% [?]
Share This