http://en.wikipedia.org/wiki/Statistically_Improbable_Phrases
Statistically Improbable Phrases
From Wikipedia, the free encyclopedia
Jump to: navigation, search
Statistically Improbable Phrases or SIPs constitute a system developed by Amazon.com to compare all of the books they index in the Search Inside! program and find phrases in each that are the most unlikely to be found in any other book indexed.[1] The system is used to find the most nearly unique portions of books for use as a summary or keyword.
SIP is also used more generally to refer to a search string likely to generate meaningful results from a search engine; that is, a string whose chance of occurring in a desirable result is much greater than its chance of occurring in a non-desirable result.
Googlewhack — a pair of words occurring on a single webpage, as indexed by Google
^ "What are Statistically Improbable Phrases?". Amazon.com. Retrieved on 2007-12-18.
People
Jeff Bezos • Tom Szkutak • Brian Valentine • Werner Vogels
Websites
A9.com • Alexa Internet • Amapedia • Askville • CDNOW • Digital Photography Review • Internet Movie Database • Joyo.com • unspun.com
Web Services
E-Commerce Service • EC2 • FPS • Marketplace • Mechanical Turk • S3 • SimpleDB • Simple Queue Service
Digital
Amazon Kindle • Amazon MP3 • Amazon Unbox • Audible.com • Mobipocket
Technology
1-Click • aStore • Gurupa • Lab126 • Obidos
Other
Amazon Fishbowl • Amazon Standard Identification Number • Breakthrough Novel Award • Statistically Improbable Phrases
Categories: Amazon.com Searching Bookselling
This page was last modified on 18 December 2007, at 12:47.
All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.) Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.
Privacy policy
About Wikipedia
Disclaimers
Revision history of Statistically Improbable Phrases
From Wikipedia, the free encyclopedia
View logs for this page
Jump to: navigation, search(Latest Earliest) View (newer 50) (older 50) (20 50 100 250 500)
For any version listed below, click on its date to view it. For more help, see Help:Page history and Help:Edit summary.(cur) = difference from current version, (last) = difference from preceding version, m = minor edit, → = section edit, ← = automatic edit summary
(cur) (last) 12:47, 18 December 2007 D.brodale (Talk contribs) m (2,326 bytes) (→See also: website->webpage (Googlewhack)) (undo)
(cur) (last) 12:40, 18 December 2007 D.brodale (Talk contribs) m (2,326 bytes) (→Referencess: typo of my own doing) (undo)
(cur) (last) 12:32, 18 December 2007 D.brodale (Talk contribs) m (2,327 bytes) (noun-verb agreement. is there a rationale why this article isn't classed as "Statistically Improbable Phrase"?) (undo)
(cur) (last) 12:29, 18 December 2007 D.brodale (Talk contribs) m (2,319 bytes) (add reflist, minor reformat of existing SeeAlso) (undo)
(cur) (last) 12:28, 18 December 2007 D.brodale (Talk contribs) (2,289 bytes) (EL->cite (Amazon.com help document on SIP)) (undo)
(cur) (last) 12:21, 18 December 2007 D.brodale (Talk contribs) (2,280 bytes) (interesting extension, but unsourced. trimmed EL (see prior edit) did not appear, upon perusal of results returned, to constitute verifiable reference, so flag as in need of evidentiary support.) (undo)
(cur) (last) 12:18, 18 December 2007 D.brodale (Talk contribs) (2,255 bytes) (→External links: rm baffling EL that seems a matter internal to Wikipedia in terms of shoring up claim to usage beyond Amazon. just plain odd.) (undo)
(cur) (last) 13:17, 28 November 2007 SmackBot (Talk contribs) m (2,577 bytes) (Standard headings &/or gen fixes. using AWB) (undo)
(cur) (last) 10:55, 6 September 2007 Kevinalewis (Talk contribs) (2,579 bytes) (recat) (undo)
(cur) (last) 22:03, 24 August 2007 Kfogel (Talk contribs) (2,573 bytes) (Mention that "SIP" has a more general usage now.) (undo)
(cur) (last) 18:09, 6 August 2007 Metaeducation (Talk contribs) (1,983 bytes) (See also Googlewhack, has similar character of finding statistically improbable pairings) (undo)
(cur) (last) 23:56, 26 February 2007 Selket (Talk contribs) (Deleted stale merge request with no consensus) (undo)
(cur) (last) 22:14, 29 January 2007 Vlad (Talk contribs) m (Not a stub anymore IMHO) (undo)
(cur) (last) 22:01, 15 January 2007 BorgQueen (Talk contribs) m (Reverted edits by 66.209.89.177 (talk) to last version by MPS) (undo)
(cur) (last) 21:58, 15 January 2007 66.209.89.177 (Talk) (undo)
(cur) (last) 01:16, 18 November 2006 MPS (Talk contribs) (not OR) (undo)
(cur) (last) 06:07, 7 November 2006 Iamunknown (Talk contribs) (update merge tag and + {{Original research}}) (undo)
(cur) (last) 07:44, 5 October 2006 Alphachimpbot (Talk contribs) m (BOT - updating merge tag) (undo)
(cur) (last) 22:11, 1 October 2006 Alaibot (Talk contribs) m (Robot: Automated text replacement (-{{[Rr]etail-stub}} +{{US-retail-stub}})) (undo)
(cur) (last) 21:03, 28 September 2006 Alphachimpbot (Talk contribs) m (BOT - updating merge tags to appear in Category:Merge by month) (undo)
(cur) (last) 17:57, 15 August 2006 Beland (Talk contribs) m (→External links: markup fix) (undo)
(cur) (last) 02:59, 9 August 2006 Tree Biting Conspiracy (Talk contribs) (+template) (undo)
(cur) (last) 03:43, 24 May 2006 JonHarder (Talk contribs) (Sharpen category.) (undo)
(cur) (last) 16:51, 23 May 2006 Dp462090 (Talk contribs) m (Limited spellcheck, unicode, and minor fixes using AWB using AWB) (undo)
(cur) (last) 06:00, 28 April 2006 Lukobe (Talk contribs) (undo)
(cur) (last) 08:58, 20 April 2006 Notinasnaid (Talk contribs) (Help end unique abuse now!) (undo)
(cur) (last) 07:25, 28 March 2006 Keycard (Talk contribs) (categorising stub) (undo)
(cur) (last) 17:21, 27 March 2006 Keycard (Talk contribs) (rv - it only works with multiple texts) (undo)
(cur) (last) 15:18, 27 March 2006 Crzrussian (Talk contribs) (undo)
(cur) (last) 15:12, 27 March 2006 Crzrussian (Talk contribs) (trimmed away the inept examples.) (undo)
(cur) (last) 18:33, 22 March 2006 BorgQueen (Talk contribs) m (stub +) (undo)
(cur) (last) 18:32, 22 March 2006 BorgQueen (Talk contribs) m (copyedit; fmt) (undo)
(cur) (last) 02:05, 24 November 2005 Bryan Derksen (Talk contribs) m (minor copyedit) (undo)
(cur) (last) 01:38, 22 November 2005 Trevor MacInnis (Talk contribs) m (Reverted edits by 70.112.86.228 to last version by 152.163.100.7) (undo)
(cur) (last) 01:37, 22 November 2005 70.112.86.228 (Talk) (undo)
(cur) (last) 08:42, 6 November 2005 152.163.100.7 (Talk) (undo)
(cur) (last) 13:59, 5 October 2005 Phil Boswell (Talk contribs) m (→External links: annotating link properly) (undo)
(cur) (last) 09:17, 29 September 2005 WindFish (Talk contribs) m (undo)
(cur) (last) 15:14, 21 September 2005 Thepcnerd (Talk contribs) (Latest Earliest) View (newer 50) (older 50) (20 50 100 250 500)
Retrieved from "http://en.wikipedia.org/wiki/Statistically_Improbable_Phrases"
Talk:Statistically Improbable Phrases
From Wikipedia, the free encyclopedia
Jump to: navigation, search
I would imagine that if this kind of analysis were to be used on blogs or essays it could make for a wonderful new matchmaking tool. —The preceding unsigned comment was added by 209.144.249.197 (talk • contribs) 03:00, 30 May 2006 (UTC)
As SIPs is (potentially) a generic term and not confined to Amazon.com space, then I think it should have its own page Geneffects 21:05, 28 August 2006 (UTC)
Wow. Another useless entry.
Nice enough examples, but really does a terrible job of actually explaining the concept and the practice.134.241.224.121 20:31, 29 March 2007 (UTC)
Are there any related statistical models or methods out there? Some references or connections would spice this up nicely. 15:35, 5 April 2007
I'd like to have the SIPs algorithm to use as a tool, something like this http://www.onelook.com/reverse-dictionary.shtml reverse dictionary (24.68.170.164 19:05, 15 May 2007 (UTC))
Regarding the "citation needed" tag for the phrase's more general usage (which stemmed from this edit): you can get 400 hits from search://"statistically improbable phrase" -amazon/ as of January 2008. I mostly hear "SIP" used in chat rooms and other places (without reference to Amazon); I'm not sure what the best way to provide a Wikipedia-quality citation is, but am pretty sure it's in general use now, for some reasonable definition of "general use". --Karl Fogel 15:45, 25 January 2008 (UTC)
Eh, looking at the "Google hit" metric in detail (the results returned above), it doesn't seem remarkable or reliable. Application of the phrase (SIPs) in contexts other than Amazon appears to mean whatever the person typing the term wants it to mean. I don't see a consistent meaning as stated in the article (noted above as flagged for supporting citation). I also didn't note anything resembling a reliable source commenting on the usage stated. Perhaps it was overlooked? D. Brodale (talk) 19:00, 25 January 2008 (UTC)
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment