A massive language research database responsible for bringing words such as "podcast" and "celebutante" to the pages of the Oxford dictionaries has officially hit a total of 1 billion words, researchers said Wednesday.
Drawing on sources such as weblogs, chatrooms, newspapers, magazines and fiction, the Oxford English Corpus spots emerging trends in language usage to help guide lexicographers when composing the most recent editions of dictionaries.
The press publishes the Oxford English Dictionary, considered the most comprehensive dictionary of the language, which in its most recent August 2005 edition added words such as "supersize,""wiki" and "retail politics" to its pages.
Oxford University Press lexicographer Catherine Soanes said the database is not a collection of 1 billion different words, but of sentences and other examples of the usage and spelling.
"The corpus is purely 21st century English," said Judy Pearsall, publishing manager of English dictionaries. "You're looking at current English and seeing what's happening right now. That's language at the cutting edge."
As hybrid words such as "geek-chic,""inner-child" or "gabfest" increase in usage, Pearsall said part of the research project's goal is to identify words that have lasting power.
"English gets really creative, really fun. What we're putting in dictionaries is words that will stick around," she said. ...more...
I guess it's quantity not quality that counts...

GJ