Tuesday, March 8, 2011
Sneaked vs. Snuck: What Ngram Tells Us
By Betty Azar
Author, Azar Grammar Series
I have just discovered an online tool that I think will be fun and useful for all of us who are fascinated by English language usage. It is Google Labs Ngram Viewer, released by Google in December, 2010.
The Ngram Viewer graphs usage frequency from 1800 to 2000, based on the corpus of millions of books that Google has thus far scanned.
The first word I looked up was snuck. Through my years of writing textbooks, I debated whether to include snuck in an advanced-level reference chart of irregular verbs. I had been taught in school that snuck was nonstandard, not used in “educated language usage.” As the years have gone by, I’ve observed that more and more “educated” writers and speakers seem to use snuck freely.
In the latest edition of Understanding and Using English Grammar, my co-writer (Stacy Hagen) and I decided to include snuck as well as sneaked after consulting the Longman corpus and a corpus researcher. Ngram confirms what we had observed; the graphs show that usage of snuck greatly increased in frequency from the mid-1990s to 2000, while sneaked (though still more common than snuck) had a spike in the 1940s, but otherwise has stayed relatively steady. In other words, just as it had seemed to me, more and more people have been using snuck more and more often. (Does your spellcheck allow snuck or mark it as an error?)
I read about this site in the New York Times (Sunday, February 27, 2011) in the On Language column, written by Ben Zimmer (and formerly, for many years, by William Safire). In the article, Zimmer says that Google’s site “…allows anyone to plumb the history of language patterns in the corpus.” It is easy to use and available to everyone who can connect to the internet.
If you go to the site and enter the words war, peace, you will readily see a visual illustration of how usage frequency reflects cultural concerns and events.
Here are some of the other things my maiden voyage into Ngram discovered:
- whom is much on the decline (no surprise there!)
- gotta is much on the ascent (ditto)
- the (the most frequently used word in English) has stayed steady; in every 100 written words, 5.5 of them will be the
- passenger pigeon goes up and down like a disturbed heart monitor until the extinction of the species, then levels out
- no problem is skyrocketing, while you’re welcome is flat-lining
- the name Betty reached its height around 1940 (I was born in 1941), and has been, with a few exceptions, in decline since
There’s perhaps no new news here, but it certainly is fun and instructive to see how language usage reflects our understandings and beliefs about our society, our history, and our times. For teachers, it is a tool to inform us of the actual usage frequency of structures, words, and topics we include in our classrooms and materials.
I hope you enjoy the site as much as I do.