Download the Ranked Wikipedia Lists
Last update: December 2022- Wikipedia: RankedWiki.zip
- Wiktionary: RankedWiktionary.zip
- Combined: RankedWikiWikt.zip
- FamousNames.txt
The structure of the files is simple: it looks like
Olav V of Norway@100 Archimedes@100 Port Phillip@100 Optimus Prime@100 Merv@100 French people@100 Three's Company@100 ...
Don't bother trying to open it in Excel; it's too big. Instead, just run some combination of grep and sed on it to find the entries you're looking for. (Windows users can get grep and sed with the excellent Cygwin utility.)
Since this data comes from Wikipedia and Wiktionary, it is distributed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. In any case, you are free to share and remix the work as long as you give attribution.
Example searches:
# Find the top 10 entries of the form ??E?L?? > grep -iP '^..e.l..@' RankedWiki.txt -m 10 Heerlen@99 Shellac@98 Gremlin@98 Siedlce@98 Ixelles@98 EHealth@96 Apelles@96 Feedlot@95 Peebles@95 Kremlin@95 # Find the top 10 entries of the form ??E?L?? in "crossword mode" > sed 's/[^A-Za-z0-9@]//g' RankedWiki.txt | grep -iP '^..e.l..@' -m 10 Heerlen@99 SheilaE@98 Shellac@98 Gremlin@98 Siedlce@98 Ixelles@98 ThePlay@97 TheBlob@97 TheBled@96 EHealth@96 # Find all people with first name "Ben" and score 100 > grep -iP '^ben \w+@100' RankedWiki.txt Ben Harper@100 Ben Bernanke@100 Ben Folds@100 Ben Hogan@100 Ben Jonson@100 Ben Roethlisberger@100 Ben Affleck@100 Ben Stiller@100 # Find the top 10 entries of 15 letters or less with a hidden "LOL" in crossword mode > sed 's/[^A-Za-z0-9@]//g' RankedWiki.txt | grep -iP '.lol.+@' -m 50 | grep -iP '^.{1,15}@' -m 10 BristolOldVic@99 Philology@98 SpecialOlympics@98 MarcelloLippi@98 Malolos@98 MickeyLolich@98 Vexillology@98 WillOldham@97 RunLolaRun@97 AlOliver@97