Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
SECURITY ADVISER  

MySpace password exploit: Crunching the numbers (and letters)

Analysis of 34,000 real examples backs up theories that it's not hard to guess most end-users' passwords

By Roger A. Grimes
November 17, 2006
 

I didn't intend to discuss passwords again this week, but a unique opportunity presented itself. A major phishing attack occurred at MySpace, and I got over 34,000 real passwords to analyze for character frequency.

Free IT resource

Virtualization Insights from Top Experts - Learn how virtualization gets real!

Sponsored by Dell

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

The phishing attack occurred because hackers were able to use MySpace’s HTML home page abilities to craft a malicious overlay page. That meant when a MySpace user thought they were logging in to a friend’s MySpace home page or profile, they were often sending their log-on names and passwords to hackers, who collected them on other compromised Web servers. It is estimated that the hackers collected more than 100,000 log-on names and password combinations before the phishing attack was noticed.

The hackers erred in making the collected passwords available for anyone to see and download. There were at least five different collection points. I picked up the password files from two of the locations; they came in at over 2GB.

After removing tons of garbage (some from coding errors, some from helpful people trying to crash the hacker’s collection files), I came up with more than 34,000 different log-on account/password combinations. Being able to collect and analyze such a large number of passwords from a wide range of users doesn’t usually happen when you’re on the white-hat side of things.

I collected the passwords into MS Access and MS Excel databases and then analyzed them for word and character distribution. Here are some of my findings from my initial queries:

*As expected, English vowels are by far the most frequent occurring password symbols (E, 48 percent; A, 46 percent; I, 34 percent; O, 33 percent). Other high-ranking letters included R (35 percent), S (32 percent), N (31 percent), L (28 percent), T (25 percent), C (21 percent), and M (21 percent).

*The letters, B, D, G, H, P, U, and Y appeared in 10 to 20 percent of the passwords.

*As expected, the letters Q (1 percent), X (3 percent), and Z (3 percent) were not popular.

*Numbers were used in well over half the passwords. The number 1 appeared 45 percent of the time, followed by the numbers 2 (22 percent), 0 (16 percent), and 3 (15 percent). Numbers 4 through 9 appeared roughly 9 to 11 percent of the time.

*As I’ve written many times, including in my last column, numbers are most often placed at the end of the password when used. For example, when the number 1 appeared, it only showed up 7 percent of the time as the first character, and only 15 percent of the time as one of the first four characters in the password.

*MySpace accounts don’t require complex passwords, so capital letters and other keyboard symbols -- such as ~, !, &, @, #, and so on -- were not present most of the time. The exclamation point was the most commonly used non-alphanumeric character at almost 3 percent, followed by the period symbol at 1.6 percent.

*Almost 1 percent of users had the word "password" as, or as part of, their password. Not real clever.

*Words, colors, years, names, sports, hobbies, and music groups were very popular. FYI, your girlfriend or boyfriend’s name isn’t that uncommon in most cases. I, too, luv Brandi, Bob, or Joe.

*The color red was twice as likely to be used in a password as blue. No other colors came close in popularity percentage-wise. I guess "chartreuse" is a relatively safe password choice.

*Other popular words include: angel, baby, boy, girl, big, monkey, me, and the.

*Cuss words were very popular. Boy, there’s a lot of aggression out there.

*I was surprised about how many Christian-sounding -- for example, "Ilovejesus" -- log-on names were associated with the worst cuss words.

*Names of sports -- golf, football, soccer, and so on -- were as popular as professional sports teams and college team nicknames.

*Certain specific letter combinations -- aa, ee, oo, dr, ea, lo, la, and so on -- appeared in a given password about 3 percent of the time.

One last note: The password list contained several e-mail/log-on account names from popular OS and software vendors. Although we can’t be assured that the passwords used on the exploited site were the same as the employee’s company password, I’m sure some are matches.

Remember this and learn from it: An exploited Web site that's completely unrelated to your company could still put your company at risk. Remind all employees not to use their company passwords on noncompany Web sites, if at all.

After going through all the files, I revealed no startling password distribution data. All of it backed up my previously published conjecture and studies (such as Perfect Passwords by Mark Burnett and The Great Password Debates by Dr. Jesper Johansson) by other friends.

And in case you’re wondering, hard-working network and security experts spent many hours notifying the ISPs and affected companies about their compromised users' passwords. Of course, I’m willing to bet that a moderate percentage of those contacted will not change their password because they will think the warning notice from their ISP is a phishing message. So they will delete it without responding or changing their password. That’s the world we live in at the moment.





 


 
InfoWorld Test Center Contributing Editor Roger A. Grimes is a Foundstone Ultimate Hacking instructor/consultant teaching Windows, Linux, Unix, and Solaris security. He is also the author of several books, including Malicious Mobile Code: Virus Protection for Windows.

  More of Roger A. Grimes' column

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  Update: HP in talks to buy EDS for up to $13 billion
Deal would strengthen HP's competitive position against IBM, but still would leave it about $10 billion short of IBM's global services revenue

»  SOA Software buys LogicLibrary
Combination links governance automation, repository technologies

»  Sun to clarify JavaFX open-source plan later this year
Sun had promised that JavaFX would be open source, but a FAQ on Sun's site indicates that only parts of the RIA development family will be open source

»  Microsoft readies service packs for dev tools
Improvements to Visual Studio 2008 include database and Office 2007 Ribbon support

»  Cisco's TelePresence gets personal
The high-definition virtual meeting system will be available at a less expensive entry price for midsized businesses later this year

»  Developers' role shifting from apps to platforms
Untrained workers are moving into app dev space, pushing career developers into the platform space, a Sun engineer noted at JavaOne




Virtualization: A Step by Step Approach to Success
Your virtual machines can be up and running in a matter of minutes. HP and Citrix have integrated XenServer with HP ProLiant servers and management tools, powered by hardware-assisted Intel Virtualization Technology to enable high- performance, cost-savings solutions for server consolidation and disaster recovery. Sponsor: HP

»  Click here to view this Webcast
  Storage is big, and getting bigger
The only certainty is that your requirement for storage will never be satisfied. While you clean out space and authorize POs, you might consider another alternative: outsourcing. The best way to deal with storage might be to let someone else deal with it. Sponsored by SGI

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 
SEE ALSO
• Phishing attack targets MySpace users


FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS  IT EXEC-CONNECT   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist