Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
CTO CONNECTION  

RSS bandwidth blues

Making RSS more manageable on the server side takes extra effort

By Chad Dickerson  
July 30, 2004
 

My recent column “RSS growing pains” provoked passionate discussion in the blogosphere, which really picked up speed when the article was linked from Slashdot. Many readers pointed out ways to make RSS more manageable on the server side. I got the sense from various Weblog posts and e-mails that the word isn’t out on these methods, so consider this column my attempt to help.

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld

In a post on his Weblog, Dare Obasanjo suggested two approaches that would help InfoWorld and other RSS feed providers limit bandwidth consumption. The first is HTTP compression, a simple but seldom-used capability of Web servers and browsers. HTTP compression is best illustrated by a simple example. Most Web browsers send a header to Web servers indicating whether they accept compressed content. The header generally looks something like: “Accept-Encoding: gzip,” which indicates that the browser can decompress files using gzip compression. When browsers make requests to an Apache server with the mod_gzip module installed, the Apache server applies gzip compression on-the-fly as clients request files. The result is a substantially smaller file being sent to the client, thereby reducing bandwidth requirements.

The second method Dare proposed is the use of the HTTP conditional GET. A full explanation of this method requires more space than I have here, but a Google search for “HTTP conditional GET” will turn up Charles Miller’s “HTTP Conditional Get for RSS Hackers” page with all the details. To quote from Miller’s page, the logic behind a conditional GET request is simple: “If this document has changed since I last looked at it, give me the new version. If it hasn’t, just tell me it hasn’t changed and give me nothing.” The conditional GET combined with HTTP compression can make a huge performance difference -- most newsreaders won’t pull an RSS feed unless it has changed, and when they do, the file will be compressed.

In my experience, the annoyances in serving RSS have less to do with bandwidth and more to do with supporting regular surges of simultaneous connections from newsreaders. This is not a new problem, and there are a number of ways to solve it. I’ll go from cheapest to most expensive. First, configuring your Web servers to handle a higher number of simultaneous connections is critical. In the Apache world, that means configuring your MaxClients setting as high as your server can realistically support. Alternately, you could use a high-speed front-end caching server such as the open source Squid to serve RSS clients more quickly. Finally, you can sign up with third-party CDN (content delivery network) services such as Akamai and Speedera to handle some or all of your RSS load.

Aside from the actions RSS providers can take to mitigate performance issues on their server farms, we can also pull for certain companies to succeed. One of the companies I’m pulling for is Bloglines, which provides a nice Web-based aggregator that I use daily. Bloglines not only acts as a proxy for a large pool of users (making one hourly request for each of our RSS feeds to serve hundreds of users) but also tells me the number of subscribers I have to each of my feeds in the requests they make to my Web server.

RSS traffic is not absolutely crushing InfoWorld’s Web servers, but scaling RSS traffic does require conscious thought and effort. With the right approach, mild annoyances can be overcome.





 


 
Chad Dickerson is CTO of InfoWorld.

  More of Chad Dickerson's column
  Chad Dickerson's Weblog

Newsletter Get Chad's column delivered weekly.
Enter e-mail address:




 

TOP NEWS:


»  Yahoo tells Icahn that its own board knows best
Yahoo claims that Icahn's proposal shows a 'significant misunderstanding' of how Microsoft's buyout offer was handled

»  Does Icahn have a backup plan?
Carl Icahn is trying to force Yahoo back to the bargaining table with Microsoft, but if Microsoft is no longer interested, he'll need to have other options available

»  Sprint: WiMax cleared for commercial use
Sprint has completed nearly a year's worth of testing and has now declared WiMax up to commerical deployment standards

»  Tools circulate that crack Debian, Ubuntu keys
The tools take advantage of a recently discovered vulnerability and can be used to forge digital signatures and steal confidential information

»  Facebook to Google: Friend Disconnect
Facebook cites violation of its terms of service as grounds for blocking Google's Friend Connect from accessing social network's members' data

»  U.S. to investigate semiconductor patent complaints
LSI and subsidiary Agere Systems ask ITC to bar imports by companies violating their patent for semiconductor chips containing tungsten metal




Virtualization: A Step by Step Approach to Success
Your virtual machines can be up and running in a matter of minutes. HP and Citrix have integrated XenServer with HP ProLiant servers and management tools, powered by hardware-assisted Intel Virtualization Technology to enable high- performance, cost-savings solutions for server consolidation and disaster recovery. Sponsor: HP

»  Click here to view this Webcast
  The Data Protection You've Been Looking For
Enterprise data is of supreme importance. If you can't find it quickly, it's worthless. If you lose it, it's a crisis. This IT Strategy Guide explores how to keep your data safe.

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS  IT EXEC-CONNECT   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist