HTTP Archive switches to Chrome

May 23, 2016 9:08 am | 7 Comments

The HTTP Archive crawls the worldâ€™s top URLs twice each month and records detailed information like the number of HTTP requests, the most popular image formats, and the use of gzip compression. In addition to aggregate stats, the HTTP Archive has the same data for individual websites plus images and video of the site loading. It’s built on top of WebPageTest (yayÂ Pat!), and all our code and data is open source. HTTP Archive is part of the Internet ArchiveÂ and is made possible thanks to our sponsors:Â Google, Mozilla, New Relic, Oâ€™Reilly Media, Etsy, dynaTrace, Instart Logic,Â Catchpoint Systems, Fastly, SOASTA mPulse, and Hosting Facts.

I started the HTTP Archive in November 2010. Even though I workedÂ at Google, I decided to use Internet Explorer 8 to gather the data because I wanted the data to represent the typical user experienceÂ and IE 8 was the world’s most popular browser. Later, testingÂ switched to IE 9 when it became the most popular browser. Chrome’s popularity has been growing, so we started parallel testing with Chrome last year in anticipation of switching over.Â This month, itÂ was determined thatÂ Chrome isÂ the world’s most popular browser.

In May 2011, I launched HTTP Archive Mobile. This testing was done with real iPhones. ItÂ started by testingÂ 1,000 URLs and hasÂ “scaled up” to 5,000 URLs. I put that in quotes because 5,000 URLs is far short of the 500,000 URLs being tested on desktop. Pat hosts these iPhones at home. We’ve found that maintaining realÂ mobile devices for large scale testing is costly, unreliable, and time-consuming. For the last year we’ve talked about how to track mobile data in a way that would allow us to scale to 1M URLs. We decided emulating Android using Chrome’s mobile emulation features was the best option, and started parallel testing in this mode early last year.

Today, we’re announcing our switch from IE 9 and real iPhones to Chrome and emulated Android as the test agents for HTTP Archive.

We swapped in the new Chrome and emulated Android data starting March 1 2016. In other words, if you go to HTTP ArchiveÂ theÂ data starting fromÂ March 1 2016 is from Chrome, and everything prior is from Internet Explorer. Similarly, if you go to HTTP Archive MobileÂ theÂ data starting fromÂ March 1 2016 is from emulated Android, and everything prior is from real iPhones. For purposes of comparison, we’re temporarily maintaining HTTP Archive IE and HTTP Archive iPhoneÂ where you can see the data from those test agents up to the current day. We’ll keep doing this testing through June.

This switchover opens the way for us to expand both our desktop and mobile testing to the top 1 million URLs worldwide. It also lowers our hardware and maintenance costs, and allows us to use the world’s most popular browser. Take a look today at our aggregate trends and see whatÂ stats we have forÂ your website.

7 Responses to HTTP Archive switches to Chrome

Eric Lawrence | 23-May-16 at 2:14 pm | Permalink |

Is the archive set up to track how many servers offer Brotli-compressed content?
Steve Souders | 23-May-16 at 5:53 pm | Permalink |

Hi, Eric! I don’t think we look at Brotli specifically vs just “compressed”. Now that we’re using Chrome we can get more stats on things like Brotli and H2. It would be great if you filed a ticket ( https://github.com/HTTPArchive/httparchive/issues ) esp. if you could explain what we can look for to detect Brotli (and other) compression (eg, which header values).
Joseph Scott | 26-May-16 at 3:33 pm | Permalink |

What connection profile is being used for the mobile tests?
Steve Souders | 26-May-16 at 6:22 pm | Permalink |

Joseph: We’re still using the 3G connection profile.
Les Murphy | 24-Jul-16 at 12:31 pm | Permalink |

Does the mobile emulation still use DummyNet for simulating 3G network characteristics, or are you now using the Chrome Network Conditions emulation?
Josh Paiva | 01-Aug-16 at 1:45 pm | Permalink |

Great article. I’m a fan of Chrome and for me it’s been easier to crack the one million url benchmark w some of my blogs. Thanks for the content was a great read!
admission | 21-Aug-16 at 6:14 am | Permalink |

One thing though â€“ and I donâ€™t mean to be nitpicky, but when you compare 4 percentage *points* (increase in requests with caching headers) to 12 *percent* (increase in number of requests per page) thatâ€™s rather misleading.
The increase from 42% to 46% cached resources is an increase of 9.5 percent. Still a far cry from the 24% increase in total transfer size and certainly disappointing that itâ€™s continuing to trail behind the reqs per page, but still.

SteveSouders.com

HTTP Archive switches to Chrome

7 Responses to HTTP Archive switches to Chrome