[HN Gopher] Show HN: A self-updating list of the most current us...
       ___________________________________________________________________
        
       Show HN: A self-updating list of the most current useragents
        
       Hi Hacker News!  I made a site which displays the most common
       useragents found on the web.  The site updates weekly with data
       sourced from the server access logs of another site I run in order
       to give an accurate picture of the devices and browsers being used
       on the web.  I do a lot of web scraping in my work and it's this
       group of people who I had in mind when creating the site.  The data
       is presented as useragent, browser, os, and relative percentage of
       occurence. It can be viewed as a table on the site or via json in
       the API.  Please let me know your thoughts or feedback and I hope
       you find it useful!  Thanks!
        
       Author : stayml
       Score  : 15 points
       Date   : 2022-12-17 21:50 UTC (1 hours ago)
        
 (HTM) web link (www.useragents.me)
 (TXT) w3m dump (www.useragents.me)
        
       | dj_mc_merlin wrote:
       | First of all, good job on the project. I do not mean to be too
       | negative, but this:
       | 
       | > The site updates weekly with data sourced from the server
       | access logs of another site I run in order to give an accurate
       | picture of the devices and browsers being used on the web.
       | 
       | If you only source data from one website, it is not an accurate
       | picture by any means. Consider that other websites have different
       | user bases. Your Firefox usage seems way too high for example,
       | it's likely your other website is largely used by other technical
       | people.
        
         | playingalong wrote:
         | AFAICT they are not trying to provide current browser usage
         | stats. They are only after having a list of popular browsers
         | which is only a subset of the former. The last paragraph about
         | scraping seems to indicate that.
         | 
         | So they only need what is a reasonable UA as of this week. They
         | don't need what is the most popular one.
        
           | stayml wrote:
           | Yes, this too. It should just be a -passable- sample of
           | what's popular and seen on the web
        
         | stayml wrote:
         | Thanks! And yep, fair comment, and I had noticed this as well
         | even more so in last week's list. I have been thinking about
         | how I could adjust the numbers in some way to counteract this
         | or add another data source.
        
       | modeless wrote:
       | Where are Android and iOS?
        
       | jeroenhd wrote:
       | How do you control for the faked user agents scrapers use? A bad
       | scraper will be much more common in your logs than a well-behaved
       | browser, after all.
        
       | drusepth wrote:
       | Always interesting to browse these lists. Figured I'd add this
       | week's browser %s from a site I run to the thread for others
       | interested in another source:
       | 
       | > Chrome 62.09%
       | 
       | > Safari 26.55%
       | 
       | > Edge 3.23%
       | 
       | > Opera 2.94%
       | 
       | > Firefox 2.64%
       | 
       | > Samsung Internet 0.88%
       | 
       | > Internet Explorer 0.54%
       | 
       | > Android Webview 0.24%
       | 
       | > YaBrowser 0.16%
       | 
       | Bias: site audience is mostly writers/authors
        
       | black3r wrote:
       | The OS column comes off as a bit deceptive, because windows 11
       | still presents as Windows NT 10.0 and newer versions of macOS
       | still present themselves as "Intel Mac OS X 10_15_7", even if
       | you're on an ARM chip...,
       | 
       | If you're parsing this info into the OS column, you should
       | probably display this info in there at least with a note, or
       | something like "Windows 10/11", "macOS 10.5 or newer"
        
         | stayml wrote:
         | Good point, thanks. I'll add that in
        
       ___________________________________________________________________
       (page generated 2022-12-17 23:00 UTC)