[HN Gopher] Show HN: Hacker News user blogroll
       ___________________________________________________________________
        
       Show HN: Hacker News user blogroll
        
       I saw this [0] pretty cool thread by user revskill, and wanted a
       quicker way to search through it, but also to keep them all in one
       place so I can read them at my leisure whenever I get time.  Right
       now is like 60 lines of Ruby using Nokogiri, but I will certainly
       look into it further down the line and improve the list.  There's a
       cronjob checking the thread every 12 hours but I will eventually
       shut that down and it will become static after that.  There are
       some really awesome blogs in there. I really recommend going
       through the list, it made my day.  [0] "Could you share your
       personal blog here". https://news.ycombinator.com/item?id=36575081
        
       Author : deathbypenguin
       Score  : 356 points
       Date   : 2023-07-05 19:06 UTC (3 hours ago)
        
 (HTM) web link (dm.hn)
 (TXT) w3m dump (dm.hn)
        
       | LordDragonfang wrote:
       | Since hn karma probably correlates to how much hn readers would
       | enjoy a blog, I'd love a column/columns that includes the user's:
       | - hn profile karma        - total karma of posts from that domain
       | - as above, but Sum(log(post_karma[i]))
       | 
       | ...or something similar.
       | 
       | Whatever is feasible. For a while I've wanted a list of
       | "blogs/domains that hn likes" that isn't polluted by general-
       | high-traffic domains.
        
         | ploum wrote:
         | That's an awesome idea, I would be really curious to see the
         | result.
         | 
         | (hoping that this does not backfire, for example encouraging
         | people to spam HN with their own posts to gain some karma on
         | the blogroll)
        
           | LordDragonfang wrote:
           | To address the exact situation in your parenthetical, I
           | considered putting Sum(log(post_karma[i]-k)), for a k such
           | that the expected log value is negative unless you get enough
           | upvotes.
        
       | joseferben wrote:
       | Thanks for putting this together, love the name!
        
       | WoodenChair wrote:
       | Unfortunately I made a typo in writing my URL (should be
       | https://observationalhazard.com/ not
       | https://obervationalhazard.com/) and this has no way to update
       | it.
        
       | jefftk wrote:
       | Neat! It looks like something is broken with unicode handling?
       | For example the "smart" apostrophe in
       | https://news.ycombinator.com/item?id=36594375 (U+2019, RIGHT
       | SINGLE QUOTATION MARK) is being rendered as "a". Perhaps
       | something is interpreiting utf-8 as latin1?
        
         | deathbypenguin wrote:
         | Indeed. I'm having a fight with that at the moment and the line
         | breaks as well.
        
       | oneeyedpigeon wrote:
       | It's great. Is there any real point in sorting on description or
       | url? I guess url does group http and https, which might be
       | useful, but description definitely seems like it would be nicer
       | if the sort option were removed.
        
       | b8 wrote:
       | Hmm, my blog wasn't added. Maybe when the data was scraped I
       | hadn't posted it yet?
        
         | deathbypenguin wrote:
         | It will be picked up in a few hours.
        
       | voigt wrote:
       | I'm missing the good old time of webrings. This is very close :)
        
         | PartiallyTyped wrote:
         | You may be interested in [1], it's run by a few friends of
         | mine. Hopefully it won't get hugged to death.
         | 
         | [1] https://32bit.cafe
        
       | jakebasile wrote:
       | Look ma, I'm in an HN link! This is pretty neat.
        
       | Moncefmd wrote:
       | This is great! Would've been cool to also be able to sort by
       | votes though.
        
       | xwdv wrote:
       | I'm going to train an LLM on all these blog posts, make a true HN
       | AI.
        
       | deathbypenguin wrote:
       | Added feeds thanks to JSTucker. They are being fetched from the
       | Gist. I think the cron ran, so there are more blogs now.
       | 
       | json: https://dm.hn/blogroll.json (I'll add the feed to each item
       | in a minute)
        
       | astuyvenberg wrote:
       | Oh neat - I saw mine on the list earlier but is now removed,
       | maybe some kind of rate limiting? Anyway, neat project!
        
       | levysoft wrote:
       | I can't believe you had the same idea and necessity but you
       | preceded me. Good job!
        
       | zdwolfe wrote:
       | Looks cool, thanks for making this.
        
       | ghomem wrote:
       | Clap clap clap. This is excellent public service @deathbypenguim.
       | Yesterday I was scrolling through that enormous thread and using
       | control+F to look for keywords of interest on the posted blog
       | descriptions. Now it will be much easier to follow fellow
       | bloggers. Thanks for having my blog on your list too.
        
       | AndrewKemendo wrote:
       | We reinvented web rings!
        
       | I-M-S wrote:
       | Can we do this for HN users' podcasts?
        
         | swyx wrote:
         | shameless plug for my pod: https://www.latent.space/podcast
        
       | 1attice wrote:
       | Naked self-promotion here, but I was late to the party on the
       | original blogroll -- is there any way to add blogs post-ex-facto?
       | Is there a submission mechanism?
        
         | TOGoS wrote:
         | I commented, but mine got missed, somehow. Maybe because my
         | phone auto-capitalized the "H" in "http" and the script didn't
         | account for funny capitalization. Sad!
        
           | 1attice wrote:
           | I did too, but I didn't include the protocol, just the bare
           | domain :/
           | 
           | FWIW it's https://lizmars.net
        
         | eigenhombre wrote:
         | Same, I think this is a great idea and would like to submit
         | mine as well -- maybe an "add" feature on the page would make
         | sense, or re-inquire here at intervals, maybe monthly/yearly?
        
       | petercammeraat wrote:
       | Brilliant. Easy to use as filter for subjects (if people
       | described their blog)
        
       | do-me wrote:
       | That's awesome and so much more practical than scrolling through
       | HN. It would also be possible to integrate semantic search so
       | people don't necessarily need to know the keywords. If you're
       | interested, feel free to ping me or take a look at
       | https://github.com/do-me/SemanticFinder. In case I could just
       | create a pre-indexed version based on your data dump which would
       | be quite convenient to use.
        
       | tenkabuto wrote:
       | These threads make me wish that I had a blog, not just a regular
       | website. :(
        
         | nelsonfigueroa wrote:
         | I took at look at your website and it seems like you could
         | easily add blog posts to it!
        
           | tenkabuto wrote:
           | Hah, thanks. I've been hoping to do so, but still haven't
           | gotten around to it. There's some quirks with the static site
           | generator that I use[0] that lead me to keep postponing
           | setting up blog-ish features, and I don't know enough python
           | to fix them.
           | 
           | [0]: https://github.com/gordonbrander/lettersmith_py
        
             | nelsonfigueroa wrote:
             | If you ever want to try a new static site generator, I use
             | Hugo[0] to generate my site. There's a lot of pre-built
             | themes[1] you can use. Most (if not all) have blogging
             | functionality built in, all you need to do is drop in a
             | Markdown file with your content. You may need to learn a
             | little bit if Golang if you want to customize themes. Just
             | throwing it out there as an option.
             | 
             | [0]: https://gohugo.io/
             | 
             | [1]: https://themes.gohugo.io/
        
               | tenkabuto wrote:
               | Thanks! In writing out my reply to you I realized that I
               | should look into other generators (specifically looking
               | into Hugo, as I think I've seen it used by people like
               | myself who take notes in Obsidian). The key features I
               | want are backlinks support and blogging features, along
               | with Markdown support.
        
       | victorbjorklund wrote:
       | Would be cool to make an RSS feed that combined all the RSS feeds
       | from all the blogs.
        
         | amadeuspagel wrote:
         | Here's an HTML feed: https://webloglist.com/hn
        
       | samsquire wrote:
       | Thanks for creating this. I will use this to go through people's
       | blogs.
       | 
       | I think my blog/journals hasn't been picked up yet
       | 
       | https://news.ycombinator.com/item?id=36588927
       | 
       | My blog is on GitHub, how do you parse the URLs?
        
         | deathbypenguin wrote:
         | It should show in a few hours when the cronjob triggers.
         | Parsing: nokogiri
        
       | ghoomketu wrote:
       | Looks great and congrats on shipping. If it were up to me I'd
       | still be deliberating the best framework and design to use for
       | this, and how I can pipe the comments through chatgpt to extract
       | the category, keywords and do things that make it the best
       | blogroll ever.
       | 
       | And then I would have just thought it's too much work for nothing
       | and that'd be the end of it :P
        
         | myth2018 wrote:
         | Dude I can definitely relate
        
         | deathbypenguin wrote:
         | haha I had the exact same ideas, but then I was "bah! I'll put
         | it out there and I'll add functionality over time"
        
           | swyx wrote:
           | did you just have the .hn TLD standing by? where is that
           | from? must have cost a pretty penny
        
       | smokel wrote:
       | Great work, thank you for sharing this.
       | 
       | I would prefer to see the entire list, so that I can easily
       | search for keywords in the browser. Apparently, all data is
       | available on the client side, but the table renderer seems to
       | limit the table size to at most 100 entries.
        
         | ryan-duve wrote:
         | A workaround while you're waiting for this to be supported by
         | OP is to go to inspector and change the last dropdown option to
         | <option value="10000">10000</option>
         | 
         | then select it in the UI.
        
       | Aissen wrote:
       | It's very nice, thanks ! It would be nice if descriptions had new
       | lines; some aren't readable, while they work quite well on HN.
        
         | deathbypenguin wrote:
         | I'm on it.
        
       | scarface_74 wrote:
       | This is awesome
        
       | abathur wrote:
       | Hmm. Any idea why some wouldn't show up? I posted in
       | https://news.ycombinator.com/item?id=36588940 but don't see it in
       | the list.
        
         | toyg wrote:
         | Same for me. Maybe the scraper choked on pagination, maybe they
         | just took a snapshot before we posted.
        
       | JSTucker wrote:
       | Heres an OPML with all the feeds I could detect from the list!
       | https://gist.github.com/Josh-Tucker/030b8cba6557927a27f1c7e6...
        
         | swyx wrote:
         | if you share the code for OPML conversion maybe OP could
         | incorporate it quickly
        
       | re wrote:
       | This made me think of "planets", which I feel had a heyday back
       | in the late 2000s before Reddit and social media took over
       | everything. Anyone want to take all the blogs with RSS/Atom feeds
       | and build an HN planet? :)
       | 
       | > In online media a planet is a feed aggregator application
       | designed to collect posts from the weblogs of members of an
       | internet community and display them on a single page
       | 
       | https://en.wikipedia.org/wiki/Planet_(software)
        
         | soegaard wrote:
         | https://planet.scheme.org/
        
         | ploum wrote:
         | Yeah, planet were awesome. I'm proud to say that my blog was
         | both on planet.gnome (the original one) and planet.ubuntu.
         | 
         | Now, I feed that the most interesting planet is planet.debian,
         | which offers lot of variety without being focused on Debian.
         | 
         | The great feature I liked was that Planet were _not_ about a
         | given project. It was about the people contributing to the
         | project. Their life. Their interests.
         | 
         | At some point, lot of planets started to ask only "on-topic"
         | posts with a specific RSS feeds. Those planets became boring as
         | it was mainly stuffs you could find on forum or any tech
         | related websites.
        
           | tenkabuto wrote:
           | Yes! I've loved Planet Python[0] because it really lets you
           | see that the Python community is quite varied, fun, and
           | human.
           | 
           | [0]: https://planetpython.org
        
         | susam wrote:
         | I still follow a few planets. For example:
         | 
         | https://planet.lisp.org/
         | 
         | https://planet.emacslife.com/
        
         | amadeuspagel wrote:
         | Here you go: https://webloglist.com/hn
        
       | susam wrote:
       | Very interesting! Thanks for sharing your project here. Out of
       | curiosity, I did some searches with some interesting strings. At
       | the time of posting this comment, here is what the search results
       | look like:
       | 
       | Vim: 8 entries
       | 
       | Emacs: 7 entries
       | 
       | Python: 24 entries
       | 
       | Rust: 24 entries
       | 
       | Lisp: 5 entries
       | 
       | Clojure: 3 entries
       | 
       | Haskell: 5 entries
       | 
       | Zig: 5 entries
       | 
       | Elixir: 4 entries
       | 
       | Scheme: 0 entries
       | 
       | Postgres: 4 entries
       | 
       | MySQL: 0 entries
       | 
       | SQLite: 3 entries
       | 
       | Jekyll: 9 entries
       | 
       | HTML: 40 entries
       | 
       | Markdown: 6 entries
       | 
       | LaTeX: 1 entry
       | 
       | Hugo: 12 entries
       | 
       | Next.js / Nextjs: 4 entries
       | 
       | Gatsby: 2 entries
       | 
       | Pelican: 0 entries
       | 
       | .com: 495 entries
       | 
       | .dev: 90 entries
       | 
       | .net: 84 entries
       | 
       | .io: 82 entries
       | 
       | .me: 53 entries
       | 
       | .org: 43 entries
       | 
       | .xyz: 15 entries
       | 
       | .page: 6 entries
       | 
       | github.io: 46 entries
       | 
       | medium.com: 18 entries
       | 
       | blogspot.com: 8 entries
       | 
       | wordpress.com: 4 entries
       | 
       | livejournal.com: 0 entries
       | 
       | tech: 178 entries
       | 
       | programming: 66 entries
       | 
       | random: 61 entries
       | 
       | thought: 49 entries
       | 
       | math: 16 entries
       | 
       | musing: 12 entries
       | 
       | blag: 1 entry
       | 
       | favorite: 28 entries
       | 
       | favourite: 9 entries
       | 
       | Now all of these results are string search results, so there is
       | always going to be a little bit of noise when we try to draw
       | conclusions out of these results. For example, the results for
       | ".dev" also contains results that look like "*dev*.com".
       | 
       | Despite the noise, I found these results interesting. I remember
       | in the early days when the blogosphere was being constructed 20
       | km above the tag clouds, it was very fashionable to have blogs
       | for random musings or random thoughts. So I am delighted to see
       | that most blogs out here are tech blogs. Surprisingly there is
       | only blag. I expected at least a few more.
       | 
       | One of the Lisp entries is mine. Also, one of the Vim entries is
       | mine. It is a bit ironical because I am actually an Emacs user.
       | If I had known the comments we write on HN would become part of
       | the search string in this blogroll, I might have chosen my words
       | in my comment to the "Ask HN" port more judiciously! :)
        
         | boricj wrote:
         | reverse engineering: 5 entries
         | 
         | Ghidra: 1 entry (mine)
         | 
         | On one hand it does bring some level of perspective on the
         | popularity of a particular topic you're into. My first reaction
         | was "Just 0.5% for reverse-engineering? I guess I'm down in a
         | deep dark rabbit hole..."
         | 
         | On the other hand, I haven't seen the blogs of Ken Shirriff,
         | Alex Ionescu or Raymond Chen on that list, which I know are
         | quite popular and regularly make it to the Hacker News front
         | page.
        
       | cavalcade119 wrote:
       | [dead]
        
       | kiruio wrote:
       | Cool, I forgot to add descriptions. Would be nice if I could fix
       | it
        
       | version_five wrote:
       | Can you say what criteria you used to filter the thread into
       | valid blogs?
        
       | hyperific wrote:
       | Thanks for doing this!
        
       | verse wrote:
       | Love this, thanks for building it!
        
       | amadeuspagel wrote:
       | The latest posts from these blogs: https://webloglist.com/hn
        
         | jakebasile wrote:
         | That's cool! Did you pull RSS from all the sites you could and
         | use that to aggregate it?
        
           | amadeuspagel wrote:
           | Yes, webloglist uses RSS autodiscovery.
        
             | darekkay wrote:
             | It seems the autodiscovery didn't work for my blog (link in
             | profile). I've posted something 2 days ago but it doesn't
             | appear on your site. My feed is on the list from JSTucker,
             | who also used some sort of autodiscovery.
        
               | amadeuspagel wrote:
               | Atom isn't supported yet. Working on it.
        
         | rambambram wrote:
         | Nice list! I was almost going to ask you if you have an OPML
         | file with all the feeds, but then I decided to check the list
         | manually for interesting latest posts and grab only their
         | feeds. Thanks for the list!
        
         | addandsubtract wrote:
         | Now we just need ChatGPT to read them all and give us a daily
         | update on the interesting ones.
        
       | skilled wrote:
       | Good job. I would honestly love this but with RSS feeds also, but
       | I know it's a tough ask unfortunately. (Not for you, but in
       | general)
        
         | xoranth wrote:
         | Most blogs that have RSS also have a `<link rel="alternate"
         | type="application/rss+xml">` tag that redirects you to the RSS
         | feed. If you pass the link to the homepage to a feed
         | reader[^0], it will follow the link tag and find the RSS feed.
         | 
         | [^0]: At least, Liferea on Linux, NetNewsWire and Vienna on
         | Mac, do this. AFAIR NetNewsWire is even smarter than that, and
         | can sometimes find the RSS feed even when there is no link tag.
        
           | marginalia_nu wrote:
           | A bit of a snag is that many CMSes generate multiple feeds,
           | and there is no way I'm aware of for identifying which is the
           | "canonical" feed.
        
       | 1270018080 wrote:
       | Conspiracy: That post was only made to harvest data for someone's
       | model
        
         | bachmeier wrote:
         | Well, given that blogs are public and the whole point is for
         | others to read them, I think that's okay.
        
       | leejoramo wrote:
       | This is great. An OPML version of this would be great to bulk
       | IMPORT the RSS/ATOM feeds into your favorite feed reading app.
        
         | JSTucker wrote:
         | I've scraped what I could from the list and exported and opml
         | here: https://gist.github.com/Josh-
         | Tucker/030b8cba6557927a27f1c7e6...
        
           | tommy_axle wrote:
           | A feed is now added to https://codeinsider.dev
           | (https://codeinsider.dev/rss.xml)
        
             | JSTucker wrote:
             | Have added your feed to the list
        
               | [deleted]
        
           | deathbypenguin wrote:
           | Thanks man, it's been added: https://dm.hn/blogroll.json and
           | next to each entry.
        
         | rambambram wrote:
         | Indeed! But I guess not every blog has a feed, or there's no
         | quick way of letting people add one to the list after the fact.
        
           | prepend wrote:
           | I expect that every blog has an rss or atom feed. It would be
           | strange for someone to go to the effort of writing a blog and
           | not setting up a feed. That and most blogs have feeds
           | automatically.
           | 
           | That being said, any blog that doesn't have a feed and has
           | some proprietary subscription is not one I want to subscribe
           | to. So not including feedless blogs is a positive for me.
        
       | TimCTRL wrote:
       | Saw the .hn domain and I was like What...HN has its own TLD. Then
       | i searched google and saw it belongs to honduras...daft me i
       | guess..
        
         | mcmcmc wrote:
         | All two-letter TLDs are country codes.
        
           | TimCTRL wrote:
           | Thanks!
        
       | akiselev wrote:
       | Our future AI overlords sincerely thank you for this pristine
       | data set.
        
       | alfiedotwtf wrote:
       | Weird. I added to that original post, but I'm not on your list.
       | Maybe your code didn't go to the "See more comments" page?
        
       | syx wrote:
       | I would add a shuffle button that opens a random blog so it's
       | nicer to discover something new compared to endless paginations.
        
         | deathbypenguin wrote:
         | Noted. I will be correcting a few bits and adding new
         | functionality over the next few days/weekend.
        
         | scastiel wrote:
         | +1
         | 
         | I would even add a "I'm feeling lucky" button, to redirect to a
         | random blog ;)
        
         | splitbrain wrote:
         | That's what https://indieblog.page was made for
        
         | deathbypenguin wrote:
         | Random blog button up now.
        
           | alonsonic wrote:
           | Love this, have been reading random blogs for the last
           | 30minuts already
        
         | cavalcade119 wrote:
         | [dead]
        
       ___________________________________________________________________
       (page generated 2023-07-05 23:00 UTC)