[HN Gopher] GITenberg is an open source community for publishing...
       ___________________________________________________________________
        
       GITenberg is an open source community for publishing ebooks in the
       public domain
        
       Author : bilinualcom
       Score  : 171 points
       Date   : 2020-08-02 13:14 UTC (9 hours ago)
        
 (HTM) web link (www.gitenberg.org)
 (TXT) w3m dump (www.gitenberg.org)
        
       | maire wrote:
       | This looks like a subset of Project Gutenberg?
       | 
       | The Girl from Alsace in Gitenberg:
       | https://www.gitenberg.org/book/35926
       | 
       | The Girl from Alsace in Gutenberg:
       | http://www.gutenberg.org/ebooks/35926
       | 
       | The numbers are even the same which seems suspicious. Hmmm.
        
         | bhickey wrote:
         | It's a version controlled dump of Gutenberg.
        
         | chmod775 wrote:
         | "GITenberg is an exploration of how Project Gutenberg might
         | work if all the Gutenberg texts were on Github, so that tools
         | like version control, continuous integration, and pull-request
         | workflow could be employed."
         | 
         | https://ebookfoundation.org/
        
         | sethish wrote:
         | Yes, absolutely. Apologies if that's not clear from the
         | website. GITenberg started as an experimental fork of PG, but
         | due to the work of Eric Hellman at the Free Ebook Foundation,
         | much of the infrastructure such as metadata formats, CI/CD for
         | building books, and DVCS backend are being ported upstream to
         | the Project Gutenberg infrastructure.
        
           | maire wrote:
           | Thanks for the clarification!
        
             | bilinualcom wrote:
             | OP here, I doubt that the website is "a SUBSET of Project
             | Gutenberg". This is not mentioned anywhere in the website.
             | As mentioned in PG website, while you can use the book
             | freely, "The name 'Project Gutenberg' is a registered
             | trademark.". I guess this is the reason they didn't mention
             | PG.
        
               | sethish wrote:
               | Now I remember, that was the reason I wasn't more clear
               | about the connection to PG when I wrote the website
               | https://github.com/gitenberg-dev/giten_site/. Since then
               | my co-founder Eric Hellman has been doing engineering
               | work for Project Gutenberg, as well as running the rest
               | of the Free Ebook Foundation, which is the parent org of
               | GITenberg, free-programing-ebooks, and Unglue.it.
               | 
               | I think that the GITenberg collection contains all of the
               | books in PG. At this point, the creation of new repos is
               | automatically done when Distributed Proofreaders creates
               | a new book in PG. Originally, I didn't include around 400
               | PG books due to their creators claiming copyright, and
               | didn't include Bruce Sterling's book because he wouldn't
               | let me re-license it creative commons rather than his
               | pseudo-public-domain license.
               | 
               | Not much has been happening with GITenberg itself in the
               | past few years. But luckily, a lot of the concepts and
               | code are getting upstreamed into PG. Which in my opinion,
               | is way way better.
        
               | bilinualcom wrote:
               | Thanks for clarification.
        
       | sethish wrote:
       | You might be more familiar with another project the Free Ebook
       | Foundation maintains: https://github.com/EbookFoundation/free-
       | programming-books/ Which is one of the top-10 repos on github by
       | number of stars.
        
       | anaphor wrote:
       | I like how they're able to accurately tag translators vs original
       | authors, e.g. https://github.com/GITenberg/The-History-of-the-
       | Peloponnesia...
        
       | koolba wrote:
       | They're probably going to have to change their name to something
       | that does not have "git" in it: https://public-
       | inbox.org/git/20170202022655.2jwvudhvo4hmueaw...
        
         | sethish wrote:
         | Gitenberg was one of the alternate spellings of Gutenberg in
         | the 1400s.
        
           | agumonkey wrote:
           | I propose Gytenberg
        
             | artiszt wrote:
             | or, widely accepted standard [in german and slavic-
             | languages] way back then, 'v' instead of 'u', and vice
             | versa
        
               | agumonkey wrote:
               | ah well for some reason the visual asthetics of gytenberg
               | vs gvtenberg made me chose the former (even though at
               | first I typed gvtenberg)
        
         | bhickey wrote:
         | > It's hard to hold them responsible for picking a name that
         | violated a policy that didn't yet exist.
         | 
         | fwiw, Gitenberg has been around at least since 2012.
        
         | metiscus wrote:
         | They've been named like that for six years apparently so I'm
         | not sure how that works for what you linked above. It is a
         | valid point that hopefully doesn't cause problems.
        
       | gravitas wrote:
       | The https://standardebooks.org/ project has been rebuilding
       | ebooks (open ePUB format) with an eye on quality and readability
       | on mobiles/tablets (fonts, copyedit, etc.) which might be of
       | interest, all books and the website revision are tracked in git.
        
         | sethish wrote:
         | I'm a big fan of standardebooks! They're doing fantastic work
         | producing better ebooks from Project Gutenberg sources.
        
           | bilinualcom wrote:
           | OP here, I found this website when I was looking for a way to
           | get the updated version (with correction) of a PG (Project
           | Gutenberg) book and all changes/diffs from the point that I
           | scrape the book from PG website for my language learning side
           | project: https://www.bilinual.com
           | 
           | The bilinual Project also rebuild ebooks (modern HTML, PDF
           | and open ePUB format) with better quality and readability(
           | while it is not its prime goal). Take a look at one example
           | here:
           | 
           | https://www.bilinual.com/book/18043sven/sv/en#line=68&lpp=23
           | https://www.bilinual.com/download/18043sven-sv-en.pdf
           | https://www.bilinual.com/download/18043sven-sv-en.epub
        
             | yesenadam wrote:
             | Wow, they look awesome, thank you! (Learning spanish here)
        
       | voldemort1968 wrote:
       | What is improved by adding Git to PG?
        
         | sethish wrote:
         | It's not so much about just adding git to books, but git to the
         | transcriptions of books from printed materials. There are
         | errors in the transcriptions, many books are lacking
         | formatting, and many books pre-date Unicode and are in ascii.
         | Tracking the book's source files in git makes it easier to
         | collaborate on making these changes, or seeing what changes
         | have been made.
        
         | bilinualcom wrote:
         | Well, for my side project, https://www.bilinual.com I needed
         | version controlling for changes that are made on a book (fixed
         | typos, ...) and I found GITenberg project. There are several
         | interesting tools developed during the project accessible here:
         | https://github.com/gitenberg-dev
        
       | hnarayanan wrote:
       | I don't know why, but I expected some amazing typography in their
       | PDFs. :(
        
         | sethish wrote:
         | [Standard Ebooks](https://standardebooks.org/) has accomplished
         | a lot in producing better ebooks of public domain texts.
        
           | bilinualcom wrote:
           | OP here, it seems standardebooks doesn't provide the books in
           | PDF format. My side project, https://www.bilinual.com rebuild
           | the ebooks in PDF format with translation hints, if you don't
           | mind about learning a new language while reading your
           | favourite books ;)
        
             | sethish wrote:
             | There are about 400 GITenberg books that have CC-by
             | licensed covers provided by Recovering the Classics. If
             | you're interested in using that art for your PDFs I can
             | find you the index!
        
               | bilinualcom wrote:
               | Thanks, it would be great. I am curious to know why do
               | they have CC license and not public domain?
        
               | robin_reala wrote:
               | We have this problem with Standard Ebooks. The number of
               | people that say things are public domain without actually
               | checking it is very high. A CC0 licence is an _explicit
               | grant_ of public domain status by the licensor, and hence
               | the legal issues rest with them in the event of any
               | problem.
               | 
               | Public domain obviously can be ascertained, but if CC0
               | hasn't been granted we rely on dated reproductions:
               | basically a photograph of the artwork in question in a
               | book or journal with a copyright date of 1924 or earlier.
               | 
               | (that's obviously specifically a US legal reading, but SE
               | from a legal point of view is a US project)
        
       | thangalin wrote:
       | I wrote a technical comparison between various public domain
       | eBook projects:
       | 
       | https://dave.autonoma.ca/blog/2020/04/11/project-gutenberg-p...
        
       ___________________________________________________________________
       (page generated 2020-08-02 23:00 UTC)