hngopher.com

       [HN Gopher] Show HN: Some blind hackers are bridging IRC to LMMs...
       ___________________________________________________________________
        
       Show HN: Some blind hackers are bridging IRC to LMMs running
       locally
        
       Author : blindgeek
       Score  : 61 points
       Date   : 2024-01-31 19:50 UTC (3 hours ago)
        
 (HTM) web link (2mb.codes)
 (TXT) w3m dump (2mb.codes)
        
       | DustinBrett wrote:
       | You could run an LLM in the browser with WebLLM and then connect
       | to IRC via WebSockets using something like KiwiIRC. Fully client
       | side AI on IRC.
        
       | xpe wrote:
       | If you didn't know... LMM = Large Multimodal Models
        
       | jpsouth wrote:
       | Hey! I don't understand too much about AI/ML/LLMs (and now LMMs!)
       | so hoping someone could explain a little further for me?
       | 
       | What I gather is this is an IRC bot/plugin/add-on that will allow
       | a user to prompt an 'LMM' which is essentially an LLM with
       | multiple output capabilities (text, audio, images etc) which on
       | the surface sounds awesome.
       | 
       | How does an LMM benefit blind users over an LLM with voice
       | capability? Is the addition of image/video just for accessibility
       | to none-blind people?
       | 
       | What's the difference between this and integrating an LLM with
       | voice/image/video capability?
       | 
       | Is there any reason that this has been made over other available
       | uncensored/free/local LLMs (aside from this being an LMM)?
       | 
       | Thanks in advance.
        
         | jpsouth wrote:
         | As a follow up to this I'd like to ask any partially sighted or
         | blind people the issues they currently experience using a LLM
         | such as ChatGPT, Bard, Llama or otherwise - both from a UI
         | perspective and an API perspective.
        
         | petercooper wrote:
         | It's the multimodal _input_ capability that seems to be of
         | value here - see the transcript at
         | https://2mb.codes/~cmb/ollama-bot/#chat-transcript .. Namely,
         | being able to interrogate images in a verbal fashion, such that
         | someone without sight (or perhaps even someone who just doesn't
         | _want_ to see an image) can get an appreciation for their
         | contents.
        
       ___________________________________________________________________
       (page generated 2024-01-31 23:00 UTC)