[HN Gopher] Bloom's 2 Sigma Problem
       ___________________________________________________________________
        
       Bloom's 2 Sigma Problem
        
       Author : oidar
       Score  : 85 points
       Date   : 2023-02-25 19:05 UTC (3 hours ago)
        
 (HTM) web link (en.wikipedia.org)
 (TXT) w3m dump (en.wikipedia.org)
        
       | ultra_nick wrote:
       | Our startup started working on solving this problem with
       | extremely customized continuous AI Tutoring last year. In his
       | research paper, Bloom says that private tutoring is the most
       | effective learning method, but similar results can be obtained
       | through a combination of scientific learning methods. However, in
       | his time, these methods were too computationally expensive for
       | teachers to implement.
       | 
       | Now, we have the technology (LLMs) for everyone to have an
       | accelerated contextual learning experience. We started by
       | tracking mastery learning with graphs, but the effort required
       | was too much for users. Now, we're augmenting the previous system
       | with ML to automatically track and explain new concepts. In the
       | far future, I'm hoping we'll be able to accelerate learning rates
       | by 150%, so that motivated highschool students can graduate with
       | a bachelor's degree.
       | 
       | Our first AI Tutor should be out by April.
       | 
       | Follow us on LinkedIn to get an alert when we launch!
       | https://www.linkedin.com/company/conceptionary/
       | 
       | To learn more, checkout our website:
       | 
       | https://conceptionary.app/
        
       | a-dub wrote:
       | i wonder how much all the redundant and slightly different
       | resources on the internet today close that gap... this paper is
       | from 1984.
       | 
       | also would be curious how good today's llms are at getting
       | someone to grok a topic they're having trouble grasping... (this
       | can be actually a hard problem as it can require understanding
       | the misunderstanding in order to guide away from it effectively.)
        
         | ipnon wrote:
         | If I ask ChatGPT how transformers work the results are
         | sufficient but brief. It's difficult to probe ChatGPT, and it
         | doesn't have an intuition for your weak points like a tutor
         | does. If I ask ChatGPT about who influenced John Maynard Keynes
         | however it will hallucinate a whole corpus of economists and
         | papers that never existed but whose existence is proposed so
         | confidently it is incredulous.
         | 
         | Mainly we should appreciate the dearth of capabilities that
         | arise merely as emergent properties of ever larger language
         | models. Some kind of architecture will be needed for each use
         | case, which means we once again find ourselves facing difficult
         | engineering problems.
        
       | [deleted]
        
       | bowsamic wrote:
       | Do we need to fix this? I am not sure that everyone needs a
       | "mastery" of education
        
         | medler wrote:
         | Education is important in the modern world, especially now that
         | the Industrial Revolution has led to an unprecedented growth in
         | knowledge work.
        
       | HDMI_Cable wrote:
       | I wonder how aging populations and large-scale language models
       | are going to change the difficulty of this problem. It seems to
       | me that we're just about to unlock 1-to-1 learning, without
       | actually needing 1 teacher per child.
       | 
       | It isn't hard to imagine a language model (or something similar,
       | I'm not in AI myself so I might not have the best idea of the
       | specifics) giving personalized instruction to a student based on
       | a knowledge bank and generative prompts + reinforcement from what
       | the child understands.
       | 
       | Add to that aging populations leading to less children and higher
       | worker-child ratios, + automation in other industries, and its
       | plausible to imagine that there could be a large movement towards
       | training new teachers, thereby reducing class sizes. This could
       | increase feedback between the two, and probably reduce
       | behavioural problems as teachers could address them more easily.
       | 
       | I guess one problem of this approach is that this 'atomizes' the
       | student, where they only engage with their device with the
       | language model, removing some of the collaborative aspects of
       | modern instructional methods. One easy fix to this is to increase
       | the amount of recess or structured play, thereby increasing
       | socialization. This seems to work in Finland [1]. If advances in
       | AI can make learning more efficient, we can increase the amount
       | of time for unstructured play.
       | 
       | ---
       | 
       | [1]: https://www.theatlantic.com/education/archive/2014/06/how-
       | fi...
        
         | civilized wrote:
         | It's a nice dream. I would just caution that we will unlock
         | people _claiming_ that this works and foisting a half-baked
         | version of it on large populations to save money, look modern,
         | and make profits for tech companies... long before we unlock
         | the actually good version.
        
         | [deleted]
        
       | swyx wrote:
       | theres a lot of hope in the educational AI space that we can do
       | personalized teaching and essentially raise everybody to the 90+
       | percentile implied by the 2 sigma problem (see
       | https://lspace.swyx.io/p/chatgpt-gpt4-hype-and-building-llm 20
       | min mark - disclaimer its my podcast with openai's new
       | representative)
       | 
       | i think this is a "last mile" type situation where we've seen
       | great progress with ChatGPT on the first 90%, but the last mile
       | will take the remaining 90% to do. not impossible, just won't
       | come anywhere as soon or as easily as people want. still worth
       | pursuing ofc, but not as a quick hustle but more if you are just
       | intensely insightful and passionate about education and figure
       | out how to apply AI to personalize it properly
        
         | bee_rider wrote:
         | Since ChatGPT often writes untrue statements, it seems like
         | quite a poor teacher. I guess it could be used as an assist for
         | an instructor (send me a question, chat GPT will answer it,
         | I'll check the output) but that seems not so revolutionary.
        
       | ohhhhhh wrote:
       | "too costly for most societies to bear on a large scale" this is
       | changing as we speak with language models.
        
         | c0brac0bra wrote:
         | Makes me think of the book in Diamond Age.
        
           | hgsgm wrote:
           | What's that book called?
        
         | pantalaimon wrote:
         | _A Young Lady 's Illustrated Primer_
        
         | nullc wrote:
         | "A Young Lady's Illustrated Primer"
        
       | l_theanine wrote:
       | Education is going to be radically changed when AI can manage to
       | helpfully teach students with a Microsoft Tay situation
       | happening.
       | 
       | Very quickly after that, I think a sort of global shift will
       | begin, as schools all around the world begin to see the power of
       | legitimate curriculums, instead of lobotomized "think how I
       | think" bs. I'm thinking of Florida right now, but I know the same
       | shit goes on in India, China, countries in Africa, and probably
       | other places too.
       | 
       | On a longer scale, as models become more and more advanced, maybe
       | there will be teachers that are constantly absorbing new
       | information and are able to keep students up-to-date with best
       | practices, new research, etc. That would not bode well for the
       | leagues of McKinsey-ites and that whole ilk.
       | 
       | I am particularly hopeful that AI will be able to radically
       | advance education.
       | 
       | That is, if this hypothetical technology is democratized and kept
       | open, rather than horded and sequestered to the ruling capital
       | class until there exists a financial, educational, biological and
       | altogether insurmountable moat between the ultra-wealthy and the
       | rest of us.
        
       | hinkley wrote:
       | We need to study what amounts of 1:1 tutoring have positive
       | outcomes. If society can't afford 1:1 tutoring for everyone, all
       | the time, can we afford 20% of the time?
        
         | hgsgm wrote:
         | That's why schools should not waste time in lecture. Students
         | should read/listen/watch and at home and bring questions to
         | school
        
         | 082349872349872 wrote:
         | My understanding is that the knee of the curve is ~10-12
         | students. Much cheaper than 1:1 or 1:2, but still more
         | expensive than the 30+ classrooms of my youth.
        
           | zozbot234 wrote:
           | You can teach large classes efficiently, but it takes a very
           | direct, rote, scripted approach that's exactly what most
           | teachers _don 't_ want to use - because fancy education
           | schools have told them that it "demeans" the profession, and
           | they've drunk the Kool-Aid. A key component is to come up
           | with memorable text that the students must be able to repeat
           | _word for word_ when prompted - both as a group and
           | individually. (Problem solving similarly starts with trivial
           | cases that the students must answer on the spot; difficulty
           | is very gradually increased, and there 's a lot of retracing
           | and review of earlier steps.)
           | 
           | This is as close as you can get to "mastery learning" in a
           | large classroom; keep all students on lockstep and highly
           | engaged, trying to ensure that no one falls through the
           | cracks.
        
       | trane_project wrote:
       | I am already working on fixing this: https://github.com/trane-
       | project/trane/
       | 
       | No AI needed. Just an old fashioned depth first search through a
       | graph of skills and dependencies.
       | 
       | I made it to help me practice music, but I have been branching
       | out and using it to study math for a few weeks. I find myself
       | saying "just one question more" and then spending another half
       | hour in it.
       | 
       | Still needs more material to be useful to other people but it's a
       | solid experience. I learned and memorized how to play most of the
       | notes in the saxophone with good intonation in about a week, as a
       | complete beginner.
        
         | cjblack wrote:
         | I really like simple, reliable, tools like this - just watched
         | the demo video. I hadn't heard of mastery learning before, but
         | now I'm thinking about trying to map some Excel lessons towards
         | this and see if I can implement it at work.
         | 
         | I've found that our youngest employees are coming out of school
         | with little-to-no knowledge of how to work spreadsheet software
         | - even those who are otherwise pretty technically proficient.
         | 
         | While programmers tend to eschew tools like Excel, in
         | consulting (or at least the consulting we do) it's critical to
         | being able to understand how a process works before designing
         | an automated solution. Excited to potentially have a way to
         | share that understanding.
        
           | trane_project wrote:
           | Thanks for the feedback. An excel course should be doable.
           | Currently the most general way to write material is to use
           | this: https://trane-
           | project.github.io/generated_courses/knowledge_... Basically a
           | folder of lessons where the flashcards are pairs of markdown
           | files.
           | 
           | There's also another tool to write simple flashcards and
           | lessons in a json file, run a command, and build all those
           | directories and markdown files for you, but I have not
           | written the documentation yet. Here's an example for a course
           | based on a reworking of Euclid's Elements:
           | https://github.com/trane-project/trane-
           | math/blob/master/cour...
           | 
           | But plain markdown files for now should get you going.
        
         | wizzwizz4 wrote:
         | Trane looks good. I've been trying to design a similar system,
         | but for mathematics: the difference there is that there can be
         | multiple routes towards a particular understanding, with
         | different routes better for different people.
         | 
         | Prerequisites for calculus: an understanding of the standard
         | reals OR an understanding of the hyperreals. Yes, real and
         | hyperreal analysis are radically different in many ways, but
         | they're provably equivalent, and they get you to the same
         | answers. Some people find one more intuitive than the other,
         | and some problems are easier in one than the other.
         | 
         | Imo, the biggest issue with mathematics education is that it
         | tries to push you down one, standardised way of thinking about
         | things: if you don't _get_ it, that cripples your ability to
         | think and reason for yourself. The main thing I do in one-to-
         | one mathematics tutoring is finding how the student thinks, and
         | nurturing _that_ (instead of stamping it out and replacing it
         | with a  "more formal" approach).
         | 
         | I'd like to see something branchy, but I haven't found a good
         | solution. Do you have any ideas for tackling this?
        
           | trane_project wrote:
           | I started working on this a few weeks ago:
           | https://github.com/trane-project/trane-math It currently
           | needs a README, but you could take a look at the courses on
           | how I am building the flashcards. It's easy to reference
           | external resources, so that's what I have been doing, rather
           | than trying to create exercises of my own.
           | 
           | I am starting with a very basic olympiad-style book and a
           | book based on Euclid's Elements, because I don't have the
           | understanding required to clearly work out the dependencies
           | of more advanced stuff. And I also would like to start at the
           | beginning to make sure I don't miss anything.
           | 
           | The ideal end state is to have courses that cover all the
           | undergrad and grad math curriculum. I am also curious on
           | whether this could be used by researchers to keep up to date
           | with the latest research on their fields. But all of that is
           | a long way out.
           | 
           | As for your question, there are a couple of ways that Trane
           | could handle multiple paths through similar material.
           | 
           | 1. Just have separate curriculums. You could copy the
           | courses, but the second copy has different dependencies,
           | courses/lessons IDs. For example, one could have a series of
           | courses teaching the undergrad MIT math curriculum and
           | another the Harvard curriculum. They might share a lot of the
           | material, but the order will be different.
           | 
           | 2. Trane does not lock you into a specific order. There are
           | filters that let you specify which parts of the graph you
           | want to study. You are free to get questions from specific
           | courses and lessons. You can also use the metadata in the
           | courses to say things like "give me questions from all
           | lessons teaching linear algebra" or "give me questions from
           | all courses on real analysis but not from the lessons on set
           | theory". The dependencies between the lessons that match that
           | metadata are still respected. There are a few more options,
           | but you get my point. The dependencies are not set in stone,
           | and there's freedom to jump around and study specific topics.
           | 
           | I actually use option 2 most days. If I want to practice
           | guitar, I just set a filter to give me exercises from the
           | guitar. Similar thing when I want to practice the saxophone.
        
             | wizzwizz4 wrote:
             | Mathematics doesn't really _have_ dependencies. Everything
             | can be thought of in terms of everything else: you need an
             | in, which is usually arithmetic or geometry (but, for
             | physicists, is often physics - and for programmers, is
             | often concepts related to their favourite programming
             | languages), and then everything _can_ be defined in terms
             | of something else.
             | 
             | However, many people find it easier if you have concrete
             | anchors, rather than making everything an abstract ivory
             | tower. A certain kind of musician might get Fourier series
             | quite quickly by analogy to the behaviour of sound, and
             | then pick up Taylor series by analogy to that; somebody
             | like me might find it easier to translate the most general
             | Fourier series to a Maclaurin series, then compare
             | coefficients. All these approaches would give you an
             | understanding of Fourier, Maclaurin and Taylor series',
             | which then lets you learn things that build upon them.
             | 
             | The "separate courses" approach is doable (and the best
             | I've come up with so far), but it doesn't scale: courses
             | that cover the same things without providing further
             | insight don't get merged together. It can't handle the
             | combinations that, intuitively, I feel like there should be
             | a way to handle. Human teachers can cope with it just fine!
             | 
             | One nitpick on trane-math: running `cargo test` will wipe
             | out the .trane directory, if it exists. That seems not very
             | good; perhaps there should be a way to point trane at a
             | temporary directory, so you can test it better?
        
         | airstrike wrote:
         | This looks great, thanks for sharing. I do think it would be
         | more ergonomic to have this served via a website given that it
         | feels CRUD-like with some browsing of exercises, relationships
         | between them, etc.
         | 
         | The CLI approach assumes people are comfortable using the
         | terminal, which unfortunately limits the number of potential
         | users
        
           | trane_project wrote:
           | Definitely. I have not worked in frontend/ux stuff for over
           | 10 years, and I am working on this by myself, so I have been
           | focusing on the CLI to quickly iterate and prove that it
           | works reasonably well. Part of the reason of trying it out on
           | math was to a find a more technical group of users.
           | 
           | The ideal state is a graphic interface where the exercises
           | are presented on the screen to reduce the friction. Gotta
           | start somewhere.
        
           | jackthetab wrote:
           | I'm more than happy to do a CLI app, but I think this needs a
           | way to run it on a phone (noticed I don't necessarily mean
           | "app" ;-). I can see me doing this on my phone while waiting
           | for the barista to make my coffee.
           | 
           | I'm going to do something similar with Anki. I'll try to
           | compare the two...
        
             | trane_project wrote:
             | Main difference with anki is that anki explicitly
             | discourages trying to make dependencies between flashcards
             | or decks. I tried to work out something like that into
             | Anki, but as I saw it, dependencies were core to what I had
             | in mind. There's some similar functionality in supermemo,
             | but it's based on a tree, not a graph, so it was inherently
             | limited.
             | 
             | No experience in mobile at all so that'll have to wait, but
             | a GUI/website/app that works on most platforms is something
             | I would like to work out eventually.
        
         | admsmz wrote:
         | This sounds very similar to the system behind
         | http://mathacademy.com . You follow a graph of skills and
         | dependencies and because you always have the necessary
         | prerequisites it is always doable.
         | 
         | Their team of mathematicians have created a curriculum from
         | k-12 to a bachelors in mathematics.
        
           | trane_project wrote:
           | Yeah, this is cool and similar to the end state I have in
           | mind. The main difference is that I am not trying to create
           | the curriculum/exercises. I am just creating flaschards that
           | say "Solve exercise x.y.z in that textbook". Plus the whole
           | free-software angle and being able to share the material as
           | text files.
           | 
           | The math thing is just a side project at the moment to try to
           | see how it works for other fields. I am primarily using this
           | for music. I was hoping something like this existed already,
           | but all the solutions are either very specialized like this
           | one or do not support dependencies as a core feature, like
           | Anki. Had Anki supported something like this, I wouldn't have
           | needed to make my own thing.
        
             | zozbot234 wrote:
             | It would be nice to simply include support for dependency
             | tracking in existing flashcard software. Unfortunately, the
             | most commonly used flashcard software, viz. Anki, has not
             | been adding such features. I can only assume that the
             | project is very lightly maintained and/or working on paying
             | down technical debt - new innovative features don't seem to
             | be a priority.
        
               | trane_project wrote:
               | I think ultimately it's a very different approach to
               | their current one, so it might be too difficult to do in
               | the existing codebase. Maybe not, I have not looked at
               | the code. But the public docs and learning philosophy
               | around anki all seemed to discourage it, so I decided to
               | make my own thing.
               | 
               | And yeah, I don't think Anki gets a lot of support. I am
               | an open-source maintainer myself, so I know that story.
        
           | soferio wrote:
           | This looks great. My son uses "beast academy" from Art of
           | Problem Solving (https://beastacademy.com/), which is
           | fantastic. But mathacademy looks like it might be a good
           | competitor at the post year 6 level.
        
       | precompute wrote:
       | Similar to the "2 Sigma humor gap".
       | 
       | Also related: sharply increasing odds of social maladjustment /
       | mistreatment after a certain IQ score.
        
         | hgsgm wrote:
         | It's completely different. The only similarity is that a
         | standard deviation is involved.
        
       | lofatdairy wrote:
       | Also discussed in the aristocratic tutoring article posted here:
       | 
       | https://erikhoel.substack.com/p/why-we-stopped-making-einste...
       | 
       | and discussed here:
       | 
       | https://news.ycombinator.com/item?id=30698624
        
       | levzettelin wrote:
       | What's problematic about this? Why is this named a "problem"?
        
         | swyx wrote:
         | it is a core driver of compounding inequity, assuming the rich
         | can afford personal tuition and the poor cannot, which then
         | gives rich kids access to better opportunities to continue
         | being rich. it gives the lie to "equality of opportunity not
         | equality of outcome" because any game of consequence involves
         | outcome being exclusive opportunity.
        
         | choxi wrote:
         | I believe it's that Bloom's study implies that 1-on-1 pacing
         | and teaching produces the best learning results, but that Bloom
         | thought it was infeasible to scale to everyone: it's "too
         | costly for most societies to bear on a large scale"
        
         | trane_project wrote:
         | That personal tutoring leads to better outcomes is not the
         | problem. That will obviously be the case.
         | 
         | The problem is how to bridge the gap in a way more people
         | realize similar benefits.
        
         | elcomet wrote:
         | The problem is that we are not giving the best education we can
         | to everyone, as it is too costly to have one-to-one tutoring
         | for all. So is there a method that works as well as one-to-one
         | tutoring but affordable by our society?
        
       | Taikonerd wrote:
       | Has anyone used any of the "intelligent tutoring systems"
       | mentioned in this Wikipedia article?[0]
       | 
       | Lots of comments here are saying "we'll have an intelligent
       | tutoring system one day, based on LLMs" -- but there are many
       | such systems that already claim to be getting good results
       | now.[1] I'd be interested to hear from anyone who's used one.
       | 
       | [0] https://en.wikipedia.org/wiki/Intelligent_tutoring_system
       | 
       | [1] https://www.lesswrong.com/posts/vbWBJGWyWyKyoxLBe/darpa-
       | digi...
        
         | admsmz wrote:
         | I am a customer of mathacademy.com, an intelligent tutoring
         | system with a focus on math. Right now they are putting the
         | finishing touches on their courses like abstract algebra and
         | discrete mathematics. They already have linear algebra and
         | calculus.
         | 
         | It absolutely works as advertised and I highly recommend it.
        
           | SunghoYahng wrote:
           | I took a look at the site you mentioned, but I don't see how
           | it's useful for learning. It uses AI to determine what the
           | learner's current level of knowledge is, but the way the
           | learner learns seems completely typical. Then I may as well
           | read classic textbooks. It doesn't seem to me to be that
           | meaningful to determine the learner's current level of
           | knowledge.
        
             | admsmz wrote:
             | Can you explain your objections? I don't really understand
             | this comment.
        
         | trane_project wrote:
         | Going through the list, it seems like all of them only work on
         | very specialized subjects and/or are not published. I have read
         | about the DARPA tutor, but I don't think it's been made
         | publically available. So I doubt anyone here has used it,
         | unless they were in that specific branch of the military at the
         | time it was used.
         | 
         | I don't see how LLMs deliver on this. I think they could speed
         | up how the curriculum is generated, and I have played around
         | with ChatGPT to do just that, but the questions only need to be
         | generated once. It's the guiding of students through the
         | exercises that needs to be personalized, not the exercises
         | themselves.
         | 
         | As for their effectiveness, I am the author of Trane (see my
         | other comment). It's worked pretty well for me, but I am
         | splitting my time between my job, working on the software,
         | creating materials, and actual practice, so I can't really say
         | what the upper limits are until I can focus on just the last
         | part.
        
       ___________________________________________________________________
       (page generated 2023-02-25 23:00 UTC)