hngopher.com

       [HN Gopher] A/B testing gets misused to juice metrics in the sho...
       ___________________________________________________________________
        
       A/B testing gets misused to juice metrics in the short term
        
       Author : pdxdmz
       Score  : 415 points
       Date   : 2022-07-12 12:05 UTC (10 hours ago)
        
 (HTM) web link (www.zumsteg.net)
 (TXT) w3m dump (www.zumsteg.net)
        
       | mattgreenrocks wrote:
       | I recall when Booking.com rolled out the false urgency features.
       | I was amazed at how utterly trashy and desperate they were.
       | 
       | The problem is it's not subtle at all; there's a handful of those
       | features that, when combined, end up being overbearing and noisy:
       | "3 people looked at this listing within the past 3 days! 12 rooms
       | left at this rate!" I don't care. I'm looking to book business or
       | vacation travel. If a spot fills up I'll just go somewhere else.
       | It'll be fine either way.
       | 
       | I don't use them anymore for that reason. Old soul (me) is old.
       | (I'm probably in a minority, judging by their advertisement
       | budget.)
        
         | LeonM wrote:
         | Booking.com (or any other hotel/flight booking sites) are the
         | masters of dark patterns. This is what happens when software is
         | 'finished', companies start to optimize for profit, regardless
         | of customer experience.
         | 
         | But unfortunately, it works.
         | 
         | I've seen friends that I consider intelligent panic buy
         | tickets/hotels, "because prices are going up since the last
         | time I checked!"
         | 
         | Next time you want to book anything, browse around, ignore any
         | of the fake urgency notifications, ignore the price (while
         | staying broadly in your price-range, of course). Then when you
         | found a destination you like, open the page in a private
         | browsing window (or clear your cookies), and you'll see that
         | prices and availability are back to normal.
        
           | tnolet wrote:
           | Interestingly enough, booking this ridiculous A/B testing
           | almost from the start. Super hard data driven company. There
           | is a great book about them, written by 3 journalists, but
           | it's only in Dutch I think. https://www.amazon.com/Machine-
           | ban-van-Booking-com-Dutch-ebo...
        
         | iamben wrote:
         | It's amazing how much this works though. I remember getting a
         | call from my mum saying "the website is telling me London is
         | already 78% booked for these dates!" It felt ridiculous having
         | to say "Mum, it's March. You're staying there in November. I
         | promise you it'll be fine..."
        
         | coldcode wrote:
         | Having worked in the OTA space, every time I see their
         | (sometimes funny) ads, I want to call them Booking.Nope
         | 
         | OTA make comparisons a bit easier, but everything is negotiated
         | and contractually controlled to keep people from just going to
         | the hotel directly. Secret hotel prices (like HotWire if that
         | still exists (Expedia) or Travelocity's Top Secret hotels if
         | that still exists (also Expedia)) are an even more crazy
         | negotiation. Hotel Tonight at least used to contact the hotel
         | chains every day for that day's options, though since they were
         | bought by AirBnB who knows what they do.
         | 
         | These days I just find a nice hotel and book with them/their
         | system directly. Airlines too, since airlines fail to give all
         | their options to the OTAs.
         | 
         | In some ways its sad that aggregators don't work all that well
         | in the main travel industry (Flight/Hotel/Car) but travel is
         | extremely complicated, highly competitive and still very
         | fractured except for airlines. Pricing comparisons are not very
         | useful since they are so mangled and obfuscated that you may as
         | well just go to several sites and do it yourself by hand. For
         | example Spirit Airlines used to give us prices for their
         | tickets at $X and were always cheaper than everyone else; yet
         | once you booked at that price they hit you for everything extra
         | (bags, res, for all I know oxygen) then our customers
         | complained we were fooling them and the real cost was higher.
        
       | hbn wrote:
       | I've talked about this on HN but I'll say it again.
       | 
       | A couple years ago, after being an Android fan for the better
       | part of a decade, I finally bought myself an iPhone and pried
       | myself away from Google's ecosystem wherever I could. And Apple
       | didn't even need to do any work for me to make this decision. It
       | was the years of abuse from Google that you experience when you
       | decide to use a Google product or service. And a big part of that
       | was the constant A/B/C/D/E/F testing. I never felt like I was
       | using a complete product, everything felt like a constant beta
       | that could be changed or rearranged at any point, and I was just
       | doing free testing work for them while they harvest all my data.
       | 
       | Every app update was a risk of the app rearranging itself, or
       | features appearing/disappearing. Eventually it didn't even come
       | from app updates in the Play Store, and new interfaces would just
       | appear one day when a server somewhere marked your account as
       | being in the group that gets the new UI. This app that you were
       | familiar with could at any point be rearranged when you open it
       | on any given day. Then maybe a week later you open it and it's
       | back to how it was before. A button you thought was here suddenly
       | isn't, and you question whether something actually changed or if
       | you're losing your mind. It's a subtle gaslighting that
       | eventually I couldn't stand any more.
       | 
       | To me, A/B testing means you don't respect your users. You see
       | them as just one factor in your money machine that can be poked
       | and prodded to optimize how much money you can squeeze out of
       | them. That's not to say a company like Apple is creating products
       | out of the goodness of their heart, but at least it feels like it
       | was developed by humans who made an opinionated call as to what
       | they thought was the right design decision, and what they would
       | want to use. And in my 2 years of owning an iPhone, I've never
       | opened my reminders app to find out that it's completely
       | unrecognizable, or my messages app has been renamed or rethemed
       | for the umpteenth time.
        
         | bentcorner wrote:
         | Google recently shut off Hangouts on me with a flight (since
         | several of my contacts reported that Hangouts worked fine for
         | them).
         | 
         | It's kind of mind boggling they'd decide to do that - the
         | replacement they direct me to (Google Chat) doesn't even have
         | feature parity so I just dropped them and moved my social
         | circle using Hangouts to a different app (since at this point
         | they all faced the same problem and we decided on a different
         | platform).
         | 
         | I'm really curious how the A/B testing for this went down -
         | Google is willingly throwing customers away because somebody
         | wants to pump numbers for a new app that is objectively worse
         | than the old one.
         | 
         | At this point Google Maps is the only product that is keeping
         | me with them, but even that one is beginning to wear thin.
        
           | jklinger410 wrote:
           | > Google Chat) doesn't even have feature parity
           | 
           | Which features are you missing?
        
             | cupofpython wrote:
             | check the AB test results
        
             | bentcorner wrote:
             | Video chat. It redirects me to Google Meet (yet another
             | app!) and the model that Google Meet uses is completely
             | backwards (create a meeting, then invite people) compared
             | to how I use Hangouts (call someone).
             | 
             | It's a complete pita and the family-acceptable-factor is
             | low.
        
         | adrr wrote:
         | We AB tested a performance enhancement to our frontend web app
         | to show that speed had significant benefits to the business. we
         | use the results justify the investment cost. We spent the next
         | six months working on making the site faster because of it. It
         | is a tool. How would you measure things without ab testing?
        
           | fisf wrote:
           | The fact that you need an A/B test to demonstrate that
           | frontend performance has an impact (on user experience first
           | and foremost) in 2022 speaks volumes.
        
             | cupofpython wrote:
             | this is the stance I take on AB tests. The are objectively
             | good at things you shouldnt need objective evidence for
        
             | ajmurmann wrote:
             | But how big is the impact and does that return compare to
             | that of other investments you aren't making because you are
             | improving performance instead?
        
             | jklinger410 wrote:
             | Yet it says nothing. Large companies are slow and stubborn.
        
         | costcofries wrote:
         | "To me, A/B testing means you don't respect your users. You see
         | them as just one factor in your money machine that can be poked
         | and prodded to optimize how much money you can squeeze out of
         | them. "
         | 
         | Your perspective is extremely short-sighted. A/B testing can
         | result in this type of behaviour but that's just poor A/B
         | testing. Good A/B testing focuses on removing distractions from
         | the experience and helping users derive more value from the
         | product. Bad A/B testing tries to make things more
         | discoverable, where discoverability is often just noise and
         | distractions. Good A/B testing ensures that the money machine,
         | as you put it, pays its dues to users by making the product
         | experience delightful.
        
           | NyxWulf wrote:
           | A/B testing is a tool, and as Deming said, the aim defines
           | the system. In your definition of Good, you are defining the
           | aim.
           | 
           | I've been in a similar situation, where I created a
           | relatively sophisticated A/* testing and control system. My
           | idea of good use of the system ended up being very different
           | from how the team employing the system thought about it.
           | 
           | I believe that is part of the point of the post, that
           | unintended, and even unimagined side effects plague even the
           | best of ideas.
        
           | koheripbal wrote:
           | Do people honestly think that Apple doesn't do A/B testing?
        
             | teakettle42 wrote:
             | Apple unequivocally does not do active A/B testing on their
             | users by changing applications or the operating system out
             | from under them.
        
               | selfhoster11 wrote:
               | It's almost like they use a dedicated focus/beta testing
               | group or something, instead of making all their users
               | join in.
        
               | daniel_reetz wrote:
               | I can't think of an Apple engineer who would make such a
               | definitive statement on behalf of the company, in public.
               | I also can't think of anyone working there who would have
               | definitive knowledge across all apps and software,
               | because disclosure would prevent such knowledge.
               | 
               | Apple introduced A/B testing to Testflight in 2017 and
               | more A/B testing to the App store this year.
               | 
               | Source: I worked at Apple, but not in software.
        
               | mrkstu wrote:
               | Yep- it does this in betas, but not in production apps.
        
           | manmal wrote:
           | UX feedback is usually retrieved via user interviews.
           | 
           | I personally have never heard from a product person ,,Let's
           | A/B test whether this is delightful". And I think that's
           | because delightfulness or satisfaction is impossible to
           | quantify in A/B tests. You only get to measure things like
           | engagement, signup rates, retention etc. - cold hard taps on
           | the screen, and no more.
           | 
           | And I must say that I'm glad that, right now, apps can't just
           | scan my face (or cortisol levels, or pheromones or...) for
           | emotional clues while I read their pesky push notifications
           | that want to coax me back into their daily active user base.
        
           | loriverkutya wrote:
           | I never seen an A/B test where the goal was not to maximise
           | profit for the company.
        
             | koheripbal wrote:
             | This comment is way too broad and cynical.
             | 
             | It reminds me of the sentiment I sometimes hear from my
             | teenage children before I explain to them that the world
             | isn't a Bernie Sander/Reddit gotcha soundbite and that
             | reality is complicated.
        
               | skupig wrote:
               | Let me be even broader and more cynical: everything a
               | company does is ultimately to maximize profit.
        
               | jen20 wrote:
               | > This comment is way too broad and cynical
               | 
               | Or, as anyone who has been on the internet since the 90s
               | will tell you, bang on the damn money.
        
               | ThalesX wrote:
               | What would the other advantages of A|B testing be for a
               | corporation instead of widening the funnel, increasting
               | lifetime revenue or other such bottom-line focused goals?
        
               | notriddle wrote:
               | Assuming you aren't running a charity, increasing the
               | bottom line is always your goal. The difference between a
               | company that burns user goodwill in the process, and one
               | that doesn't, is the difference between short-term and
               | long-term thinking.
               | 
               | The downside with hired testers is that they are unlikely
               | to be a statistically representative sample of your
               | target market. I don't think this is actually a problem
               | for most startups or new products being launched by major
               | players, since a lot of UX issues early in product
               | development will be obvious to _anyone_ who isn't the
               | original developers, and the number of actual users you
               | have won't be big enough to serve as a statistically
               | representative sample of your target market anyway.
        
               | throwaway98797 wrote:
               | you can genuinely find out what message makes more sense.
               | 
               | explaining what your product does is hard when you're the
               | product expert. A/b testing ad copy helps folks parse
               | what your product does.
               | 
               | If your product is actually good then having more people
               | understand what it is, is also good.
        
               | dvtrn wrote:
               | And all of that amounts to what if no one decides to
               | cough up money to use your product?
        
               | pessimizer wrote:
               | "Reality is complicated" is often a way to rationalize
               | "I'm going to do it how I've always done it, regardless
               | of how it looks."
        
             | costcofries wrote:
             | That's probably intentional and a sign that it was a well
             | designed A/B test :)
        
             | mattnewton wrote:
             | Over what time horizon? You should optimize short term
             | metrics (for feasibility) that are (as best you can
             | approximate) the cause of long term profits. The profits
             | part is what makes it the business, using metrics for
             | decisions are what makes it an A/B test. I think where
             | people run into trouble is when they are optimizing only
             | for this quarter's revenue directly because long term value
             | is too hard to measure.
        
             | adrr wrote:
             | Isn't that any goal of an organization? Everything ladders
             | up to the success of the company. Especially if you're a
             | corporation where you have fiduciary responsibility to the
             | shareholders.
             | 
             | AB testing is beyond UI. It could be AB testing algorithms
             | for recommendations or search results. When i worked for a
             | CPG company, we would AB test the sample sizes in the
             | boxes. To answer the question what is right amount of
             | product for a sample. We would test shipping to speed, was
             | it worth the upgrade to expedited shipping in terms of LTV
             | to justify the extra cost.
        
             | bentcorner wrote:
             | What would an A/B test where the goal was not to maximize
             | profit even look like? The very act of creating an A/B test
             | is because the worker wants to improve something in search
             | of higher profits.
        
             | mcrad wrote:
             | Oh yeah? I've never seen a marketing department with a
             | decent understanding of microeconomics. Most in fact are
             | trying to maximize budget ie. expense.
        
           | saagarjha wrote:
           | Well, then, the reason why people hate A/B testing is because
           | everyone does it wrong. People can write fast software in
           | Python too but the reason it's known as being a slow language
           | is because it invariably gets used differently.
        
           | drewcoo wrote:
           | "You're doing it, but in some subtle but very important way
           | that's not at all obvious to you, you're doing it wrong."
           | 
           | How many tech startup patterns fit that? That's a sign that
           | either the pattern does not generalize well or it's snake
           | oil.
        
             | selfhoster11 wrote:
             | > "You're doing it, but in some subtle but very important
             | way that's not at all obvious to you, you're doing it
             | wrong."
             | 
             | Doesn't that exact statement describe how Agile (or many
             | other concepts, really) is used in a lot of companies?
             | There's nothing wrong with it in principle, but practice is
             | all over the place.
        
             | Lutger wrote:
             | I don't think so. A/B testing is just not a cure-all and
             | alternative for good design, that is all. I mean, if the
             | pattern is 'don't bother thinking too much about ux, just
             | A/B test everything in PROD to death', then yeah, you are
             | right.
             | 
             | However, there are many products which allow users to enter
             | into a beta or test group, where they are the willing
             | subjects to their experiments (in exchange for the latest
             | shiny new stuff). This has the aspect of consent, leaving
             | the 'stable ring' free of such variability. The fact that
             | google and many startups are not using such consent and
             | offering stability, doesn't mean it can't be done or isn't
             | done.
        
             | costcofries wrote:
             | I think of it as validating and improving imperfect user
             | assumptions.
        
               | thatguy0900 wrote:
               | This reads exactly like parody corporatespeak jargon.
        
               | MegaButts wrote:
               | Clearly it's the users who are wrong with their stupid
               | assumptions and we will correct them with our mandatory
               | weekly update.
        
           | DiggyJohnson wrote:
           | I'm not sure this addresses the core criticism of using your
           | users as a testbed.
        
           | jcelerier wrote:
           | > Your perspective is extremely short-sighted.
           | 
           | it's the perspective of the normal users.
           | 
           | every time i'm using a website and it does not behave exactly
           | the same than for other people or I notice some AB testing,
           | in my head it goes "who the fuck these people think they
           | are?". The computing experience _must_ be consistent and
           | repeatable. If I wanted something that can change depending
           | on the current position of the stars I 'd ask another human,
           | not a computer.
           | 
           | A/B testing can result in this type of behaviour but that's
           | just poor A/B testing.
        
           | svnpenn wrote:
           | > Your perspective is extremely short-sighted
           | 
           | no, yours is. if some company wants to do some testing, they
           | SHOULD PAY users for that. A/B testing is just exploiting
           | users to get free testing.
        
         | ChrisMarshallNY wrote:
         | _> means you don 't respect your users._
         | 
         | I was just pontif- er, _talking_ about this to someone, a
         | couple of days ago.
         | 
         | I _love_ the users of my products. Most of my products are
         | free, and are carefully-crafted, highly-polished, _complete_
         | deliverables, and I fret over how they are used -even if by a
         | tiny number of end users-, like a nervous hen. I do what I do,
         | out of love for the craft, and out of a genuine desire to make
         | people 's lives easier, through the technology I have at my
         | disposal.
         | 
         | It is my belief that most tech companies _despise_ their user
         | base. Users are little more than cattle, to be fattened and
         | slaughtered.  "Caring about the user" means optimizing for
         | "engagement," or keeping them trapped within their own
         | ecosystem. John Oliver did a rant about this, recently[0]. It
         | has nothing to do with actually caring about the user, or
         | solving their problem. It is about _harvesting_ users.
         | 
         | In fact, my discussion about this, came about, because someone
         | wanted to keep users inside the app I'm writing, as opposed to
         | linking them to a more familiar app, on their phone (for the
         | record, it was for videoconferencing). Linking is a "no-
         | brainer," as I can link out to _dozens_ of installed apps,
         | using the simple URL scheme method, built into iOS[1], and
         | "keeping them in the app," would have required several months
         | of extra work, polluting the app with megabytes of junk code,
         | because I'd need to use SDKs, and also kill the ability to
         | easily scale to add new clients (contrary to popular belief,
         | Zoom is not the only videoconferencing option). It would also
         | have possibly put us on the hook, legally, for what happened in
         | those videoconferences.
         | 
         | [0] https://youtu.be/jXf04bhcjbg?t=638
         | 
         | [1] https://developer.apple.com/documentation/xcode/defining-
         | a-c...
        
         | mgoetzke wrote:
         | Recently they remove YouTube PiP on iOS. Then it came back to
         | my device but not others in the family. We pay for YT Premium.
         | This is beyond infurating
        
           | [deleted]
        
         | bratbag wrote:
         | Depends on where A/B testing is used.
         | 
         | If it's something one-and-done (like different permutations of
         | a signup flow to see what is easier for users), then I don't
         | see the harm in it.
        
         | dakial1 wrote:
         | The problem is not the AB testing, as it can be a good thing to
         | improve the experience, the problem is poorly set, short
         | sighted, OKRs. The author points that in the text, as he
         | mentions many times that the leadership never asks the right
         | questions, mainly how it will impact the client in the long run
         | (being NPS evolution, lifetime value, etc..)
        
         | brynjolf wrote:
         | Ironically the place I'm getting forced A/B testing now is the
         | Playstore. They moved the update apps section so it is now 2-3
         | clicks more, making me have to make a seperate shortcut just to
         | reach the app update quickly.
         | 
         | The point is of course to make it annoying to manually update
         | apps and enable auto update. I have been burned too many times
         | with a auto update so I refuse.
         | 
         | This wasn't enough, they really want to force me enable auto
         | updates to the point of the update section of the app having
         | 50% of the visible space on my big screen being covered with a
         | message to enable auto update over WiFi. [0]
         | 
         | Whoever is doing this at Google... Stop. Just stop. It is
         | cringe.
         | 
         | [0] https://i.imgur.com/MbXn9gy.jpg
        
         | Lutger wrote:
         | > To me, capitalism (A/B testing) means you don't respect your
         | citizens (users). You see them as just one factor in your money
         | machine that can be poked and prodded to optimize how much
         | money you can squeeze out of them.
         | 
         | You just described doing business in todays world.
         | 
         | Being a bit more generous towards A/B testing, I would make a
         | counter point: _not_ doing any kind of user testing, of which
         | large scale automated A/B tests are just a subset, means you
         | don't respect your users. Because it means you just assume you
         | know what their experience is like, or worse: you don't even
         | care about it and bother to learn something.
         | 
         | Your complaint seems to be more about the scale and aspect of
         | automation honestly, and continuity of the services, which is a
         | valid complaint against Google but not about A/B testing in
         | general.
        
           | saagarjha wrote:
           | > Because it means you just assume you know what their
           | experience is like, or worse: you don't even care about it
           | and bother to learn something.
           | 
           | A/B tests are not the only, or even the best, way of
           | collecting user feedback.
        
         | fouric wrote:
         | A few months ago, I came to the belief that Google is the ADHD
         | toddler of user-facing software development - absolutely unable
         | to sit still and concentrate on anything, hence the constant
         | UI/UX churn, half-baked products, and graveyard[1] of shiny
         | things that they worked on for a few years before abandoning.
         | 
         | Google seems to be really good at making _developer tools_ like
         | Borg and Blaze - however, I think that as an organization they
         | have some deficit that makes them _not responsible enough to
         | develop user-facing software_ (like, uh, an operating system).
         | 
         | Maybe Google would be better as a B2B company.
         | 
         | [1] https://killedbygoogle.com/
        
         | aceazzameen wrote:
         | You nailed it. Google is constantly forcing users to relearn
         | most of their products year after year. Give me Google products
         | from a decade ago and I'd still be happy. Now I'm moving on
         | from Google also. It's an untrustworthy brand.
        
         | corrral wrote:
         | > Every app update was a risk of the app rearranging itself, or
         | features appearing/disappearing.
         | 
         | This shit drives my parents insane. Me too, when I have to help
         | them. I've had to spend tens of seconds looking at a major
         | screen in the _phone app_ , of all things, to figure out WTF
         | I'm looking at so I could help _them_ figure out what was up.
         | Re-arranged every update (or new phone) for absolutely no
         | reason, terrible affordances, poor use of _their own_ design
         | language. Ugh.
         | 
         | I'd get them on iOS but they need larger screens and the $400
         | small iPhones (what I have) are already more expensive than
         | they think a phone "should" cost, so they keep buying $200
         | Android phones about once a year (hoping the next one will be
         | better) and not being able to use them because the UI is
         | garbage.
        
           | bombcar wrote:
           | It's doubly sad because the larger phone or iPad would be
           | perfect, but every year it's another $200 to tell Google
           | they're doing the right thing.
           | 
           | At least on iPad/iPhone you can set the apps to access Google
           | mail, etc, which doesn't change as often, but still too
           | often.
        
           | selfhoster11 wrote:
           | At one point I was trying to set up my grandma with a popular
           | video calling app on a dedicated device so we could stay in
           | touch.
           | 
           | Before I could give her the freshly grandparent-proofed
           | device, said video calling app upgraded on my parents' PC
           | first and changed _literally every single element of the UI_
           | beyond recognition. To someone the age of my grandma, that
           | would be literally like bricking the device remotely, because
           | none of the buttons would look the same, and she would not be
           | able to work out how to use the new interface.
           | 
           | STOP CHANGING THINGS! Even if the new UI is better
           | (debatable), some people just like or rely on a particular
           | layout to operate the device or app. Don't rearrange without
           | giving a ~permanent setting to use the old layout.
        
             | corrral wrote:
             | > STOP CHANGING THINGS! Even if the new UI is better
             | (debatable), some people just like or rely on a particular
             | layout to operate the device or app. Don't rearrange
             | without giving a ~permanent setting to use the old layout.
             | 
             | The fact that we're 15 years into smartphones being popular
             | and that phones & computerized address books were basically
             | fully-solved interfaces _long_ before those took off, and
             | they 're _still_ fucking around with phone app UI in big
             | ways, is a sign of some kind of institutional or industry-
             | market failure to me. Or both. Allowed, I suppose, because
             | Google 's market position (i.e. monopoly across _several_
             | markets) both gives them the surplus profit (i.e. rents) to
             | fuck around and waste money on _harmful to users_ crap like
             | phone app redesigns, and insulates them from any actual
             | threat to their profit due to those bad choices.
        
               | thatguy0900 wrote:
               | Well, all of these companies employ teams of ui and ux
               | designers. They will never show up to work one day and
               | tell their boss the ui is done. No matter how perfect the
               | design currently is
        
               | cupofpython wrote:
               | of course not. that's the responsibility of the company
               | management to say "okay UI is good lets divert resources
               | elsewhere"
        
               | [deleted]
        
             | zanecodes wrote:
             | In the same way that we try to use semantic versioning to
             | prevent unintended breakages in dependents of our public-
             | facing APIs, I think we developers should start considering
             | the UI/UX of our apps to be their public-facing API, and
             | use the philosophy of semantic versioning accordingly.
             | 
             | When we rename or remove a function from an API, that's a
             | breaking change, any dependent software will no longer work
             | unless it's modified to take that change into account.
             | 
             | Similarly, when we move, rename, or remove a UI element,
             | keyboard shortcut, or visual affordance, that should be
             | considered a "breaking change" for our dependents, the
             | humans on the other side of the screen. And in the same
             | way, we should avoid making such changes unless the long-
             | term benefit of doing so outweighs the short-term cost.
             | Plus, users will know that moving from 7.x.x of your
             | software to 8.x.x will require them to relearn some aspect
             | of it.
        
           | jl6 wrote:
           | If you put a dollar value on your own time spent providing
           | training and tech support to your parents, the iPhone options
           | start to look much cheaper.
        
             | corrral wrote:
             | I've considered just getting them one but that's _another_
             | new interface for them to learn. Though one that 's much
             | more stable across years.
        
         | amelius wrote:
         | Yes, I hate Apple, but I'm starting to hate Google even more.
         | One of these days, I might switch too.
         | 
         | Why can't we have nice things?
        
         | quickthrower2 wrote:
         | I moved back to iPhone in the 2015s onwards (I think) because
         | of the crap quality of devices available, and the bloatware
         | atop if android.
         | 
         | In many cases the hardware was so poor it was hard to make a
         | call due to the touchscreen.
         | 
         | Since the primary thing I want the phone to do is make a call I
         | switched to the "it just works" camp and haven't regretted it.
         | 
         | Except getting photos off the phone. Until I realised the best
         | tool for that is ... Ubuntu!
        
       | twawaaay wrote:
       | I think the main issue is that testing is misused to create
       | better version of something when it should be used to create
       | knowledge.
       | 
       | So if you do testing and it gives you some kind of result, the
       | crucial step is trying to understand what it really means, is
       | there something we can learn from it.
       | 
       | Unfortunately, this is also the hard part that requires actual
       | effort and intelligence and is difficult to scale -- and so is
       | frequently skipped.
        
       | Eddy_Viscosity2 wrote:
       | I'd say this is a sub-category of the saying: "There is nothing
       | in this world that an MBA can't and won't make worse".
       | 
       | It happens when people change perspectives from building and
       | sustaining businesses to exploiting and squeezing every employee,
       | supplier, and customer for the last drop.
        
         | dang wrote:
         | Please don't take HN threads into flamewar. The one you started
         | here was particularly shallow and gratuitous.
         | 
         | https://news.ycombinator.com/newsguidelines.html
        
         | jollybean wrote:
         | This is just casual bigotry.
        
           | Eddy_Viscosity2 wrote:
           | I'll say that there are managers out there who do fantastic
           | and important work, but they are the ones that have a build &
           | sustain perspective. If you feel offended by this comment,
           | its because you are in the squeeze & exploit group and you
           | know it.
        
             | jollybean wrote:
             | Wow, this is some really strange gaslighting.
             | 
             | I'm an Engineer and an MBA, very proud of both
             | designations, and proud of almost all of my colleagues in
             | both of those fields.
             | 
             | I'm a better person for knowing the people I worked and
             | studied with.
             | 
             | I'm not hugely offended by the OP comment (or even yours),
             | it's not a big deal, rather I'm pointing out that it is
             | straight up a kind of misplaced bigotry - maybe not the
             | best word, but it's correct. It's more ridiculous than
             | anything.
             | 
             | Just re-read what you wrote: "because you are in the
             | squeeze & exploit group and you know it"
             | 
             | Seriously? What the F is your problem? Why would you even
             | conceive to write that to a random commenter on HN?
             | 
             | Do you folks not see this weirdly dark and perverse
             | cynicism coming out here? What's wrong with you people?
             | 
             | I think maybe there is an odd, intellectually lazy thing
             | happening whereby some people, possibly lacking the
             | understanding of a lot of the mechanics of the 'business
             | world', and knowing that 'bad business people exist' ...
             | just want to throw it onto 'MBAs' for some strange reason,
             | not understanding how odd and misinformed that
             | rationalization is. It's really weird. Guys, stop this,
             | it's just misinformed.
        
           | dang wrote:
           | Please don't feed flamewars on HN, regardless of how bad
           | another comment is or you feel it is. It just makes
           | everything worse.
           | 
           | https://news.ycombinator.com/newsguidelines.html
           | 
           | Edit: you broke the site guidelines particularly badly later
           | in the thread. We ban accounts that do that, so please don't
           | do it again. More here:
           | https://news.ycombinator.com/item?id=32072856.
        
           | verve_rat wrote:
           | Calling that bigotry is offensive to anyone that has been
           | subjected to actual bigotry. You should be ashamed.
        
             | dang wrote:
             | Taking HN threads further into hellish flamewar is against
             | the site guidelines, so please don't.
             | 
             | Attacking someone personally is not ok. We're trying to
             | avoid the online callout/shaming culture here: https://hn.a
             | lgolia.com/?sort=byDate&type=comment&dateRange=a...
             | 
             | Having a flamewar about the definition of a word is
             | particularly pointless because different people have very
             | different associations with the same word--especially when
             | you remember that this place literally has commenters
             | coming from all over the world.
             | 
             | Edit: you've unfortunately been breaking the site
             | guidelines in other threads as well, e.g. here:
             | https://news.ycombinator.com/item?id=32050947. We
             | eventually ban accounts that do that, so please don't do
             | that. If you wouldn't mind reviewing
             | https://news.ycombinator.com/newsguidelines.html and taking
             | the intended spirit of the site more to heart, we'd be
             | grateful.
        
             | jollybean wrote:
        
               | aixi wrote:
               | You're being responded by a bunch of resentful engineers,
               | but you're correct; it's the definition of bigotry.
        
               | dang wrote:
               | Flamewar comments will get you banned here, regardless of
               | how wrong someone else is or you feel they are. If you'd
               | please review
               | https://news.ycombinator.com/newsguidelines.html and
               | stick to the rules when posting here, we'd appreciate it.
        
               | aixi wrote:
               | yeah, my bad
        
               | pjc50 wrote:
               | The problem here is the use of identity "I AM an MBA"
               | rather than mere posession "I have an MBA" or even "I
               | have been awarded an MBA". You've made a certificate into
               | part of your identity and are getting into fights on that
               | basis.
               | 
               | "Engineer" can at least be a verb. I don't think MBA can?
        
               | verve_rat wrote:
               | I hate to break it to you, but if you have a problem with
               | 'technical types', this might not be the best place to
               | hang out...
        
               | Jasper_ wrote:
               | Before you commented, I thought "there is nothing in this
               | world an MBA can't make worse" was a bit too harsh. Now,
               | I see it's actually accurate.
        
               | dang wrote:
               | Flamewar comments will get you banned here, regardless of
               | how wrong someone else is or you feel they are. If you'd
               | please review
               | https://news.ycombinator.com/newsguidelines.html and
               | stick to the rules when posting here, we'd appreciate it.
        
               | bobkazamakis wrote:
        
               | Asooka wrote:
               | Now _that 's_ bigotry.
        
               | grapeskin wrote:
               | MBAs aren't something people are born into or end up
               | through no choice of their own. Bigotry against MBAs is
               | impossible.
               | 
               | It's like being a loan shark or pimp. People actively
               | choose that life and society talks poorly about them
               | because of the social harm they bring. There's nothing
               | stopping someone from washing their hands of any of those
               | things and being productive.
        
               | jollybean wrote:
        
               | [deleted]
        
               | AlexandrB wrote:
               | The best communicators I've worked with have generally
               | been scientists, especially physicists. Many MBAs seem to
               | communicate in platitudes and vague business jargon that
               | makes simple concepts sound complicated. This is obvious
               | when you start probing for specifics after terms like
               | "value add" or "synergy" are thrown around.
        
               | jen20 wrote:
               | > MBA's are generally better communicators
               | 
               | "Convenience Fee" "Synergy" "Circle back to this"
               | "Double-click on this issue"
        
               | Miraste wrote:
               | > MBA's are generally better communicators and much
               | easier to get along with.
               | 
               | This is exactly the problem. The MBA skill set is
               | climbing corporate hierarchies. Not product design,
               | customer service, empathy, or long-term thinking; only
               | promotions. That's the whole reason anyone gets an MBA.
               | 
               | MBAs often lack domain knowledge, but even when they have
               | it, the MBA problem remains because their priorities are
               | unrelated to product improvement. They spend their time
               | ingratiating themselves and taking over the top layers of
               | management, hiring more MBAs all the while, until a
               | formerly functional company ends up like Boeing or
               | Sculley's Apple.
               | 
               | Too many MBAs will inevitably degrade product quality.
        
               | jollybean wrote:
               | "You can tell the witches by their propensity to gather
               | with one another - which is proof indeed they are
               | witches, and must be burned, lest they infect the rest of
               | us good people!"
               | 
               | My. God. Man.
               | 
               | The amount of misunderstanding, arbitrarily 'made up
               | stuff' and naive speculation on this thread ... it's just
               | ridiculous.
               | 
               | It's kind of general professional designation, and
               | valuable education for those with no business background
               | (though likely not useful unless from one of the better
               | schools). I'll gather that more than 1/2 of them come
               | from hard professional backgrounds (i.e. CFA, Finance,
               | Eng, Science, Econ) and that they may not be directly
               | oriented towards a specific 'role or trade' frankly is
               | not that important.
               | 
               | Almost everything that has been specifically said in this
               | thread is a bit ridiculous (almost comical), and a lot of
               | it kind of petty as well.
               | 
               | "Those sneaky manipulators!"
               | 
               | "Their priorities are this or that!"
               | 
               | And really, really weird the amount of hyperbole here.
               | 
               | I'm going to guess that maybe you don't really know MBA
               | is (that's ok, it's not an insult, most people don't),
               | and because you don't speak the language of finance,
               | macro/micro economics, marketing, you may be inclined to
               | assume that those things 'are not irrelevant', which is
               | and oddly peculiar form of ironic glibness.
               | 
               | It's a generalist designation.
               | 
               | Some of them are good, some are not.
               | 
               | It's not that big of a deal, and it's not at all like
               | what some people are implying here.
        
               | Miraste wrote:
               | This is a website for a startup accelerator, not a pure
               | tech site. Almost everyone here, including myself, has
               | plenty of contact with MBAs. You might consider that when
               | reading all the negative feedback you've received.
               | 
               | > the language of finance, macro/micro economics,
               | marketing
               | 
               | This is like saying people get computer science degrees
               | to learn about programming. Is it important for the job?
               | Yes. Will you learn it in the program? Yes. Is it why
               | people pursue the degree? Absolutely not. An MBA is an
               | expensive, time-consuming certification. Nobody gets a
               | graduate degree in business administration because they
               | enjoy the classes. People pursue them because they
               | provide opportunities for career advancement at the
               | higher levels.
               | 
               | > Some of them are good, some are not.
               | 
               | This is the core of the problem. For a company, it
               | doesn't matter whether the MBAs are good, because they're
               | almost definitionally climbers and as climbers their
               | incentives are fundamentally misaligned with long term
               | success.
               | 
               | I'll use Boeing as an example again, because it's such a
               | textbook case. Management (composed primarily of MBAs)
               | discovered senior engineers were expensive and laid them
               | all off. Twenty years later, their planes crash, their
               | capsules leak, and their rockets don't work well enough
               | to launch at all. Did they make the wrong decision? Well
               | it looked very good on a lot of quarterly revenue
               | reports, EBITDA was up, and the CEO responsible for most
               | of those layoffs (James McNerney, MBA) retired well
               | before the disasters to accolades, trusteeships, and not
               | tens but hundreds of millions of dollars. From his
               | perspective, he achieved the goal behind getting an MBA
               | in the first place.
               | 
               | There's nothing wrong with ambition, but being wary of
               | people with too much of it has been a tenet of humanity
               | since before the refinement of bronze.
        
               | verve_rat wrote:
               | Your behaviour here isn't a good argument for this
               | thread's feelings about MBAs being "obstinate or
               | unreasonable".
        
               | at_a_remove wrote:
               | Most shysters are great communicators and quite a lot of
               | fun to be around. _It 's how they operate_.
        
               | badRNG wrote:
               | > "obstinate or unreasonable attachment to a belief,
               | opinion, or faction; in particular, prejudice against a
               | person or people on the basis of their membership of a
               | particular group"
               | 
               | This is just the fallacy of definition. Technically being
               | anti-racist or anti-terrorism is "bigotry" on a
               | fallaciously "strict" interpretation of some short
               | definition of the word if one refuses to acknowledge its
               | social and historical context (and how the word is used
               | in the real world.)
        
               | shijie wrote:
               | Not every critical or mean-spirited comment is bigotry.
               | You got your fee-fees hurt because there's probably some
               | truth to the comment, even if it's mean. Buck up, quit
               | complaining, prove the parent comment wrong in your day-
               | to-day work. Right now you're just deepening a few MBA
               | stereotypes at the moment.
        
               | dang wrote:
               | Flamewar comments, name-calling, and personal attacks
               | will get you banned here, regardless of how wrong someone
               | else is or you feel they are. Please don't post like this
               | to HN.
               | 
               | If you wouldn't mind reviewing
               | https://news.ycombinator.com/newsguidelines.html and
               | taking the intended spirit of the site more to heart,
               | we'd be grateful.
        
               | Bancakes wrote:
        
               | jollybean wrote:
               | I'm both a Software Engineer and an MBA, and I can assure
               | you that you're wrong, and with a comedic, glib kind of
               | 'Musk-ian' lack of self awareness that's weirdly common
               | among technical types. I mean it's getting funny now, I
               | was hoping that your comment was satire (!), alas, no.
        
               | ben_w wrote:
               | You can only choose your own responses to what you
               | experience, not other people's.
               | 
               | Engineering without professional leadership leads to
               | wasted effort, solutions in search of problems, and also
               | occasional successes such as Linux.
               | 
               | Leadership without engineering (or equivalent in other
               | fields) is the B-Ark.
               | 
               | If you let yourself respond with anger, _no matter how
               | justified you feel_ , you're going to end up in the
               | second group -- leadership is a position of strength, and
               | if there's one thing people like even less than sore
               | losers, it is sore winners.
        
               | dang wrote:
               | If you keep posting like this we're going to end up
               | having to ban you again, so please stop.
               | 
               | Flamewar comments are not ok, and personal attacks are
               | very not ok. No more of this please, regardless of how
               | wrong someone else is or you feel they are.
               | 
               | https://news.ycombinator.com/newsguidelines.html
        
               | Bancakes wrote:
               | >You're wrong because..... ugh! you just are, okay?!
        
               | dang wrote:
               | Posting like this will get you banned here. If you'd
               | please review
               | https://news.ycombinator.com/newsguidelines.html and
               | stick to the rules when posting here, we'd appreciate it.
        
               | josefx wrote:
        
       | mschuster91 wrote:
       | My biggest issue with A/B testing isn't even mentioned here...
       | gaslighting your customers is absolutely _not OK_. Particularly
       | with older people, the constant  "where the f..k did Outlook now
       | put feature XYZ" (in the case that comes to my mind, the CC bar
       | which used to be tab-reachable, now you have to tab+space or
       | manually click on the tiny gray "cc" letters) onslaught is just
       | absurd. When you change how applications behave without telling
       | the users, it's a direct attack on their muscle memory at best
       | and makes them question their sanity at worst.
       | 
       | My second biggest issue is: it's _rare_ that companies offer
       | actual, live-human support these days anyway. When marketing adds
       | A /B testing, shit becomes _really_ annoying if something breaks
       | as a result - usually the phone lines are suddenly flooded, the
       | agents have no idea what has happened either and try to reproduce
       | and figure out what 's going on (and sometimes _can 't_ because
       | they aren't part of the test group!), and so even people who
       | haven't been in the testing group are going to be very pissed
       | off.
       | 
       | IMHO, A/B testing without explicitly notifying the customers _in
       | advance_ should be banned by law, and that ban be harshly
       | enforced. Customers are not guinea pigs, and with the rise of
       | elderly people on the Internet this becomes an actual public
       | safety issue (as ever-changing stuff makes it easier for
       | scammers!).
        
         | Akronymus wrote:
         | Not to mention the lovely "This option that used to be here no
         | longer is here." getting the response of "its still here (for
         | me)". Youtube specifically loves to do this to me.
        
         | mjburgess wrote:
         | You're downvoted, but this issue is more common than it seems;
         | and I agree, more serious than it seems.
         | 
         | You're describing adversarial UI changes to small populations
         | _of then unsupported_ customers. This can have outsized impacts
         | on vulnerable populations, eg., esp. the elderly.
        
         | emsixteen wrote:
         | > it's rare that companies offer actual, live-human support
         | these days anyway.
         | 
         | This is one of my most intense frustrations in the modern age.
         | Complete and utter disrespect for your customers' time and
         | knowledge.
        
         | johnnymorgan wrote:
         | Love the post and I tend to agree.
         | 
         | As Product manager/owner I've only found A/B testing useful
         | when trying to narrow in on a _specific_ demographic and you
         | are trying to find some optimization.
         | 
         | The marketing/sales funnel use of it is kind of gross and has
         | ruined , imo, something that has utility in a very narrow
         | scope.
         | 
         | Cheers, also very much agree customers should be informed and
         | allowed to opt out.
         | 
         | 'hey we have a new UX to try..would you like to switch?' the
         | data from people that opt-in is way better
        
         | donmcronald wrote:
         | > Outlook now put feature XYZ" (in the case that comes to my
         | mind, the CC bar which used to be tab-reachable, now you have
         | to tab+space or manually click on the tiny gray "cc" letters)
         | 
         | I can sort of understand wanting to hide stuff on mobile, but
         | the discovery of controls to unhide things should be better. I
         | often help people that are stuck trying to figure out how to do
         | something in an app and not realizing they can click on
         | something that gives no indication it's clickable is a common
         | thing.
         | 
         | Desktop is another world. I often have 20+ inches of horizontal
         | space and a hamburger menu. It's infuriating, especially when
         | the hamburger menu is hiding _one_ button.
        
       | readingnews wrote:
       | Hrm, looks like A/B testing has destroyed the website.
        
       | tracerbulletx wrote:
       | If you're upset about internet retail, I hope you're also upset
       | about milk being in the back of the store to get you to walk
       | through the whole thing, because this has been merchandising's
       | bread and butter for a very long time.
        
         | epolanski wrote:
         | Eggs and salt are a common dark pattern, milk is in the fridges
         | with diary products in most of europe, hard to miss.
        
       | weeksie wrote:
       | Bright eyed PMs whispering "statistically significant" to
       | themselves over and over as they nervously scan their data
       | aggregation dashboards for wiggles.
        
       | jerf wrote:
       | "As an experiment, I went through a list of holiday weekend
       | sales, and opened all the sites. They all -- all, 100% --
       | interrupted my attempt to give them some money."
       | 
       | This is a good touchstone to use for "you've overoptimized your
       | site, tone it back". I am also taken aback every time I'm on a
       | site, I've got something in my shopping cart, I'm headed for the
       | "check out" button, or I'm even _on the checkout page_ , and some
       | stupid interstitial pops up. Dude, I'm trying to enter my credit
       | card information! Back off! Especially stupid for a "sign up for
       | our newsletter" popup; we all know that unclicking the "yes, we
       | can email you every 17 seconds from now until the heat death of
       | the universe with valuable offers from 'our affiliates' which we
       | define as 'anyone we share a species with'!" box on the checkout
       | form is mandatory, and if we don't see it immediately we'd best
       | go hunting for it. You've already default populating the checkbox
       | to "yes" on this _very screen_ , get out of the way!
       | 
       | Less unbelievably stupid, but related, is when I'm examining
       | product X and just after I scroll down a bit to read more you pop
       | up something related to... well... anything other than product X!
       | I'm signalling interest in product X as hard as I can, and you've
       | AB tested that this is a great time to jangle your keys over
       | there instead? Your AB testing is stupid and can't possibly fail
       | to be some stupid statistical fluke or other terrible error. What
       | fisherman goes out on his boat, hooks a fish, and then rushes to
       | throw another completely different lure out to the hooked fish
       | and get them on that hook instead? This is another good
       | touchstone for being "overoptimized".
        
         | joe_the_user wrote:
         | Let me propose a different possibility.
         | 
         | Suppose the site isn't concerned about the sale very much at
         | all?
         | 
         | Suppose the thing that the site uses to reel people in, is a
         | good deal that isn't very profitable to the site but what the
         | site then tries to sell is a very profitable near-scam/ripoff.
         | Scaring off half the ordinary customers becomes worth it to get
         | even 10% of the customers buying the scam.
         | 
         | What seems like "poor optimization" can easily optimization for
         | something and could be seen as "the scammification of the web".
        
           | bsuvc wrote:
           | Exactly this.
           | 
           | Many here are focusing on a single interaction. While the
           | outcome of that single interaction is negative to the
           | company, the aggregate outcome must be positive somehow,
           | perhaps in the way you said, but it doesn't even have to be a
           | scam or ripoff. Some products just have a higher margin
           | and/or customer LTV.
           | 
           | As an individual, it is annoying, but the company is focusing
           | only on the macro effect when it does something like this.
        
         | tracerbulletx wrote:
         | I would really warn you against thinking your intuitions are
         | going to be a good sign for whether or not something is a good
         | retail decision.
        
         | oxfordmale wrote:
         | I love to sign-up to news letters and get a discount. Of course
         | I am giving you my spam account I set up for this exact
         | purpose.
        
         | naravara wrote:
         | Honestly instead of a cookie law I wish GDPR has imposed a rule
         | that required all those stupid interstitial pop-ups to conform
         | to a standard that could be easily blocked by the browser. I
         | mean they _are_ asking for emails, which is a massive and
         | totally unnecessary proliferation of personally identifiable
         | information.
         | 
         | I hate them so much. It makes it feel like so much more of a
         | chore to try to do research or look for things online. I'd
         | honestly prefer 56k page-load speeds if the pages were free of
         | this garbage.
        
           | jdlshore wrote:
           | Pet peeve: GDPR is not the cookie law, and is fact a very
           | sensible collection of restrictions on how companies can
           | collect and process personal data. The annoying banners you
           | see are against the spirit of the GDPR, and quite possibly
           | against the letter of it, too.
        
         | EGreg wrote:
         | Maybe your money is no longer the most important thing for them
         | at that moment? Given that you'll probably proceed with the
         | purchase anyway, they could be making more overall from the
         | crowd which also signs up for updates.
        
         | sharemywin wrote:
         | Most people are just browsing so it's optimizing for that when
         | it should be optimizing for sales
        
         | CSMastermind wrote:
         | Recently I got an account on a developer tool (Checkly) because
         | the company I joined uses it. I then got 5 different emails
         | from them in a 48 hour period.
         | 
         | Like I'm sure many users sign up then drop out of your funnel
         | but I'm part of an organization that's a paying customer. I'm
         | already going to use your stuff. What possible business benefit
         | could there be to you spamming me? If anything you're risking
         | the inverse - it made me want to migrate away from the tool.
        
           | mattgreenrocks wrote:
           | The 'cost' of email is borne by the recipient, mostly.
        
           | chairmanwow1 wrote:
           | Sounds like they are optimizing for a KPI on time to full
           | integration. Someone else paid for it, now they want to make
           | sure that you are actually using it.
           | 
           | Still absurd, but I know this is a problem friends of mine
           | have had.
        
             | bombcar wrote:
             | The "here's how to use the product you paid for" emails
             | that trickle out over a few weeks or a month is vastly
             | different than "throw you into every single marketing email
             | bucket we have" but companies seem to lean toward the
             | latter.
        
           | tnolet wrote:
           | Hey CSMastermind, I'm founder at Checkly and I got a ping we
           | were mentioned. We do send out some "getting started"
           | messages on autopilot. We also did a product launch Thursday
           | and then our regular changelog on Monday. That probably was
           | overwhelming. If you could email me on tim -at- checklyhq
           | -dot- com I will track down if we hit the the spam cannon too
           | hard.
        
             | kaoD wrote:
             | I know this is well-intentioned and not an automated
             | message, but I find it ironic that you managed to get an
             | additional message to him over here.
        
               | tpoacher wrote:
               | one asking them to establish contact for communication to
               | continue, no less. xD
               | 
               | I can't help be reminded of those "if you'd like to
               | unsubscribe call us so we can harass you with offers even
               | more" you see in some company terms and conditions.
        
             | musicale wrote:
             | > if we hit the the spam cannon too hard
             | 
             | Maybe consider, oh I don't know, not deploying a _spam
             | cannon_ in the first place?
             | 
             | The good news is that this sort of thing is enough to
             | trigger the spam filter in my email program so I'll never
             | see it.
        
             | wccrawford wrote:
             | I don't use your tool and I'm unrelated to CSMastermind,
             | but I can tell you that you absolutely hit the "spam
             | cannon" too hard. 5 messages in 48 hours? That's absolutely
             | ridiculous. They could all have been 1 message if it was
             | important that they get that info, and it could have been 0
             | messages if it wasn't.
        
               | tnolet wrote:
               | You are correct. I will figure out two things.
               | 
               | 1. Why our fancy expensive mailing/marketing tool
               | (Intercom) does not spread these messages in a relaxed
               | fashion (it should)
               | 
               | 2. If of those five messages, maybe two were the
               | obligatory "confirm your email" and standard "Welcome to
               | Checkly, this is what we do"
               | 
               | TL;DR we should not spam.
        
               | tnolet wrote:
               | We checked our Intercom. We spammed. One user got six
               | emails in a 5 day span. There were some separate
               | initiatives going on. We didn't check the settings and
               | current outgoing mail. We will change it.
               | 
               | BTW: all of this was done without any bad intent. It's
               | 100% us being stupid and not coordinating and being
               | diligent enough.
        
               | stalfosknight wrote:
               | You only get one chance to make a first impression.
        
               | cto_of_antifa wrote:
        
               | tpoacher wrote:
               | Send a seventh to apologize! /s
               | 
               | (on a serious note, good on you for taking action though.
               | +1)
        
               | sonofhans wrote:
               | This is such an impressive series of responses. I know
               | some folks are flipping you shit here, but I've seen a
               | lot of people try to "engage" with customers, in HN and
               | elsewhere, and I think you've done it really nicely. You
               | sound like a human; you admit mistakes; you follow up.
               | Good on you.
        
             | oxfordmale wrote:
             | tnolet, if I get five emails from a company with 48 hours,
             | I will set up a spam rule for them. If I really need to get
             | that much help to get started, your UX design is likely not
             | very good.
        
               | tnolet wrote:
               | You are correct. I'm the same. Our emails are even pretty
               | good. Our UX is pretty damn good. Somehow we dropped the
               | ball here.
        
         | nemo44x wrote:
         | A lot of this happens because different managers have different
         | metrics/KPIs they are optimizing around and they all find "good
         | places" to do things to help meet their goals. The secondary
         | effects aren't considered. One managers quest for outperforming
         | their goals comes at the expense of another managers goals.
        
           | d23 wrote:
           | There was a point a few years ago where you could not see a
           | single piece of user-generated content above the fold on the
           | reddit home page. A bunch of teams had jockeyed for having
           | their little carousels and banners put on top, and of course,
           | metrics were always cited.
           | 
           | I left a screenshot in slack and it ended up causing a couple
           | of teams to have to roll back their widgets, but it always
           | baffled me that we were able to focus so much on the
           | individual trees of metric optimization that we would miss
           | the forest to that extent.
        
             | saagarjha wrote:
             | It seems like every company goes through some version of
             | this. At Twitter there was a channel called #ios-six-bars
             | or something like that that started when an engineer posted
             | a screenshot of the home timeline with six bars of things
             | on it, all from jockeying teams trying to grab a spot on
             | that page: Home, Spaces, all the new features just had to
             | be thrown in the face of somebody who probably just wanted
             | to read some tweets. Discussions were had, product cohesion
             | was brought up, then things went quiet for a bit. Until
             | someone posted a new screenshot a couple months later with
             | seven bars on the screen.
        
             | nemo44x wrote:
             | > but it always baffled me that we were able to focus so
             | much on the individual trees of metric optimization that we
             | would miss the forest to that extent
             | 
             | Always look to the decision maker's incentives and you'll
             | almost always discover why things are the way they are. And
             | often, to your point, there's an aspect of tunnel vision
             | associated with it because considering the bigger picture
             | is difficult as a company grows and becomes more complex
             | and creates friction in achieving goals.
             | 
             | Ultimately, this is the purpose of senior leadership. But
             | the Peter Principal really begins to kick in at that level
             | and the truth is, many senior leaders are in over their
             | head and are unable to materialize the broader strategy and
             | understand how their various units are affecting it. So we
             | end up with crappy products.
        
               | bluGill wrote:
               | The best metric is the end of year bonus, tided to yearly
               | total company financial results, but that only gets
               | measured once per year. I can measure many things on
               | every transaction, but how they in total work out to my
               | end of year bonus and paycheck are much harder to see.
               | 
               | Of course if my bonus is some small KPI I can optimize
               | that at the expense of overall performance.
        
               | the_gipsy wrote:
               | Gervais principle says that senior leaders work for
               | themselves, not the company, up to the point of working
               | against the company.
        
           | retcon wrote:
           | The primary not secondary effect of random sampling is noise.
        
           | spookthesunset wrote:
           | This is why any good org will make sure to observe all
           | important KPI's while doing an A/B test. If your "email
           | signup" KPI went to the moon but tanked your "bought shit"
           | metric... you should probably roll back.
        
             | travisjungroth wrote:
             | It's really easy for this to be noise from false negatives.
             | On an A/A test with five guardrail metrics and a threshold
             | of p>0.05, you'll get a false negative 22.6% of the time.
        
           | DrewADesign wrote:
           | Having experience design expertise at the executive level can
           | mitigate this. If nobody is advocating for good user
           | experiences, nobody is advocating for the usefulness of your
           | online product as a whole, and it shows.
        
         | enlyth wrote:
         | Speaking of 'sign up to our newsletter', one of the latest dark
         | patterns I've found that astounded me was adding a checkbox to
         | the login form [0], where you'd normally expect the 'remember
         | me' checkbox to be. You almost click it out of muscle memory if
         | you don't read what it says.
         | 
         | [0]
         | https://i.postimg.cc/HW89hs7r/Screenshot-2022-07-12-145957.p...
        
           | Akronymus wrote:
           | Thats one of the things I REALLY dislike about GoG lately. It
           | tries really hard to bait you into signing up for the
           | newsletter when buying stuff.
        
           | eloisius wrote:
           | Email marketing in general blows my mind. Marketers typically
           | have absolutely no respect for consent, and the costs are
           | completely borne by the recipient. The whole industry depends
           | on dark patterns, shady list sharing, and scraping your email
           | to add it to their lists despite you having no relationship
           | with them. I know it's not simple, and it's just my
           | frustration speaking, but I don't understand how my mail host
           | can't ban all Mailchimp et al IPs for me, or implement some
           | standard such that it costs them a penny to send me an email.
        
             | ryanmcbride wrote:
             | That's why the only email subscription service on my site
             | is completely transparent and details exactly what we're
             | storing, and it's impossible to click by accident.
             | Everything I do for myself I try to build like a service
             | I'd like to use myself. But the second managers or
             | marketers are involved it's all out the window. I remember
             | early in my career my boss had me add every email included
             | in a TED booklet to their marketing email list. I told him
             | that morality issues aside he could likely catch a fine for
             | that, especially since the type of people listed in a TED
             | booklet are likely more litigious than the average bear.
             | Didn't care, wanted more eyes on the marketing.
        
             | bbarnett wrote:
             | I'm not a big fan of ipv6, but fan or not, I bet if all
             | spammy mailchimp type provider IP ranges were confiscated
             | and freed, we'd be in ipv4 land for another 20 years.
             | 
             | And as a second thought, the way China amd Russia are
             | going, maybe we should just reclaim all their ipv4
             | addresses, and just give each country 1 IP, they can proxy
             | through it on their end.
        
               | quickthrower2 wrote:
               | Then say goodbye to the internet and hello to a mesh of
               | country-specific networks.
               | 
               | Finally, true decentralisation!
        
               | bbarnett wrote:
               | ? That's my point, as this is China already, and Russia
               | is not far behind.
        
               | jamiek88 wrote:
               | We already have that. It's just we are in denial as a
               | society about it.
               | 
               | Globalization is over. The post Berlin wall fall
               | consensus undermined and ruined.
               | 
               | The last to realize, loses.
        
             | bottled_poe wrote:
             | Mostly, it doesn't even seem to matter whether you agree or
             | not. Inevitably you end up receiving affiliate emails
             | regardless.
        
               | dwighttk wrote:
               | Yeah my _mortgage_ company sent me a letter saying to opt
               | out of affiliate marketing Emails or snail mail I have to
               | send them a letter requesting it. This was 4 months after
               | they bought my mortgage so the most the letter woulda
               | done is stop them after selling my info for 4 months.
        
               | Spooky23 wrote:
               | I had a coworker who sent those letters as a side hustle.
               | He had a few different ones and would send the letters
               | certified mail. Companies are very poor at compliance,
               | and certain violations allow you to sue the company.
        
               | dspillett wrote:
               | This is why every company gets a different address for
               | me. If junk starts coming in, that address is blocked,
               | and I stop doing business with that company (if I haven't
               | already).
               | 
               | This sometimes falls foul of spammers adding some random
               | addresses of the form blahblah@mycatchall.domain.tld or
               | <commonname>@mycatchall.domain.tld into their lists, but
               | that hasn't happened often enough to be a problem. That
               | it isn't much of a problem surprises me a little, given
               | how much <commonname>@domain.tld (no sub-domain)
               | addresses are used this way. I have considered trying the
               | pattern somename@<sub-domain-per-company>.domain.tld as
               | an alternative if that becomes a problem, but before
               | implementing that I need to change my email setup (doing
               | that anyway soon as running Zimbra's OSS version is going
               | to get more difficult next year) and maybe my DNS server
               | of choice (if wildcard MX records are an issue, I've not
               | looked into that).
               | 
               | Sometimes I get funny looks for addresses like this,
               | especially as I usually work the company/other name in
               | there somewhere. I had one website refuse to accept an
               | address based on their name, which was a rad flag and I
               | backed away from going any further into dealing with that
               | organisation.
        
               | 3dGrabber wrote:
               | > If junk starts coming in, that address is blocked, and
               | I stop doing business with that company (if I haven't
               | already).
               | 
               | AND CALL them, if possible: "I've received marketing
               | emails from your company recently, how is this possible,
               | I've never signed up, yaddayadda... "
               | 
               | Generate some cost on their side.
        
               | dreamcompiler wrote:
               | I typically do it with
               | 
               | <emanynapmoc>@mydomain.tld
               | 
               | Spelling the company's name backwards makes it easy to
               | match to a company for use by my own spam filter without
               | setting off their pattern detectors.
        
               | aaaaaaaaaaab wrote:
               | You can just use [id]-[sha1 hmac]@domain.tld
               | 
               | The id could be anything, and the SHA1 HMAC takes 32
               | characters in base32 (which is an email-address-safe
               | encoding). Then just configure your spamfilter to reject
               | any address where the HMAC doesn't check out.
               | 
               | Of course, the drawback is that you'll need a computer to
               | generate a new address... At which point you may as well
               | store an explicit whitelist of valid addresses.
        
             | fatnoah wrote:
             | > Email marketing in general blows my mind
             | 
             | My favorite thing is when the companies outsource the email
             | marketing, so that it has absolutely zero relevance. I've
             | been using the same online tax preparer for 10 years, and
             | I've had exactly zero refunds, yet their emails during tax
             | season always let me know that "my refund is waiting".
        
             | propogandist wrote:
             | beware that Meta/FB and Tiktok scripts are among those that
             | siphon off email data even before a web form is submitted
             | 
             | https://arstechnica.com/information-
             | technology/2022/05/some-...
        
               | idrios wrote:
               | This is a good reminder to keep your hosts file updated
               | to block at least some of these sites' attempts to take
               | your data.
               | 
               | https://someonewhocares.org/hosts/
        
           | SoftTalker wrote:
           | I _never_ check  "remember me" so maybe that's good for me?
        
           | bombcar wrote:
           | "Remember me in your newsletter list" is the next one. Send
           | me money!
        
           | dwighttk wrote:
           | Heh. I've stopped clicking remember me boxes because they
           | never work.
        
           | cardamomo wrote:
           | Yes! I just noticed this for the first time yesterday and
           | thought, "I hope this isn't another terrible trend in dark
           | patterns."
        
           | CodesInChaos wrote:
           | atlassian does this as well:
           | https://i.postimg.cc/zfwbG5Ft/atlassian-login.png
        
             | geysersam wrote:
             | I knew Atlassian hates their users. But that much??
        
           | aroccoli wrote:
           | Yeah, I booked a flight on WizzAir two days ago, and this
           | felt like a low blow even from WizzAir.
        
             | _puk wrote:
             | Predicting the flight will be cancelled in 3.. 2.. 1.. But
             | then the newsletter will haunt you for far far longer..
             | 
             | Not a burnt WizzAir customer at all! /s
        
               | enlyth wrote:
               | Happened to my dad on Sunday, and the only replacement
               | flight they would offer is for this Friday, what a
               | complete joke air travel is in 2022.
        
               | nebusoft wrote:
               | I mean you're complaining about an ultra-low cost
               | airline. why would you expect it to have a good customer
               | experience?
        
               | enlyth wrote:
               | Yes I am. They also overbooked seven people on that
               | flight, delayed it for hours, and then completely
               | cancelled. There were people in wheelchairs left stranded
               | at the airport after waiting there all day. This is just
               | plain incompetence.
        
         | tshaddox wrote:
         | The thing is, you haven't really shown that these sites aren't
         | successfully optimizing for conversions. Couldn't it very well
         | be the case that UI which annoys some high-intent users by
         | interrupting them or adding steps to the checkout process also
         | increases overall conversions?
        
           | jl6 wrote:
           | True, A could be "annoy users" and B could be "don't annoy
           | users", and A could perform better overall, but in this
           | framework you might be missing C which is "annoy users except
           | those already deep in the funnel".
        
             | tshaddox wrote:
             | My point is more that there can be two groups of users with
             | mutually exclusive desires, and it can be practical to
             | choose to satisfy one group over the other based on your
             | particular goals. There's not always some monotonically
             | increasing function where you can over time satisfy a
             | higher and higher portion of your users.
        
         | Jenk wrote:
         | > I'm signalling interest in product X as hard as I can, and
         | you've AB tested that this is a great time to jangle your keys
         | over there instead?
         | 
         | If I may... I have seen data from a big retailer that shows any
         | user that doesn't immediately purchase an item, is actually not
         | that interested in the product on the screen. If a customer is
         | going to buy something, they will do it promptly. Anyone else
         | is just browsing.
         | 
         | YMMV, grain of salt, context dependent, etc, etc.
        
           | axus wrote:
           | Could the popup be a punishment for reading the fine print?
        
           | jerf wrote:
           | In this case, what I'm referring to is:                   1.
           | Clicked on page.         2. Took maybe 10 seconds to take in
           | what is "above the fold".         3. Scrolled down to see
           | what else there is.         4. BAM! Popup triggered by
           | scrolling down.
           | 
           | While I understand what you're getting at, they do not yet
           | have the info to know that I'm browsing or whatever. They
           | were so excited about their stupid popup that they didn't
           | even get that far.
           | 
           | I will say, generally, when I'm to the point that I'm
           | entering credit card info, I've put up with it, but I _have_
           | been chased off of sites by this use case before. Especially
           | if that popup also crosses with some other popup and now I 'm
           | chasing down the tiny little 6pt light-grey-on-white little
           | "x"s to click away the popups in the _right order_.
           | 
           | Actually, let me add that to my touchstone list. _OF COURSE_
           | hiding the dismissal icon for the popup increases
           | "engagement" with the popup. You don't even need to run a
           | test for that, because _what other result could it have_?
           | "We shrank the close icon, moved it to the lower right corner
           | where nobody expects it, and made sure to kill the constrast
           | even harder, and customers dismissed it 2.5 seconds _more
           | quickly_ on average "? Of course that's not possible. But...
           | that's the wrong question! And AB testing is _really good_ at
           | answering the question you 're asking, it has no mechanism in
           | and of itself to see whether you're asking the right
           | question. If you're getting down to this you've
           | overoptimized.
        
             | Akronymus wrote:
             | Or a popup that triggers for moving the mouse towards the
             | top bar. I constantly highlight text for reading purposes
             | (A habit I have) of course that moves the mouse. Not a
             | reason to annoy me with that shit.
        
             | dpe82 wrote:
             | In fact, you _want_ the dismiss button to be easily
             | discovered and used. Dismissals are an important signal
             | about the quality of the content; just as important as
             | clicks. When you make the button impossible to use you rob
             | yourself of that signal while simultaneously making click
             | data far less reliable.
        
               | bbarnett wrote:
               | Unless you're a manager, not getting the results desired.
               | 
               | Then the problem is people, and "they're just clicking
               | close out of reflex!".
               | 
               | Cue hiding this, and results you wanted appear! Success!
               | Raise! Promotion! Or, maybe more funding, due to signs of
               | greater engagement.
        
               | dpe82 wrote:
               | Sure; until a company that actually understands this
               | stuff comes along and eats that company's lunch. In the
               | long run, reality eventually wins.
        
           | sharemywin wrote:
           | The number to figure out is how much time do you wait to
           | interrupt. Also wonder if it's person dependant. Some people
           | aren't impulsive buyers.
        
         | legalcorrection wrote:
         | Indeed, quantitative data without qualitative understanding is
         | useless. You can't understand data without understanding
         | mechanisms, because there's an infinite number of possible
         | confounding factors that you can only dismiss through your
         | qualitative understanding of the dynamic you are measuring.
        
       | fleddr wrote:
       | I absolutely agree that A/B testing in the way described in the
       | article is a catalyst for creating dark patterns in a UI. Because
       | dark patterns work, they deliver short term increases in
       | particular metrics.
       | 
       | The author's idea is that this short term gain damages longer
       | term metrics. That sounds logical and agreeable, but that doesn't
       | make it true. Not in my experience anyway.
       | 
       | Probably the people complaining the most about annoying UI
       | patterns weren't going to convert anyway. Whilst those coming
       | with a specific conversion goal to your site will convert even if
       | annoyed in the process.
       | 
       | Anyway, the true root cause goes all the way to the top. When you
       | give a team a 20% sales increase target and "deliver by next
       | quarter or be fired"...this is what you get. If the executive
       | level dismisses a healthier, more sustainable long term growth
       | model, then there's pretty much no way to stop this.
       | 
       | It's so hard to stop because it actually works. It works short
       | term and evidence that it harms long time is typically lacking or
       | it simply isn't true.
        
       | kjhgkjghkj wrote:
       | Intentional or not, one outcome on sites that are relentlessly
       | A/B tested is that the resulting UI design lets users know that
       | content they want is there, they just need to click and scroll a
       | bit more to find it.
       | 
       | Having left FB years ago, I now watch people "navigate" their
       | site/apps with disbelief.
        
         | regularfry wrote:
         | Isn't that exactly the problem? The resulting UI isn't
         | designed, it's aggregated across a disjointed set of granular
         | tweaks.
        
           | kjhgkjghkj wrote:
           | A problem for who? Given that people already invested in the
           | product ecosystem seem to have almost limitless patience to
           | scroll for the right content, I'm sure it improves almost
           | every user time and attention metric.
           | 
           | It's why I saw it as my moral duty to leave (as well as the
           | other FB properties), so that at least in a small way, I
           | "produce content" that is only available by interacting with
           | me as a person.
        
           | ssharp wrote:
           | This is part of the "unchecked" part of AB testing the
           | headline mentions.
           | 
           | You, of course, need to ensure the granular tweaks can be
           | rolled up into something usable as the granular tweaks prove
           | successful. You can't just keep bolting on UI changes while
           | losing sight of the larger experience. Each incremental A/B
           | test is testing against a previously successful variant so
           | eventually the control is radically different from where it
           | started and you're only concerned about beating the control.
           | Using a longer-term holdout group or reseting the control
           | experience during incremental testing can help mitigate this
           | and get you zoomed out a bit from the local maxima.
        
       | test1235 wrote:
       | archive: https://archive.ph/fuUPG
        
       | amluto wrote:
       | Reading this makes me think of the handful of sites, often
       | targeted at professionals, that highly optimize for the
       | experience of actually buying things. McMaster-Carr comes to
       | mind. Their users shop there over and over, and McMaster wants to
       | keep them. So you can find things for $2 or $2000, shipping
       | prices are inoffensive, customer service is friendly but rarely
       | needed, and there are minimal distractions on the way to checking
       | out or even after checking out. The only real issues are mostly
       | related to the fact that they sell so many products that one can
       | get lost in the 4000+ items that all match the search. Well done.
       | 
       | This is an interesting contrast to Amazon that also makes
       | checkout easy but bombasts the user with thousands of listings,
       | mostly mildly fraudulent and consisting of absolute crap, and
       | still somehow gets repeat business.
        
         | saagarjha wrote:
         | > This is an interesting contrast to Amazon that also makes
         | checkout easy
         | 
         | Hey, would you like Prime with that? Do you know we provide
         | free two-day shipping with Prime? If you sign up for Prime
         | today you can get a $100 discount!
        
         | rightbyte wrote:
         | McMaster-Carr might be the single thing I miss the most from my
         | time in the US. It is like ... stupid good. Their listings
         | catalogization is like godlike compared to alternatives.
         | 
         | The Amazon or Google way of throwing all things into the bin
         | and spew it out to the users is BS. We are saying we live in an
         | information age but I firmly believe stuff were way better
         | catalogizised back when it was done manually by paid
         | gatekeepers.
        
         | int_19h wrote:
         | https://www.usplastic.com/ is another "industrial" example.
        
       | causi wrote:
       | The problem with AB testing is that it's a short-term strategy.
       | For example, if a news site runs AB testing with headlines,
       | they'll find that bullshit clickbait headlines get more pageviews
       | than concise, accurate headlines, but the constant use of
       | clickbait headlines will over time destroy overall traffic to
       | your site. More frustratingly, sites run by smart people tend to
       | fall into a balance where the worst articles get the most
       | alluring headlines.
        
         | boruto wrote:
         | Indeed,
         | 
         | If do more ad placements increase revenue is the test and then
         | there is 20% jump what are you as an engineer going to do? Tell
         | to management that its bad?
        
         | multivariate wrote:
         | I write A/B tests for headlines for a news site, this is too
         | broad a generalization. Clickbait titles aren't great for
         | building subscribers or establishing trust, which is what we
         | really care about (LTV). To the author's credit, our deepest
         | testing insights come from analyzing a lot of historical data
         | (not just last week's).
        
         | tomrod wrote:
         | This highlights the major downside to "data-driven" policy and
         | decisions.
         | 
         | Data can "lie". What is observed is not always reality, simply
         | what we can see of it.
         | 
         | Consider auctions. You never actually "see" the bidder's demand
         | or utility. Yes, there are some ways to structure auctions that
         | in theory show willingness to pay and such (ignoring
         | confounding factors and irrationality), but you don't actually
         | observe anything beyond the bid.
         | 
         | Similarly, on websites, you don't always know the causal
         | reasons people click here or there. You know perhaps enough to
         | predict a step-wise behavior, but don't (usually) understand
         | the full behavioral lifecycle -- especially if a metric
         | improves but at the hidden cost of decrements to conversion and
         | similar.
        
         | dr_dshiv wrote:
         | I'm a huge fan of metrics. Huge! But they are worthless when
         | not combined with qualitative experience. AB testing needs to
         | be combined with human-centered "actually talking to people
         | about their experiences." Otherwise, you drift and the metrics
         | no longer match the objective.
        
         | naravara wrote:
         | > but the constant use of clickbait headlines will over time
         | destroy overall traffic to your site
         | 
         | I'd add a bit of nuance here. They are very good at _driving
         | traffic_ , but very bad at _building an audience_. You do this
         | long enough and your news site is now optimized for attracting
         | hot-take appreciators who engage with the news like a tabloid.
         | This drives away everyone who doesn 't want to be reading a
         | tabloid and makes you more dependent on keeping up with
         | traffic-gaming strategies to continuously drive traffic. You've
         | basically shifted your business from being a place that
         | produces journalism to being a place that figures out ways to
         | game social media trends and SEO.
        
         | goodside wrote:
         | There's nothing about AB testing that requires you to use
         | short-term metrics. I used to manage AB tests for online dating
         | sites (OkCupid, Grindr) where subscription revenue is what
         | matters, and the gains of any strategy will take months to
         | materialize. We were well aware that, say, raising prices would
         | yield more short-term revenue at the expense of long-term
         | revenue. That didn't stop us from testing, it just made the
         | statistics more complicated.
        
           | Philadelphia wrote:
           | OkCupid has completely destroyed its interface and utility,
           | so whatever they're doing doesn't seem to be working anymore.
        
             | goodside wrote:
             | I left in 2015, as soon as it became apparent the party was
             | over. OkCupid went downhill for a lot of reasons, but
             | overly aggressive A/B testing wasn't one of them.
        
           | im3w1l wrote:
           | Did you A/B test the matching algorithms?
        
           | tsimionescu wrote:
           | Sure, but in many cases, such as the example given by GP,
           | long-term AB testing is hard or almost impossible. For the
           | testing to have validity, you need the A and B cohorts to be
           | stable, and have little or no overlap, and that is hard for
           | long time spans for anything that is not account based (and
           | somewhat dangerous even for account-based things, as people
           | will almost certainly start to notice that they are getting a
           | different experience than their peers, which may upset them).
        
             | goodside wrote:
             | In online dating, at least, this is a non-issue. Using an
             | online dating app is, ironically, a solitary enough
             | activity that people don't go around comparing whether
             | their UI is different from their friends' UI. You of course
             | can't let the same user see two versions, but that just
             | means doing permanent group assignment on signup. We used
             | to A/B test subscription prices over enormous ranges (e.g.,
             | randomly giving some people 90% discounts) and
             | approximately nobody noticed outside of obscure Reddit
             | threads.
        
               | avisser wrote:
               | I wonder if you two are talking past each other a little.
               | I'm thinking that A/B testing for content is a different
               | beast than A/B testing for experience.
        
               | goodside wrote:
               | I'm not disagreeing -- My point is really, "not all AB
               | testing is bad, even if the kind you're most familiar
               | with leads to shitty content." My second comment was just
               | more of side note.
        
               | tsimionescu wrote:
               | Sure, this is a niche with some very nice properties for
               | this type of thing.
        
       | nuc1e0n wrote:
       | This kind of optimisation for short term gains at the expense of
       | long term sustainability is what is causing climate change and
       | the collapse of the global economy. But the politicians and heads
       | of industry who preside over this situation will all be
       | retired/dead before it becomes a problem. Or so they thought.
        
       | redleggedfrog wrote:
       | I think that you have to take into account the popularity of
       | these methods when evaluating whether to implement them. It would
       | seem the more sites that do these obtrusive UI patterns the less
       | effective they become. Anecdotally nearly every method described
       | in the article is an automatic back button off the site for me.
        
       | AndrewThrowaway wrote:
       | Imagine this beautiful business software which during the years
       | and numerous A/B tests, "best UX practices", design languages and
       | whatnot became this all "applesque", minimalist UI with 80% of it
       | being a white space. By the way winning numerous design awards.
       | 
       | However entering e.g. client's information take a lot of steps,
       | you are constantly clicking "Next" throughout these beautiful
       | wizards and pages. After some time everybody starts to feel that
       | there must be a better way.
       | 
       | What is the solution?
       | 
       | Spreadsheet import! Where you can just do everything in this
       | "complicated" UI of Microsoft Excel, with formulas, and hundred
       | buttons at once on the screen. Fill in hundreds of rows of
       | information and just import it to the "beautiful business
       | system".
        
       | lifeplusplus wrote:
       | A nice way to summarize this article to think about local maxima
       | and global maxima.
       | 
       | A/B testing right now is done on cohort basis and tests are ran
       | for weeks to couple of months. This means where lifetime span of
       | a customer is beyond few weeks and months, it's really not
       | possible to tell if global maximum was missed.
       | 
       | I.e. you increase the number of promotional emails the customers
       | get per week. You do it for 3 weeks and see that customers who
       | got those emails had higher conversion. But you didn't get to see
       | that customers who kept getting those higher number of emails
       | completely unsubscribed after 3 months of pain. But by this time
       | all customers are on higher frequency group so it's hard to tell
       | what would be driving the unsubscriptions.
       | 
       | I'm no expert but here are some solutions:
       | 
       | 1. You should have really delayed long running control groups.
       | Preferably going well beyond average duration your customer
       | sticks around. These groups should get onto new things a year
       | after. But even then it'd be not possible to take out WHAT
       | feature is affecting them, because in 1 year main group would
       | have accumulated lot of features. But still something...
       | 
       | 2. You should really have lots of secondary KPIs that measure
       | things that affect long term KPIs. Sure conversion is better, but
       | is time spent reading newsletters increasing? Are buyers feeling
       | good about their experience with the brand... some of these KPI
       | are more qualitative and can't be just automated.
       | 
       | what else?
        
       | ghostly_s wrote:
       | Nothing torpedoes my opinion of a brand more effectively than one
       | of those insulting "Yes, spam me!"/"No, I'm a moron who hates
       | saving money" popups. Absolutely mind-boggling that any thinking
       | person thought that was an okay way to talk to customers.
        
       | EricMausler wrote:
       | /rant
       | 
       | AB testing is and always has been fish oil for management. The
       | only things it can actually prove, are more easily identifiable
       | by common sense. So wherever it actually works, it was probably a
       | waste of time / overkill for evidence.
       | 
       | - sincerely, a business analyst
        
         | tbranyen wrote:
         | Have to disagree. I've found plenty of issues that affect real
         | production users through the use of AB testing. Problems that
         | were small enough to escape review, testing, and reporting, but
         | large enough to be stat-sig. They always lead to a bug, or
         | issue with test vs control.
         | 
         | I will always use AB testing for uncertain code in the future.
         | I was skeptical when I first started writing AB tests, but they
         | have proven their worth over and over again.
        
           | ratww wrote:
           | Sure, but that's not really A/B testing, those are more often
           | called staged rollouts or progressive rollouts.
        
             | tbranyen wrote:
             | I'm talking about running week or month long tests with
             | control and multi test cells containing new functionality,
             | configuration, or code to determine the viability of a
             | single or combination of changes by analyzing statistical
             | output driven by p-value and pre-determined target metrics.
             | 
             | These types of experiments are extremely valuable in
             | uncovering hard-to-find bugs, assuming you have sufficient
             | logging and confidence around your metrics. They let you
             | know a problem exists and roughly where it is in the
             | product. From there you can drill down and investigate your
             | source code until the discrepancy is found.
        
               | EricMausler wrote:
               | This makes sense to me. Not the kind of AB testing I had
               | in mind, but fair point. I was thinking more about
               | decision making processes, not operations
               | troubleshooting.
        
         | [deleted]
        
       | benja123 wrote:
       | I say this a lot and I will keep saying it. Conversion !=
       | customer obsession. There is a place for A/B testing. It is
       | necessary and can be extremely beneficial in helping your
       | customers enjoy and use your product more successfully.
       | 
       | The main issue is that people mix conversion with customer
       | obsession! Whenever you work on a product or feature you should
       | be asking yourself "Is this really good for my customer" - if the
       | answer is no, then no matter what the A/B tests/conversion rates
       | show you don't do it.
       | 
       | Unfortunately we mostly hire the wrong people as PMs, who then
       | hire clones of themselves. They are not truly customer obsessed
       | and use A/B tests incorrectly which results in products that
       | trick or force customers to do things they don't understand/want
       | to do. Long term this is bad for the product and company
        
         | jfoster wrote:
         | Yeah, this is key. Improving a product in the direction of
         | customer intent vs against customer intent.
        
         | 10x-dev wrote:
         | My 'favorite' silly thing PMs do is UX research studies
         | (typically on 5-10 people) and essentially ask completely
         | untrained people if we should go with X/Y or Z. It's a super-
         | effective way of avoiding responsibility for product decisions
         | ("the data suggest we should go with Y"). If only building good
         | products were as easy as asking what customers think they want.
        
           | HWR_14 wrote:
           | Either they're doing the UX research wrong or (more likely)
           | you're misunderstanding the process. You don't ask them if
           | you should do X/Y/Z. You ask them to do X in the program, and
           | see that none of them can find widget Y which controls it
           | because they keep clicking on widget Z.
           | 
           | It's about observing the users fumble through your UX when
           | you know their motivation.
        
             | 10x-dev wrote:
             | I wish the problem were my misunderstanding the process,
             | because then I could fix it easily by learning more about
             | the process. I do get where you're coming from though.
        
             | ThalesX wrote:
             | > It's about observing the users fumble through your UX
             | when you know their motivation.
             | 
             | Some time ago we did such a test. We called 10 customers to
             | our offices and had them do some flows in the application.
             | They didn't fumble. They pretty much did what they had to
             | do and left positive reviews.
             | 
             | That whole thing got scrapped because consultants convinced
             | our CEO that qualitative data is not good for global scoped
             | startups, and that we should be building based on
             | quantitative data.
             | 
             | Honestly, in less than a year, our customer experience was
             | already taking a dive because all the extra little features
             | we would add and strange UI elements, it became a confusing
             | mess and our tracked NPS (Net Promoter Score) showed that.
             | I've since left the company, but I check on them from time
             | to time and they never really recovered and continue doing
             | A | B in the hopes of hitting that sweet spot. It's just an
             | unrecognizable monster at this point in my opinion.
        
           | throwaway98797 wrote:
           | only listen to customers problems and never their solutions
        
         | jklinger410 wrote:
         | > The main issue is that people mix conversion with customer
         | obsession!
         | 
         | The logic is: If they hate your app, they won't spend money. If
         | they love your app, they will. Which is what would make you
         | think A/B testing and UX work are the same thing.
         | 
         | There's really nothing new about this issue at all. Playing
         | towards the average creates a lot of shitty stuff, in
         | apps/websites as well as politics and wherever else there are
         | metrics to track.
         | 
         | The genius of a good product is that it will make a stand and
         | not give in to the whims of over-optimization in order to
         | maintain its original intent. This is what made Apple unique.
         | 
         | It requires leadership with guts who aren't chasing the latest
         | shiny object.
        
         | time_to_smile wrote:
         | The term "customer obsession" has become a red flag for me when
         | interviewing because I've never worked at or chatted with a
         | company that had "customer obsession" as value that wasn't
         | aggressively working to squeeze every dime from their users
         | with zero interest in whether or not this squeezing was harmful
         | to the customer.
         | 
         | An actual, sincere customer obsession (and btw I think we both
         | completely agree here) means that you are willing to lose out
         | on some conversion and revenue in order to make sure your
         | customers are top priority.
         | 
         | Real customer obsession isn't just an ethical principle either,
         | it makes business sense. The problem is that the value of
         | customer obsession is realized over the span of years or
         | decades. Companies that have a sincere customer obsession are
         | the kinds of places that survive economic ups and downs, where
         | people's children grow up and are loyal to the product because
         | they remember the time their parents were treated well by the
         | company.
         | 
         | If your only company focus is Q4 KPIs then you really can't
         | have "customer obsession".
        
       | forgotmypw17 wrote:
       | I have developed a personal strategy of ridding the Web of these
       | things. Anytime it happens, I close the tab and move along. Very
       | little of value is lost.
        
         | donmcronald wrote:
         | This is basically what I do. Anything that pops up or tries to
         | grab my attention gets instant closed before I look at it and
         | if I can't find the control to close it in 1s I just close the
         | whole tab.
        
       | WhitneyLand wrote:
       | What an excellent write up.
       | 
       | I agree with the sentiment on AB testing but I think the bigger
       | insight is that we need to be reminded to see the forest for the
       | trees with any process, tool, or goal.
       | 
       | Sometimes these intangibles are hard to measure and almost need
       | to be sensed.
       | 
       | It reminds me of how you can see the exact same development
       | methodology used at two different companies, where at one company
       | it works beautifully and at the other it becomes a bureaucratic
       | albatross.
        
       | epolanski wrote:
       | Anecdotal: we released plenty of improved features, like a better
       | gallery to see the items in our shop, users used it a lot +250%,
       | but conversion rate went down 4%.
       | 
       | They spent more time seeing the items and..didn't like the pics
       | and conversion went down. In the end we reverted to the crap
       | gallery we had before, they don't click it anymore and conversion
       | went back up again..
        
         | bee_rider wrote:
         | If it possible that there'd be a long term effect like:
         | 
         | * Users know you have a nice gallery
         | 
         | * They are more likely to shop at your store
         | 
         | * In the end, you get more sales despite the lower conversation
         | rate
        
           | epolanski wrote:
           | I agree with your point, but after finding out that in this
           | industry you just need to be able to monkey some code to be
           | called an engineer, random people are now data analysts
           | because they can pull "experiment A revenue up, experiment B
           | revenue down" and call it a day.
        
       | ryanmarsh wrote:
       | For many businesses revenue is a function of aggressive deal
       | making. Full stop. In an undifferentiated market of discretionary
       | (impulse) purchases if you don't hustle the customer you make
       | less. The author of this article is confusing companies that are
       | bad at hustling with hustling being bad.
       | 
       | One time offers, limited time offers, mailing list signups, up-
       | sells, and cross sells are time tested ways to increase sales
       | dating as far back as radio era telephone and catalog sales.
       | 
       | Steve Madden is a perfect example of this. They sell
       | undifferentiated popular shoe styles less expensive than high
       | fashion but more expensive than knockoffs. They have to hustle
       | you to get you on their mailing list (for 10% off your order) in
       | the hopes that you'll make another impulse purchase later when
       | you get a text or email from them. If they weren't as aggressive
       | you might never make another impulse purchase with them again as
       | there are tons of brands selling nearly identical products.
       | 
       | Some companies are just horrible at hustling so they actually get
       | in the way of you completing your purchase. In a competitive
       | market this is a self correcting problem.
        
       | Kaotique wrote:
       | AB testing shows zero respect to your customers. It is the
       | equivalent of testing your theories on lab rats.
       | 
       | Instead try to improve the customer experience, make better
       | products, improve customer service.
        
         | treis wrote:
         | >Instead try to improve the customer experience, make better
         | products, improve customer service.
         | 
         | Without a metric to say what is "better" and a method to
         | measure it this is empty advice.
        
           | Kaotique wrote:
           | There are many other metrics that do not involve AB testing.
           | You can just survey customer experience before, during and
           | after a purchase for example. I never said to throw out all
           | metrics.
           | 
           | With AB testing your are optimising for a specific outcome.
           | Usually higher conversion. As pointed out in the article
           | eventually you'll end up with a bunch of colourful buttons
           | and scary texts that persuade the user to click. A lot of the
           | "only 2 seats/rooms available" are lies to scare the user
           | into a conversion.
        
           | josefx wrote:
           | But does AB testing provide the only or even best metric for
           | that? It probably is the cheapest way requiring the least
           | engagement with the lab mice.
        
             | treis wrote:
             | AB testing is how you isolate a change and measure the
             | impact. It's the only real way to be able to associate
             | cause and effect. Best you can do otherwise is measure
             | something over time while making changes. You can try and
             | correlate changes with outcomes but it's hard to be sure
             | the change is what drove the outcome.
        
         | Saturdays wrote:
         | That's a very weak blanket statement, there are totally
         | reasonable A/B tests you can run that don't deteriorate a
         | user's experience, and the results can guide you to a better
         | customer experience overall.
        
           | Kaotique wrote:
           | It did not mean it too seriously, of course there are also
           | good AB tests, but there are a lot of bad ones out there.
           | Those are what the article was about.
           | 
           | (edited for clarification)
        
         | smeyer wrote:
         | >Instead try to improve the customer experience
         | 
         | AB testing can be (although isn't always) used to improve the
         | customer experience. Assuming you know exactly what will make
         | the customer experience best without actually testing it can
         | also lead to a worse experience.
        
           | throwaway290 wrote:
           | A/B testing helps you maximize a metric, not make customer
           | experience better. Those are different things.
        
             | noirbot wrote:
             | They're only different if you've selected bad metrics. If
             | you've got two different search algorithms, running an A/B
             | test and measuring how often the user selects the first
             | item returned is a good measure for how well your search
             | algorithm is returning the information the customer wanted,
             | which is good customer experience.
        
               | throwaway290 wrote:
               | They are always different. You cannot hold a conclusive
               | A/B test for customer experience.
               | 
               | Search engine, a single-purpose tool, is as simple as
               | they come regarding customer experience. Still, a good
               | search algorithm can make me click on the first result if
               | it is good, and a bad search algorithm can make me click
               | the first result because they are so bad that scrolling
               | further is a waste of time, especially if I already
               | needed to scroll through widgets and ads to get to the
               | first result.
               | 
               | It's not about just selecting good metrics, it's about
               | higher level picture that A/B testing can never get you.
        
             | ssharp wrote:
             | Better customer experiences often times lead to increased
             | metrics. They are not totally different things.
        
               | wongarsu wrote:
               | Maybe if you are very aware of the fact that your goal
               | and metric aren't totally aligned, but that often gets
               | lost. As a result A/B testing for longer website visits
               | can make websites that make it more obvious that the
               | information you want is there, but also make the path to
               | actually get it longer. A/B testing for engagement might
               | promote divisive behavior and fights. A/B testing for
               | read rate or clicks might lead to trust loss.
               | 
               | I think a lot of lessons from AI safety apply
               | surprisingly well to A/B testing, mainly around how hard
               | it is to align your actual goals with the metrics you use
               | for optimization, and how disastrous the consequences can
               | be. It doesn't have to go wrong, but it's incredibly hard
               | to ensure it goes right, especially if it's the only
               | feedback you have.
        
               | ssharp wrote:
               | I've spent a lot of my career doing A/B testing,
               | including doing that role exclusively for a number of
               | years. I specialize in ecommerce, so maybe I have too
               | narrow of a view here, but in that vast majority of
               | cases, I am optimizing for revenue per visitor, which is
               | a function of conversion rate and average order value.
               | There are sometimes leading indicators like engagement,
               | but in ecommerce, you're afforded the luxury of basing
               | things on revenue or even bottom line.
               | 
               | I really don't like the positioning of ALL A/B testing as
               | unethical behavior where you're hostilely trying to take
               | advantage of a user. It's quite the opposite. There are a
               | lot of extremely poor user experiences out there and a
               | quality testing program can help improve user
               | experiences, remove risk from making sweeping changes,
               | and help you learn more about your audience and market.
               | 
               | The vast majority of the successful testing I've done is
               | done around trying to _HELP_ users navigate the site and
               | product catalog, understand the product, and purchase the
               | product. Attention spans are fleeting with online
               | shopping and even the smallest points of confusion or
               | friction can turn shoppers off.
               | 
               | Additionally, often times I'll read into test results
               | after a month or so to see if there were any issues with
               | orders that might indicate purchases from disinterested
               | people or misaligned expectations.
        
               | noirbot wrote:
               | But that's just optimizing for bad metrics. At this
               | point, anyone who thinks "engagement" and "time spent on
               | page" are customer-positive metrics is in a different
               | mental space than you and I. There's a lot of ineffable
               | things that make up good customer experience that would
               | be hard-to-impossible to A/B test, but it doesn't mean
               | that A/B testing is "unsafe" just because it could be
               | used to optimize for bad things any more than any other
               | telemetry or metrics gathering could be bad because you
               | could optimize for evil things. And at the same time, bad
               | management and product leadership can optimize and
               | develop towards bad goals with plenty of tools that
               | aren't A/B testing.
               | 
               | It seems to miss the point to blame/stigmatize a specific
               | tool because it's been used poorly by a few bad actors in
               | a public way.
        
               | rout39574 wrote:
               | I think the point is that most metrics are "bad metrics"
               | for this purpose, as suggested by Goodheart's law.
               | 
               | https://en.wikipedia.org/wiki/Goodhart%27s_law
               | 
               | Further, I imagine that the obvious "known bad" metrics
               | are not selected only by "A few bad apples". I think it's
               | likely they are selected by the mass of business actors
               | looking for current quarter results.
        
               | noirbot wrote:
               | For sure. I don't think there's general "overall" metrics
               | that you want to be testing against every single time on
               | every change outside of basic performance metrics for
               | loading or rendering in real-world environments.
               | 
               | I wasn't at all trying to say that only a few places are
               | optimizing for bad things, but as you see all over this
               | thread, there's a number of companies that immediately
               | come to people's minds as bad actors when it comes to A/B
               | testing - Google, Meta, Microsoft. There's plenty of
               | other companies that are more ethical about it, or use it
               | as part of rolling out general changes and collecting
               | feedback. I know half of the time I log into the AWS
               | Console it has some sort of "Hey, we're testing out a new
               | upcoming UX for this page. Click here if you want to go
               | back to the old one", which seems like a decent way for
               | them to get feedback on the new designs while not
               | drastically disrupting things.
        
               | throwaway290 wrote:
               | Bad management can certainly ruin things without A/B
               | testing.
               | 
               | It doesn't excuse A/B testing simply being a poor tool
               | among all you have access to. Talking to users and
               | stakeholders, for example, provides infinitely more
               | input. (Edit: yeah in many cases measuring what users do,
               | directly watching or via analytics, is also useful.)
        
               | noirbot wrote:
               | Definitely - I'm not trying to say A/B testing is
               | amazing, just that a lot of the comments have a strong
               | "if you do A/B testing you're evil and are out to
               | manipulate people" bent to them, which I think is too far
               | in the other direction.
               | 
               | Talking to people is great, but getting a representative
               | sample is hard, and often people are bad at both
               | understanding what they want, expressing it, or even
               | being accurate about how they use things. I know when I
               | was working closer to the UX side of the business before,
               | I was constantly surprised by both what users would say
               | they want AND by how users actually used the products.
               | 
               | In my mind, A/B testing is good as a sort of "final pass"
               | to serve as broad, semi-random validation that the change
               | you're looking to make does actually do the thing that
               | it's intended to do. It's not great for early on when you
               | don't really know what to measure or look for, or if the
               | change is remotely reasonable, but it can help check for
               | if your focus group/user panel happened to be weirdly
               | skewed in their usage/desires.
        
           | mschuster91 wrote:
           | > Assuming you know exactly what will make the customer
           | experience best without actually testing it can also lead to
           | a worse experience.
           | 
           | For that you usually hire a market research company or do
           | what they will do: take an interviewer, two cameras (one
           | front-face, one top-hands) and hire an as-diverse-as-possible
           | pool of test candidates that you then put through whatever
           | workflow optimization you want to do. Then afterwards, you
           | interview them - side benefit, you can get really interesting
           | general side knowledge that you'd _never_ gain from a dumbass
           | A /B scheme: is your font style/color scheme legible, can the
           | site be used by colorblind people, are there stock photo
           | choices that give off stereotypical vibes...
           | 
           | It's real fun and a worthwhile experience for everyone
           | involved.
        
             | ssharp wrote:
             | It's not really an either/or option. You can use testing to
             | validate the changes stemming from market research.
             | 
             | Having seen lots of site redesigns go horribly wrong due to
             | 100% earnest people trying their best and utilizing the
             | research that was afforded to the process, I always
             | recommending incrementally testing into changes on high-
             | value / high-risk applications, even when the
             | "improvements" were backed by solid research. You never
             | know until you release.
        
               | mschuster91 wrote:
               | The "or" was meant to be the distinction on who does the
               | user testing - I've seen both in-house testing operations
               | and outsourced ones. For small scale operations, it may
               | actually be cheaper to run them in-house and only hire
               | external testers... cameras are dirt cheap these days.
               | 
               | Hiring a market research company is usually worth it if
               | you have a contract with them anyway (which gives you
               | better rates on the testing) or lack someone on staff who
               | knows how to deal with cameras.
        
         | rgavuliak wrote:
         | I have experience where the company paid a UX agency to create
         | a flow that was by all standards better customer experience and
         | a better product, nicer too. They ran an AB test, turns out
         | people were more likely to pay with the old version. AB testing
         | is good that it challenges what UX people think is better
         | experience or product with hard metrics.
        
           | nkrisc wrote:
           | Which is why A/B testing is an important part of the UX
           | _toolkit_. It 's a tool among others, and is one way to
           | validate assumptions. A good UX designer will try to base
           | their designs on data and reasonable hypotheses drawn from
           | the data, but a new design or flow is necessarily based on
           | some amount of assumptions, so it requires validation.
           | 
           | That said, an A/B test does not tell you _why_ something didn
           | 't work. You can make further assumptions based on the
           | results and develop new hypotheses, but it never tells you
           | why. Typically you would do some kind of qualitative UX
           | research on a prototype or even static concepts beforehand to
           | identify these kinds of issues before you even expend the
           | effort to do a live A/B test. Far cheaper to do a study with
           | 6-12 people and a prototype than to build out a full,
           | functioning A/B test experience.
           | 
           | It's possible the flow they created _was_ generally better
           | but perhaps it had one fatal flaw. Perhaps that flaw could
           | easily be remedied once identified.
           | 
           | A/B testing is just one small part of a good UX process.
        
       | scotty79 wrote:
       | Giraffe neck is the result of a/b testing.
       | 
       | If you known its inside anatomy you know what I mean.
        
       | andreareina wrote:
       | Getting a 503, so here's an archive:
       | https://web.archive.org/web/20220712122630/https://www.zumst...
        
       | _the_inflator wrote:
       | Maybe Apple will come up with a reality distortion field and will
       | remove "urgency" warnings and informations from websites on
       | Safari, as well as blocking "Join our Newsletter now and get a
       | discount" pop-ups.
       | 
       | What once was ads everywhere, is now psychological gaming.
       | 
       | I hope someone comes up with a Google Extension, and maybe Apple
       | with a new "Access Website" mode.
       | 
       | These messages are boring to be honest. Once you noticed them
       | everywhere, game over for me. Time to move on.
        
       | londons_explore wrote:
       | You can get long term results from AB tests long after the test
       | has ended...
       | 
       | For example, you can see if Group A or Group B from a test are
       | more likely to still use the site 1 year later.
       | 
       | You hypothesize that those ways to 'juice the metrics in the
       | short term' hurt the user experience in the long term... Well if
       | your hypothesis is right, these long term AB results should show
       | it.
        
         | epolanski wrote:
         | > For example, you can see if Group A or Group B from a test
         | are more likely to still use the site 1 year later.
         | 
         | This isn't very feasible on most products and certainly limited
         | by the amount of data collected.
        
       | chunkyks wrote:
       | "If a study came out that said deafening high-pitched noises
       | increased conversion rates, we would all be bleeding from our
       | ears by end of business tomorrow, right?"
       | 
       | Netflix auto play? Is that you? You were a hateful idea, no one
       | liked you, yet you stubbornly hung on for far too long
        
         | _tom_ wrote:
         | I'm convinced that Netflix uses "number of hours watched" as a
         | success metric. Autoplay raises that.
         | 
         | I'd pay twice as much to watch half as much quality
         | programming, but that would tank what they _think_ is a
         | positive metric.
        
       | axg11 wrote:
       | A/B testing is local optimization. It should only be done on a
       | mature(-ish) product when you have given up on finding a global
       | minimum.
       | 
       | Running experiments and A/B tests are popular because it is
       | _guaranteed_ to give you signal. If you have a large engineering
       | team and you're not sure how to filter the quality of results,
       | gating everything through A/B tests is a well understood
       | methodical way to ensure only positive work makes it way through.
       | 
       | Early stage startups should never A/B test. When you're searching
       | for product market fit, you're doing global optimization within
       | the search space. Your product will change drastically as you
       | make new learnings. Premature optimization (A/B tests) will only
       | be detrimental.
        
         | jaggederest wrote:
         | > Running experiments and A/B tests are popular because it is
         | _guaranteed_ to give you signal. If you have a large
         | engineering team and you're not sure how to filter the quality
         | of results, gating everything through A/B tests is a well
         | understood methodical way to ensure only positive work makes it
         | way through.
         | 
         | It's almost guaranteed to ensure only _false_ positive work
         | makes its way through. If you 're picking 0.05 as your P value,
         | and you're running dozens to hundreds of tests, your false
         | positives are almost certain to exceed your actual positives.
         | 
         | When I'm working for clients that do a lot of A/B testing, I
         | suggest that they should always run A/A tests to ensure that
         | they're not incorrectly rejecting the null hypothesis. If your
         | A/A tests are showing significant differences, you have a
         | problem in your testing pipeline that by definition can't be
         | cured by more testing. You need holdout groups and selectivity
         | about what to test, instead of just throwing everything at the
         | proverbial wall.
        
           | jakubmazanec wrote:
           | That's why we calculate stuff like effect size and power of a
           | test (or even better, use Bayesian statistics); just p < 0.05
           | is practically almost meaningless.
        
           | purplerabbit wrote:
           | Great insight. Without this approach, A/B testing could be
           | used to generate an infinite stream of meaningless work
        
           | 3pt14159 wrote:
           | Even checking A/A tests won't surface all the issues. A
           | proper A/B test is one that samples over a long enough time
           | to adjust to the true audience of the service.
           | 
           | For example, imagine a costume shop that ran a couple dozen
           | A/B tests over the summer. Those results may look
           | statistically significant. They may even stand up against the
           | A/A test. But people that buy costumes in the summer are
           | very, very different than people that buy them in October,
           | and if 90% of the store's business is in the run up to
           | halloween, then all these micro optimizations could actually
           | make your total business performance worse.
           | 
           | I'm a A/B testing skeptic too, though I admit they have a
           | time and a place. My favourite are ones that can be reasoned
           | about as actual hypotheses. This usually involves some degree
           | of data analysis or segmentation. For example, increasing
           | font sizes may boost conversion, and a later analysis shows
           | that this was almost solely a lift in conversion rates
           | amongst the 45+ cohort. The data in this case isn't just
           | blindly driving design decisions, it's helping inform the
           | staff on how to better design in the future for the audience
           | we have.
        
           | marcosdumay wrote:
           | Well, if you are running hundreds of tests with 0.05 p-value,
           | you will get plenty of false ok A/A tests, and there isn't
           | much of a reason to expect them to be correlated to actual
           | signal on your A/B tests.
           | 
           | A/A tests do test your methodology as you said. But they do
           | not fix a p-value one order of magnitude higher than it
           | should be. (And yeah, I'm aware _you_ know that, but your
           | comment places them on the same context, so it got
           | misleading.)
        
         | andsoitis wrote:
         | Even for a mature product where you might be doing A/B tests to
         | explore hypotheses that you think will improve the product for
         | the user, it is also worth considering doing mountain tests
         | where you try to escape the local maxima.
        
         | ravivyas wrote:
         | "Running experiments and A/B tests are popular" ... because you
         | can give up on your own judgment and opinions and say "the data
         | says"
        
           | 411111111111111 wrote:
           | > _give up on your own judgment and opinions and say "the
           | data says"_
           | 
           | The beauty of AB testing is that you don't have to give up
           | your opinion. You can just change irrelevant things until the
           | result you desire gets proven by chance and now you've got
           | data to base your opinion on!
        
       | fairity wrote:
       | It doesn't seem like the author has any hard data that supports
       | his claim that long term LTV and K-factor losses outweigh short
       | term conversion rate wins. Maybe I missed it? Without said
       | evidence, it's probably safe to assume his generalized claim is
       | wrong in most cases.
        
       | commandlinefan wrote:
       | > Next to some hotels, a message that supply was limited.
       | 
       | It's also worth noting that there's no way in hell they actually
       | know that with any sort of precision. No GDS has proper up-to-
       | date knowledge of bookings from all the various sources that
       | hotel reservations actually go through (they overbook _airline
       | flights_ ). What they're really saying is that the small
       | inventory of rooms that _are reserved for them to book
       | exclusively_ are almost gone.
        
       | gumby wrote:
       | Such perfect timing: I just tried to place a take out lunch order
       | with a restaurant. Opening the page popped up a modal box that
       | said "Join Our List Subscribe to find out about new specials,
       | community events, store openings and more." There were no buttons
       | to click, no place to enter my email address (had I wanted to)
       | and clicking did not dismiss it. The modal had a background that
       | obscured the actual page.
       | 
       | I finally opened the inspector and deleted it, so that I could
       | use the menu to select "order online", which took me to a page
       | ... with the same modal.
        
       | ravivyas wrote:
       | In todays world of algorithms optimising marketing, and constant
       | updates on marketing channels, it is hard to say if an A/B test
       | worked as quality of users is never consistent.
       | 
       | I currently work in a game publishing company, here are 2
       | anecdotes from it
       | 
       | 1. We run an A/B for game performance but we keep changing the
       | bids for our games, and thus get varied quality of users, A/B
       | tests don't really help in such a case 2. Once by mistake we ran
       | the same creative on FB for 2 different ads.. both ended up
       | having totally different metrics
        
       | walrus01 wrote:
       | If you _really_ want to see a massive amount of additional offers
       | and small /partially hidden "no thanks" links, check out the work
       | flow to reserve and rent a small light duty trailer with U-Haul.
       | 
       | You have to click through at least 10 pages of additional offers
       | (and many extra price things that are added by default!) before
       | you get to the actual checkout page.
        
       | phendrenad2 wrote:
       | A/B testing can be powerful, but you quickly lose your editorial
       | voice and your headlines become the same clickbait garbage that
       | works for bottom-tier blogspammers. Look at a site like The
       | Register. Could they use A/B testing to pick headlines? If they
       | do, it's a light usage, because the clever and witty headlines
       | have an internal consistency that I've come to enjoy and expect.
        
       | happimess wrote:
       | I had a PM who pushed us to A/B test _everything_. We hired a new
       | graphic designer who suggested that we change our product links
       | from ALL CAPS to Title Case (a very popular idea on the team, and
       | his first real suggestion after a few weeks with us), and she
       | insisted that we A/B test it first. It felt like an insult to
       | him, and a dumb test since title case looked way better.
       | 
       | The three key outcomes I observed from the relentless A/B testing
       | were UI antipatterns, team burnout, and a well-attended
       | conference talk about "how we ran 105 A/B tests in a year, and
       | what we learned".
        
         | gorbachev wrote:
         | Was one of the learnings "Everyone hated the product manager"?
        
         | edmundsauto wrote:
         | I've had a similar experience, although my learning was
         | "people, even experts, are really terrible at understanding
         | which treatment will perform better".
         | 
         | We always run >=3 variants, surveyed the dozen team members on
         | which one they thought would run. Over the years, there was no
         | clear pattern over who could make that prediction.
         | 
         | IE, it's not possible to predict which is the most effective
         | treatment, even when you include a really bad idea in the
         | treatments!
        
       | ryanmcbride wrote:
       | I give an incredibly similar warning every time a company I'm
       | working for starts trying to dip their toe into A/B testing. I
       | have a lot of experience with it at scale (one at a fortune 100
       | company) and I've even built an a la carte testing framework in
       | aws for a company that didn't like Target or Optimizely.
       | 
       | Every single time I warn them about how the bill of goods they've
       | been sold with A/B testing is almost completely unattainable,
       | especially in the way that they want to go about it. They won't
       | magically start getting more conversions by changing a button
       | color. Even if they start getting more clicks, they rarely start
       | getting more complete conversions, because the increased numbers
       | is usually from people who weren't good leads in the first place.
       | 
       | On top of that every company I've worked with has no idea what
       | the real methodology for good tests is, no matter how many times
       | I explain it or put it in a slide deck. I would constantly get
       | requests to use A/B testing for feature rollout.
       | 
       | Them: "Hey, could you do an A/B test of our existing site design
       | and our upcoming redesign?"
       | 
       | Me: "if the old design performs better are you going to toss out
       | the redesign?"
       | 
       | Them: "No we're going with the redesign but we want metrics on
       | how it'll affect traffic"
       | 
       | Me: "Those metrics are useless if you aren't going to listen to
       | them, and if the results come back and the old design performs
       | better, you're not even going to put it in a presentation because
       | it's counter to your planned actions. There's literally no point
       | in running this test"
       | 
       | Them: "Run it anyway"
        
       ___________________________________________________________________
       (page generated 2022-07-12 23:00 UTC)