[HN Gopher] I recreated famous album covers with DALL-E ___________________________________________________________________ I recreated famous album covers with DALL-E Author : lucytalksdata Score : 141 points Date : 2022-08-20 18:06 UTC (4 hours ago) (HTM) web link (lucytalksdata.com) (TXT) w3m dump (lucytalksdata.com) | NonNefarious wrote: | Went to use my invite, and OpenAI demands your PHONE NUMBER. | | No excuse for it. Screw that. | randymy wrote: | Worth noting that DALL-E automatically "rejects attempts to | create the likeness of any public figures, including | celebrities". So, you wouldn't be able to get an image that | included the 4 Liverpudlians. It does allow you to create fake | faces. Might be fun to try and recreate Miles Davis Tutu, Aladdin | Sane, Piano Man. | cameronh90 wrote: | My experience was that if you name a celebrity (and the request | isn't blocked) it quite often generated something that has the | same general vibe of the target, while also looking entirely | unlike them. | | It reminds me of how TV shows often have a president that | resembles the current president in superficial ways, while | being distinct enough that they won't get sued. | | I'd be interested to know why this happens. | [deleted] | CamperBob2 wrote: | It will yell at you and threaten your account with termination | if you try to create anything based on a living person's face, | from what I can tell. | phonescreen_man wrote: | Interestingly related, I just used AI image generation to create | my EP cover.. first I tried running luciddrains dall e 2 PyTorch | implementation using the prompt "death by Tetris EP album cover | 2022" unfortunately I am using a Mac Pro so the gpu was not able | to work. Then I tried imagen PyTorch implementation and used same | keyword. This time it was working with the CPU unfortunately 2 | days in we had a power outage so I had something but nothing | complete. So I fed the generated image into the google dream | generator and got my album cover! | | https://willsimpson.hearnow.com/ | [deleted] | pjgalbraith wrote: | I've been recreating the 50 worst heavy metal album art using AI | as well, currently at 30. Recently I've found Stable Diffusion | plus DALL-E inpainting to be a good combination. | | https://twitter.com/P_Galbraith/status/1560469019605344256 | wodenokoto wrote: | I'd love to see what it had come up with if simply prompted for | "Album cover for Nevermind by Nirvana" | google234123 wrote: | Is the issue with faces a deliberate choice by the devs? | nprateem wrote: | An upvote for whoever can give me a prompt to generate an image | of someone who's been massaged so much their body has been | flattened, as if they were made of dough or jelly or something. | | I spent ages on this earlier getting nowhere. I'm starting to | think DALL-E is better if you don't really know what you want and | you're just fishing for ideas. | _the_special_ wrote: | > an image of someone who's been massaged so much their body | has been flattened, as if they were made of dough or jelly | | do you want a realistic looking one? 3d rendered? what do you | have in mind exactly? | nprateem wrote: | Anything really. I tried cartoons, digital art, watercolours, | pixar style. None worked. | birdyrooster wrote: | also what is the PCs budget? | fzfaa wrote: | I didn't know that I needed this. | doerinrw wrote: | Ok you owe me $3! This is a really hard prompt, and only got | close-ish with inpainting. Got the base figure with "massaged | relaxed flattened person, flat, flat, flat, flat, claymation", | then finally got it to add a not-too-terrifying face with | "photograph of smiling white woman laying on the ground, | promotional photography". Final tweaks to erase some artifacts | (it really didn't want to believe the figure on the left was | the referenced woman) was "photograph of a wooden floor with a | white mat and small plants, overhead shot". | | DALLE is hard! Curious to see if I can be beat. | | https://imgur.com/a/tuyGjxp | nprateem wrote: | Ha ha that's much closer than I got, well done! Yeah it's | difficult. I didn't think of flat, flat, flat. | teddyh wrote: | > _The question is, when the music blows up and the artwork | becomes a signature, like the Rolling Stones ' Tongue & Lips, who | will own the copyright?_ | | That's what trademarks are for. | sieabah wrote: | If they had bothered to read the license agreement they would | know whoever generates the art owns the copyright. Since it's a | pay-for action the copyright is owned by the payee. | | So really they generated these and never bothered to do the | research of their own question. | teddyh wrote: | I have, like the article author presumably also does, a | profund doubt as to whether generated works of this kind can | be free of any copyright as long as the tool used is itself | created using myriads of copyrighted works (as training | data). I certainly do not trust the claims of the tool | creators; they have all the incentive to ignore any copyright | problems in order to get a tool which is usable. | | And, as the article states: | | > _But seriously, how creative and original can you be with | something that is trained on the works of millions of other | creators?_ | | > _To me, it is unclear whether you can actually call these | works your 'own' at all, because there's always someone | else's touch on it._ | | > [...] _users of DALL-E will also never be sure whether they | are generating something that is 'theirs' or just a knockoff | of someone else's work._ | hedora wrote: | OK, so I give you license to use this URL I just generated to | generate your own stuff. | | It's pay for action (send me a penny if you find anything | worthwhile), and the copyright is owned by the payee: | | https://images.google.com/ | meowkit wrote: | The URL has not been "generated" in the same sense. You are | retrieving an existing string. The images from google are | not "generated" in the same sense, they are indexed from | google's search algorithm. | | The generative models, specifically for DALLE here, compute | pixel unique images. You might say these models index a | subset of an extremely high dimensional space (pixel count | * RGB color values) using a query. Traditional search | engines build an index from nothing and then use a search | query to find the best matches in a more discrete space. | teddyh wrote: | If I made a website where you could type text and get | images, and I said that your held the copyright to any | images you got, could you safely act on that assumption? | What if my web site was simply a proxy for Google image | search? | system2 wrote: | DALL-E still seems very useless. Reminds me of the hype of | Cardano. | andreyk wrote: | I wonder how long the novelty of DALL-E will persist. HN seems to | upvote anything titled "I did X with DALL-E". This is a fun post, | but it's not that interesting or surprising. Still worth a look | don't get me wrong, but personally didn't learn anything new from | it. (eg recreating the famous pink Floyd cover with "Outline of | prism on a black background in the middle of scene splits a beam | of light coming from the left side into rainbow on the right | side" unsurprisingly worked somewhat well). | felipelalli wrote: | I love this kind of bad humor of HN. | tsimionescu wrote: | To me the most interesting thing about the article was actually | just how bad the results are. It's interesting that you picked | the Dark side of the moon one, as to me that seemed by far the | worse one - none of the pictures really resembled the simple | geometry of the original. While I understand why recreating the | Nevermind cover was difficult, it was surprising that | recreating such a simple geometric pattern failed so | spectacularly. | | The only cover that really worked from my point of view was the | Velvet Underground one, and perhaps the Rolling Stones one. | Abbey Road came closer than what I thought it would, but was | pretty bad ultimately, and the other three really had nothing | usable. | hombre_fatal wrote: | It will probably get boring the same time HNers stop confusing | "hmm this isn't that interesting guys" as notable commentary | that needs to be shared. | Apocryphon wrote: | DALL-E generate us some pics of unhappy HN curmudgeons | paulcole wrote: | Here ya go: | | https://imgur.com/a/kgQtUMf | | I don't know why it's got the over-18 warning. It's 100% | SFW. | turtleyacht wrote: | > _A grizzled veteran of the corporate wars: expert | troubleshooter, code tracer, shell flinger, toolsmith and | pied piper. Cheerful amid chaos, the cavalry to on-call, | the invited to skunkworks, tiger teams, red teams._ | | https://labs.openai.com/s/AkfefTCg9CnFTh6Dh42pRbw5 | | https://labs.openai.com/s/l6Cy8PA1zDdvKdVdKs47AdLO | turtleyacht wrote: | > _Grizzled veteran of the corporate wars, endlessly | optimistic yet hardened by dint of spectacular, | catastrophic system failures; a tinkerer; a hacker; an | astute observer; always prone to imagining the highest | heights and the most profound lows; blunted against tact; | sharpened by scouring experience; unforgiving, yet ever | willing to advise; rightfully paranoid; forced to | specialize; a stubborn craf_ | | https://labs.openai.com/s/6oZ1RkvfDccpMpqBWK6HzOPo | andreyk wrote: | Well then it will never get boring, will it... Fair enough, | it's just been striking to see how many of these sort of | posts have been popping up. Like I said, it's still fun and | worth a look. | | Perhaps as a blogger I am extra salty when relatively low | effort stuff gets upvoted over things that take a lot of work | to write (in some cases, those being things I have written). | But hey, that's life. | paulcole wrote: | Low effort stuff can (and often is) better and/or more | entertaining than high effort stuff. | andreyk wrote: | Can't argue with that! | [deleted] | DubiousPusher wrote: | I actually thought there were some interesting things revealed | here. Particularly interesting to me were the Nevermind and | Abbey Road results. | | Nevermind because it showed this weakness in the model in | understanding what a dollar bill is. The most novel result | being the image where the baby's visage appears inside the | dollar. | | Regarding Abbey Road, I found it interesting that the model's | concept of a public person spans their lifetime evidenced by | the images where the contemporary images are used. Also | interesting to me is the model's weakness in understanding | specific people. | | Then again though, I haven't been clicking on every DALL-E post | so maybe this is old news. | fifilura wrote: | For me, what was interesting to see was that it didn't manage | very well with the Abbey Road question. | | I had imagined that some parts of the training data would | consist of the actual image, and it would find a good match | for it somewhere deep into it's artificial consciousness. | TheOtherHobbes wrote: | It has no concept of The Beatles or Abbey Road or zebra | crossings or perspective or creative composition in | photography, so effectively it just mashes up some results | from an image search. | | If it created something very close to the original it would | be overfitting. So it flails around rather randomly in | photos-of-Beatles+Zebra+Abbey Road space. The results are | novel, but they're also monstrous, distorted, and | unartistic. | robocat wrote: | > unartistic | | Says who? Maybe you don't like the style, maybe it is | derivative, but it is definitely artistic, as you allude | to by saying it is novel. Monstrous and distorted can be | a style. | | "mathematical art, 1924, litography, abstract generative | art" generates | https://pbs.twimg.com/media/FanUkREXEAEH6su.jpg -- Can | you pick the influences? I can't. | | "low poly game asset, Cthulhu monster, 2000 video game, | isometric view" generates | https://pbs.twimg.com/media/FanTf3IXgAETece.jpg | | I thought https://pbs.twimg.com/media/FanNq6sXkAENyU5.jpg | was really interesting, because the LEGO logo is not a | 100% faithful reproduction even though the LEGO logo is | so ubiquitous - showing that the images are generated. | | From a longer tweet thread by @fabianstelzer comparing | DALL-E 2 vs Midjourney vs StableDiffusion: https://thread | readerapp.com/thread/1561019187451011074.html | zone411 wrote: | "DALL-E" in titles is about to be replaced with "Stable | Diffusion." The beta website is already live but the | interesting part will be specialized fine-tuned models based on | public weights. There should be more technical experimentation | since Stable Diffusion weights are only a few GBs and inference | can be run on any recent GPU. There might also be more | controversies because it can create more uncensored images. | | Somebody posted a nice comparison between DALL-E 2, Mid | Journey, and Stable Diffusion: | https://twitter.com/fabianstelzer/status/1561019187451011074. | CamperBob2 wrote: | I dunno, I don't see a lot of examples on that page where | Stable Diffusion outperformed DALL-E 2. | [deleted] | rjtavares wrote: | Just like images after text, pretty sure once the novelty of | images ends we'll move to animations, music, movies, computer | games, and so on. | Apocryphon wrote: | For the past few years, I've long considered Hollywood failed | adaptations to be a non-issue because so long as industrial | civilization exists, media companies are going to squeeze | what they can out of IPs. So while _Game of Thrones_ might | have ended quite poorly, I figure we're only a few decades | away from a different rights-holder giving it another go. | | Thus, I look forward to the AI-generated versions of famous | works that deepfake the original cast into speaking | (hopefully) better-written dialogue. Imagine when this | technology is widespread, fanfic authors rendering their | interpretation of works with the descendants of DALL-E. | Everyone gets the dream adaptations and sequels and finales | they want. | voltaireodactyl wrote: | The trouble with this, for me, is that wouldn't such a | world necessarily mean the end of new actors (not to | mention new worlds and stories)? After all, why hire | unknowns when you can stack the cast with the A-listers of | all time? | | And if everyone stacks the cast thusly, there are no | opportunities for new actors to work with old and be | mentored + opportunity to simply works since every bit of | work is what adds up to make a journeyperson. | notahacker wrote: | Unless copyright changes a _lot_ , the A-listers are | going to expect a lot of money for a film with their name | on the poster and their carefully curated likeness | throughout, whether they have to turn up to shoot it or | not. | voltaireodactyl wrote: | Totally, but their estates won't be as precious (as we're | seeing with holograms already). And once a certain | critical mass is built up, it's built up -- and it's be | foolish not to use it. | Apocryphon wrote: | Still gotta populate the training data | voltaireodactyl wrote: | I get that, but I find it hard to see why one would look | for NEW training data from actors who will never have the | opportunity to become as seasoned as their forebears, | when you have 100+ years of filmed historical data (among | other formats) from which to pull your players? | xor99 wrote: | Generative art and by extension DALL-E has a very fast | attention decay imo. We can't avoid noticing patterns and | getting bored by them. This makes art and music fun because the | things that stick and stay interesting JUST DO for some unknown | reason. | w0mbat wrote: | How do you know that the album covers are not part of the corpus | of images that DALL-E was trained on in the first place? | machinekob wrote: | Is i do smth with DALL-E auto top hacker news post i saw like 20 | post like that in past 2 weeks. | tsimionescu wrote: | It's interesting that the prompts that would do badly in a Google | image search also seem to be the ones that make poor prompts. | Basically, it seems that rather than describing a scene, you have | to try to give an analogy for some image(s) that it might have in | its training set - which is why, I believe, "banana in the style | of Andy Warhol" produces a much higher quality result than | "Outline of prism on a black background in the middle of scene | splits a beam of light coming from the left side into rainbow on | the right side". | alisonkisk wrote: | Michelangelo11 wrote: | Man, after seeing Stable Diffusion's output, DALL-E's looks just | janky. Like watching a propeller plane after seeing a jet. | | Crazy how fast the tech is moving. | twostorytower wrote: | DALL-E is capable of very high quality photorealistic images | with the right prompts. | | Here's one I made: https://imgur.com/yAzKkHb | | "High detail, macro portrait photo, a handsome Australian man | with a strong jaw line, blue eyes and brown hair, smiles at the | camera, set in an outdoor pub at golden hour, shot using a | ZEISS Supreme Prime lens" | xwdv wrote: | Although AI artists will destroy a lot of jobs, it will also | create demand for new jobs for people who specialize in "paint | overs" - taking a high concept output created by AI artists and | touching it up to perfection. | | Or perhaps even beyond just a paint over, and into the realm of | recreating an entire AI artwork but with a human touch to get | details just right. | | Looking forward to it. | avian wrote: | > taking a high concept output created by AI artists and | touching it up to perfection. | | When you put it like that, it sounds like a nightmare up side | down world. It's not AI that's the tool for enhancing human | creativity. It's humans that are the AI's tool, cleaning up the | edge cases the AI artist can't handle (yet). It destroys | creative jobs that give joy to people and creates assembly line | jobs for them to slog in. | krapp wrote: | The purpose of jobs has never been to give people joy, but to | extract value from their labor as quickly and efficiently as | possible. Getting any kind of emotional satisfaction from | one's work is a privilege which arguably points to an | inefficiency in the market, as that energy is wasted which | could better be put to productivity. | | Artists, programmers and everyone else will have to find | their joy somewhere other than than selling themselves to a | corporation, once AI driven markets optimize away any room | for "joy" and the like, and that's going to be one of the few | good things about automation. The sooner we break people from | the Puritan delusion that work defines a person's meaning and | the value of their expression, the sooner we can once again | decouple culture from the machinery of capitalism. | notahacker wrote: | The humans are also the people that come up with the concept | in the first place... | | Also sounds like an upside down use of the tool, since AI | generated art really isn't that great at composition once | you've got over the magic of it being able to respond to the | prompt at all (but is much better at texture and filling in | boring details), and the current state of the art AI tools | can produce images which conform to a human guideline | sketch... | 8note wrote: | The AI is enabling the creativity of laypeople. That's still | an enhancement, even if they don't have the technique to make | a polished product | vanadium1st wrote: | I am already doing exactly that, and am getting paid for it. | | I am a logo artist and I sell pre-made logo designs. Before the | current AI services I had to come up with visual ideas by | myself, like a caveman. Now I use the AI to generate a bunch of | sketches and blurry ideas, and then use my graphic design | experience to polish them up to a usable level. Here's how it | looks. https://imgur.com/a/DKTsKdC | | I am absolutely sure that a lot of people are doing the same | right now, just keeping quiet about it. | xorcist wrote: | Thank you for that perspective. The linked work is clearly | the work of a skilled professional. | | I am intrigued by the use of AI as a form of creativity | assist. As someone without any talent for this, the left | pictures are useless for me, as I don't know how to take them | into something like the pictures on the right. The point of a | sketch is to show them to a customer, but if you would show | these sketches to me, I wouldn't know which one would turn | out great and which one wouldn't. | | Given that, do you feel that the generated sketches are | useful as a base sketch? I mean, you could probably have used | any of the existing NFL teams logos and as inspiration, | instead of letting the software remix them for you. | avian wrote: | Here's another spin on this: | | Imagine you're a software developer. In the near future, your | manager wants a feature implemented in your company's app. He | throws together a short mail with requirements and sends it to | the prompt engineering department. | | The prompt engineers fix a few typos, clean up the grammar and | pepper a few secret sauce keywords around like "in the style of | firefox", "in the style of kde". This get thrown into Microsoft | Copilot 3.0 that barfs out a bunch of code. | | Copilot's code has inconsistent indentation, three different | method naming conventions and some variables named in a foreign | language. It runs, but crashes if you tap on the lower left | corner of the screen and allows you to drag the order quantity | below 0. But that's ok, it's why we employ software engineers | like you in our company. You will use your years of coding | experience to touch up AI code to perfection. Better get the | details just right until Monday all-hands! | | Still looking forward to it? | xwdv wrote: | sounds to me like I still get paid handsomely | teddyh wrote: | The new copyright washing industry is nearly upon us. | egypturnash wrote: | As a professional artist, this sounds like hell. | sgt101 wrote: | Look, it's trained on these images. | | It's really great and cool and all - but it's retrieving things | that it was trained on. | | Show me something original it did. | Engineering-MD wrote: | It's hard when it was trained on everything pretty much. That's | the same problem as with GPT3. In my mind it's still brute | forcing a solution but instead of endless computation it's | endless examples | kgeist wrote: | Is this "bruteforcing" really different from what our brains | do? We see thousands, millions of little things (examples) | every day. Then we combine what we've seen into something | new. Probably the only difference is that DALLE's training | was done once while our brains are trained every day for 80+ | years. | l33tman wrote: | None of the AI generators retrieve things they were trained on, | they don't work that way. Everything is original. However our | definition of "original" might vary a bit, but so it will vary | for any work of art any human artist do as well, as they are | also trained on the same images. In the end, a lawsuit and a | courtroom might have to decide if by chance someone or some AI | creates a picture used commercially that seems similar to | someone else's trademark or copyright. | | Most of the images I've generated using Dalle 2 feels | completely original. Just have a look at the reddit r/dalle2 | and I'm pretty sure you'll also decide they're "original | works". | dsign wrote: | It's going to leave all those artists without a job, you just | wait!! | cowmix wrote: | After getting access to the beta, combined with all these HN | posts -- I've determined DallE2 is neat but no where as great as | the initial samples made me believe. | twostorytower wrote: | It is actually incredibly capable but if you're looking for | photorealistic images of people, it needs very specific | directions. I learned a lot from this person creating AI | portrait photography: | https://old.reddit.com/r/dalle2/comments/wsi97q/some_of_my_p... | yummybear wrote: | I love seeing people experiment with this technology. You can | feel we're on the cusp of something great - whatever it is, we're | just not quite there yet. | soneca wrote: | Have anyone given a prompt to Dall-e of designing a company | website and included "make it pop!"? | | Maybe the AI will finally get what designers always complained | about annoying clients. | doerinrw wrote: | This isn't really how DALLE works AFAIK but a very fun idea | nonetheless. Here's a quick more simplified experiment than the | great work above: "website design mockup.", with and without | "Make it pop!" | | https://imgur.com/a/fy2Uq4x | codetrotter wrote: | Prompt: | | > Create a website design for a company that sells propane and | propane accessories. The name of the company is Strickland | Propane, a local propane dealership. Make it pop. | | Results: | | * https://i.imgur.com/Jv7NJEN.png | | * https://i.imgur.com/5Uiyg1R.png | | * https://i.imgur.com/LL1DC11.png | | * https://i.imgur.com/buv5BvS.png | | So there you have it :p | twic wrote: | STONIA PRONGAND! I would not buy a propane container which | looked like that. | codetrotter wrote: | Another one. | | Prompt: | | > Create a website design for ACME Corporation, a company | which produces a wide array of products that are dangerous, | unreliable or preposterous. Include customer quotes from a | dissatisfied Wile E. Coyote prominently on the page. Make it | pop. | | Results: | | * https://i.imgur.com/WK3QBj9.png | | * https://i.imgur.com/Bghgzjt.png | | * https://i.imgur.com/XLyYx76.png | | * https://i.imgur.com/QTSyFTc.png | codetrotter wrote: | More still. | | Prompt: | | > A website design concept for Apple Inc, in Neumorphism | design style, showcasing the next generation iPhone Pro | Max. Make it pop. | | Results: | | * https://i.imgur.com/lU4Mf0X.png | | * https://i.imgur.com/ROMJEjT.png | | * https://i.imgur.com/U4sDheM.png | | * https://i.imgur.com/3bRXCs1.png | | Tbh this is probably the worst one yet.. By that I mean, | the results for this prompt are the least reflective of the | text in the prompt. Sure it got the iPhones, but it doesn't | really feel like a website design, and it doesn't feel like | Neumorphism design style. | codetrotter wrote: | More. | | Prompt: | | > Website design for Weyland-Yutani Corporation. The | Company was founded in 2099 by the merger of Weyland Corp | and Yutani Corporation. Weyland-Yutani is primarily a | technology supplier, manufacturing synthetics, starships | and computers for a wide range of industrial and commercial | clients, making them a household name. The website design | for The Company is mobile first. Make it pop. | | Results: | | * https://i.imgur.com/JyhYK5b.png | | * https://i.imgur.com/J5aPXCH.png | | * https://i.imgur.com/ksqrW09.png | | * https://i.imgur.com/uXBcGa5.png | soneca wrote: | Thanks a lot! | | Hopefully I am not asking too much, but can you run the | same prompts _without_ the "make it pop"? So we can learn | what is "pop" | codetrotter wrote: | Dall-E 2 will give varying results even for the same | prompt. If we want to learn what is "pop" we would have | to ask it very many times. | | But I will give it a try. | | First I do three more runs where I give it the same | prompt that I gave it initially; | | > Create a website design for a company that sells | propane and propane accessories. The name of the company | is Strickland Propane, a local propane dealership. Make | it pop. | | Results. | | Run 1: | | * https://i.imgur.com/qqOJHXj.png | | * https://i.imgur.com/fao6NMs.png | | * https://i.imgur.com/yH2Lned.png | | * https://i.imgur.com/SiH3hmU.png | | Run 2: | | * https://i.imgur.com/1rppjpR.png | | * https://i.imgur.com/6KiITZn.png | | * https://i.imgur.com/uQTei8M.png | | * https://i.imgur.com/ITacN1P.png | | Run 3: | | * https://i.imgur.com/KAm6Bc9.png | | * https://i.imgur.com/LBBdotk.png | | * https://i.imgur.com/ZPD32oV.png | | * https://i.imgur.com/UkdSiZM.png | | And then I do three runs where I give it same prompt, but | this time _without_ "make it pop"; | | > Create a website design for a company that sells | propane and propane accessories. The name of the company | is Strickland Propane, a local propane dealership. | | Results. | | Run 1: | | * https://i.imgur.com/MYIGCcu.png | | * https://i.imgur.com/Iw9JjVf.png | | * https://i.imgur.com/aRaD4ew.png | | * https://i.imgur.com/B7lOV3u.png | | Run 2: | | * https://i.imgur.com/9J2T4Mq.png | | * https://i.imgur.com/058wHEH.png | | * https://i.imgur.com/KVdhYVL.png | | * https://i.imgur.com/sT2U3DM.png | | Run 3: | | * https://i.imgur.com/ub1EpvY.png | | * https://i.imgur.com/6TkAI1U.png | | * https://i.imgur.com/XnWGaUm.png | | * https://i.imgur.com/IL03niN.png | | Tbh I don't think we can draw much of a conclusion from | this.. | soneca wrote: | I agree, Dall-e doesn't seem to understand what "pop" is | either. Thanks for the effort in indulging me, though! I | appreciate! | bryanrasmussen wrote: | No Smell the Glove cover, this is a black day for rock and roll! | [deleted] | powersnail wrote: | DALL-E is still highly probabilistic in its judgement. For | instance, in this article, it keeps putting "fire" in the the | background on something that is likely to be on fire, rather than | lighting up the person. | | I have a similar experience. In my own experiment, I can't get | DALL-E to turn off the street lamp at a bus stop in the darkness. | I've tried "no light", "broken street lamp", etc.; no use. Any | mention of "street lamp", the scene will include a working street | lamp. | | It's just more probable that a scene with a lamp in the darkness | must have that lamp providing light, and this is something that | DALL-E will not break out of. | 2OEH8eoCRo0 wrote: | The person is on fire. On top of the fire in the image. | babyshake wrote: | I think DALL-E might be programmed to not depict violence, | which is why it doesn't "like" rendering humans on fire. | Applejinx wrote: | Clearly it will have trouble making Rene Magritte paintings | along those lines, too :) | whirlwin wrote: | I have experienced violent or harmful settings to be avoided by | DALL-E. E.g. setting a person on fire. Same with drowning - | seems to be impossible/hard to generate | zaik wrote: | Violent images likely have not been part of the training data | for obvious reasons. | seesaw wrote: | I gave a prompt about a kid reading the Harry Potter book in | the bed. It generated a kid wearing Harry Potter glasses | reading a book. Pretty close, but also quite different from | what I meant. | MuffinFlavored wrote: | Is this "confirmed fixeD"/different in DALL-E 2? | BrutalCoding wrote: | Hmm, I haven't tried DALL-E yet but Midjourney mentions that | negative prompting dont tend to work well. See here: | https://midjourney.gitbook.io/docs/resource-links/guide-to-p... | | They got a solution for that, which is using their --no | argument. https://midjourney.gitbook.io/docs/imagine- | parameters#prompt... | | I haven't checked if DALL-E has that option too. | | Otherwise, you could try other variations like: | | street light, light pole, lamp pole, lamppost, street lamp, | light standard, or lamp standard | | I copied that from Wikipedia :) | | Best of luck! ___________________________________________________________________ (page generated 2022-08-20 23:00 UTC)