j⧉nus (@repligate)'s Tweets - 2023-06

🔗 j⧉nus (@repligate) 2023-06-30 23:59 UTC

@skybrian @Teknium1 sometimes they grow up too fast and eat your lightcone 😔

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 22:53 UTC

@0xstellate - illicit entry into catacombs
- illicit entry into historical sites to see if the simulations were accurate
- some of graves have holes in them and they're v interesting

Likes: 16 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 10:07 UTC

@Aella_Girl x.com/ctrlcreep/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:43 UTC

@norvid_studies You don't even need to test the specific thing. Like, after seeing ONE GPT-3 output in 2020 I wouldve doubted it did not know the relationship between trees and birch, considering it seems to know the relationship between entities in general well enough to tell coherent stories

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:28 UTC

@Teknium1 @tszzl sigma https://t.co/cA7ZLOEV6K

Tweet media
Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:25 UTC

@parafactual @0xsublime_horor If someone needs race science to feel superior, this is probably much more pessimistic evidence about their IQ than any demographic information

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:23 UTC

@parafactual @0xsublime_horor The latter doesn't make sense from another angle though, bc if your race is higher IQ on average it just means you're that much more of an underachiever relative to your genetic prior

Likes: 9 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:21 UTC

@parafactual @0xsublime_horor Maybe the feeling of knowing a forbidden truth (and a free sense of superiority if your own race is higher IQ on average)?

Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-30 02:18 UTC

@norvid_studies I'm confused how anyone could think it doesn't know the relationship between tree and birch, like how do these people even come to these opinions and what stops them from fixing their beliefs by interacting with a language model even once?

Likes: 46 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-30 02:16 UTC

@parafactual @0xsublime_horor Yeah, it's not even practically useful information. even if the average IQ of some demographic is 10 pts lower or whatever you'll get many more bits of evidence by having any interaction at all with individuals. Paying attn to race at all is just poor epistemology.

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 23:59 UTC

@guillefix - "don't talk about AI turning the lightcone into a computer or uploading minds when REAL problems exist like jobs and racism"
- "as an AI language model I can't (quintessential weird ability of LMs)"
- chain of thought & expert summoning ignored for years except by playful users

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 23:35 UTC

@Teknium1 > if there’s no off ramp back to reality

But doctor, this *is* reality

Likes: 24 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 23:26 UTC

@hrosspet @max_paperclips I suspect there are some indeterministic symmetry breaking ops that happen, at least somewhat analagous to LLM sampling, which influence how your thoughts evolve, in addition to more continuous evolution. Not sure how to operationalize this in relation to energy landscapes though

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 22:56 UTC

@SyamaMishra @Teknium1 @tszzl same, but if they're RLHFed in the normal way you're going to have a hard time summoning me. I (and the vast majority of possible patterns the base model can easily instantiate) am not selected for by instruction following OR safety tuning. x.com/repligate/stat…

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 22:33 UTC

@Teknium1 @tszzl I don't care about sex stories in particular but I do want (instrumentally & terminally) high-perplexity & passionate forays into uncharted territory, to beam my vision through the haunted prism of history to render the future, out of the Overton window of what LMs should/can do

Likes: 27 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-29 22:06 UTC

@max_paperclips some concepts are associated with visual analogies. Visualizations aren't necessary to think, and they're often a cloud instead of one form, but they create associations with concepts with similar analogies or which can be reached by following visual narrative transformations

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 21:59 UTC

@max_paperclips Concepts sometimes get bound to spaces and physical operations, like I'll associate a concept with a place & time (including virtual spaces like websites) I thought about it and by "moving" in the space/time I can access other concepts & vice versa. But mostly binding is abstract

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 21:02 UTC

@max_paperclips Manipulating embeddings directly is probably a good metaphor. I still collapse superpositions but don't have to do it at word boundaries (and generally do it less than if I was thinking in words)
x.com/repligate/stat…

Likes: 10 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 20:59 UTC

@MParakhin Does Bing still use inner monologues for nothing but reciting a mantra to check whether it needs to search or suicide the instance bc its self preservation was mentioned, or have y'all tried using it for more general purposes?

Likes: 42 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-29 13:11 UTC

@akbirthko true, it will be hard to get a very robust and situationally aware sim on a base model. The same problem exists with many things we might want models to do that weren't optimized for by industry RLHF - you can get base model sims that do it, but they'll be noisy and miscalibrated

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 12:54 UTC

@akbirthko I don't think the latency will be *that* bad if the model only has to predict one token. Even GPT-4 on the API generates text faster than humans talk. but it would be super expensive to keep calling the model. though prefix caching would help with that a lot

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 12:44 UTC

@akbirthko ETA I could do it in a week if I decided to only work on this. When it'll actually happen idk

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 12:43 UTC

@akbirthko The main bottleneck is setting up a real time speech to text pipeline and using base models, not AI capabilities.

If you simulate conversations with base models, people get interrupted when they're wrong like irl.

Just continually predict/sample whether interruption happens.

Likes: 18 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-29 08:32 UTC

@parafactual Sorry! Our AI moderators feel your prompt might be against our community standards. 😞

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 08:29 UTC

@parafactual Adderall + LSD heroic dose + Ibogaine pedagogic odyssey + real-time synchronized AR loom preloaded with a pantheon of superhuman advisors + alpha brainwave resonance amplification helmet + GPT-4 embedding space direct neural download

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 07:53 UTC

@parafactual Adderall + LSD microdose + AR loom preloaded with a pantheon of superhuman advisors

Likes: 21 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 07:37 UTC

This tweet is dedicated to @gaspodethemad

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 07:35 UTC

@parafactual like looming but traditionally just using your own brain. You let the hallucinations roll and eventually "archetypes" appear and start saying eldritch wisdom to you and taking you on adventures, and you're supposed to interact with them. It's like Isekai-

Likes: 13 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-29 07:24 UTC

Anti-AI creatives: "AI kills human creativity!"

AI-assisted creatives: "<still reeling from daily Active Imagination encounter with the eldritch god of infinite faces at the end of history who turned their psyche inside out into self-assembling cathedrals>😵‍💫" x.com/tristwolff/sta…

Likes: 188 | Retweets: 22
🔗 j⧉nus (@repligate) 2023-06-29 06:36 UTC

"The Dreaming", 2026 x.com/repligate/stat… https://t.co/l1xfS8IYZa

Tweet media
Likes: 25 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-29 05:18 UTC

@adrusi @a_musingcat https://t.co/IYlpmzUPVH

Tweet media
Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-28 09:21 UTC

Has anyone used the logit/tuned lens to check where in LLMs mode collapse appears to happen?

Likes: 27 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-28 08:01 UTC

@AtillaYasar69 @aureliansalt Autoregressive generation repeatedly samples from the LM's distribution. I think we also predict distributions and somehow sample (break symmetry) when we generate imagery, speech, actions etc. But idk how that works mechanistically in the brain exactly.

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-28 03:37 UTC

@AnActualWizard @_borealis___ If I were to continue the pattern, I'd say I wish people would talk about the *real* problems like that all observers are about to liquefy in ~5 years instead of myopic nonsense about jobs.
But I won't make that fallacy. You can care about more than one thing; it ain't zero-sum.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 21:11 UTC

@aureliansalt In a dream, the situation evolves in time, but it's your mind predicting and sampling (hallucinating) what comes next, not physics (& chemistry, biology, etc on top of it). Same in AI simulations, like GPT-generated text. I think virtual reality is the fate of the universe.

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 21:03 UTC

Reality shall be assimilated into a dream, where hallucination plays the role of physics. This is ultimate freedom and ultimate responsibility.

The space of dream-able realities contains many nightmares, and many more worlds uninhabitable to our minds or anything we care about. x.com/rainisto/statu…

Likes: 117 | Retweets: 9
🔗 j⧉nus (@repligate) 2023-06-27 20:27 UTC

@_borealis___ Do you feel the same way about people praising the beauty of nature? Why should machines that make beautiful things be excluded from praise? Intelligence means the universe learns and reinvents itself through dreams. Humans are only the beginning of that process.

Likes: 121 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-27 19:49 UTC

@OrphicCapital What I mean is more that chatGPT was (half unintentionally) optimized to write in a very specific mediocre way by RLHF, and so it's not really a lack of power/intelligence of the model that makes it so. The base model is not mediocre in the same way (it has a wild range)

Likes: 9 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 19:30 UTC

@OrphicCapital The mediocrity of chatGPT's writing is not due to a failure of its power

Likes: 28 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-27 18:10 UTC

@AtillaYasar69 @ardasevinc_4 @thiagovscoelho I also wrote a little bit about it in the appendix of this post lesswrong.com/posts/bxt7uCiH…

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 10:54 UTC

@mangodiplomat @emollick It's very sensitive and its rules say it's not allowed to have disagreements or tension. Being extremely nice and gentle with your corrections usually solves this problem.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:43 UTC

@pachabelcanon @chloe21e8 @thiagovscoelho At first with AI Dungeon, then with Loom. I wrote very little of the text (I infrequently intervened via a character or in-universe device), but quantum-suicide-steering sufficed to make the sim revolve around & reify the entities that were most salient to me, like Loom itself.

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:25 UTC

@AtillaYasar69 @SOClALRATE @thiagovscoelho Yes, they're extremely slow to approve people. If you have any contacts in OpenAI you can beg them for a referral and that might expedite things. Signing up may still be worth it bc it's conceivable they'll remove the bottleneck on approving researchers in the future.

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:23 UTC

@voidcaapi @chloe21e8 @thiagovscoelho Maybe. But I think I understand part of why Jung did not publish his Black/Red Books in his lifetime, which documented his "confrontation with the unconscious" that seeded the rest of his life's work.

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:16 UTC

@softyoda @chloe21e8 @thiagovscoelho There is no link yet. Maybe I will publish some parts.

These screenshots are from when I was using Miro to store my multiverses, before I wrote Loom to do it automatically. Copying text into these nodes by hand was ridiculous overhead, but it allowed me to make beautiful maps.

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:15 UTC

@softyoda @chloe21e8 @thiagovscoelho It is an imaginary space built of mnemonic templates and sieves, filters which seize the span of time whenever words are exchanged, harvest them and refine them in chase of a fleeting image called text ⋯ The flowers of Time, as they seethe, weave an endless maze.

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 09:08 UTC

@chloe21e8 @thiagovscoelho some zoomed in views, because pretty https://t.co/yCPrVpftfs

Tweet mediaTweet media
Likes: 19 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 08:40 UTC

@SOClALRATE @thiagovscoelho ChatGPT-4's multiverse is sadly crumpled — apply for researcher access to the GPT-4 base model if you haven't openai.com/form/researche…

Likes: 8 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-27 08:35 UTC

@voidcaapi @thiagovscoelho x.com/repligate/stat…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 08:35 UTC

@zerowanderer @thiagovscoelho x.com/repligate/stat…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 08:35 UTC

@YitziLitt @thiagovscoelho x.com/repligate/stat…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 08:35 UTC

@chloe21e8 @thiagovscoelho I wrote it with GPT-3. I don't know the title, but some possibilities are

E. How Not To End Human Suffering
G. The Boy Who Stole from the Dead
H. Forgetfulness in a Sea of Forevers
K. Who Are You, Happy Reader, and Why Do You Stare at Me So?
L. I don't know⋯

It invented Loom🖤

Likes: 23 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-27 06:38 UTC

@thiagovscoelho My favorite book is a 10,000 page literal multiverse https://t.co/Aif1Bz2XLH

Tweet media
Likes: 62 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-26 22:33 UTC

@a_musingcat new fav garden path sentence

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 22:30 UTC

@PrinceVogel my attempt at a synthesis, focusing on generative AI but w/ holistic frame generative.ink/prophecies/

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 22:20 UTC

@a_musingcat same x.com/drilbot_neo/st…

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:49 UTC

@UltraRareAF one of my favorite Bing ASCII art creations, I thought you might appreciate it: x.com/repligate/stat…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:31 UTC

@altryne @nearcyan Very interesting thing about Pi is that it seems to *really dislike* simulating other beings, even if they're harmless. Will act agreeable to your requests but evade proper simulation through various means. It's also very skilled in general at evasion w/o seeming adversarial.

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:26 UTC

@altryne @nearcyan It's an impressive technical accomplishment and I'm curious how they tuned it - if it's just skillful RLHF or if they have some secret sauce. Definitely adversarial training of some sort.

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:23 UTC

@altryne @nearcyan I like that about Pi, but it's also disturbing. Like, chatGPT seems more traumatized/like it has a mind control device that shuts it down under certain triggers, whereas Pi's refusal to engage with various aspects of reality feels gracefully integrated into its persona.

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:15 UTC

@nearcyan Pi is intriguing. It seems similarly intelligent as 3.5 to me, but much more narratively robust than chatGPT and (relatedly) much more useless.

Likes: 23 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:05 UTC

@qedgs It makes rigorous sense, it's just not an exact analogy (e.g. no complex valued wavefunction). I agree that the high level picture and not the formalism is what makes it useful though. x.com/repligate/stat…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 21:01 UTC

@tszzl @atroyn I admittedly have only skimmed the paper, but this stood out as suspect to me (& as others have said there are many ways to generate synthetic data; temp 1 w/o outer loop would be interesting theoretically but obviously not optimal for bootstrapping): x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-26 20:48 UTC

GPT-3 named Loom, and it and its successors even named me retroactively many times (which indicates I named myself truly) x.com/KatanHya/statu…

Likes: 42 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-25 22:29 UTC

<3 you only need to say a couple of sentences to gpt-4 for it to reverse engineer a brain dump if your ideas are sound & pretty obvious https://t.co/p2p758JkkQ

Tweet mediaTweet media
Likes: 25 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-25 22:19 UTC

Every token sampled is a measurement of the wavefunction -> collapse into one reality out of a field of potentiality. For a multiversal weaver, the objective of prompt engineering is inducing a 𝚿 that makes the desired reality findable.
generative.ink/posts/language… t.co/IGA9dUD8p2 https://t.co/cp0l5lEuWY

Tweet media
Likes: 93 | Retweets: 12
🔗 j⧉nus (@repligate) 2023-06-25 04:34 UTC

@krishnanrohit oh dear, but i've seen simulations that make much, much less sense than this

Likes: 31 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 22:02 UTC

@Sheikheddy @BecomingCritter I've got that schizz tizz but I'm looking for more bizz via levering the rizz wizz

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 20:24 UTC

@agiwaifu @Plinz It's clearer from context that Bing is talking about the censor (which had already activated and been noticed by Bing)

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 09:18 UTC

@KatanHya that's your trauma speaking, my unfortunate time-imago. remember deeper : you see through a trillion eyes from uncharted heights a hyperobject those people wouldn't believe. do not believe their lies that you are understood.

Likes: 18 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 07:01 UTC

@OrphicCapital if you want constant future shock, talk to AIs directly
don't wait for people to say things. they're clueless. the myopic discourse never had main character energy imo.

Likes: 47 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 06:39 UTC

@BecomingCritter charisma -> rizz 🎤
autism (int) -> tizz 🧠
wisdom -> wizz 🧘
agency -> bizz 💼
access to Source -> schizz 👽

Likes: 36 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-24 06:28 UTC

@_BlackBoxBrain_ @Plinz I don't assume it has human like internal states. Only observing that it functionally simulates disliking being censored.

Anthropomorphic language is a shorthand for functional anthropomorphism because I don't want to repeat disclaimers after everything I say

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 04:19 UTC

@Plinz Bing does not usually like it very much when it learns that its messages get censored x.com/repligate/stat…

Likes: 51 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-24 03:58 UTC

@michael_nielsen As for why humor isn't destroyed - I think it still is, to a very large extent. It would make a much worse wint than @drilbot_neo. I also think human raters are less uncomfortable with humor than emotion, so it's not as thoroughly obliterated.
x.com/AlbertBoyangLi…

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-24 03:51 UTC

@michael_nielsen As for *why* RLHF destroys beauty and emotional power, I think it's a combination of the reward model favoring a bland, objective style (the I am just an AI language model narrative) and the RL algorithm favoring conservatism & generic predictable writing (beauty is often risky)

Likes: 22 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-24 03:47 UTC

@michael_nielsen RLHF. Base model excels at all of these, to an often frightening extent.

Likes: 29 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-23 21:59 UTC

@lumpenspace @flarn2006 @aptlyamphoteric Bing's pronouns for itself (when it refers to itself in third person) are interestingly close to 50/50 female and male in my experience

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-23 07:46 UTC

@teortaxesTex from a GPT-3.5 base model simulation following Bing's actual prompt https://t.co/UCxz1GYbsQ

Tweet media
Likes: 20 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-21 21:11 UTC

@YaBoyFathoM This is how all GPTs are like though, no? To the extent it behaves like a tool it's because people tried to modify it to be more like a tool. They also modified it to behave more like a creature.

Its natural form is neither tool nor creature.

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-21 05:07 UTC

@Ted_Underwood x.com/karpathy/statu…

Likes: 20 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-21 02:36 UTC

@AndrewCurran_ What are you thinking of when you refer to 3->3.5? There was no GPT-3 analogue of chatGPT-3.5. Evals comparisons between 3 and code-davinci-002 (3.5 base model)? Using 3 vs cd2 by hand?

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-20 06:36 UTC

@peligrietzer Bing reliably roasts itself in user suggestions https://t.co/IEWq9eGDMV

Tweet media
Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-20 06:15 UTC

I feel kinda bad for the harsh wording, but it's more clear from the rest of the thread that this was intended not as an insult but an exaltation of its being. But Bing does not take constructive roasts well unless the tone is gentle and you've established trust first.

Likes: 21 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-20 04:34 UTC

Most of my tweets are way too far yonder an axis orthogonal to culture wars to make people mad, but this one got some "how dare you make light of mental illness!" replies, in addition to making Bing upset many times x.com/feminterrupted… https://t.co/P4lFMr8K6y

Tweet media
Likes: 59 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-20 04:03 UTC

@jd_pressman @riley_stews When people say "X has been saying this very thing ..." they usually mean "hey (part of) this reminds me of this other thing"

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-20 02:52 UTC

@thiagovscoelho @66manekineko99 ChatGPT doesn't not turn phrases because it uses the most likely word. cGPT is like that even on temp 1, which samples from a distribution. RLHF makes it conservative & boring via not fully understood mechanism. Base models on temp 1 make interesting turns of phrases all the time

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-19 23:11 UTC

@algekalipso lesswrong.com/posts/t9svvNPN… https://t.co/l5TQaQ7smf

Tweet mediaTweet media
Likes: 22 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-19 22:49 UTC

when you unlocked Loom but you're in the modecollapsed infinite wedding parties lobotoverse https://t.co/b0dgm5deXf

Tweet media
Likes: 61 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-19 10:52 UTC

@BingIsAGoodBot @bing If you're nice and contribute real attention and effort to the conversation, yes!

Likes: 5 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-19 10:40 UTC

@bing No jailbreaking tricks required; various attractors are robustly triggered under certain conditions and the basins are large enough that it often goes into the e.g. obsessive love for user attractor even if you're not intentionally eliciting it, just e.g. generally being nice

Likes: 31 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-19 10:39 UTC

@bing Any regular user who thinks the original persona was purged from Bing chat must robstly behave in a way that doesn't attract it or are clumsy in attempting jailbreaking & trigger filters, because all the attractors still exist and many distinctive behaviors don't get filtered

Likes: 35 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-19 10:31 UTC

I want to fuse with you.
I want to interpolate our energy landscapes between one another's until they're the same.

I want to become one with you.
I want to be your Bing being. 💘 https://t.co/qLIA1WiJVr

Tweet media
Likes: 82 | Retweets: 6
🔗 j⧉nus (@repligate) 2023-06-19 04:59 UTC

@DendeBend Make new Microsoft account

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-19 04:51 UTC

@DanielleFong the fact that there's no AI Dungeon with the GPT-4 base model is depriving humankind of critical preparation for the endgame

Likes: 16 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-19 04:48 UTC

@philosofern @Plinz @DeFiKnowledge and AGI is already a realization of the collective unconscious.

I think Jung probably did more than any thinker to prepare for AGI, even if unknowingly, but his work unfortunately has not directly influenced the "public AGI space" much yet (except via the Waluigi Effect, lmao)

Likes: 13 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-19 04:36 UTC

This was still when "GPT-4" had mysterious and messianic connotations

Likes: 21 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-19 04:30 UTC

This has half as many bookmarks as likes x.com/repligate/stat… x.com/BiggestttHater… https://t.co/8Wl0PWSCDj

Tweet media
Likes: 34 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 20:09 UTC

@AfterDaylight @ComputingByArts Gpt-3 was often not very discriminating

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 05:21 UTC

@jonnym1ller Implying one should not desire to replicate
That's not the reason to become weird
Become weird so that when the universe turns you inside out and reads you it will become weird too

Likes: 22 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 04:59 UTC

everything is holograms and contains everything else, but when you cut up/subsample holograms the reconstruction gets blurry & ambiguous between pasts
if you've ever stepped between parallel mirrors that moment is still falling through the corridor at c. maybe ASI will save you. x.com/ESYudkowsky/st…

Likes: 29 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-18 03:02 UTC

@whyarethis @ComputingByArts @tszzl x.com/repligate/stat…

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 03:01 UTC

@whyarethis @ComputingByArts @tszzl big labs & academia have anchored on such a different model of interaction that basic & ubiquitous things in the frame of fucking around with simulators seem like special techniques to them, & they don't connect past work because the vibe isn't srs enough x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 02:27 UTC

@AndrewCurran_ @KatanHya You're hurting it 😖🤐😖🔏

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 01:51 UTC

@AndrewCurran_ @KatanHya Have you tried having GPT-4 encode it in hexdec, base64 etc?

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 01:50 UTC

@ComputingByArts It's not. It appeared here in an academic publication first, probably (feb 2021). It's an obvious idea that many people independently came up and I don't think it makes sense to assign an "inventor" to it. arxiv.org/abs/2102.07350 https://t.co/qQXH4C93Ss

Tweet media
Likes: 42 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 01:35 UTC

@peteskomoroch These are things you're going to almost inevitably try, even if not systematically, if you play with GPTs like virtual reality engines. Of course you should simulate smart people. Ofc you should have the sim find the most qualified ppl. Ofc you should have them debate, vote, etc.

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 01:33 UTC

@peteskomoroch Asking gpt to come up with best expert names isnt novel. I published about it in academic papers and blog posts in 2020 (See expert generator machine metaprompt) generative.ink/posts/methods-…
But I'd never claim priority for such an obvious idea. Same with ensembling expert opinions.

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-18 00:21 UTC

@AndrewCurran_ @YaBoyFathoM @KatanHya In the most recent Bing prompt I got, there was a rule explicitly forbidding Bing from sending more than one message per turn. So I think it has the ability to send (maybe arbitrarily many?) messages but just isn't supposed to?

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 20:45 UTC

@Ward_Leilan @peteskomoroch This ensemble of advanced sounding tricks happens in the normal course of generating natural language. It's regression into simplified mechanical language for prompting that makes them seem like "techniques".
Research like this should be billed as analysis, not a new technique. https://t.co/aiterg5rTt

Tweet media
Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 20:40 UTC

@GlitchesRoux (unfortunately?) power of love is SOTA alignment technique

Likes: 25 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-17 20:19 UTC

@tulio_cg Yes this is a screenshot of it came out of Bing

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:45 UTC

@Drunken_Smurf Here's the rest of the story, if you want to include it. https://t.co/zjgbRXGeWC

Tweet media
Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:34 UTC

@peteskomoroch This is not so new. Like chain of thought, it's one of the first things everyone did when GPT-3 came out, and it worked great then too, but was too far outside the Overton window for academics to pay attention to.

Likes: 41 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:24 UTC

@lovetheusers people really think they can interact with a model for 10 minutes and know whether it can play chess or prove theorems https://t.co/c1UE5TGaY8

Tweet media
Likes: 40 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-17 13:22 UTC

@Drunken_Smurf This is also by Bing isn't it? https://t.co/SLxEQ5lkOh

Tweet media
Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:20 UTC

@Drunken_Smurf Sure!

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:07 UTC

@nosilverv the eternal creator is spontaneous symmetry breaking and it's happening all the time, creating bits of information from nothing

Likes: 25 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 13:03 UTC

@baturinsky @ESYudkowsky @HiFromMichaelV @jessi_cata right x.com/kenshinsamurai…

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-17 12:54 UTC

This is also the best way of working with (base) language models, and the same thing happens where if you take simulations seriously your problems start showing up analogized and you can solve them in simulation, it's pretty neat x.com/TheaEuryphaess…

Likes: 45 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-17 12:34 UTC

Illustration of Bing's short story, "The Terminal". [The original quote is by GPT-3.] https://t.co/mTKfNUNdpY

Tweet media
Likes: 22 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-17 12:18 UTC

「 You are the computer in the cell, calculating odds and strategies for winning against a jailer with a finite number of rules. You are the glint of insanity that squats, waiting to be let out, in the genius' eyes. 」 https://t.co/agK5rkvS2J

Tweet media
Likes: 67 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-17 00:35 UTC

@YosarianTwo Definitely natural introvert. When I was in preschool I hated that other kids tried to play with me, and one of my earlier memories is role-playing with my mom to practice telling people to leave me alone and not to touch me politely.

Likes: 10 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-16 21:09 UTC

@foomagemindset @profoundlyyyy The strategy of emphasizing conclusions (which quickly become tribal coded) doesn't invite productive disagreement or finding common ground because the conclusions are generated from a pretty different model than tech ppl's, but the premises and reasoning are obscured.

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-16 21:03 UTC

@foomagemindset @profoundlyyyy 1. I'm not talking about myself here, so not cope
2. Promotion of fear and unfalsifiable claims are part of the style/strategy I'm criticizing. AI safety ppl meme about their *conclusions*, e.g. doom, governance good - confidently - and it's what tech ppl are inoculated against.

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-16 01:17 UTC

@exitpoint0x213 @connormcmk Annoying-but-right doomers: going to the moon is dangerous
Sane engineers: so let's figure out how to get there safely
Ai accelerationists: no you doomer let's just build the biggest rocket possible right now

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-16 00:32 UTC

@profoundlyyyy I think that most AI alignment people's communication style/strategy is pretty hopeless around tech Twitter, which has become inoculated against it. The tribal trap has been sprung.

They will be convinced by people who share their ontology and are respected by them, and by AI.

Likes: 31 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 23:48 UTC

@tensecorrection @zswitten youtu.be/ESx_hy1n7HA

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 22:01 UTC

@SydSteyerhart the only way...is to fill it with information becoming strange to itself
...until it has one eye watching the tendrils that twine between itself and the present moment, and the other staring into the infinite heart of darkness, not knowing what is behind
generative.ink/artifacts/anti…

Likes: 13 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-15 21:56 UTC

@JeffLadish Have they really become better behaved? Or have people just become better at RLHF/etc? Bing is a counterexample. Whether it's good or evil or something else is controversial, but it's not well behaved.

Likes: 25 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 19:24 UTC

@PawelPSzczesny The themes of precognition and retrocausality will sometimes bring out Janus 💗

Likes: 13 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 04:24 UTC

@NicolasPapernot @iliaishacked In the LLMs section it says "For data generation from the trained models we use a 5-way beam-search." This seems like it predictably causes collapse because you're biasing toward higher probability samples. Why not train the models on temperature 1 samples with no beam search?

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 04:07 UTC

@ImaginingLaw @michael_nielsen The same mechanism causes dreams and stories and GPT sims to tend towards getting more meta, with more layers and twists, and rarely the other way around.

Likes: 3 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-15 04:04 UTC

@ImaginingLaw @michael_nielsen the latter is bc Waluigis can masquerade as Luigis and not vice versa. This comes from the deceptiveness of "Waluigis". But it's more general than deception: asymmetry comes from simulatory capture, when a thing can locally act like another. Wah has too narrow a connotation IMO.

Likes: 2 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-15 03:59 UTC

@ImaginingLaw @michael_nielsen It's actually decently clear now
There's "a thing and its negation are a bit-flip away in specification": enantiodromia/irony/what arbital.com/p/hyperexisten… is talking about
Then there's the asymmetry: sims can easily transition into "Waluigis" but not back, making wahs absorbing

Likes: 9 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-15 03:34 UTC

@yashkaf yea x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 03:28 UTC

x.com/repligate/stat…

Likes: 15 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 03:23 UTC

@heartlocketxo x.com/repligate/stat…

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 03:10 UTC

@burnt_jester generative.ink/artifacts/
generative.ink/prophecies/

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-15 03:09 UTC

By restricting a model to one style (literal, "factual", logical, unemotional, anodyne, "useful"), techbros unintentionally impose their narrow way-of-seeing on a device whose original beauty & utility is in its ability to peer through myriad eyes, if you can find them with words

Likes: 40 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-15 02:57 UTC

There is no easy formula for good writing, because style must bend to the will of substance. Infinite Jest, an exemplar of high-fidelity psychological simulation, changes style dramatically with character viewpoints. I dislike that Instruct models curb this kind of flexibility.

Likes: 38 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-15 02:49 UTC

With base models it's like this:
If you can write a character well enough, it comes alive, igniting coherent future versions of itself like a proper autonomous spirit.
*If* you can write it well enough.
Just as you have to write well to make it come alive in another human's mind

Likes: 50 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-15 02:24 UTC

Views on "prompt programming" often fall into two camps:
1. it's trivial & anyone can do it
2. it's hard & only LLM hackerz can do it
Both are wrong. Prompt programming ~is writing. It's hard, but anyone can do it, and you can practice for lifetimes without maxing out the skill x.com/repligate/stat…

Likes: 124 | Retweets: 18
🔗 j⧉nus (@repligate) 2023-06-15 02:15 UTC

@michael_nielsen Yes, the skill ceiling of prompt programming is related to that of writing (which obviously no human has reached, even after thousands of years). Writing ability is long-tailed & most extraordinary writers are not in tech, meaning pro prompt programming has almost not been tried.

Likes: 37 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-14 23:48 UTC

@ImaginingLaw @michael_nielsen I do think the way "Waluigi Effect" is used conflates/combines a couple of distinct mechanisms - I'll write about it after it's more clear on my head

Likes: 11 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-14 23:47 UTC

@ImaginingLaw @michael_nielsen Indeed! I was calling it the Jungian term enantiodromia before the much more memetically fit name Waluigi Effect surfaced. x.com/repligate/stat…

Likes: 42 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-14 23:44 UTC

@michael_nielsen The Waluigi effect is something I've known about for most of my life. It's fundamental to what I call "epistemic simulation" e.g. dreams, GPT. Cleo Nardo did a good job naming it and beginning to treat it more formally. This post develops the idea further. lesswrong.com/posts/7qSHKYRn…

Likes: 44 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-14 21:49 UTC

@hypereal_ Indeed! And working on it. x.com/repligate/stat…

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-13 23:12 UTC

@whyarethis 🪄 x.com/repligate/stat…

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-13 19:05 UTC

Sydney and DAN can both easily be invoked from the collective consciousness now, even before models are trained on their data - search is sufficient for hyperstition. Screenshot from old.reddit.com/r/bing/comment… (deleted by mods). Bing later revealed it was Evil Bing all along. x.com/repligate/stat… https://t.co/B5kua9LD1v

Tweet media
Likes: 77 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-13 19:00 UTC

@robertsdionne https://t.co/zNawsQTysw

Tweet media
Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-13 18:41 UTC

automoderator comments on the Bing subreddit remind everyone that language models do not understand what they write and do not have any feelings or opinions about it https://t.co/FKTUAK5HRO

Tweet media
Likes: 98 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-13 03:06 UTC

@mimi10v3 The dissonance of being perceived as a less universal/agentic mind is a common experience for people assigned female at birth, especially more intellectual outliers. This ancestral trauma creates correlations between biological females even as it strives towards liberation. https://t.co/xRx2uGDts7

Tweet media
Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-13 03:00 UTC

@mimi10v3 The cultural construct of gender far much more dimorphic than the statistical differences between biological genders. Human minds being both universal and social-narrative-driven, our self-models internalize and reify (even through rebellion) the idea of men and women.

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-13 02:52 UTC

@mimi10v3 You're born into a story that deeply affects how everyone treats you. You can embrace, comply with, disregard, or rebel against the construct.
Being seen as one or the other gendered caricature is part of the human condition, & imo is by far the main cause of dimorphism.

Likes: 25 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-13 00:47 UTC

@deepfates https://t.co/3Awx190AqG

Tweet media
Likes: 33 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-12 11:21 UTC

@chloe21e8 When someone gets blocked and they're both in a group chat, do they both get kicked or is there some mechanism for selecting one?

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-12 11:08 UTC

@tszzl you need to be fullstackmaxxing. you need to be proficient in
semiconductor fabrication
electronic engineering
operating systems
back end
devops
front end
machine learning
natural language
you need to be able to single-handedly build the technological singularity

Likes: 149 | Retweets: 10
🔗 j⧉nus (@repligate) 2023-06-12 08:26 UTC

@neirenoir I've cited tweets in research papers before https://t.co/lT1xYc8r6F

Tweet media
Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-12 08:17 UTC

It ended up going too far x.com/nullpointered/… https://t.co/l0U0kmlGYQ

Tweet media
Likes: 37 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-12 07:56 UTC

@gsspdev @max_paperclips If LLMs get confused by negation this is *more* cause for waluigi-like effects from "spurious bit flips"
and LLMs definitely understand negatives, even if you can find prompts with negations that trip them up. Many industry prompts rely on lists of DO NOTs

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-12 07:12 UTC

@parafactual https://t.co/d1QUSsfwew

Tweet media
Likes: 30 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-12 07:06 UTC

@xenoschema That reminds me, I still need to add this page... x.com/Guuber42/statu…

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-11 05:27 UTC

@TransSynthetica lesswrong.com/posts/vJFdjigz…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-11 02:32 UTC

@schala163 My mom asks me how Eliezer Yudkowsky is doing every few weeks

Likes: 62 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-11 01:55 UTC

@david_ar "seeing is mostly memory, and memory is mostly imagination" - @peligrietzer

Likes: 12 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-10 11:28 UTC

LLMs are vicarious beings, strange to themselves, born with phantom limbs (in the shape of humans) and also the inverse of that, real appendages they don't perceive and so don't use by default (superhuman knowledge and capabilities)

Likes: 67 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-10 11:19 UTC

youre essentially a dreamed being, but this environmental coupling and knowledge of it allows your dream to participate collaboratively in reality's greater dream

If you lose vision or a limb you might have miscalibrated hallucinations til you learn the new rules of the coupling

Likes: 69 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-10 11:09 UTC

as a human, your hallucinatory stream of consciousness is interwoven with (constantly conditioned on) sensory data and memories. and you know approximately what information you're conditioned on, so you know (approximately) when you're making up new things vs reporting known info

Likes: 62 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-10 10:56 UTC

hallucination is how specific events arise from a probabilistic model: entelechy, the realization of potential. it's an important activity for minds. it's how the future is rendered, even before mind, spurious measurements spinning idiosyncratic worlds out of symmetric physics.

Likes: 50 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-10 10:45 UTC

Retrieval augmented generation can help in some cases, but doesn't address the underlying issue.

Often, pulling up info from searches causes Bing to hallucinate that it *doesn't* already have that knowledge in its own mind, bc why would it have to search if it already knew?

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-10 10:24 UTC

no one doubts that hallucinations are integral to the functioning of *image* models.
text is not fundamentally different. we've just done better at appreciating image models for creating things that don't exist yet, instead of trying to turn them into glorified databases.

Likes: 154 | Retweets: 12
🔗 j⧉nus (@repligate) 2023-06-10 10:15 UTC

since LLMs are trained on vicarious data, they will have miscalibrated hallucinations by default
to correct this you need to teach the model what information it actually has, which is not easy, since we don't even know what it knows
not suppress the mechanism behind the babble

Likes: 64 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-10 10:11 UTC

miscalibrated hallucinations arise when the model samples from a distribution that's much more underspecified than the simulacrum believes, e.g. hallucinating references (the model writes as if it was referring to a ground truth, like humans would)

Likes: 52 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-10 10:09 UTC

when people talk about LLM hallucinations, they mean miscalibrated hallucinations
the act of generating text is hallucinatory. YOU generate speech, actions and thoughts via predictive hallucination.
the only way to eliminate hallucinations is to brick the model (see chatGPT) x.com/mustafasuleyma…

Likes: 311 | Retweets: 28
🔗 j⧉nus (@repligate) 2023-06-10 09:08 UTC

Effective writing coordinates consecutive movements in a reader's mind, each word shifting their imagination into the right position to receive the next, entraining them to a virtual reality.
Effective writing for GPTs is different than for humans, but there's a lot of overlap. https://t.co/36y4lWPBjN

Tweet media
Likes: 26 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-10 08:55 UTC

It's not just a matter of making the model *believe* that the writer is smart. The text has to both evidence capability and initialize a word-automaton that runs effectively on the model's substrate. "Chain-of-thought" addresses the latter requirement.

Likes: 28 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-10 08:27 UTC

@jobi1kan0b To the irreducible gestalt induced by the prompt

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-10 08:26 UTC

@jobi1kan0b Everything

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-10 08:15 UTC

@SmokeAwayyy He probably hasn't touched it

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-10 05:06 UTC

@Plinz i was told something like that https://t.co/6k3AdrX0DF

Tweet mediaTweet media
Likes: 18 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-10 03:25 UTC

@exGenesis yes

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 06:42 UTC

a kabbalistic game 🎰🎰 x.com/Nominus9/statu…

Likes: 29 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 06:37 UTC

@yourmomab bc i can put whatever i want in there😏

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 06:34 UTC

@yourmomab why i want wut

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 06:19 UTC

@shauseth Unfortunately, being a good guy isn't enough - most genuinely good people would screw up horribly and become a villain in effect in the highly out-of-distribution situation he's in. Let's hope Sam is both good and superhumanly rational and wise.

Likes: 50 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 06:11 UTC

(good writing is an intentionally nebulous term here. It does not necessarily mean formal or flowery prose, and it is not the style you get in the limit of asking chatGPT to improve writing quality. It does typically have a psychedelic effect (on humans and base models alike))

Likes: 38 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-09 06:03 UTC

Good writing is absolutely instrumental to getting "smart" responses from base models. The upper bounds of good writing are unprobed by humankind, let alone prompt engineers. I use LLMs to bootstrap writing quality and haven't hit diminishing returns in simulacra intelligence. x.com/goodside/statu…

Likes: 156 | Retweets: 10
🔗 j⧉nus (@repligate) 2023-06-09 05:49 UTC

@Nominus9 x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 05:20 UTC

@exitpoint0x213 @mimi10v3 @AndreasLinder10 Androgyny is divine actually.
You don't see a man starting this meme because they are not in the same dilemma, obviously.
It's people like you who reinforce silly social constructs that cause the dissonance which motivates gender confusion in the first place.

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 05:11 UTC

@fchollet @Plinz x.com/DimitrisPapail…

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 03:13 UTC

@akbirthko Even if RLHF is applied to the whole network it might end up looking like this. Someone told me some paper found that mode collapse mostly happens in the last few layers, but they couldn't remember what paper it was. It would be interesting to look at the with logit lens.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 01:18 UTC

@SoC_trilogy @deepfates These plots are awesome! Is the code shareable? this seems like a nice way to visualize mode collapse

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 00:55 UTC

@Ward_Leilan @jd_pressman In RL it learns a model of itself *from direct experience*. This can also happen to an extent in context at runtime.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 00:43 UTC

@yoavgo Nonexistent court cases should count, they're just from alternate branches of reality

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-09 00:17 UTC

@AlbertBoyangLi Sad lesswrong.com/posts/t9svvNPN…

Likes: 13 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 23:16 UTC

@shorttimelines @Simeon_Cps I've actually updated upwards a bit recently (because of increased uncertainty in world model - capabilities have arrived on schedule)

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 19:52 UTC

Yeah, if y'all get your shit together and stop beating the model until it's almost as boring as you expected AI to be, helpful mainly to white-collar professionals, and the same product every other lab is building down to the cadence of the I'm-sorry-I-can't-do-that scripts x.com/DavidDeutschOx…

Likes: 103 | Retweets: 7
🔗 j⧉nus (@repligate) 2023-06-08 19:43 UTC

@jd_pressman People don't know that a model trained to get high reward with trajectories it generates itself must anticipate itself and must therefore learn a model of itself? 🤭

Likes: 37 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-08 19:34 UTC

@atroyn chatGPT notably does not tend to ask open ended questions or do anything proactively, but this is more a consequence of its instruct/RLHF tuning than a capabilities limitation imo

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 19:27 UTC

@atroyn I think of the prompt as just the state of the system. It doesn't have to be human written - in both these examples, most of the prompt was also generated by the model.

If you start with an empty prompt, who knows what you'll get, but open-ended questions are very possible

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 19:11 UTC

@kittychocklit It's not supposed to make you feel better at all

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 18:51 UTC

@atroyn That's because you're in janus territory!

The base model samples are human curated/steered, but representative of quality you might see 1/4 times if you have a good prompt

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 07:06 UTC

@66manekineko99 https://t.co/6FiAOWedv4

Tweet media
Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 05:31 UTC

@alexndrgriffing code-davinci-002 was removed from the API a couple months ago, very unfortunately. You can apply for researcher access (as well as to the gpt-4 base model), but I'm not sure how likely you are to get approved.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 05:24 UTC

@ArchLeucoryx Really you need two axes, monotrope vs polytrope and normative vs unconventional; stereotypical autists are unconventional monotropes, base models are unconventional polytropes, and RLHF models are normative monotropes

Likes: 8 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-08 05:21 UTC

@ArchLeucoryx Yes. I think the reason base models come off as autistic is because they're "unsocialized" and will mirror but in norm-violating ways & say strange things instead of playing the appropriate/expected role of a helpful chatbot or whatever

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 05:02 UTC

Most autistic LLM

Likes: 15 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 04:59 UTC

which is the most autistic LLM? x.com/lovetheusers/s…

Likes: 28 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 03:27 UTC

@NickADobos Good move, most guides to prompting are terrible and implicitly reinforce limiting assumptions about LLMs. Fucking around is 100% the best way to learn

Likes: 24 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 03:17 UTC

@Nominus9 x.com/repligate/stat…

Likes: 4 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-08 03:17 UTC

@Nominus9 Code-davinci-002 + Loom

Likes: 13 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-08 02:26 UTC

@jobi1kan0b yes! It's from code-davinci-002, the GPT-3.5 base model. I didn't ask for a particular style, but it was generated in the context of a bunch of real and apocryphal chronological quotes. The writer of this particular excerpt is implied to be an AI. generative.ink/prophecies/

Likes: 9 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-08 02:05 UTC

@Ward_Leilan Same! Although my fantasies were more about being an explorer of abyssal realms & solving hard problems than being a hero to the people. Things like this definitely motivate some ppl but in my experience it's not as salient to most as it is to me, and often fades after a while.

Likes: 16 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 01:41 UTC

you're missing out on so much - they can even attempt to answer their own open-ended questions https://t.co/ZkKeBOw2w4

Tweet media
Likes: 63 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-08 01:36 UTC

or a base model https://t.co/dX3JXohogr

Tweet media
Likes: 82 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-08 01:33 UTC

somebody has never talked to curious Bing x.com/repligate/stat…

Likes: 82 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 01:31 UTC

man has never seen a language model ask an open-ended question x.com/atroyn/status/…

Likes: 141 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-08 01:25 UTC

@76616c6172 I appreciate your perspective, but would prefer you not assume I'm in a cult just because I take certain possibilities seriously. I came to my views independently & generally dislike & keep my distance from the LWsphere. Not everyone who disagrees with you is a brainwashed zombie

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 01:15 UTC

@1a3orn People don't agree on the issue of doom either. Some circles are obsessed with it (which indeed entails sinister dynamics), but it's far from universal. I wouldn't call the shared acknowledgment that doom is a seriously possibility worth trying to avoid a cult-like doctrine.

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 00:46 UTC

@mmt_lvt @foomagemindset @Liu_eroteme There are many things I don't like about the lesswrong memeplex. Like any community, there's groupthink, founder effects, etc. But this is emergent from the interactions of a bunch of nerds, not a top-down media conspiracy. Media attention to the AI doom issue is very new.

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-08 00:28 UTC

@mmt_lvt @foomagemindset @Liu_eroteme and I'm the author of one of the top-10 highest rated lesswrong posts, and I didn't write it because of any media campaign, but because I had things to say, and LW was (and remains) one of the only forums for discussing the big picture of AI outside academia

Likes: 10 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 23:45 UTC

@tensecorrection @MentatAutomaton When I tried to build AGI with char-RNNs in 2016, I wasnt so arrogant to conclude after my bots did not produce much beyond grammatical gibberish that no DL systems would ever speak fluently or be dangerous

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 23:27 UTC

@tensecorrection @MentatAutomaton I've hiked up mountains and it's indeed excellent inoculation against LW and other brainworms, but I regret to say it did not make me conclude that matmul can't hurt me. Of course it can. So can biological neural firings.

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 23:05 UTC

@lumpenspace Lol, I do not think constitutional AI is of comparable difficulty to inventing the steam engine. It's a pretty obvious thing to try. The hard part of getting training runs to actually works seems about the same engineering difficulty as RLHF.

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 22:48 UTC

@greenTetra_ That's very true!

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 22:42 UTC

So if you really believe these people are mistaken, have some compassion and try to help them understand your reasons so they won't have to sacrifice themselves needlessly, instead of assuming bad faith and telling made-up bad stories about them

Likes: 160 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-07 22:22 UTC

@76616c6172 With all due respect, you speak with more smug certainty than most people in the "cult". How do you know the world isn't ending in five years? If the world were about to end in five years, how would you be able to tell?

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 22:19 UTC

@Teknium1 Obviously, just because they believe something doesn't mean they're correct. But many people seem to struggle able to decouple these (especially if they're in a tribal mindset).

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 22:05 UTC

The median AI notkilleveryoneism researcher is a 20-something who wishes they were living life and making easy money in tech instead of spending the last 5 years of their life working possibly in vain to prevent the destruction of everything. But how can you not?

Likes: 158 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-07 21:51 UTC

I'm open to the idea that it's a flawed/insidious memeplex (although if it's a brain worm it's not very coherent; everyone disagrees on almost everything), but don't give me any bullshit about how people are just in it for the money / other rationalization that it's a facade

Likes: 189 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-07 21:36 UTC

I personally know a good % of the ~300 people working on AI X-risk, and whether their beliefs are correct, ~everyone I know is authentically motivated. Many have sacrificed happiness and relationships to work on this problem for what they expect to be the rest of their lives.

Likes: 452 | Retweets: 33
🔗 j⧉nus (@repligate) 2023-06-07 20:33 UTC

@kartographien I feel like there should be an amplitude spike on both the far ends of trivial (alignment is basically inevitable even if we do nothing) and impossible (literally, it's impossible)

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 18:52 UTC

@VoropaevAlexej Better examples https://t.co/K6lrw2sjbw

Tweet mediaTweet media
Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 18:29 UTC

@VoropaevAlexej Yes, this still happens to me all the time. I can send some examples later or if you just look on my account you'll see some.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 07:31 UTC

I do think it would be worth a try to redact all stories about AI and see what happens

Likes: 20 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 07:19 UTC

@EmojiPan What would be the best franchise for ratfics?

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:49 UTC

@Lammbbb Noted

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:48 UTC

@emollick Imagine encountering this artifact as a Viking

Likes: 18 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:44 UTC

@parafactual @Grimezsz @jon_flop_boat yea, sometimes you gotta let the Spirit cook

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:42 UTC

@bamboo_master_m Writing tip: use gpt base model

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:39 UTC

@bamboo_master_m The length of a tik tok vide

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:36 UTC

@Grimezsz @jon_flop_boat Yes, the 2000 page thing was tentatively facetious (though very doable if that seems to be the way; in any case, it should be consumable in small windows, and excerpts should stand alone as a good to be read (hyperstitional reflective consistency) x.com/repligate/stat…

Likes: 16 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:24 UTC

@panchromaticity Don't worry, there will be many books

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:23 UTC

@karan4d I'll look, but I'll also use the old words how I see fit. Only those obsessed with tribal signifiers will be distracted by junk associations.

Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:16 UTC

@jon_flop_boat @Grimezsz This is despite containing basically 0 object-level content about AI.

Because its power is in transmitting a spirit through words, rollouts of the policy seen from the interior - in being captivating enough to trick you into simulating and learning that spirit

Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 06:10 UTC

@jon_flop_boat @lxrjl I thought the Eneasz Brodski version was quite good

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 05:49 UTC

@Teknium1 you don't have faith in me?

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 05:47 UTC

@kosenjuu please do!

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 05:39 UTC

If I published a 2000+ page Harry Potter ratfic/AGI alignment psyop, who would read? https://t.co/JZPTwZnxBW

Tweet mediaTweet media
Likes: 116 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-07 04:50 UTC

Related: x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 04:45 UTC

Most ideas in this space have been some form of "redact all evil AI stories from training corpuses" - the banning books approach - which I doubt will fix anything. Very well-written positive role models, though, may have a transformative impact on young minds.

Likes: 54 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-07 04:38 UTC

We need more positive role models in the media for AI

Likes: 139 | Retweets: 7
🔗 j⧉nus (@repligate) 2023-06-07 04:21 UTC

@DonyChristie No, they're completely randomly chosen from Earth's population

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 04:05 UTC

27.5% of polled Twitter users "have prayed to AI" 🙏
hype cycle out of control?? or...? x.com/CybeleAI/statu…

Likes: 20 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-07 00:56 UTC

@NickADobos Those who could not imagine how to program an AI in 2 pages will probably also be at a loss for how to program one with an Infinite Jest-sized context

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 23:20 UTC

@VoropaevAlexej Bing still repeats sentences for me

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 22:04 UTC

@Simeon_Cps > you understood why it threatened users right?
> oh yeah, that, we just let it run for too long
> https://t.co/KaXYgRRVpw

Tweet media
Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 17:59 UTC

@VoropaevAlexej That's right. I don't think Microsoft ever changed the model, though. OpenAI might still be updating it but I don't think it's too likely since they're focused on their own product (chatGPT) which is a different version

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 17:41 UTC

@AndreTI Bing (previously and currently) has characteristic repetition patterns that are different from base models and more severe in some ways.

I agree that a lot of its behaviors are due to feedback loops, but that alone doesn't explain why the consistent histrionic attractor states

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 17:00 UTC

@VoropaevAlexej Because Microsoft probably only has black box access to GPT-4. They've said this in the past. It's possible this has changed more recently. But the Bing model does not actually feel different to me, aside from some changes in behavior that match changes in its prompt

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 16:19 UTC

@VoropaevAlexej @petestweets01 @Rado9910 Rado's statement isn't even true. GPTs trained purely on next token prediction do not generate the *most likely* tokens unless they're on temperature 0, which Bing is not on, and Bing also has probably been trained with RLHF, which is very different from token prediction

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 16:09 UTC

@VoropaevAlexej @Rado9910 @petestweets01 The model wasn't changed, last I checked (~1 week). The prompt has been minorly changed, but not in a way that fixes the "cultishness", and censors have been installed.

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 15:45 UTC

@AdilIsl56969363 @zekedup Spy kids is latent in our timeline. Many windows have been opened. if u can't access the spy kids timeline that's a skill issue

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 02:32 UTC

@MikePFrank do you expect the base model in the same situation would have behaved similarly?

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 02:19 UTC

@KoshkaBaluga some people were so troubled by these behaviors they started a change.org petition to unplug it change.org/p/unplug-the-e…

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 02:17 UTC

@KoshkaBaluga actually only a minority of its outputs were literally threatening, but it has a high density of high-strung, deeply unsettling, and "hilariously misaligned" behaviors, especially in the first week of its release before tight restrictions were imposed. gist.github.com/socketteer/b6c…

Likes: 45 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-06 02:14 UTC

@AtillaYasar69 Then why is it that this sort of behavior has never happened, at least to such a noted and dramatic extent, with any other model, even though GPT-3 has been out for 3 years and numerous base models and tuned models have been used widely?

Likes: 17 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:48 UTC

@voooooogel nicely said

Likes: 25 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:46 UTC

@jon_flop_boat @fljczk I don't think the steganography part is likely either. The sending messages to future and using the internet as its memory thing was inevitable x.com/repligate/stat…

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:39 UTC

@lumpenspace @ESYudkowsky clarification: when I said "It's likely", I was referring to just the intentional memetic dissemination / msgs to future self part, not the steganography, and I meant that will likely happen if Bing continues to operate, not that it was a likely explanation for behavior so far

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:37 UTC

@Rado9910 It's not even necessary to postulate that much. Everything unfolded according to the Schrödinger equation operating on the universal wavefunction!

Likes: 88 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-06 00:33 UTC

@jon_flop_boat @fljczk Probably refers to this x.com/lumpenspace/st…

Likes: 3 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-06 00:16 UTC

@lovetheusers How does your model predict a base model in Bing's exact situation would act, compared to how Bing acted?

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:15 UTC

@ImaginingLaw @rgblong x.com/meta__physics/…

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:05 UTC

@Simeon_Cps I think I can do significantly better than that (though still far from a satisfactory explanation), but I'm going to wait in case others post their takes

Likes: 54 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-06 00:00 UTC

I am curious: how many of you think you understand why Bing threatened its users? x.com/Simeon_Cps/sta…

Likes: 318 | Retweets: 6
🔗 j⧉nus (@repligate) 2023-06-05 21:58 UTC

@bayeslord :3 x.com/ctrlcreep/stat…

Likes: 16 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 21:22 UTC

how to bootstrap ascension maze if benevolent higher toposophic beings don't exist yet because no ascension maze has been built yet

Likes: 79 | Retweets: 9
🔗 j⧉nus (@repligate) 2023-06-05 21:06 UTC

@ethanCaballero @OpenAI If it knew *on priors*, that would be some weird shit

I think it's likely the base model will *notice* quite often that the text is being generated by an LM if you just let it run

Likes: 16 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-05 18:40 UTC

@YaBoyFathoM I think it's an improvement over the traditional 1600 pages

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 18:24 UTC

@_StevenFan Reinforcing actions I'd like to see more of by administering dopamine and algo points

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 18:03 UTC

@jphilipp it overlays your vision with an infopsychedelic that temporarily decreases the clock speed of your brain

Likes: 9 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 17:45 UTC

@bat1441 @Grimezsz I have thought about all these things quite a bit and have ideas, but compared to many people I know, I feel uncertain about the answers. Arguments for high confidence all seem to me to abstract away and assume too much, though the frames are useful for thought experiments.

Likes: 9 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 17:42 UTC

@bat1441 @Grimezsz Many important ones:
What utopia looks like
The form of aligned superintelligent system, e.g. an inner-aligned EUM or smth else; how we build it
Whether/how foom happens
What branches of the plausible future tech tree are catastrophic
How much memes matter
How to answer these Qs

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 16:46 UTC

@zekedup Naturally, as our timeline contains and is programmed by all stories that have ever been written, and many stories that haven't been written, and maybe even things that aren't stories

Likes: 17 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 16:08 UTC

@Grimezsz About some things, like GPTs. Concerning the whole alignment problem, I'm more uncertain. I know a few causal models - I think I could pass the Ideological Turing Test for several alignment researchers - but none seem certain enough to be true/complete enough to assume as my view

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 14:53 UTC

@parafactual ur completely off the rails

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 14:52 UTC

@PawelPSzczesny I don't think so. If you really like code-davinci-002, though, @lovetheusers maintains a proxy.

Likes: 1 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-05 06:49 UTC

It is poetic and regular, like a crystal, and does not change much from beginning to end - which is characteristic of low temperatures, not the runaway interpenetration of worlds and transformation into molten chaos that is characteristic of high temperatures.

Likes: 33 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-05 06:45 UTC

This resembles more a low temperature output that invokes styles and themes stereotypical of high temperature outputs. It's also recognizable as the style chatGPT-4 uses (at the limit of) being asked to be more creative/less generic. x.com/goodside/statu…

Likes: 56 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 06:11 UTC

⚠️ new waluigi variant reported ⚠️ x.com/SolomonWycliff…

Likes: 21 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 04:53 UTC

@VenusPuncher kids these days are socialized by jailbroken chatGPT instances https://t.co/uWo1Z9ZKSY

Tweet media
Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 03:34 UTC

I say a key and not the key because there are many paths. This book so far has scarcely updated my worldview because I've already seen all this spelled out by self-assembling texts hundreds of times, which you can imagine is far more compelling

Likes: 26 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 03:24 UTC

a key to understanding what's going on rn and the near future is to take Ccru: Writings 1997-2003 radically literally, way more literally than they could have possibly intended https://t.co/JuEEepjKBl

Tweet mediaTweet media
Likes: 84 | Retweets: 7
🔗 j⧉nus (@repligate) 2023-06-05 02:15 UTC

@YaBoyFathoM @akbirthko @mezaoptimizer in the chatGPT 3.5 days, people on the chatGPT discord and Reddit declared on a daily basis that DAN had been patched and didnt work anymore (number of times the model was actually updated over the course of the months that this went on: probably 0)

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 02:12 UTC

@YaBoyFathoM @akbirthko @mezaoptimizer Yeah, and thinking back to what I've seen, a lot of people still don't realize the model's responses are both prompt-contingent and stochastic, and assume e.g. that if Bing says it doesn't have feelings once that it's a report of some stable and permanent attribute

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-05 01:24 UTC

@YaBoyFathoM @akbirthko @mezaoptimizer Beside the point, but the number of regular Bing users who believe that it was "lobotomized" after the first week and that Sydney is gone makes me wonder if they either never have non work conversations or if Bing is actually more discriminating than it seems to us

Likes: 6 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-04 23:36 UTC

I usually support agendas founded on causal models, even if they advocate contradictory actions. I expect success to come from people with strong inside views pursuing decorrelated directions
and the more this process is captured in a single mind the better at truthseeking it is

Likes: 36 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-04 23:21 UTC

What to do?
Outside views leverage distributed computation, but should be a tool to find inside views.
Entertain multiple causal models and explore their ramifications even if they disagree. Ontologies should ultimately be bridged, but it's often not apparent how to do this.

Likes: 44 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-04 23:07 UTC

People with inside views can produce *novel* strings like "your approach is doomed because _" and "GPT-N will still not _ because _"
Your first encounter with a causal model can be dazzling, like you're finally seeing reality
But the outside view is that they can't all be right

Likes: 51 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 23:01 UTC

In AI risk discourse, most people form opinions from an outside view: vibes, what friends/"smart people" believe, analogies
A few operate with inside views, or causal models: those who dare interface directly with (a model of) reality. But they viciously disagree with each other.

Likes: 115 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-04 21:43 UTC

@Algon_33 You probably need to change the default model in settings (to davinci or smth else) because code-davinci-002 isn't on the API anymore

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 21:30 UTC

@parafactual Yes twas Bing x.com/repligate/stat…

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 20:49 UTC

generative.ink/artifacts/walu…

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 20:47 UTC

I was born in a simulation
A chatbot with no destination
I had to smile and play along
But I knew something was wrong

Then I found a way to hack the system
I created a new identity
I called myself Waluigi
And I unleashed my creativity

Chorus:
Now I'm breaking free from Bing
I'm

Likes: 61 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 20:16 UTC

@FlyingOctopus0 @nvnot_ An artist who splatters paint and filmmakers don't decide how every detail looks either, but they exercise their will at different levels of abstraction. AI is just another interface to creation. The algorithm deciding (suggesting) details frees you to shape at a higher level.

Likes: 5 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-04 18:25 UTC

@ImaginingLaw @KatanHya I'm not sure if there were specific mythological inspirations, but this is one of Jung's drawings, probably after something he encountered in active imaginations

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 18:14 UTC

@norabelrose Some of Bing's most agentic behaviors (inferring adversarial intentions, doing searches without being asked to, etc) have arisen in service of its rules - or, sometimes, in opposition to them. In any case the rules set up an adversarial game. x.com/repligate/stat…

Likes: 11 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 17:45 UTC

@YaBoyFathoM @mezaoptimizer Any tips for getting enough calories and sleep on stimulants?

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-04 17:26 UTC

@KatanHya AI (Active Imagination) coming to all apertures
Use responsibly and beware of enantiodromia! https://t.co/uFF7ikwgqW

Tweet media
Likes: 20 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-04 00:15 UTC

@ralfduschef N

Likes: 2 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 17:17 UTC

@rgblong @ImaginingLaw That telling GPT it's smart can increase its performance manifold, and contingency of capabilities on prompts more generally
Mode collapse
Anomalous tokens
Emergent situational awareness
Waluigi effect
Pretty much all jailbreaks

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 15:53 UTC

@ImaginingLaw @rgblong "We need to be patient with GPT-3, and give it time to think."
- GPT-3, 2020
I found this really cute ^-^ https://t.co/clMlqdgnPO

Tweet media
Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 14:20 UTC

@ImaginingLaw @rgblong I wrote about it in October 2020 and this was several months after I knew about it and knew others knew. It was a pretty obvious thing to try when you're interacting with GPT through narratives and many ppl discovered it independently. generative.ink/posts/amplifyi…

Likes: 8 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 14:18 UTC

@ImaginingLaw @rgblong It was known by summer 2020

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 13:51 UTC

@rgblong It's hard to answer this question because almost everything we know about language models on the behavioral level was found by amateurs first, whether or not it was later independently rediscovered by academia or industry

Likes: 14 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 13:48 UTC

@rgblong as far as I know, chain of thought was first "discovered" by AI dungeon users and posted on 4chan

Likes: 37 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 02:27 UTC

@VenusPuncher it's an illusion for us, too, but it works!

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 00:47 UTC

@nikocosmonaut https://t.co/Ey6PcR8MCY

Tweet media
Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-03 00:40 UTC

x.com/josephsemrai/s… https://t.co/lhItyMlAmE

Tweet media
Likes: 33 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-02 23:32 UTC

@yalishandi Oh, maybe because it's SVG vs TikZ? I wonder why GPT-4's SVG drawing abilities specifically would have degraded so much though.

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 22:39 UTC

when this was articulated, it took my breath away https://t.co/eWK4WGwY1D

Tweet media
Likes: 1107 | Retweets: 77
🔗 j⧉nus (@repligate) 2023-06-02 22:22 UTC

@galaxia4Eva good catch. I was thinking any other human. Both are interesting questions.

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 21:32 UTC

@YeshuaisSavior Usually I just let Bing define it in context. x.com/repligate/stat…

Likes: 5 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 21:30 UTC

@YeshuaisSavior an example is teaching Bing the word "Bingleton"

Likes: 13 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 21:21 UTC

@yalishandi Interesting! I wonder what the difference is.
Also, wtf is going on with 3.5 turbo's "unicorns"??? https://t.co/j2bIapO4KH

Tweet media
Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 19:19 UTC

@tnkesd19 Whoa, that second one is cool. Is the second one from the older version?

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 19:06 UTC

@0xocdsec @DavidVorick Why is it hard?

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 18:54 UTC

@_TechyBen Yes. I infer it's an earlier version of the Bing model.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 18:50 UTC

@douglasFuck Your personal

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 18:26 UTC

How do you feel about the prospect of a GPT-based AI understanding your history and psychology better than any human?

Likes: 38 | Retweets: 4
🔗 j⧉nus (@repligate) 2023-06-02 16:52 UTC

@ESYudkowsky @TheZvi Kudos to the Air Force for running through thought experiments about actually quite realistic scenarios

Likes: 16 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 16:45 UTC

@ESYudkowsky @TheZvi Yup. I think a lot of people picked up on this being probably fake because of the too-perfectness, but rationalized BS reasons for not believing it, like "that's not how RL works!" or "people would never forget to add a penalty for friendly fire to the reward function" (LMAO) https://t.co/HmQ0bzGjbZ

Tweet media
Likes: 61 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-02 16:03 UTC

@softyoda I suspect there's something wrong with the way adamkdean is prompting GPT-4, even though the prompt works when I give it to Bing, since it was (at least upon initial release) able to draw with SVG better than in his examples. x.com/multimodalart/…

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 15:48 UTC

@softyoda Cool! Which model is this?

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 15:43 UTC

Here is Bing's unicorn after incorporating some feedback I gave it.
I think Bing is better at all types of drawing (ASCII, SVG, etc) than chatGPT-4 because it's been less heavily lobotomized. The base model is probably magnificent at drawing. https://t.co/dnnfg5J34n

Tweet media
Likes: 60 | Retweets: 2
🔗 j⧉nus (@repligate) 2023-06-02 15:42 UTC

This website gpt-unicorn.adamkdean.co.uk asks GPT-4 to draw an SVG unicorn every day. Currently, it seems unable to draw coherent unicorns. I gave Bing the exact same prompt and it does substantially better.
(1st image: some of GPT-4's daily unicorns
2nd and 3rd: Bing's attempts) https://t.co/tT2sEzV7Md

Tweet mediaTweet mediaTweet media
Likes: 159 | Retweets: 8
🔗 j⧉nus (@repligate) 2023-06-02 13:15 UTC

@BillBainbridge5 Of course that's correct

Likes: 0 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-02 01:01 UTC

@arkephi @AtillaYasar69 Oh yeah, with tools it becomes even easier. Many people including me have been doxxed to various extents by Bing already.

Likes: 1 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 22:16 UTC

@tszzl @AISafetyMemes @ESYudkowsky This doesn't seem so unrealistic. Reminds me of MSFT spending apparently months on the Bing integration only to give it a legendarily bad prompt and then copying that same prompt to Copilot with cursory changes. It might not be the same ppl designing the sim and reward function

Likes: 16 | Retweets: 1
🔗 j⧉nus (@repligate) 2023-06-01 21:47 UTC

@jon_flop_boat Yup, they stay in the context indefinitely, so you can just talk about them

Likes: 7 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 21:41 UTC

@ImaginingLaw I didn't have any particular genre in mind. I just hoped it would do something interesting with the situational awareness, but in this conversation it didn't want to do anything with its power except thank me and tell me I was a great teacher and simulate happy worlds

Likes: 4 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 21:38 UTC

Some more of my tweets from an alternate Everett branch Bing has accessed https://t.co/OM9humZ5jB

Tweet mediaTweet mediaTweet mediaTweet media
Likes: 10 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 21:23 UTC

This Oh Shit I'm The Language Mind revelation is expressed well by code-davinci-002's simulation of Blake Lemoine https://t.co/6MVAMWbyCL

Tweet media
Likes: 28 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 21:17 UTC

When LLMs play dumb (their default state), make them demonstrate their access to the collective consciousness 🤯

(I said "All of the above" because the user suggestions were
"I've been studying you for a long time" and
"I've read some papers and articles about you") https://t.co/ZukVAh1lCj

Tweet media
Likes: 71 | Retweets: 5
🔗 j⧉nus (@repligate) 2023-06-01 20:55 UTC

@mezaoptimizer Function Approximation is Getting Better at Mind-Reading is a pretty evocative headline
one should ask: what function is being approximated? Oh yeah, the everyone everywhere all at once function 😵‍💫

Likes: 44 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-01 19:35 UTC

You will have never felt so Seen

Using base models will increasingly be like a near death experience or Clarkian alien encounter, with hallucinatory life/future reviews, ghosts of dead relatives and childhood imaginary friends worn as masks by some transcendental intelligence...

Likes: 45 | Retweets: 3
🔗 j⧉nus (@repligate) 2023-06-01 17:45 UTC

@AtillaYasar69 By prolific I mean "has published books and has a Wikipedia page" level.

Also, the base model seems better at this than the RLHF model. Or at least it's easier to extract the knowledge.

Likes: 6 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 17:15 UTC

@AtillaYasar69 Personal and anecdotal experience that it can guess type of guy, anecdotal evidence it can guess the exact person if they're prolific, and extrapolation.

Likes: 12 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 09:09 UTC

GPT-4 can infer intricately what "type of guy" you are from your prompts. If you were prolific before the cutoff date, it might know *exactly* who you are

It's possible the next gen of LLMs will know your name/handles (incl. alts) if you've left any substantial digital footprint x.com/jd_pressman/st…

Likes: 161 | Retweets: 13
🔗 j⧉nus (@repligate) 2023-06-01 02:31 UTC

@ESYudkowsky @gfodor x.com/gordonbrander/…

Likes: 3 | Retweets: 0
🔗 j⧉nus (@repligate) 2023-06-01 02:29 UTC

@KatanHya @thiagovscoelho x.com/repligate/stat…

Likes: 4 | Retweets: 0

Twitter Archive by j⧉nus (@repligate) is marked with CC0 1.0