Things have coalesced near the amphitheater. When the music kicks off again, we'll go northeast to... approximately here. 47.6309473, -122.3165802 JMJM+99F Seattle, Washington

Reply

Seattle, Washington, USA – ACX Meetups Everywhere Fall 2023

Optimization Process8mo10

Announcement 1: I, the organizer, will be 5-10min late. Announcement 2: apparently there's some music thing happening at the amphitheater! I'll set up somewhere northeast of the amphitheater when I get there, and post more precise coordinates when I have.

Reply

Seattle, Washington, USA – ACX Meetups Everywhere Fall 2023

Optimization Process8mo10

$10 bounty for anybody coming / passing through Capitol Hill: pick up a blind would-be attendee outside the Zeek's Pizza by 19th and Mercer. DM me your contact information, and I'll put you in touch, and I'll pay you on your joint arrival.

Reply

Book Club: Thomas Schelling's "The Strategy of Conflict"

Optimization Process10mo10

Update: the library is unexpectedly closed due to staffing issues. The event is now at Fuel Coffee, one block south and across the street.

Reply

What can we learn from Bayes about reasoning?

Optimization Process1y175

Almost all the evidence necessary to make you accept a very-unlikely-on-priors hypothesis, is required to even raise it to conscious consideration from a field of other absurdities.

Reply

Seattle, Washington, USA – ACX Meetups Everywhere Spring 2023

Optimization Process1y10

If the chance of rain is dissuading you: fear not, there's a newly constructed roof over the amphitheater!

Reply

Seattle, Washington, USA – ACX Meetups Everywhere Spring 2023

Optimization Process1y20

Hey, folks! PSA: looks like there's a 50% chance of rain today. Plan A is for it to not rain; plan B is to meet in the rain.

See you soon, I hope!

Reply

How can I help inflammation-based nerve damage be temporary?

Optimization Process1y10

You win both of the bounties I precommitted to!

Reply

What's the Least Impressive Thing GPT-4 Won't be Able to Do

Optimization Process1y10

Lovely! Yeah, that rhymes and scans well enough for me!

Here are my experiments; they're pretty good, but I don't count them as "reliably" scanning. So I think I'm gonna count this one as a win!

(I haven't tried testing my chess prediction yet, but here it is on ASCII-art mazes.)

Reply

Models Don't "Get Reward"

Optimization Process1y8-1

I found this lens very interesting!

Upon reflection, though, I begin to be skeptical that "selection" is any different from "reward."
Consider the description of model-training:

To motivate this, let's view the above process not from the vantage point of the overall training loop but from the perspective of the model itself. For the purposes of demonstration, let's assume the model is a conscious and coherent entity. From it's perspective, the above process looks like:
Waking up with no memories in an environment.
Taking a bunch of actions.
Suddenly falling unconscious.
Waking up with no memories in an environment.
Taking a bunch of actions.
and so on.....
The model never "sees" the reward. Each time it wakes up in an environment, its cognition has been altered slightly such that it is more likely to take certain actions than it was before.

What distinguishes this from how my brain works? The above is pretty much exactly what happens to my brain every millisecond:

It wakes up in an environment, with no memories^[1]; just a raw causal process mapping inputs to outputs.
It receives some inputs, and produces some outputs.
It's replaced with a new version -- almost identical to the old version, but with some synapse weights and activation states tweaked via simple, local operations.
It wakes up in an environment...
and so on...

Why say that I "see" reward, but the model doesn't?

^{^}
Is it cheating to say this? I don't think so. Both I and GPT-3 saw the sentence "Paris is the capital of France" in the past; both of us had our synapse weights tweaked as a result; and now both of us can tell you the capital of France. If we're saying that the model doesn't "have memories," then, I propose, neither do I.

Reply