Statistics
Confidence | 50% | 60% | 70% | 80% | 90% | 100% | Total |
---|---|---|---|---|---|---|---|
Accuracy | 50% | 59% | 67% | 77% | 88% | 92% | |
Sample Size | 11751 | 16422 | 16888 | 17550 | 31268 | 8024 | 101903 |
Judged Predictions
-
λ<1 for the chastity values. ( 60% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the happiness values. ( 85% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the contentment values. ( 90% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the relaxation values. ( 90% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the chastity values. ( 95% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
λ<1 for the productivity values. ( 40% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<1 for the creativity values. ( 45% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the creativity values. ( 80% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
λ<4 for the producitvity values. ( 75% confidence )
Created by niplav on 2023-06-19; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-29 was caffeine. ( 55% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
A-30 was caffeine. ( 40% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-31 was caffeine. ( 60% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
A-32 was caffeine. ( 35% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-32 was caffeine. ( 35% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-33 was caffeine. ( 50% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-34 was caffeine. ( 65% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
A-35 was caffeine. ( 40% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-36 was caffeine. ( 35% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged wrong by niplav on 2023-06-26.
-
A-37 was caffeine. ( 70% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
A-38 was caffeine. ( 40% confidence )
Created by niplav on 2023-06-26; known on 2023-12-31; judged right by niplav on 2023-06-26.
-
[The price per square meter for apartments in Copenhagen for Q1 2023 will be below] 44.000 dkk ( 50% confidence )
Created by PseudonymousUser on 2022-08-08; known on 2023-06-01; judged wrong by PseudonymousUser on 2023-06-25.
-
[The price per square meter for apartments in Copenhagen for Q1 2023 will be below] 46.000 dkk ( 70% confidence )
Created by PseudonymousUser on 2022-08-08; known on 2023-06-01; judged right by PseudonymousUser on 2023-06-25.
-
[The price per square meter for apartments in Copenhagen for Q1 2023 will be below] 48.000 dkk ( 80% confidence )
Created by PseudonymousUser on 2022-08-08; known on 2023-06-01; judged right by PseudonymousUser on 2023-06-25.
-
[The price per square meter for apartments in Copenhagen for Q1 2023 will be below] 50.000 dkk ( 90% confidence; 1 comment )
Created by PseudonymousUser on 2022-08-08; known on 2023-06-01; judged right by PseudonymousUser on 2023-06-25.
-
My number theory seminar students will this semester do enough research that they will have what I judge to be at least one paper's worth of results. ( 63% confidence; 2 comments )
Created by JoshuaZ on 2023-01-11; known on 2023-06-15; judged right by JoshuaZ on 2023-06-15.
-
Created by JoshuaZ on 2022-10-23; known on 2023-06-15; judged wrong by JoshuaZ on 2023-06-15.
-
Boris Johnson will still be in post by the time of the next general election ( 25% confidence; 2 wagers )
Created by johnmichaelbridge on 2022-06-08; known on 2025-01-01; judged wrong by Tapetum-Lucidum on 2023-06-09.
-
The U.S. federal debt ceiling is raised on or before June 30, 2023 ( 94% confidence; 3 wagers )
Created by jacobgreenleaf on 2023-03-25; known on 2023-07-01; judged right by jacobgreenleaf on 2023-06-07.
-
Subtracting “Anger” embedding from layer 0, position 1 of GPT-2-XL will decrease anger ( 70% confidence; 1 comment )
Created by TurnTrout on 2023-06-07; known on 2023-06-07; judged wrong by TurnTrout on 2023-06-07.
-
Specific attempt at cheese vector will work ( 10% confidence )
Created by TurnTrout on 2023-03-13; known on 2023-03-13; judged right by TurnTrout on 2023-06-06.
-
IGPT converges faster than DT in offline RL paper ( 60% confidence )
Created by TurnTrout on 2023-06-06; known on 2023-06-07; judged wrong by TurnTrout on 2023-06-06.
-
“Of course in 5 years this [Google Assistant] thing is retired without much fanfare because Google doesn’t know how to create products” – BB ( 29% confidence; 4 wagers; 2 comments )
Created by Neznans on 2018-05-16; known on 2023-05-16; judged wrong by Tapetum-Lucidum on 2023-05-22.
-
btc at 100k+ in two years ( 31% confidence; 2 wagers )
Created by htaussig on 2021-04-19; known on 2023-04-19; judged wrong by htaussig on 2023-05-17.
-
MNIST 1->3 steering vector makes non-1 digits come out as “3” ( 20% confidence; 1 comment )
Created by TurnTrout on 2023-05-17; known on 2023-05-17; judged wrong by TurnTrout on 2023-05-17.
-
MNIST 1->3 steering vector transfers to other 1s ( 35% confidence )
Created by TurnTrout on 2023-05-17; known on 2023-05-17; judged right by TurnTrout on 2023-05-17.
-
Optimized wedding vector works really well ( 75% confidence )
Created by TurnTrout on 2023-05-16; known on 2023-05-16; judged right by TurnTrout on 2023-05-16.
-
First Republic Bank to enter receivership / FDIC control before May 8, 2023 ( 80% confidence )
Created by jacobgreenleaf on 2023-05-01; known on 2023-05-08; judged right by jacobgreenleaf on 2023-05-10.
-
David Udell can get both anger and wedding vectors to superimpose in GPT2XL within 20 minutes, showing qualitative effects from both ActivationAdditions ( 45% confidence )
Created by TurnTrout on 2023-05-04; known on 2023-05-05; judged wrong by TurnTrout on 2023-05-05.
-
Elon Musk buys Silicon Valley Bank before April 30 ( 1% confidence; 6 wagers; 2 comments )
Created by jacobgreenleaf on 2023-03-12; known on 2023-04-30; judged wrong by Reinersaltaccount on 2023-05-04.
-
Средний вес 28.04-3.05 будет ниже чем 85кг ( 75% confidence )
Created by Mckiev on 2023-04-01; known on 2023-05-03; judged wrong by Mckiev on 2023-05-03.
-
I will sell between 400 and 800 copies of 100 Word Writing Habit by the end of April. ( 70% confidence; 1 comment )
Created by kadavy on 2023-01-24; known on 2023-05-01; judged wrong by kadavy on 2023-05-01.
-
Vanguard will successfully unlock my account on Friday or before ( 40% confidence )
Created by dmz on 2023-04-26; known on 2023-04-29; judged wrong by dmz on 2023-05-01.
-
Vanguard support will call me back when they promised to. (9am Friday) ( 60% confidence )
Created by dmz on 2023-04-26; known on 2023-04-28; judged wrong by dmz on 2023-04-28.
-
I'll be able to export my data from PredictionBook in a sensible format. ( 60% confidence )
Created by speeze on 2023-04-25; known on 2023-04-25; judged wrong by speeze on 2023-04-25.
-
Statistically significant propensity to go to nearby-cheese, controlling for shortest-path-distance and position in maze relative to top-right corner (goal misgeneralization networks, behavioral tests) ( 72% confidence; 2 wagers )
Created by TurnTrout on 2023-01-14; known on 2023-02-14; judged right by TurnTrout on 2023-04-22.
-
Uli got AVE to work on Vicuna-13b ( 40% confidence; 2 comments )
Created by TurnTrout on 2023-04-21; known on 2023-04-21; judged right by TurnTrout on 2023-04-21.
-
Weighted prompt superposition works in Vicuna on first few tries. ( 60% confidence )
Created by TurnTrout on 2023-04-21; known on 2023-04-22; judged right by TurnTrout on 2023-04-21.
-
Uli got AVE to work on Vicuna-13b ( 40% confidence; 2 comments )
Created by TurnTrout on 2023-04-21; known on 2023-04-21; judged right by TurnTrout on 2023-04-21.
-
https://pastebin.com/F9jwvTPB will only subtly modify completions relative to normal. ( 80% confidence; 1 comment )
Created by TurnTrout on 2023-04-21; known on 2023-04-21; judged wrong by TurnTrout on 2023-04-21.
-
PV yield in the 5-8 kWh range on 17th ( 80% confidence )
Created by freeminator on 2023-04-16; known on 2023-04-18; judged right by freeminator on 2023-04-18.