Page history

Jump to navigation Jump to search

Double Thompson Sampling Algorithm

8 May 2024

Gmelli
Text replacement - ". "" to ". “"
m
04:37
+6

13 February 2024

Gmelli
Text replacement - "[[::" to "[["
m
05:30
−10

12 February 2024

Gmelliapi
no edit summary
12:30
+21
Maintenance script
ContinuousReplacement
10:24
+1
Gmelli
no edit summary
10:23
+189
Maintenance script
ContinuousReplacement
10:23
+1
Gmelli
no edit summary
10:22
+340
Gmelli
Created page with "A Double Thompson Sampling Algorithm is a multi-armed bandit algorithm that extends the Thompson Sampling strategy by maintaining two separate probability models for each action, aimed at reducing variance in the action selection process and improving exploration efficiency. * <B>Context:</B> ** It can (typically) be applied in online learning and decision-making processes where the goal is to balance exploration of new actions with exploitation o..."
10:22
+3,258

Retrieved from "http://www.gabormelli.com/RKB/Special:History/Double_Thompson_Sampling_Algorithm"