Feature/serow sac #65

MichaelMarav · 2025-11-25T14:40:47Z

Per contact frame SAC training framework.
Key features:

Random initialization point on reset
Training does not stop when cf loses contact with ground. It returns 0 reward and the last observation until the foot is in contact again
Training stops and reset() is called when filter is diverged (error greater than a threshold
Incorporated prestepdqn in the time variable I pass from step to reset and vice versa
Added serow convergence in reset so the agent begins the tuning after EKF has converged (using .filter until it converges)
Eval function that stops training after reward hits a certain hyperparam threshold (not sure if this is good)
Reward computes Mahalanobis distance on SE(3) between estimated and GT pose or normalized SE(3) geodesic metric

GL

…he contact is lost

…ctions

…e velocity on reset()

… of timesteps on reset()

MichaelMarav and others added 22 commits October 31, 2025 15:30

Added working SAC for serow covariance tuning

07f4a7a

Added a function to set the base state pose

9e8513b

Fixed random initialization seed

3b3376b

Added set_base_pose binding

cd94b87

Changed plot structure

19b8454

Changed hyperparams for training and added plotting of rewards

a245f9d

Added correct seed for reset, changed the RL step to not reset when t…

55d8426

…he contact is lost

Changed the plotting to include orientation and removed the plot of a…

0bee276

…ctions

Added function to set the base velocity

3a49b15

Changed the way the bias is removed from GT and added setting GT bas…

47b91e7

…e velocity on reset()

Added set base linear velocity pybind

0ed762f

Added random seet to SAC model

0113c5a

Changed the way GT bias is removed

46fd354

Fixed bug that trained only on one foot_frame

73bbe55

changed the reward

c4a37a4

added convergence cycles

bab1214

Added function to check if state is valid and added re-initialization…

fb250f2

… of timesteps on reset()

added check for valid state

079a8ab

wip

f36553e

Added function to set base pose covariance

057a0ab

wip + cleanup

de6e527

removed reward weights

ad97c3b

MichaelMarav requested a review from mrsp November 25, 2025 14:40

MichaelMarav self-assigned this Nov 25, 2025

MichaelMarav added 4 commits December 8, 2025 14:19

added plots for per axis force

57351c3

Changed the observation

7700795

Modified to fit the state

f2a8dd6

changed plots to plot the correct rewards per episode

31940c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/serow sac #65

Feature/serow sac #65

Uh oh!

MichaelMarav commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature/serow sac #65

Are you sure you want to change the base?

Feature/serow sac #65

Uh oh!

Conversation

MichaelMarav commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants