Skip to content
Snippets Groups Projects

Repository graph

You can move around the graph by using the arrow keys.
Select Git revision
  • c8a36fb1854be984ad958e8bc827633a0be37e55
  • master default
  • betterInstructions
  • tact_agent
4 results
Created with Raphaël 2.2.06Oct6May4Aug225Jan22211915523Dec2221141022Oct16139130Sep28Updated format string in DataShopLogger to use uppercase %Smastermasterupdated run script to restrict range of inputs, updated knowledge to just have conceptual knowledge, working out pseudo code for rule updating.tact_agenttact_agentadded code to run tact agent on fractions, simplified fractions output representation to remove the above, left, etc.changing version for errorbetterInstructi…betterInstructionsmaking pbr falsev0.1fixing namechanging name to tutorenvssetup fileadding instructionscomment and instruction changesadded multi ppo-operator tuned hyperparamsfixed multicolumn bug where agent can receive reward for submitting empty string to a field that should be emptyremoved py_rete dependencyadded multicolumn run_alset fractions back to normal difficultyadded operator multicolumn modelmostly working version of ppo tuning/training for fraction operator modelset lower min eval for fraction ppo tuning, disabled logging for multicolumn, added dual decision tree model for fractionstrying to get ppo model working with fractions, had to reduce problem difficultyupdated multicolumn to rename carrys to the column they align with.latest updateschanged sampling space to be dynamic to match up batches and steps betteradded ability to do multiple processes for study in paralleladjustment to tune ppocode for tuning ppogot the dual ppo working, with some of the classes for optuna tuning"working version of fractions PPO model and decision tree modelworking with decision tree and cobweb for tutor envworking version of request_demo for multicolumn addition; datashop loggingworking env with eye / perceptionworking pixel representation for multicolumnworking version of multicolumnadded in soem renderingworking v1 for fractions that abstracts away specific numbers in fieldsforgot to include ppo exampleworking RL modelrenamed to lower caseworking example with ALworking version of fractions environment
Loading