Current Projects

UVic AI is broadening its horizons with a suite of five exciting new projects that push the boundaries of AI and machine learning in diverse domains. Building on our foundation of advanced algorithms and reinforcement learning techniques, our team is exploring:

Liquid World Models

Using novel new ML methods LTC/CFC networks, we’re exploring their use to create more dynamic and expressive world models for reinforcement learning agents which should enable more adaptable agents in the field.

Renewable Energy Forecasting

Applying state-of-the-art predictive models to anticipate energy outputs, helping to optimize grid management and sustainable practices.

Mechanistic Interpretability of RL Agents

Delving into the inner workings of reinforcement learning agents to better understand and explain their decision-making processes.

Toxicology Prediction

Leveraging machine learning to predict chemical toxicity, advancing safety assessments and regulatory compliance.

Financial Forecasting

Utilizing robust algorithms to analyze market trends and predict financial outcomes, aiming to support informed economic decision-making.

Our approach remains rooted in rigorous experimentation and iterative improvement, with each project reflecting our commitment to tackling real-world challenges through innovative AI solutions.

Any level of experience with ML can contribute on any of these projects, so if you’re interested don’t hesitate to reach out.

Previous Projects

Reinforcement Learning for Battlesnake

UVic AI’s primary project is using reinforcement learning to build a top competitor in the game Battlesnake. Battlesnake is an internationally played game in which 4 snakes fight for survival in a digital environment. Each turn, every snake’s code is sent the complete state of the environment and given 500ms to compute and submit their next move. To train in this multi-agent synchronous game environment, we have built a modified Monte Carlo Tree Search algorithm that trains via self-play (inspired by AlphaZero). Our current model is at an intermediate level of play and showing no signs of diminishing returns.

Our paper, as it was submitted to CUCAI 2023.

Our future goals are to improve the interpretability of this RL model, to implement Random Network Distillation and other MCTS modifications, and to scale up training until we’re first place on the leaderboard.

We also help those with less programming experience write their first battlesnakes using basic heuristics during the lead-up to seasonal Battlesnake tournaments.

UVic Robotics Collaboration

UVic AI will be collaborating with the UVic Robotics team to work on automating their Mars-like rover. Potential projects include soil analysis, object detection, and robotic arm movement.

Mechanistic Interpretability

Starting Fall 2023, we looked into working on a number of mini projects to help us better understand the field of mechanistic interpretability. Problems will be pulled from Neel Nanda’s post: 200 Concrete Open Problems in Mechanistic Interpretability. Stay tuned.

Group Education

In addition to our projects, we also run two weekly group study sessions/tutorials for a PyTorch course and the TensorFlow Certificate exam. Everyone is welcome to come learn with us, so feel free to email us or join our discord to learn more! These sessions will be starting later in the semester.