Monitoring
Currently in production
Automated e-mails instances level alerts for excessive memory use
Daily automated e-mail report on instance disk usage
Daily e-mails reports on successful models run for each concept and each model, including latest data points for data integrity review
Models recalibration reports
Data integrity reports
Automated e-mail alerts if any exception on models and models re-calibration run
Unit testing
Instances connections with static IP and GCP security group
GCP 2FA
All running software continuously logs information
Planned for version 2
Automated CI/CD with integrity testing
Daily data visualization reports at multiple levels
Hard back-ups
Disk encryption
Slack status reports on quant data pipeline and quant infrastructure
Slack and phone alerts for servers and models run failures and critical exceptions
Servers and data back-up strategy
DevOps to design security policy (intrusion/hacking, firewalls, etc)
The production will be split across multiple servers (e.g.: 3 servers with 33% loading each), some of them will be production and some will be beta. When there is a code change, the beta server will be turnover first. Conditional on successful testing, the production servers will be turned over in turn
Last updated