A lightweight tool for versioning data alongside source code and building data pipelines