I am currently available for consulting in the following areas:
- Data Engineering - scraping, web log processing, ETL, etc...
- Basic Data Science/Machine Learning - nothing too complex yet, but I'm working on it.
Languages & frameworks
- Python - numpy, sklearn
- Apache Spark
Continuous Integration & Testing
- Circleci & Travis
- BDD with Behat, Cucumber & selenium
- unit and module testing with phpunit & python unittest
- Load testing with Apache Bench & loader.io
A hodgepodge of various other technologies I've worked on/with.
- AWS Redshift
- AWS Lambda - mostly Python some node.js
- AWS EC2 Container Service (ECS)
- AWS SQS
- AWS EMR (Spark)
- AWS Kinesis
- AWS IAM credential Management - Temp Session Tokens
- Algolia Search
- Continuous Integration & Testing - Circleci
- Webscraping scrapy + custom scrapers
- Redis - caching and to store web crawler state.
- Loadbalancing & reverse proxy setup - Nginx, Haproxy, ELBs
- Frontend Dev