Skills
Comprehensive inventory of my technical skills, tools, and methodologies.
Machine Learning & AI
Supervised Learning
- Random Forests, Gradient Boosting
- Support Vector Machines (SVM), Logistic Regression
- Decision Trees, k-Nearest Neighbors, Naive Bayes
Unsupervised Learning
- Clustering: k-Means, DBSCAN, Hierarchical,...
- Dimensionality Reduction: PCA, t-SNE,...
- Autoencoders
Deep Learning
- Deep Neural Networks, CNNs, RNNs, LSTMs
- Transformers, Attention Mechanisms
- Graph Neural Networks (GNNs)
- Generative Adversarial Networks (GANs), Variational Autoencoders
Model Interpretability & Explainable AI
- SHAP, LIME
- Feature importance analysis
- Model-agnostic explanations
- Local explanations (individual predictions)
- Global explanations (model behavior)
- Counterfactual explanations
- Decision trees for interpretability
Ensemble Methods
- Bagging, Boosting, Voting classifiers
Feature Engineering & Hyperparameter Tuning
- Feature selection & extraction
- Grid Search, Random Search, Bayesian Optimization
ML Frameworks
- scikit-learn, PyTorch, TensorFlow, Keras
- Hugging Face Transformers
Generative AI & Agentic Workflows
LLM Architectures & Models
- Decoder-only architectures (GPT-style)
- Encoder-decoder architectures (T5-style)
- Mixture of Experts (MoE) models
- Small Language Models (SLMs)
- Multimodal models (vision-language)
AI Agents & Agentic Workflows
- AI agents / autonomous agents
- Tool use / function calling
- Chain-of-Thought (CoT) prompting
- Multi-agent systems
- Agent orchestration
- Memory systems for agents
Prompt Engineering
- Few-shot prompting
- Zero-shot prompting
- System prompts / instruction tuning
- Prompt templates
LLM Application Frameworks
- LangChain, LangGraph
Model Context Protocol (MCP)
- MCP (Model Context Protocol)
- MCP servers
- MCP clients
- Custom MCP tools/resources
Fine-tuning & Training
- Instruction tuning
- Supervised fine-tuning (SFT)
- Parameter-efficient fine-tuning (PEFT)
- LoRA, QLoRA
- Quantization
LLM Platforms & APIs
- OpenAI API, OpenAI Assistants API
- Amazon Bedrock
- Hugging Face Inference API
- Ollama
AI-Assisted Development
- AI code assistants (GitHub Copilot, Cursor, Amazon Q Developer)
- Code generation with LLMs
- Code review with AI
- AI-assisted debugging
Safety & Quality
- Content filtering
- Output validation
- Hallucination detection
Structured Output
- JSON mode / structured outputs
Multimodal AI
- Vision-language models (GPT-4V, Claude 3, etc.)
- Image generation (DALL-E, Stable Diffusion, Midjourney)
Statistical Analysis & Data Science
Statistical Methods
- Hypothesis Testing, Bayesian Statistics
- Experimental Design, A/B Testing
- Multivariate Analysis, Dimensionality Reduction
- Causal Inference, Bootstrap/Resampling Methods
Data Science Tools
- pandas, NumPy
- SQL
- Jupyter Notebooks, JupyterLab, Google Colab, Marimo
Statistical Libraries
- scipy.stats, PyMC
Visualization
- matplotlib, seaborn, plotly
- Streamlit, Grafana
Data Imputation
- Missing data analysis
- Mean/median/mode imputation
- Forward fill / backward fill
- Interpolation methods
- K-Nearest Neighbors (KNN) imputation
- Multiple imputation
- Deep learning-based imputation
- scikit-learn SimpleImputer / IterativeImputer
Data Linkage & Entity Resolution
- Probabilistic record linkage
- Entity resolution / deduplication
- Splink
Data Privacy & Anonymization
Anonymization Techniques
- Data anonymization, Data pseudonymization
- k-anonymity, l-diversity, t-closeness
- Differential privacy
- Data masking, Tokenization
Privacy-Preserving Methods
- Synthetic data generation
- PII (Personally Identifiable Information) detection
- GDPR compliance techniques
Software Engineering & Development
Programming Languages
- Python, C++, C
- Fortran, Shell scripting (Bash)
Python Ecosystem
- Package Management: pip, pip-tools, uv, Poetry, conda, virtualenv/venv
- Type Checking & Linting: mypy, pylint, black
- Testing: pytest, unittest, doctest
- Type hints/annotations, pydantic
Version Control
- Git, GitHub, GitLab, Bitbucket
- Git workflows (feature branches, gitflow)
IDEs & Editors
- VS Code, PyCharm, Cursor
- Vim
Software Practices
- Object-Oriented Programming
- Design Patterns: Factory, Decorator
- Code Review, Refactoring
- Performance Optimization, Debugging & Profiling
DevOps & CI/CD
Containerization
- Docker, Podman, Docker Compose
- Dockerfile optimization
CI/CD Platforms
- GitHub Actions, GitLab CI, Jenkins
Monitoring & Observability
- Prometheus, Grafana, OpenTelemetry
Security
- Security Scanning
Cloud Computing
- AWS EC2
Physics & Scientific Computing
Physics Domains
- Particle Detection & Instrumentation
- Monte Carlo Simulations
- Quantum Mechanics, Relativity
- Nuclear Physics, Cosmology, Dark Matter Research
Physics Software
- ROOT (CERN), Geant4, CMSSW
- HTCondor, WLCG (Worldwide LHC Computing Grid)
Scientific Computing
- Numerical Methods, Scientific Visualization
- High-Performance Computing (HPC)
- Simulation Software, SciPy
GPU Computing
- NVIDIA GPUs
Natural Language Processing
NLP Libraries
- spaCy, NLTK
NLP Tasks
- Text Classification, Named Entity Recognition
- Sentiment Analysis, Text Generation
- Tokenization
Specialized Areas
Computer Vision
- Image Classification
Distributed Computing
- Multi-threading, Multi-processing
- GPU Computing, Grid Computing
Hardware/Embedded Systems
- FPGA Programming, Data Acquisition Systems
Domain Expertise
Financial Services
- Financial Data Analysis
- Regulatory Compliance
- Back-office workflow automation
- AI-powered reconciliation systems
- Exception management & resolution
Quality Assurance & Testing
Testing Practices
- Unit Testing, Integration Testing, End-to-End Testing
- Test-Driven Development, Performance Testing
- Test Automation
Testing Tools
- pytest (fixtures, parametrize)
- mocking/patching, coverage.py, tox
Project Management & Leadership
Leadership Skills
- Mentoring
- Cross-functional Team Coordination
Agile/Scrum
- Scrum ceremonies
- Backlog Management & Prioritization
- Kanban
Tools
- Jira, Confluence, Trello
Communication & Documentation
Technical Writing
- Scientific Papers, Technical Specifications
- User Guides, Code Documentation
Documentation Tools
- Sphinx, Read the Docs
- Docstrings (NumPy/Google style)
- LaTeX, Markdown, reStructuredText
- Obsidian (Knowledge Management)
Public Speaking
- Conference Presentations
- Technical Training, Stakeholder Presentations
Operating Systems & System Administration
- Linux/Unix system administration (Ubuntu, CentOS)
- Windows
- Shell scripting & automation (Bash)
- System monitoring & process management
- Remote server management (SSH, terminal multiplexers)
- Build systems (make, cmake)