Skills

Comprehensive inventory of my technical skills, tools, and methodologies.

Sebastian in the office

Machine Learning & AI

Supervised Learning

  • Random Forests, Gradient Boosting
  • Support Vector Machines (SVM), Logistic Regression
  • Decision Trees, k-Nearest Neighbors, Naive Bayes

Unsupervised Learning

  • Clustering: k-Means, DBSCAN, Hierarchical,...
  • Dimensionality Reduction: PCA, t-SNE,...
  • Autoencoders

Deep Learning

  • Deep Neural Networks, CNNs, RNNs, LSTMs
  • Transformers, Attention Mechanisms
  • Graph Neural Networks (GNNs)
  • Generative Adversarial Networks (GANs), Variational Autoencoders

Model Interpretability & Explainable AI

  • SHAP, LIME
  • Feature importance analysis
  • Model-agnostic explanations
  • Local explanations (individual predictions)
  • Global explanations (model behavior)
  • Counterfactual explanations
  • Decision trees for interpretability

Ensemble Methods

  • Bagging, Boosting, Voting classifiers

Feature Engineering & Hyperparameter Tuning

  • Feature selection & extraction
  • Grid Search, Random Search, Bayesian Optimization

ML Frameworks

  • scikit-learn, PyTorch, TensorFlow, Keras
  • Hugging Face Transformers

Generative AI & Agentic Workflows

LLM Architectures & Models

  • Decoder-only architectures (GPT-style)
  • Encoder-decoder architectures (T5-style)
  • Mixture of Experts (MoE) models
  • Small Language Models (SLMs)
  • Multimodal models (vision-language)

AI Agents & Agentic Workflows

  • AI agents / autonomous agents
  • Tool use / function calling
  • Chain-of-Thought (CoT) prompting
  • Multi-agent systems
  • Agent orchestration
  • Memory systems for agents

Prompt Engineering

  • Few-shot prompting
  • Zero-shot prompting
  • System prompts / instruction tuning
  • Prompt templates

LLM Application Frameworks

  • LangChain, LangGraph

Model Context Protocol (MCP)

  • MCP (Model Context Protocol)
  • MCP servers
  • MCP clients
  • Custom MCP tools/resources

Fine-tuning & Training

  • Instruction tuning
  • Supervised fine-tuning (SFT)
  • Parameter-efficient fine-tuning (PEFT)
  • LoRA, QLoRA
  • Quantization

LLM Platforms & APIs

  • OpenAI API, OpenAI Assistants API
  • Amazon Bedrock
  • Hugging Face Inference API
  • Ollama

AI-Assisted Development

  • AI code assistants (GitHub Copilot, Cursor, Amazon Q Developer)
  • Code generation with LLMs
  • Code review with AI
  • AI-assisted debugging

Safety & Quality

  • Content filtering
  • Output validation
  • Hallucination detection

Structured Output

  • JSON mode / structured outputs

Multimodal AI

  • Vision-language models (GPT-4V, Claude 3, etc.)
  • Image generation (DALL-E, Stable Diffusion, Midjourney)

Statistical Analysis & Data Science

Statistical Methods

  • Hypothesis Testing, Bayesian Statistics
  • Experimental Design, A/B Testing
  • Multivariate Analysis, Dimensionality Reduction
  • Causal Inference, Bootstrap/Resampling Methods

Data Science Tools

  • pandas, NumPy
  • SQL
  • Jupyter Notebooks, JupyterLab, Google Colab, Marimo

Statistical Libraries

  • scipy.stats, PyMC

Visualization

  • matplotlib, seaborn, plotly
  • Streamlit, Grafana

Data Imputation

  • Missing data analysis
  • Mean/median/mode imputation
  • Forward fill / backward fill
  • Interpolation methods
  • K-Nearest Neighbors (KNN) imputation
  • Multiple imputation
  • Deep learning-based imputation
  • scikit-learn SimpleImputer / IterativeImputer

Data Linkage & Entity Resolution

  • Probabilistic record linkage
  • Entity resolution / deduplication
  • Splink

Data Privacy & Anonymization

Anonymization Techniques

  • Data anonymization, Data pseudonymization
  • k-anonymity, l-diversity, t-closeness
  • Differential privacy
  • Data masking, Tokenization

Privacy-Preserving Methods

  • Synthetic data generation
  • PII (Personally Identifiable Information) detection
  • GDPR compliance techniques

Software Engineering & Development

Programming Languages

  • Python, C++, C
  • Fortran, Shell scripting (Bash)

Python Ecosystem

  • Package Management: pip, pip-tools, uv, Poetry, conda, virtualenv/venv
  • Type Checking & Linting: mypy, pylint, black
  • Testing: pytest, unittest, doctest
  • Type hints/annotations, pydantic

Version Control

  • Git, GitHub, GitLab, Bitbucket
  • Git workflows (feature branches, gitflow)

IDEs & Editors

  • VS Code, PyCharm, Cursor
  • Vim

Software Practices

  • Object-Oriented Programming
  • Design Patterns: Factory, Decorator
  • Code Review, Refactoring
  • Performance Optimization, Debugging & Profiling

DevOps & CI/CD

Containerization

  • Docker, Podman, Docker Compose
  • Dockerfile optimization

CI/CD Platforms

  • GitHub Actions, GitLab CI, Jenkins

Monitoring & Observability

  • Prometheus, Grafana, OpenTelemetry

Security

  • Security Scanning

Cloud Computing

  • AWS EC2

Physics & Scientific Computing

Physics Domains

  • Particle Detection & Instrumentation
  • Monte Carlo Simulations
  • Quantum Mechanics, Relativity
  • Nuclear Physics, Cosmology, Dark Matter Research

Physics Software

  • ROOT (CERN), Geant4, CMSSW
  • HTCondor, WLCG (Worldwide LHC Computing Grid)

Scientific Computing

  • Numerical Methods, Scientific Visualization
  • High-Performance Computing (HPC)
  • Simulation Software, SciPy

GPU Computing

  • NVIDIA GPUs

Natural Language Processing

NLP Libraries

  • spaCy, NLTK

NLP Tasks

  • Text Classification, Named Entity Recognition
  • Sentiment Analysis, Text Generation
  • Tokenization

Specialized Areas

Computer Vision

  • Image Classification

Distributed Computing

  • Multi-threading, Multi-processing
  • GPU Computing, Grid Computing

Hardware/Embedded Systems

  • FPGA Programming, Data Acquisition Systems

Domain Expertise

Financial Services

  • Financial Data Analysis
  • Regulatory Compliance
  • Back-office workflow automation
  • AI-powered reconciliation systems
  • Exception management & resolution

Quality Assurance & Testing

Testing Practices

  • Unit Testing, Integration Testing, End-to-End Testing
  • Test-Driven Development, Performance Testing
  • Test Automation

Testing Tools

  • pytest (fixtures, parametrize)
  • mocking/patching, coverage.py, tox

Project Management & Leadership

Leadership Skills

  • Mentoring
  • Cross-functional Team Coordination

Agile/Scrum

  • Scrum ceremonies
  • Backlog Management & Prioritization
  • Kanban

Tools

  • Jira, Confluence, Trello

Communication & Documentation

Technical Writing

  • Scientific Papers, Technical Specifications
  • User Guides, Code Documentation

Documentation Tools

  • Sphinx, Read the Docs
  • Docstrings (NumPy/Google style)
  • LaTeX, Markdown, reStructuredText
  • Obsidian (Knowledge Management)

Public Speaking

  • Conference Presentations
  • Technical Training, Stakeholder Presentations

Operating Systems & System Administration

  • Linux/Unix system administration (Ubuntu, CentOS)
  • Windows
  • Shell scripting & automation (Bash)
  • System monitoring & process management
  • Remote server management (SSH, terminal multiplexers)
  • Build systems (make, cmake)