> Casey Harford █
Senior Software Engineer – GPU Infrastructure
Terrebonne, Oregon
Email: casey@caseyharford.com
LinkedIn |
GitHub
> Summary
Seasoned engineer specializing in GPU infrastructure, distributed compute, and high‑scale deployment tooling.
Extensive experience building internal frameworks, performance-critical systems, and automation used across large engineering organizations.
Comfortable designing end-to-end solutions spanning hardware, firmware, kernel-level behavior, and distributed cloud services.
> Skills
- GPU Infrastructure, Distributed Systems, HPC Workloads
- Go, Python, C/C++, Linux systems programming
- Kubernetes, container orchestration, service deployment frameworks
- Automation tooling, internal developer platforms, CI/CD
- Observability, performance profiling, system optimization
- Hardware bring‑up, low‑level debugging, protocol analysis
> Experience
Senior GPU Infrastructure Engineer – LinkedIn
Apr 2024 – Present
- Lead engineering work on GPU fleet lifecycle, provisioning, orchestration, and performance systems.
- Developed internal frameworks and automation tooling supporting large-scale compute clusters.
- Improved operational efficiency, observability, and reliability across multiple teams.
Senior Site Reliability Engineer – LinkedIn
Sep 2016 – Apr 2024
- Built and maintained large-scale production systems, automated workflows, and led incident response processes.
Weebly | San Francisco, CA
Automation Engineer & Software Engineer III
- Developed automation frameworks, CI/CD pipelines, and operational tools for web hosting services.
Build.com | Chico, CA
Lead Support Engineer & QA Automation Engineer
- Provided escalated support and implemented test plans across software lifecycle.
Best Buy
Counter Intelligence Agent
- Troubleshot and resolved client computer issues.
Pacific Gas and Electric Company
Linux System Administrator Intern
- Deployed and supported Red Hat Enterprise Linux servers.
> Education
Bachelor of Science in Computer Information Systems | CSU Chico
> Selected Contributions
- Architected and maintained internal GPU infrastructure systems and operational frameworks.
- Improved reliability, observability, and efficiency of large-scale compute clusters.
- Led cross-team architectural decisions and RFC-driven design discussions.
- Developed tooling and frameworks to streamline deployments and monitoring workflows.
> Additional Interests
- Linux, embedded systems, and microcomputers
- RC vehicles, drones, and FPV systems
- Home-lab, virtualization, networking
- Automation and custom tooling