Aki Ariga

Principal Software Engineer

Treasure Data

Biography

Aki Ariga is a Vancouver based Principal Software Engineer at Treasure Data. His interests include developing production Machine Learning systems, Machine Learning products, and ML Ops. He aims to leverage Machine Learning powers and technologies for business and social good.

He leads several communities in Tokyo, such as Machine Learning Casual Talks, and Kawasaki.rb, and he was also one of the organizers of the “Working Group of Machine Learning Systems and Operations for Productionization” in the Special Interest Group on Machine Learning System Engineering.

Interests

Machine Learning
MLOps
Natural Language Processing

Education

MEng in Electrical Engineering and Computer Science, 2008
Nagoya University
BSc in Electrical Engineering and Computer Science, 2006
Nagoyua University

Recent Posts

Machine Learning Project and Scrum

I’ve worked on several machine learning projects, and intuitively, I’ve felt that Scrum doesn’t seem well-suited for machine learning. However, during an internal discussion, a colleague said, “If we use Technical Stories, we should be able to break down tasks to fit within two weeks for any tasks.

2025-05-02

Migrated From Netlify to Cloudflare Pages

Netlify is a great service, but it is also known as slowness in Japan. I have been using Netlify for my blog hosting for a long time, but I decided to migrate to Cloudflare Pages to improve the speed of access to my blog from Japan.

2024-02-02

Migrated From Netlify to Cloudflare Pages

Scrape Notion and convert into PDF

I love VanGohan, who is a Japanese meal kits provider in Vancouver. Their meal kits are really tasty and authentic Japanese foods. I can’t live without them. When I visited Japan last year, I wasn’t too eager to find nice Japanese restaurants because of them.

2024-01-26 python

tabula-py 2.8.0 now uses jpype to launch JVM

Recently, I released tabula-py 2.8.0. It is a major release because it uses jpype to launch JVM. This means that it reduces JVM launch time since jpype reuse JVM via JNI. How fast is it? I measured read_pdf_with_template function execution time, which repeatedly launches Java process in the previous version.

2023-09-09 python

4 Steps to Release a CLI in Python

This is what I learned from creating a Python CLI (digdaglog2sql) in a day. In just 4 steps, you can release a CLI written in Python easily. Create a project by using poetry Poetry is a modern Python packaging and dependency management tool.

2022-05-20 python

4 Steps to Release a CLI in Python

Recent Posts (in Japanese)

機械学習プロジェクトとスクラム

機械学習プロジェクトをいくつかこなしてきたが、直感的にはスクラムが機械学習に向い

2025-05-02 10 min read

2024年を振り返って

2024年中はバタバタしていて書きそびれたので、2025年に書いています。仕事今

2024-12-31 4 min read misc

「海外生活経験ゼロからカナダでソフトウェアエンジニアになった話〜英語勉強＆就活対策〜」を読んだ

@nappan23 さんの「海外生活経験ゼロからカナダでソフトウェアエンジニアになった話〜英語勉強

2024-06-20 2 min read book review

2023年を振り返って

今年もあと一日になりましたが、昨年同様一年を振り返っていきたいと思います。 OSS

2023-12-31 6 min read misc

携帯からSlackを消した

休みの時の仕事の通知をコントロールする術をずっと模索していた。これは、放っておい

2023-08-20 2 min read

OSS / notebooks

2023-09-01 OSS

Notion scraper to generate PDFs of VanGohan’s printable recipes

2022-05-05 OSS

Extract SQLs from digdag log to visualize SQL lineage

2019-12-01 OSS

Extract you tables in PDF into pandas DataFrame

2019-12-01 Machine Learning

Machine Learning in Production Wiki

Machine Learning infrastructure/architecture/operation for productionization

2019-12-01 document

Docker Sphinx Recommonmark

Sphinx documentation toolchain, including latex and recommonmark in an Ubuntu docker container

2019-12-01 Jupyter notebook

tutorial machine learning or data science, written in Japanese

2019-12-01 workflow, digdag

cookiecutter-digdag

A template generates digdag workflows for SQL and Python

2019-12-01 TreasureData, workflow, digdag

Unofficial Treasure Workflow Client

2019-12-01 client

Simple R client for Treasure Data

2017-12-01 NLP

Python/Ruby wrapper for KyTea

Recent Publications

Quickly discover relevant content by filtering publications.

MLOpsの歩き方 (Beginners Guide to MLOps)

This article covers very biginning guide for MLOps, i.e., What is MLOps? How do tech giants make Machine Learning systems? What …

仕事ではじめる機械学習 (Machine Learning for Business)

First book for how to design Machine Learning systems and how to proceed Machine Learning projects. This book is originally written in …

Aki Ariga, Shinta Nakayama, Takashi Nishibayashi