mariokostelac [1] has done a fantastic job with sagemaker-setup [2]. Highly recommend taking a look.
Useful scripts for making AWS SageMaker better
References:
[1]: https://github.com/mariokostelac
[2]: https://github.com/mariokostelac/sagemaker-setup
Published
All published posts
2493 posts
latest post 2026-05-11
Publishing rhythm
I like pypeaday’s [1] project aoc-2020 [2].
Advent of Code 2020
References:
[1]: https://github.com/pypeaday
[2]: https://github.com/pypeaday/aoc-2020
I’m really excited about auto-editor [1], an amazing project by WyattBlue [2]. It’s worth exploring!
Auto-Editor: Efficient media analysis and rendering
References:
[1]: https://github.com/WyattBlue/auto-editor
[2]: https://github.com/WyattBlue
Adding Audio to my blog posts
This is episode 1 of the Waylon Walker Audio experience, posts from
waylonwalker.com [1]{.hoverlink} in audio form.
So I have had this idea for awhile to add audio to my blog posts. The idea
partly comes from the aws blog, if you have ever been on their blog you will
have noticed that they have a voiced by amazon polly section.
What to Expect # [2]
Honestly I don’t know this is all new to me and I dont have much to go off of.
For now its a test that may or may not work out.
I will say that the time that I have available for clean audio is a bit limited
so expect these to come out in batches as I get time to go back and record.
What Not to Expect # [3]
One thing that makes the aws blog really hard to listen to is the robotic
voice, I definitely don’t want that. This will be voiced by a real human, Me.
At the same time written text doesn’t translate directly to audio well so don’t
necessarily expect the audio to be
word for word.
Code blocks # [4]
There are a lot of code block...
Check out yetudada [1] and their project yetudada [2].
No description available.
References:
[1]: https://github.com/yetudada
[2]: https://github.com/yetudada/yetudada
I’m impressed by quickpython [1] from timothycrosley [2].
A retro interactive coding environment powered by Python and nostalgia
References:
[1]: https://github.com/timothycrosley/quickpython
[2]: https://github.com/timothycrosley
gatsby-remark-embedder
Inspired by discourse’s link expansion I am rolling out expansions for one line
links on the blog waylonwalker [1]. I was able to find
a gatsby plugin
gatsby-remark-embedder [2]
that expands one line links for social cards for popular platforms like twitter
and YouTube through a repose from Kyle Mathews to my tweet.
https://twitter.com/kylemathews/status/1329817928666005504
Use Cases # [3]
This covers a couple of use cases I have with very little effort.
- Twitter
- YouTube
install # [4]
npm i gatsby-remark-embedder gatsby-plugin-twitter
This was super quick and simple to setup, the only thing that was extra was to
install the gatsby-plugin-twitter plugin as well as the
gatsby-remark-embedder.
enable # [5]
// In your gatsby-config.js
module.exports = {
// Find the 'plugins' array
plugins: [
`gatsby-plugin-twitter`,
{
resolve: `gatsby-transformer-remark`,
options: {
plugins: [
{
resolve: `gatsby-remark-embedder`,
options: {
customTransformers: [
// Your custom t...
Expand One Line Links
I wanted a super simple way to cross-link blog posts that require as little effort as possible, yet still looks good in vanilla markdown in GitHub. I have been using a snippet that puts HTML [1] into the markdown. While this works, it’s more manual/difficult for me does not look the best, and does not read well as
Goals for new card # [2]
The new card should be fully automated to expand with title, description, and
cover image. Bonus if I am able to attach a comment behind it.
- fully automated
- card expansion
- Title
- description
- cover image
Old Card # [3]
If you can call it a card 🤣. This card was just an image wrapped in an anchor tag and a paragraph tag. I found this was the most consistent way to get an image narrower and centered in both GitHub and dev.to.
<p style='text-align: center'>
<a href='https://waylonwalker.com/notes/eight-years-cat/'>
<img
style='width:500px; max-width:80%; margin: auto;'
src="https://images.waylonwalker.com/eight-years-cat.png"
al...
astronomer [1] has done a fantastic job with dag-factory [2]. Highly recommend taking a look.
Dynamically generate Apache Airflow DAGs from YAML configuration files
References:
[1]: https://github.com/astronomer
[2]: https://github.com/astronomer/dag-factory
orchest [1] by orchest [2] is a game-changer in its space. Excited to see how it evolves.
Build data pipelines, the easy way 🛠️
References:
[1]: https://github.com/orchest/orchest
[2]: https://github.com/orchest
Find and Replace in the Terminal.
grepr # [1]
grepr() {grep -iRl "$1" | xargs sed -i "s/$1/$2/g"}
```bash
grepr() {grep -iRl "$1" | xargs sed -i "s/$1/$2/g"}
grepd # [2]
grepd() {grep -iRl "$1" | xargs sed -i "/^$1/d"}
CocSearch # [3]
:CocSearch published: false -g *.md
References:
[1]: #grepr
[2]: #grepd
[3]: #cocsearch
gvanrossum [1] has done a fantastic job with patma [2]. Highly recommend taking a look.
Pattern Matching
References:
[1]: https://github.com/gvanrossum
[2]: https://github.com/gvanrossum/patma
Resume Tips
- customize for the job
- Why are you a good fit?
- What will you bring to the role?
- Give real outcomes
- give real experience
- Stop tech vomiting
- if you link to GitHub
- Make a profile readme
- Guide me to your best work
- have some activity
- if you link to LinkedIn
- Provide some benefit that is not on your resume
- Have a logical flow of experience (dont make me hunt for past experience)
- Keep it under 2 pages
- Who you know.
- Reference real experience
- Deployed 12 data pipelines with over 500 nodes to process 200GB of data at a Fortune 100 company
- vs
- Knowledge of Data Engineering methodology with python EC2
- Dont be so fluffy
mingrammer [1] has done a fantastic job with diagrams [2]. Highly recommend taking a look.
🎨 Diagram as Code for prototyping cloud system architectures
References:
[1]: https://github.com/mingrammer
[2]: https://github.com/mingrammer/diagrams
Just starred svelte-actions [1] by swyxio [2]. It’s an exciting project with a lot to offer.
prototype official actions for Svelte
References:
[1]: https://github.com/swyxio/svelte-actions
[2]: https://github.com/swyxio
Codeit Bro Interview
[1]
use this profile image
Please share your professional role as a data scientist? [Also feel free to
share about your personal projects, publications, etc.]
I graduated with a Mechanical Engineering Degree 8 years ago. Much of my work
early in my career [2] was wrapped
around analyzing larger datasets for my group to understand quality, drive
changes to improve quality or prove that quality was already good.
[3]
Three years ago I made the switch to Data Science and have loved every minute of
it. It is a very dynamic field that is continually changing and there are
always a new set of skills to learn and hone in on. I talk a lot about the
mindset of always learning, sharing knowledge, and communicating in my
newsletter [4]
What are the most difficult challenges you faced as a data scientist and how
you resolved them?
Deployment is a high bar to enter. Jupyter notebooks provide a suspiciously simple start into Data Science. Folks with very little coding experience can easily ...
reasons-to-kedro
There are many reasons that you should be using kedro. If you are on a team of
Data Scientists/Data Engineers processing DataFrames from many data sources
should be considering a pipeline framework. Kedro is a great option that
provides many benefits for teams to collaborate, develop, and deploy data
pipelines
What is Kedro [1]
Starter Template # [2]
Kedro makes it super easy to get started with their cli that utilizes
cookiecutter under the hood.
conda create -n my-new-project -y python=3.8
kedro new
kedro install
kedro run
Create New Kedro Project [3]
read more about how to start your first kedro project here
Collaboration # [4]
Kedro provides many tools that help teams collaborate on a single codebase.
While writing monolithic scripts it can be easy to pin yourself in a corner
where it is difficult to have multiple people making changes to the
notebook/script at the same time. Kedro helps guide your team to break your
project down into small pieces that different members o...
Reasons to Kedro
Reasons to Kedro # [1]
- collaboration
- Sharable catalog
- small nodes over monolithic notebooks
- catalog
- easily load anything without needing to run
- No need to write read/write code
- pipeline
- No need to keep execution order in your head
- easily run a slice of a pipeline
- plugins
- pip install
- make your own
- hooks
- flexible expandable cli
Reasons Not to Kedro # [2]
- Already utilizing another DAG framework
- Data is not in a widely supported format
- Micro short-lived project
- Large Project / Deadline
- Use a lower profile project to learn first
- Team not willing to change
- Need minimal dependencies
- God Project - kedro owns everything??
References:
[1]: #reasons-to-kedro
[2]: #reasons-not-to-kedro
Just starred Second-Brain [1] by KasperZutterman [2]. It’s an exciting project with a lot to offer.
A curated list of awesome Public Zettelkastens 🗄️ / Second Brains 🧠 / Digital Gardens 🌱
References:
[1]: https://github.com/KasperZutterman/Second-Brain
[2]: https://github.com/KasperZutterman
Reading List
Latest Post # [1]
latest [2]
STOP LEAVING Browser Tabs open and save them here!
- https://nesbitt.io/2026/03/04/package-managers-need-to-cool-down.html
- https://mariozechner.at/posts/2026-03-25-thoughts-on-slowing-the-fuck-down/
- https://danielmiessler.com/blog/ai-stops-being-artificially-cheap
---
- jbrancha til [3]
- The Video Course Launch that Made Me Think [4]
- photo prism [5]
- box python library [6]
- kedro on hn [7]
- How can a Data Scientist refactor Jupyter notebooks towards production-quality code? [8]
- Sourcing vs executing in Bash [9]
- Should We Follow The Open-Closed Principle? [10]
- Create multi-dimensional arrays in pure Python: The Correct Way [11]
- Beware of These 9 Red Flags in a Developer Interview [12]
- How to Overcome Impostor Syndrome as a Developer [13]
- lazy load youtube videos [14]
- lite youtube embeds [15]
- full subtitle youtube search [16]
---
- Jungle Scout - Kedro Case Study [17]
- Kedro Sessions [18]
- Julia Evans - A...