Drafts

Thanks For Subscribing

✨ You’re Awesome Thank you so much for subscribing to my newsletter. It’s still early days, please let me know what you want to hear about, I would love to get the conversation started! My hope is that you find some wicked sweet content here and stay tuned for more, but if you dont like it you can opt-out at any time. What to expect # [2] - Weekly inspirational message based on my experiences breaking into the industry - monthly-ish message about what is going on new posts, life, updates - Future annouce premium content ( let me know what you think it should be ) References: [1]: /og/ [2]: #what-to-expect

Practice your craft

Show up For Practice # [1] how I practice # [2] What Works for You # [3] References: [1]: #show-up-for-practice [2]: #how-i-practice [3]: #what-works-for-you

Productive one on one

[1]Inspired by https://m.soundcloud.com/front-end-happy-hour/episode-095-shot-for-shot-one-on-ones References: [1]: https://dropper.wayl.one/file/33e9b982-c621-4506-a61c-bfe106d633f1.webp

Long variable names are good

🏷️ Long variable names are a good thing. Self documenting code is more important than poorly documented code. Simply adding a few characters to your variable names can go a long ways. Containers are plural # [1] Aliases are welcome # [2] Scope is important References: [1]: #containers-are-plural [2]: #aliases-are-welcome

What is Refactoring

Make the change easy Make the easy change Martin Fowler When to refactor

automate your deploys

simple click 2

RSS feed for your Gatsby Site

Adding an rss feed to your gatsby js site is super simple. https://www.gatsbyjs.org/packages/gatsby-plugin-feed/ Install # [1] npm install --save gatsby-plugin-feed How to use # [2] // In your gatsby-config.js module.exports = { plugins: [ { resolve: `gatsby-plugin-feed`, options: { query: ` { site { siteMetadata { title description siteUrl site_url: siteUrl } } } `, feeds: [ { serialize: ({ query: { site, allMarkdownRemark } }) => { return allMarkdownRemark.edges.map(edge => { return Object.assign({}, edge.node.frontmatter, { description: edge.node.excerpt, date: edge.node.frontmatter.date, url: site.siteMetadata.siteUrl + edge.node.fields.slug, guid: site.siteMetadata.siteUrl + edge.node.fields.slug, custom_elements: [{ "content:encoded": edge.node.html }], }) }) }, query: ` { allMarkdownRemark( sort: { order: DESC, fields: [frontmatter___date] }, ) { edges { node { excerpt html fields { slug } frontmatter { title date } } } } } ...

Conda Build

Conda Build Install Conda Build # [1] conda update conda conda install conda-build References: [1]: #install-conda-build

What DataScientists Should Know About S3

Pyspark

I have been using pyspark since March 2019, here are my thoughts.

Making good documentation in python

Tools Sphinx # [1] Portray # [2] I just started using portray and it is amazingly simple to use! Methodology References: [1]: #sphinx [2]: #portray

Amazon Web Services

I started using aws in March 2019. Here are some of my notes. aws-cli # [1] ec-2 # [2] SageMaker # [3] S3 # [4] References: [1]: #aws-cli [2]: #ec-2 [3]: #sagemaker [4]: #s3

Minimal Project Structure

TLDR # [1] Use .gitignore.io [2] and consider adding an alias to your terminal to quickly add a .gitignore to any project missing one. alias gitignore='curl https://www.gitignore.io/api/vim,emacs,python,pycharm,sublimetext,visualstudio,visualstudiocode,data > .gitignore' Add a minimal setup.py to the root of your project, and use the following command to install it. pip install -e . consider using **cookiecutter [3] References: [1]: #tldr [2]: https://www.gitignore.io [3]: https://github.com/audreyr/cookiecutter

Python Tips

Dictionaries # [1] Unpacking # [2] - **kwargs - func(**input) - locals().update(d) # [3] References: [1]: #dictionaries [2]: #unpacking [3]: #heading

remove git cruft

inspiration # [1] My original inspiration for this post came from steven ostermiller’s blog post that no longer exists from my last check in May, 2024. https://blog.ostermiller.org/removing-and-purging-files-from-git-history/ I was able to find it on the way back machine though. https://web.archive.org/web/20240222195617/https://blog.ostermiller.org/removing-and-purging-files-from-git-history/ git log --all --pretty=format: --name-only --diff-filter=D | sed -r 's|[^/]+$||g' | sort -u git filter-branch --tag-name-filter cat --index-filter 'git rm -r --cached --ignore-unmatch FILE_LIST' --prune-empty -f -- --all rm -rf .git/refs/original/ git reflog expire --expire=now --all git gc --aggressive --prune=now git push origin --force --all git push origin --force --tags cd MY_LOCAL_GIT_REPO git fetch origin git rebase git reflog expire --expire=now --all git gc --aggressive --prune=now References: [1]: #inspiration

Follow Your Passion

Example # [1] References: [1]: #example

Follow Your Passion

Follow Your Passion # [1] my journey into data science In January 2018 I started work as a full time data scientist turning my passion into a career. It is something that I didn’t see myself doing 5 years ago, but is something that I love to do. It combines my love of data, visualization, story telling, software development, and writing code. Most of all it allows me to work in a space that promotes learning and creativity. As a mechanical engineer for a company that has been building equipment for nearly a century the mechanical engineering is very well established I felt that there was not a lot of room for creativity. Find Your Role # [2] When I first started as a full time mechanical engineer References: [1]: #follow-your-passion [2]: #find-your-role

Generating Readme Tables From Pandas

Generating Readme Tables From Pandas # [1] I commonly have a need to paste the first few lines of a dataset into a markdown file. I use two handy packages to do this, tabulate and pyperclip. Lets say I have a Pandas DataFrame in memory as df already. All I would need to do to convert the first 5 rows to markdown and copy it to the clipboard is the following. from tabulate import tabulate import pyperclip md = tabulate.tabulate(df.head(), df.columns, tablefmt='pipe') pyperclip.copy(md) This is a super handy snippet that I use a lot. Folks really appreciate it when they can see a sample of the data without opening the entire file. References: [1]: #generating-readme-tables-from-pandas

Pycon 2018 Roundup

These are my notes from pycon 2018 videos. I love the python community and especially the conference talks. This year I am going to take some notes from my favorite talks and post them here. This is an Incomplete working post. Jake VanderPlas - Performance Python: Seven Strategies for Optimizing Your Numerical Code [1] # [2] - Always profile before making any optimizations. - Vectorize with Numpy - Looping in python can be slow - Use specialized data structures. - scipy.spacial - pandas - xarray - scipy.sparse - sparse package - scipy.sparce.csgraph - Cython - Add types - Numba - jit - Fortran Like Speed - heavy dependencies - Dask - distributed tasks - Can be executed locally or on a cluster - Look for an existing package - resist the urge to reinvent the wheel https://www.youtube.com/watch?v=zQeYx87mfyw Justin Crown - “WHAT IS THIS MESS?” - Writing tests for pre-existing code bases - PyCon 2018 [3] # [4] This was a great talk about not only test driven de...

`j`	Scroll down
`k`	Scroll up
`g` `g`	Scroll to top
`Shift` `G`	Scroll to bottom
`d`	Half-page down
`u`	Half-page up

`j` / `↓`	Next post (in feeds)
`k` / `↑`	Previous post (in feeds)
`Enter` / `o`	Open highlighted post
`Shift` `O`	Open in new tab
`g` `h`	Go to home
`g` `s`	Focus search
`[`	Previous page
`]`	Next page
`b`	Toggle left sidebar
`Shift` `B`	Toggle right sidebar
`s`	Toggle simple/rich feed view

`/`	Focus search input
`⌘CtrlK`	Focus search (alternative)
`y` `y`	Copy URL to clipboard
`?`	Show this help
`Esc`	Close / clear highlight