Posts tagged: kedro

All posts with the tag "kedro"

kedro catalog create

I use kedro catalog create to boost my productivity by automatically generating yaml catalog entries for me. It will create new yaml files for each pipeline, fill in missiing catalog entries, and respect already existing catalog entries. It will reformat the file, and sort it based on catalog key.

https://youtu.be/_22ELT4kja4

What is Kedro

...

Kedro-Broken-Urls

Broken Urls ¶ # https://github.com/josephhaaga) [ ] https://example.com/file.h5 https://raw.githubusercontent.com/kedro-org/kedro/develop/static/img/pipeline_visualisation.png https://example.com/file.txt https://github.com/jmespath/jmespath.py. https://github.com/tsanikgr) https://example.com/file.csv https://kedro.readthedocs.io/en/latest/04_user_guide/15_hooks.html https://kedro.readthedocs.io/en/stable/07_extend_kedro/04_hooks.html https://github.com/EbookFoundation/free-programming-books/blob/master/books/free-programming-books.md#python https://github.com/quantumblacklabs/private-kedro/blob/develop/docs/source/04_user_guide/04_data_catalog.md http://example.com/api/test https://example.com/file.parquet https://kedro.readthedocs.io/en/stable/11_faq/01_faq.html#how-do-i-upgrade-kedro https://example.com/file.xlsx https://www.datacamp.com/community/tutorials/docstrings-python https://github.com/mmchougule) https://example.com/file.tf https://kedro.readthedocs.io/en/stable/04_user_guide/13_journal.html https://example.com/test_tf http://example.com/test.csv https://www.britishcouncilfoundation.id/en/english/articles/british-and-american-english https://mattturck.com/data2020/ https://example.com/file.yaml https://kedro.readthedocs.io/en/latest/04_user_guide/08_advanced_io.html#partitioned-dataset https://example.com/file https://example.com/file.json https://kedro.readthedocs.io/en/stable/04_user_guide/08_advanced_io.html#incremental-loads-with-incrementaldataset...
1 min read

kedro Virtual Environment

Avoid serious version conflict issues, and use a virtual environment anytime you are running python, here are three ways you can setup a kedro virtual environment.

https://youtu.be/ZSxc5VVCBhM

I prefer to use conda as my virtual environment manager of choice as it give me both the interpreter and the packages I install. I don’t have to rely on the system version of python or another tool to maintain python versions at all, I get everything in one tool.

...

Kedro Pipeline Create

Kedro pipeline create is a command that makes creating new pipelines much easier. There is much less boilerplate that you need to write yourself.

https://youtu.be/HtyIKqlEoNw

The kedro cli comes with the following command to scaffold out new pipelines. Note that it will not add it to your pipeline_registry, to be covered later, you will need to add it yourself.

...

Kedro Install

Kedro comes with an install command to install and manage all of your projects dependencies.

https://youtu.be/IWimEs-hHQg

You must start by having your kedro project either cloned down from an existing project or created from kedro new. Then activate your environment.

...

Kedro Git Init

Immediately after kedro new, before you start running kedro install or your first line of code the first thing you should always do after getting a new kedro template created is to git init.

https://youtu.be/IGba3ytf_6U

Its as simple as these three commands to get started.

...

Kedro New

https://youtu.be/uqiv5LAiJe0

Kedro new is simply a wrapper around the cookiecutter templating library. The kedro team maintains a ready made template that has everything you need for a kedro project. They also maintain a few kedro starters, which are very similar to the base template.

What is Kedro

...

What is Kedro

Kedro is an unopinionated Data Engineering framework that comes with a somewhat opinionated template. It gives the user a way to build pipelines that automatically take care of io through the use of abstract DataSets that the user specifies through Catalog entries. These Catalog entries are loaded, ran through a function, and saved by Nodes. The order that these Nodes are executed are determined by the Pipeline, which is a DAG. It’s the runner’s job to manage the execution of the Nodes.

https://youtu.be/Wf4rnFsaFFU

What is Kedro

...

How I Kedro

https://youtu.be/bw5_FWDVRpU

I recently switched over to using Ubuntu, it works well pretty much out of the box for me. I am using gnome with a dark theme.

I am still using the built in default gnome terminal, it just works. It does all the things that I need it to do. It supports transparency renders my fonts and allows me to highlight things well.

...

3 min read

Incremental Versioned Datasets in Kedro

Kedro versioned datasets can be mixed with incremental and partitioned datasets to do some timeseries analysis on how our dataset changes over time. Kedro is a very extensible and composible framework, that allows us to build solutions from the individual components that it provides. This article is a great example of how you can combine these components in unique ways to achieve some powerful results with very little work.

What is Kedro

👆 Unsure what kedro is? Check out this post.

...