written by Eric J. Ma on 2018-07-27 | tags: open source pyjanitor data science
A new release of pyjanitor
is out!
Two new features that I have added in include:
Both of these tasks come up frequently in data preparation.
For example, concatenating a few columns together oftentimes lets us create an unique index based sample properties.
On the other hand, deconcatenating columns into multiple columns can be useful when our index is used to store metadata. (This really shouldn't be happening, but... sometimes that's just how the world works right now...)
Here's an example of how it works:
To install pyjanitor
, grab it from PyPI:
$ pip install pyjanitor
The conda-forge build will be coming soon!
@article{
ericmjl-2018-pyjanitor-released,
author = {Eric J. Ma},
title = {pyjanitor 0.3 released!},
year = {2018},
month = {07},
day = {27},
howpublished = {\url{https://ericmjl.github.io}},
journal = {Eric J. Ma's Blog},
url = {https://ericmjl.github.io/blog/2018/7/27/pyjanitor-0-3-released},
}
I send out a newsletter with tips and tools for data scientists. Come check it out at Substack.
If you would like to sponsor the coffee that goes into making my posts, please consider GitHub Sponsors!
Finally, I do free 30-minute GenAI strategy calls for teams that are looking to leverage GenAI for maximum impact. Consider booking a call on Calendly if you're interested!