User Tools

Site Tools


pandas_notes

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
pandas_notes [2020/12/30 02:33] rajupandas_notes [2023/11/12 09:06] (current) – [links I came across] raju
Line 1: Line 1:
 +==== common pandas data types ====
 +
 +^ data type ^ description ^ supports missing values ^
 +| float | The NumPy float type | Yes |
 +| int | The NumPy integer type | No |
 +| 'Int64' | pandas nullable integer type | Yes |
 +| object | The NumPy type for storing strings (and mixed types) | |
 +| 'category' | pandas categorical type | Yes |
 +| bool | The NumPy Boolean type | No. \\ None becomes False, np.nan becomes True. |
 +| 'boolean' | pandas nullable Boolean type | Yes |
 +| datetime64[ns] | The NumPy date type | Yes (NaT) |
 +
 +Ref:- (Pandas 1.x Cookbook, by Matt Harrison and Theodore Petrou, second edition, published in 2020) -> Chapter 1 -> page-7
 +
 +
 +==== What packages does pandas depend on? ====
 +  * Dependencies - https://pandas.pydata.org/pandas-docs/stable/getting_started/install.html#dependencies
 +  * Recommended dependencies - https://pandas.pydata.org/pandas-docs/stable/getting_started/install.html#recommended-dependencies
 +  * Optional dependencies - https://pandas.pydata.org/pandas-docs/stable/getting_started/install.html#optional-dependencies
 +
 ==== links I came across ==== ==== links I came across ====
 +  * https://github.com/pandas-dev/pandas/releases - pandas release history
 +  * http://pandas.pydata.org/pandas-docs/stable/getting_started/install.html - pandas installation page. Contains instructions to install pandas in various ways.
 +  * http://pandas.pydata.org/pandas-docs/stable/ - pandas official documentation
   * https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html - good reference to learn about iloc, slicing ranges   * https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html - good reference to learn about iloc, slicing ranges
 +  * DuckDB can run SQL queries directly on Parquet files and automatically take advantage of the advanced features of the Parquet format.
 +    * https://duckdb.org/2021/06/25/querying-parquet.html - gives more details
 +    * tags | pandas, streaming
 +
 +==== EmptyDataError ====
 +Instead of
 +  pandas.io.common.EmptyDataError
 +use
 +  pandas.errors.EmptyDataError
 +
 +data point | as of pandas 1.1.2, pandas.io.common.EmptyDataError does not work.
 +
 +Ref:-
 +  * https://pandas.pydata.org/docs/whatsnew/v0.20.0.html#pandas-errors
 +  * <WRAP indent>
 +> We are adding a standard public module for all pandas exceptions & warnings pandas.errors. (GH14800). Previously these exceptions & warnings could be imported from pandas.core.common or pandas.io.common. These exceptions and warnings will be removed from the *.common locations in a future release. (GH15541)
 +</WRAP>
 +  * https://pandas.pydata.org/docs/reference/api/pandas.errors.EmptyDataError.html - documentation from latest stable version
 +  * https://pandas.pydata.org/pandas-docs/version/1.4/reference/api/pandas.errors.EmptyDataError.html - documentation from version 1.4
 +  * https://pandas.pydata.org/pandas-docs/version/0.20/generated/pandas.errors.EmptyDataError.html - documentation from version 0.20
 +
 +==== assert_frame_equal ====
 +Instead of
 +  from pandas.util.testing import assert_frame_equal
 +use
 +  from pandas.testing import assert_frame_equal
 +
 +data point:
 +
 +Using ''from pandas.util.testing import assert_frame_equal'' in pandas 1.1.2, I get
 +
 +  FutureWarning: pandas.util.testing is deprecated. Use the functions in the public API at pandas.testing instead.
  
pandas_notes.1609295604.txt.gz · Last modified: 2020/12/30 02:33 by raju