Rename all file to kebab-case

2025-06-09 03:07:13 +00:00 · 2022-05-13 18:33:39 +02:00 · 2022-05-13 18:33:39 +02:00 · 68ca4e4f86
commit 68ca4e4f86
parent ffc085d78a
117 changed files with 0 additions and 1260 deletions
--- a/python/libs/math/numpy.md
+++ b/python/libs/math/numpy.md
@ -0,0 +1,328 @@
+# NumPy Lib
+
+## MOST IMPORTANT ATTRIBUTES ATTRIBUTES
+
+```py
+array.ndim     # number of axes (dimensions) of the array
+array.shape  # dimensions of the array, tuple of integers
+array.size  # total number of elements in the array
+array.itemsize  # size in bytes of each element
+array.data  # buffer containing the array elements
+```
+
+## ARRAY CREATION
+
+Unless explicitly specified `np.array` tries to infer a good data type for the array that it creates.  
+The data type is stored in a special dtype object.
+
+```py
+var = np.array(sequence)  # creates array
+var = np.asarray(sequence)  # convert input to array
+var = np.ndarray(*sequence)  # creates multidimensional array
+var = np.asanyarray(*sequence)  # convert the input to an ndarray
+# nested sequences will be converted to multidimensional array
+
+var = np.zeros(ndarray.shape)  # array with all zeros
+var = np.ones(ndarray.shape)  # array with all ones
+var = np.empty(ndarray.shape)  # array with random values
+var = np.identity(n)  # identity array (n x n)
+
+var = np.arange(start, stop, step)  # creates an array with parameters specified
+var = np.linspace(start, stop, num_of_elements)  # step of elements calculated based on parameters
+```
+
+## DATA TYPES FOR NDARRAYS
+
+```py
+var = array.astype(np.dtype)  # copy of the array, cast to a specified type
+# return TypeError if casting fails
+```
+
+The numerical `dtypes` are named the same way: a type name followed by a number indicating the number of bits per element.
+
+| TYPE                              | TYPE CODE    | DESCRIPTION                                                                                |
+|-----------------------------------|--------------|--------------------------------------------------------------------------------------------|
+| int8, uint8                       | i1, u1       | Signed and unsigned 8-bit (1 byte) integer types                                           |
+| int16, uint16                     | i2, u2       | Signed and unsigned 16-bit integer types                                                   |
+| int32, uint32                     | i4, u4       | Signed and unsigned 32-bit integer types                                                   |
+| int64, uint64                     | i8, u8       | Signed and unsigned 32-bit integer types                                                   |
+| float16                           | f2           | Half-precision floating point                                                              |
+| float32                           | f4 or f      | Standard single-precision floating point. Compatible with C float                          |
+| float64, float128                 | f8 or d      | Standard double-precision floating point. Compatible with C double and Python float object |
+| float128                          | f16 or g     | Extended-precision floating point                                                          |
+| complex64, complex128, complex256 | c8, c16, c32 | Complex numbers represented by two 32, 64, or 128 floats, respectively                     |
+| bool                              | ?            | Boolean type storing True and False values                                                 |
+| object                            | O            | Python object type                                                                         |
+| string_                           | `S<num>`       | Fixed-length string type (1 byte per character), `<num>` is string length                |
+| unicode_                          | `U<num>`      | Fixed-length unicode type, `<num>` is length                                              |
+
+## OPERATIONS BETWEEN ARRAYS AND SCALARS
+
+Any arithmetic operations between equal-size arrays applies the operation element-wise.
+
+array `+` scalar --> element-wise addition (`[1, 2, 3] + 2 = [3, 4, 5]`)
+array `-` scalar --> element-wise subtraction (`[1 , 2, 3] - 2 = [-2, 0, 1]`)
+array `*` scalar --> element-wise multiplication (`[1, 2, 3] * 3 = [3, 6, 9]`)
+array / scalar --> element-wise division (`[1, 2, 3] / 2 = [0.5 , 1 , 1.5]`)
+
+array_1 `+` array_2 --> element-wise addition (`[1, 2, 3] + [1, 2, 3] = [2, 4, 6]`)
+array_1 `-` array_2 --> element-wise subtraction (`[1, 2, 4] - [3 , 2, 1] = [-2, 0, 2]`)
+array_1 `*` array_2 --> element-wise multiplication (`[1, 2, 3] * [3, 2, 1] = [3, 4, 3]`)
+array_1 `/` array_2 --> element-wise division (`[1, 2, 3] / [3, 2, 1] = [0.33, 1, 3]`)
+
+## SHAPE MANIPULATION
+
+```py
+np.reshape(array, new_shape)  # changes the shape of the array
+np.ravel(array)  #  returns the array flattened
+array.resize(shape)  # modifies the array itself
+array.T  # returns the array transposed
+np.transpose(array)     # returns the array transposed
+np.swapaxes(array, first_axis, second_axis)  # interchange two axes of an array
+# if array is an ndarray, then a view of it is returned; otherwise a new array is created
+```
+
+## JOINING ARRAYS
+
+```py
+np.vstack((array1, array2))   # takes tuple, vertical stack of arrays (column wise)
+np.hstack((array1, array2))  # takes a tuple, horizontal stack of arrays (row wise)
+np.dstack((array1, array2)) # takes a tuple, depth wise stack of arrays (3rd dimension)
+np.stack(*arrays, axis)  # joins a sequence of arrays along a new axis (axis is an int)
+np.concatenate((array1, array2, ...), axis) # joins a sequence of arrays along an existing axis (axis is an int)
+```
+
+## SPLITTING ARRAYS
+
+```py
+np.split(array, indices)  # splits an array into equall7 long sub-arrays (indices is int), if not possible raises error
+np.vsplit(array, indices)  # splits an array equally into sub-arrays vertically (row wise) if not possible raises error
+np.hsplit(array, indices)  # splits an array equally into sub-arrays horizontally (column wise) if not possible raises error
+np.dsplit(array, indices)  # splits an array into equally sub-arrays along the 3rd axis (depth) if not possible raises error
+np.array_split(array, indices) # splits an array into sub-arrays, arrays can be of different lengths
+```
+
+## VIEW()
+
+```py
+var = array.view()  # creates a new array that looks at the same data
+# slicing returns a view
+# view shapes are separated but assignment changes all arrays
+```
+
+## COPY()
+
+```py
+var = array.copy()  # creates a deep copy of the array
+```
+
+## INDEXING, SLICING, ITERATING
+
+1-dimensional  --> sliced, iterated and indexed as standard
+n-dimensional --> one index per axis, index given in tuple separated by commas `[i, j] (i, j)`
+dots (`...`) represent as many colons as needed to produce complete indexing tuple
+
+- `x[1, 2, ...] == [1, 2, :, :, :]`
+- `x[..., 3] == [:, :, :, :, 3]`
+- `x[4, ..., 5, :] == [4, :, :, 5, :]`
+iteration on first index, use .flat() to iterate over each element
+- `x[*bool]` returns row with corresponding True index
+- `x[condition]` return only elements that satisfy condition
+- x`[[*index]]` return rows ordered by indexes
+- `x[[*i], [*j]]` return elements selected by tuple (i, j)
+- `x[ np.ix_( [*i], [*j] ) ]` return rectangular region
+
+## UNIVERSAL FUNCTIONS (ufunc)
+
+Functions that performs element-wise operations (vectorization).
+
+```py
+np.abs(array)  # vectorized abs(), return element absolute value
+np.fabs(array)  # faster abs() for non-complex values
+np.sqrt(array)  # vectorized square root (x^0.5)
+np.square(array) # vectorized square (x^2)
+np.exp(array)  # vectorized natural exponentiation (e^x)
+np.log(array)  # vectorized natural log(x)
+np.log10(array)  # vectorized log10(x)
+np.log2(array)  # vectorized log2(x)
+np.log1p(array)  # vectorized log(1 + x)
+np.sign(array)  # vectorized sign (1, 0, -1)
+np.ceil(array)  # vectorized ceil()
+np.floor(array)  # vectorized floor()
+np.rint(array)     # vectorized round() to nearest int
+np.modf(array)  #  vectorized divmod(), returns the fractional and integral parts of element
+np.isnan(array)   # vectorized x == NaN, return boolean array
+np.isinf(array)  # vectorized test for positive or negative infinity, return boolean array
+np.isfineite(array)  # vectorized test fo finiteness, returns boolean array
+np.cos(array)  # vectorized cos(x)
+np.sin(array)  # vectorized sin(x)
+np.tan(array)  # vectorized tan(x)
+np.cosh(array)  # vectorized cosh(x)
+np.sinh(array)   # vector sinh(x)
+np.tanh(array)  # vectorized tanh(x)
+np.arccos(array)  # vectorized arccos(x)
+np.arcsinh(array)  # vectorized arcsinh(x)
+np.arctan(array)  # vectorized arctan(x)
+np.arccosh(array)  # vectorized arccosh(x)
+np.arcsinh(array)  # vectorized arcsin(x)
+np.arctanh(array)  # vectorized arctanh(x)
+np.logical_not(array)  # vectorized not(x), equivalent to -array
+
+np.add(x_array, y_array)  # vectorized addition
+np.subtract(x_array, y_array)  # vectorized subtraction
+np.multiply(x_array, y_array)  # vectorized multiplication
+np.divide(x_array, y_array)  # vectorized division
+np.floor_divide(x_array, y_array)  # vectorized floor division
+np.power(x_array, y_array)  # vectorized power
+np.maximum(x_array, y_array)  # vectorized maximum
+np.minimum(x_array, y_array)  # vectorized minimum
+np.fmax(x_array, y_array)  # vectorized maximum, ignores NaN
+np.fmin(x_array, y_array)  # vectorized minimum, ignores NaN
+np.mod(x_array, y_array)  # vectorized modulus
+np.copysign(x_array, y_array)  #  vectorized copy sign from y_array to x_array
+np.greater(x_array, y_array)  # vectorized x > y
+np.less(x_array, y_array)  # vectorized x < y
+np.greter_equal(x_array, y_array)  # vectorized x >= y
+np.less_equal(x_array, y_array)  # vectorized x <= y
+np.equal(x_array, y_array)  # vectorized x == y
+np.not_equal(x_array, y_array)  # vectorized x != y
+np.logical_and(x_array, y_array)  # vectorized x & y
+np.logical_or(x_array, y_array)  # vectorized x | y
+np.logical_xor(x_array, y_array)  # vectorized x ^ y
+```
+
+## CONDITIONAL LOGIC AS ARRAY OPERATIONS
+
+```py
+np.where(condition, x, y)  # return x if condition == True, y otherwise
+```
+
+## MATHEMATICAL AND STATISTICAL METHODS
+
+`np.method(array, args)` or `array.method(args)`.  
+Boolean values are coerced to 1 (`True`) and 0 (`False`).
+
+```py
+np.sum(array, axis=None)  # sum of array elements over a given axis
+np.median(array, axis=None)  # median along the specified axis
+np.mean(array, axis=None)  # arithmetic mean along the specified axis
+np.average(array, axis=None)  #  weighted average along the specified axis
+np.std(array, axis=None)  # standard deviation along the specified axis
+np.var(array, axis=None)  # variance along the specified axis
+
+np.min(array, axis=None)  # minimum value along the specified axis
+np.max(array, axis=None)  # maximum value along the specified axis
+np.argmin(array, axis=None)  # indices of the minimum values along an axis
+np.argmax(array, axis=None)  # indices of the maximum values
+np.cumsum(array, axis=None)  # cumulative sum of the elements along a given axis
+np.cumprod(array, axis=None)  # cumulative sum of the elements along a given axis
+```
+
+## METHODS FOR BOOLEAN ARRAYS
+
+```py
+np.all(array, axis=None)  # test whether all array elements along a given axis evaluate to True
+np.any(array, axis=None)  # test whether any array element along a given axis evaluates to True
+```
+
+## SORTING
+
+```py
+array.sort(axis=-1)  # sort an array in-place (axis = None applies on flattened array)
+np.sort(array, axis=-1)  # return a sorted copy of an array (axis = None applies on flattened array)
+```
+
+## SET LOGIC
+
+```py
+np.unique(array)  # sorted unique elements of an array
+np.intersect1d(x, y)  # sorted common elements in x and y
+np.union1d(x, y)  # sorte union of elements
+np.in1d(x, y)  #  boolean array indicating whether each element of x is contained in y
+np.setdiff1d(x, y)  # Set difference, elements in x that are not in y
+np.setxor1d()  # Set symmetric differences; elements that are in either of the arrays, but not both
+```
+
+## FILE I/O WITH ARRAYS
+
+```py
+np.save(file, array)  # save array to binary file in .npy format
+np.savez(file, *array) # save several arrays into a single file in uncompressed .npz format
+np.savez_compressed(file, *args, *kwargs)  # save several arrays into a single file in compressed .npz format
+# *ARGS: arrays to save to the file. arrays will be saved with names "arr_0", "arr_1", and so on
+# **KWARGS: arrays to save to the file. arrays will be saved in the file with the keyword names
+
+np.savetxt(file, X, fmt="%.18e", delimiter=" ")   # save array to text file
+# X: 1D or 2D
+# FMT: Python Format Specification Mini-Language
+# DELIMITER: {str} -- string used to separate values
+
+np.load(file, allow_pickle=False)  # load arrays or pickled objects from .npy, .npz or pickled files
+np.loadtxt(file, dtype=float, comments="#", delimiter=None)
+# DTYPE: {data type} -- data-type of the resulting array
+# COMMENTS: {str} -- characters used to indicate the start of a comment. None implies no comments
+# DELIMITER: {str} -- string used to separate values
+```
+
+## LINEAR ALGEBRA
+
+```py
+np.diag(array, k=0)  # extract a diagonal or construct a diagonal array
+# K: {int} -- k>0 diagonals above main diagonal, k<0 diagonals below main diagonal (main diagonal k = 0)
+
+np.dot(x ,y)  # matrix dot product
+np.trace(array, offset=0, dtype=None, out=None)  # return the sum along diagonals of the array
+# OFFSET: {int} -- offset of the diagonal from the main diagonal
+# dtype: {dtype} -- determines the data-type of the returned array
+# OUT: {ndarray} -- array into which the output is placed
+
+np.linalg.det(A)  # compute the determinant of an array
+np.linalg.eig(A)  # compute the eigenvalues and right eigenvectors of a square array
+np.linalg.inv(A)  # compute the (multiplicative) inverse of a matrix
+# A_inv satisfies dot(A, A_inv) = dor(A_inv, A) = eye(A.shape[0])
+
+np.linalg.pinv(A)  # compute the (Moore-Penrose) pseudo-inverse of a matrix
+np.linalg.qr()  # factor the matrix a as qr, where q is orthonormal and r is upper-triangular
+np.linalg.svd(A)  # Singular Value Decomposition
+np.linalg.solve(A, B)  # solve a linear matrix equation, or system of linear scalar equations AX = B
+np.linalg.lstsq(A, B)  # return the least-squares solution to a linear matrix equation AX = B
+```
+
+## RANDOM NUMBER GENERATION
+
+```py
+np.random.seed()
+np.random.rand()
+np.random.randn()
+np.random.randint()
+np.random.Generator.permutation(x)  # randomly permute a sequence, or return a permuted range
+np.random.Generator.shuffle(x)  # Modify a sequence in-place by shuffling its contents
+
+np.random.Generator.beta(a, b, size=None)  # draw samples from a Beta distribution
+# A: {float, array floats} -- Alpha, > 0
+# B: {int, tuple ints} -- Beta, > 0
+
+np.random.Generator.binomial(n, p, size=None)  # draw samples from a binomial distribution
+# N: {int, array ints} -- parameter of the distribution, >= 0
+# P: {float, arrey floats} -- Parameter of the distribution, >= 0 and <= 1
+
+np.random.Generator.chisquare(df, size=None)
+# DF: {float, array floats} -- degrees of freedom, > 0
+
+np.random.Generator.gamma(shape, scale=1.0, size=None)  # draw samples from a Gamma distribution
+# SHAPE: {float, array floats} -- shape of the gamma distribution, != 0
+
+np.random.Generator.normal(loc=0.0, scale=1.0, Size=None)  # draw random samples from a normal (Gaussian) distribution
+# LOC: {float, all floats} -- mean ("centre") of distribution
+# SCALE: {float, all floats} -- standard deviation of distribution, != 0
+
+np.random.Generator.poisson(lam=1.0, size=None)  # draw samples from a Poisson distribution
+# LAM: {float, all floats} -- expectation of interval, >= 0
+
+np.random.Generator.uniform(low=0.0,high=1.0, size=None)  # draw samples from a uniform distribution
+# LOW: {float, all floats} -- lower boundary of the output interval
+# HIGH: {float, all floats} -- upper boundary of the output interval
+
+np.random.Generator.zipf(a, size=None)  # draw samples from a Zipf distribution
+# A: {float, all floats} -- distribution parameter, > 1
+```
--- a/python/libs/math/pandas.md
+++ b/python/libs/math/pandas.md
@ -0,0 +1,646 @@
+# Pandas
+
+## Basic Pandas Imports
+
+```py
+import numpy as np
+import pandas as pd
+from pandas import Series, DataFrame
+```
+
+## SERIES
+
+1-dimensional labelled array, axis label referred as INDEX.  
+Index can contain repetitions.
+
+```py
+s = Series(data, index=index, name='name')
+# DATA: {python dict, ndarray, scalar value}
+# NAME: {string}
+s = Series(dict)  # Series created from python dict, dict keys become index values
+```
+
+### INDEXING / SELECTION / SLICING
+
+```py
+s['index']  # selection by index label
+s[condition]  # return slice selected by condition
+s[ : ]  # slice endpoint included
+s[ : ] = *value  # modify value of entire slice
+s[condition] = *value  # modify slice by condition
+```
+
+## MISSING DATA
+
+Missing data appears as NaN (Not a Number).
+
+```py
+pd.isnull(array)  # return a Series index-bool indicating which indexes don't have data
+pd.notnull(array)  # return a Series index-bool indicating which indexes have data
+array.isnull()
+array.notnull()
+```
+
+### SERIES ATTRIBUTES
+
+```py
+s.values  # NumPy representation of Series
+s.index  # index object of Series
+s.name = "Series name"  # renames Series object
+s.index.name = "index name"  # renames index
+```
+
+### SERIES METHODS
+
+```py
+pd.Series.isin(self, values)  #  boolean Series showing whether elements in Series matches elements in values exactly
+
+# Conform Series to new index, new object produced unless the new index is equivalent to current one and copy=False
+pd.Series.reindex(self, index=None, **kwargs)
+# INDEX: {array} -- new labels / index
+# METHOD: {none (don't fill gaps), pad (fill or carry values forward), backfill (fill or carry values backward)}-- hole filling method
+# COPY: {bool} -- return new object even if index is same -- DEFAULT True
+# FILLVALUE: {scalar} --value to use for missing values. DEFAULT NaN
+
+pd.Series.drop(self, index=None, **kwargs)  # return Series with specified index labels removed
+# INPLACE: {bool} -- if true do operation in place and return None -- DEFAULT False
+# ERRORS: {ignore, raise} -- If "ignore", suppress error and existing labels are dropped
+# KeyError raised if not all of the labels are found in the selected axis
+
+pd.Series.value_counts(self, normalize=False, sort=True, ascending=False, bins=None, dropna=True)
+# NORMALIZE: {bool} -- if True then object returned will contain relative frequencies of unique values
+# SORT: {bool} -- sort by frequency -- DEFAULT True
+# ASCENDING: {bool} -- sort in ascending order -- DEFAULT False
+# BINS: {int} -- group values into half-open bins, only works with numeric data
+# DROPNA: {bool} -- don't include counts of NaN
+```
+
+## DATAFRAME
+
+2-dimensional labeled data structure with columns of potentially different types.
+Index and columns can contain repetitions.
+
+```py
+df = DataFrame(data, index=row_labels, columns=column_labels)
+# DATA: {list, dict (of lists), nested dicts, series, dict of 1D ndarray, 2D ndarray, DataFrame}
+# INDEX: {list of row_labels}
+# COLUMNS: {list of column_labels}
+# outer dict keys interpreted as index labels, inner dict keys interpreted as column labels
+# INDEXING / SELECTION / SLICING
+df[col]  # column selection
+df.at[row, col]  # access a single value for a row/column label pair
+df.iat[row, col]  # access a single value for a row/column pair by integer position
+
+df.column_label  # column selection
+
+df.loc[label]  # row selection by label
+df.iloc[loc]  # row selection by integer location
+
+df[ : ]  # slice rows
+df[bool_vec]  # slice rows by boolean vector
+df[condition]  # slice rows by condition
+
+df.loc[:, ["column_1", "column_2"]]  # slice columns by names
+df.loc[:, [bool_vector]]  # slice columns by names
+
+df[col] = *value  # modify column contents, if colon is missing it will be created
+df[ : ] = *value # modify rows contents
+df[condition] = *value  # modify contents
+
+del df[col]  # delete column
+```
+
+### DATAFRAME ATTRIBUTES
+
+```py
+df.index  # row labels
+df.columns  # column labels
+df.values  # NumPy representation of DataFrame
+df.index.name = "index name"
+df.columns.index.name = "columns name"
+df.T # transpose
+```
+
+### DATAFRAME METHODS
+
+```py
+pd.DataFrame.isin(self , values)  # boolean DataFrame showing whether elements in DataFrame matches elements in values exactly
+
+# Conform DataFrame to new index, new object produced unless the new index is equivalent to current one and copy=False
+pd.DataFrame.reindex(self, index=None, columns=None, **kwargs)
+# INDEX: {array} -- new labels / index
+# COLUMNS: {array} -- new labels / columns
+# METHOD: {none (don't fill gaps), pad (fill or carry values forward), backfill (fill or carry values backward)}-- hole filling method
+# COPY: {bool} -- return new object even if index is same -- DEFAULT True
+# FILLVALUE: {scalar} --value to use for missing values. DEFAULT NaN
+
+pd.DataFrame.drop(self, index=None, columns=None, **kwargs)  # Remove rows or columns by specifying label names
+# INPLACE: {bool} -- if true do operation in place and return None -- DEFAULT False
+# ERRORS: {ignore, raise} -- If "ignore", suppress error and existing labels are dropped
+# KeyError raised if not all of the labels are found in the selected axis
+```
+
+## INDEX OBJECTS
+
+Holds axis labels and metadata, immutable.
+
+### INDEX TYPES
+
+```py
+pd.Index  # immutable ordered ndarray, sliceable. stores axis labels
+pd.Int64Index  # special case of Index with purely integer labels
+pd.MultiIndex  # multi-level (hierarchical) index object for pandas objects
+pd.PeriodINdex  # immutable ndarray holding ordinal values indicating regular periods in time
+pd.DatetimeIndex  # nanosecond timestamps (uses Numpy datetime64)
+```
+
+### INDEX ATTRIBUTERS
+
+```py
+pd.Index.is_monotonic_increasing  # Return True if the index is monotonic increasing (only equal or increasing) values
+pd.Index.is_monotonic_decreasing  # Return True if the index is monotonic decreasing (only equal or decreasing) values
+pd.Index.is_unique  # Return True if the index has unique values.
+pd.Index.hasnans  # Return True if the index has NaNs
+```
+
+### INDEX METHODS
+
+```py
+pd.Index.append(self, other)  # append a collection of Index options together
+
+pd.Index.difference(self, other, sort=None)  # set difference of two Index objects
+# SORT: {None (attempt sorting), False (don't sort)}
+
+pd.Index.intersection(self, other, sort=None)  # set intersection of two Index objects
+# SORT: {None (attempt sorting), False (don't sort)}
+
+pd.Index.union(self, other, sort=None)  # set union of two Index objects
+# SORT: {None (attempt sorting), False (don't sort)}
+
+pd.Index.isin(self, values, level=None)  # boolean array indicating where the index values are in values
+pd.Index.insert(self, loc, item)  # make new Index inserting new item at location
+pd.Index.delete(self, loc)  # make new Index with passed location(-s) deleted
+
+pd.Index.drop(self, labels, errors='raise')  # Make new Index with passed list of labels deleted
+# ERRORS: {ignore, raise} -- If 'ignore', suppress error and existing labels are dropped
+# KeyError raised if not all of the labels are found in the selected axis
+
+pd.Index.reindex(self, target, **kwargs)  # create index with target's values (move/add/delete values as necessary)
+# METHOD: {none (don't fill gaps), pad (fill or carry values forward), backfill (fill or carry values backward)}-- hole filling method
+```
+
+## ARITHMETIC OPERATIONS
+
+NumPy arrays operations preserve labels-value link.  
+Arithmetic operations automatically align differently indexed data.  
+Missing values propagate in arithmetic computations (NaN `<operator>` value = NaN)
+
+### ADDITION
+
+```py
+self + other
+pd.Series.add(self, other, fill_value=None)  # add(), supports substitution of NaNs
+pd,Series.radd(self, other, fill_value=None)  # radd(), supports substitution of NaNs
+pd.DataFrame.add(self, other, axis=columns, fill_value=None)  # add(), supports substitution of NaNs
+pd.DataFrame.radd(self, other, axis=columns, fill_value=None)  # radd(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### SUBTRACTION
+
+```py
+self - other
+pd.Series.sub(self, other, fill_value=None)  # sub(), supports substitution of NaNs
+pd.Series.radd(self, other, fill_value=None)  # radd(), supports substitution of NaNs
+ps.DataFrame.sub(self, other, axis=columns, fill_value=None)  # sub(), supports substitution of NaNs
+pd.DataFrame.rsub(self, other, axis=columns, fill_value=None)  # rsub(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### MULTIPLICATION
+
+```py
+self * other
+pd.Series.mul(self, other, fill_value=None)  # mul(), supports substitution of NaNs
+pd.Series.rmul(self, other, fill_value=None)  # rmul(), supports substitution of NaNs
+ps.DataFrame.mul(self, other, axis=columns, fill_value=None)  # mul(), supports substitution of NaNs
+pd.DataFrame.rmul(self, other, axis=columns, fill_value=None)  # rmul(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### DIVISION (float division)
+
+```py
+self / other
+pd.Series.div(self, other, fill_value=None)  # div(), supports substitution of NaNs
+pd.Series.rdiv(self, other, fill_value=None)  # rdiv(), supports substitution of NaNs
+pd.Series.truediv(self, other, fill_value=None)  # truediv(), supports substitution of NaNs
+pd.Series.rtruediv(self, other, fill_value=None)  # rtruediv(), supports substitution of NaNs
+ps.DataFrame.div(self, other, axis=columns, fill_value=None)  # div(), supports substitution of NaNs
+pd.DataFrame.rdiv(self, other, axis=columns, fill_value=None)  # rdiv(), supports substitution of NaNs
+ps.DataFrame.truediv(self, other, axis=columns, fill_value=None)  # truediv(), supports substitution of NaNs
+pd.DataFrame.rtruediv(self, other, axis=columns, fill_value=None)  # rtruediv(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### FLOOR DIVISION
+
+```py
+self // other
+pd.Series.floordiv(self, other, fill_value=None)  # floordiv(), supports substitution of NaNs
+pd.Series.rfloordiv(self, other, fill_value=None)  # rfloordiv(), supports substitution of NaNs
+ps.DataFrame.floordiv(self, other, axis=columns, fill_value=None)  # floordiv(), supports substitution of NaNs
+pd.DataFrame.rfloordiv(self, other, axis=columns, fill_value=None)  # rfloordiv(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### MODULO
+
+```py
+self % other
+pd.Series.mod(self, other, fill_value=None)  # mod(), supports substitution of NaNs
+pd.Series.rmod(self, other, fill_value=None)  # rmod(), supports substitution of NaNs
+ps.DataFrame.mod(self, other, axis=columns, fill_value=None)  # mod(), supports substitution of NaNs
+pd.DataFrame.rmod(self, other, axis=columns, fill_value=None)  # rmod(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+### POWER
+
+```py
+other ** self
+pd.Series.pow(self, other, fill_value=None)  # pow(), supports substitution of NaNs
+pd.Series.rpow(self, other, fill_value=None)  # rpow(), supports substitution of NaNs
+ps.DataFrame.pow(self, other, axis=columns, fill_value=None)  # pow(), supports substitution of NaNs
+pd.DataFrame.rpow(self, other, axis=columns, fill_value=None)  # rpow(), supports substitution of NaNs
+# OTHER: {scalar, sequence, Series, DataFrame}
+# AXIS: {0, 1, index, columns} -- whether to compare by the index or columns
+# FILLVALUE: {None, float} -- fill missing value
+```
+
+## ESSENTIAL FUNCTIONALITY
+
+### FUNCTION APPLICATION AND MAPPING
+
+NumPy ufuncs work fine with pandas objects.
+
+```py
+pd.DataFrame.applymap(self, func)  # apply function element-wise
+
+pd.DataFrame.apply(self, func, axis=0, args=())  # apply a function along an axis of a DataFrame
+# FUNC: {function} -- function to apply
+# AXIS: {O, 1, index, columns} -- axis along which the function is applied
+# ARGS: {tuple} -- positional arguments to pass to func in addition to the array/series
+# SORTING AND RANKING
+pd.Series.sort_index(self, ascending=True **kwargs)  # sort Series by index labels
+pd.Series.sort_values(self, ascending=True, **kwargs)  # sort series by the values
+# ASCENDING: {bool} -- if True, sort values in ascending order, otherwise descending -- DEFAULT True
+# INPALCE: {bool} -- if True, perform operation in-place
+# KIND: {quicksort, mergesort, heapsort} -- sorting algorithm
+# NA_POSITION {first, last} -- 'first' puts NaNs at the beginning, 'last' puts NaNs at the end
+
+pd.DataFrame.sort_index(self, axis=0, ascending=True, **kwargs)  # sort object by labels along an axis
+pd.DataFrame.sort_values(self, axis=0, ascending=True, **kwargs)  # sort object by values along an axis
+# AXIS: {0, 1, index, columns} -- the axis along which to sort
+# ASCENDING: {bool} -- if True, sort values in ascending order, otherwise descending -- DEFAULT True
+# INPALCE: {bool} -- if True, perform operation in-place
+# KIND: {quicksort, mergesort, heapsort} -- sorting algorithm
+# NA_POSITION {first, last} -- 'first' puts NaNs at the beginning, 'last' puts NaNs at the end
+```
+
+## DESCRIPTIVE AND SUMMARY STATISTICS
+
+### COUNT
+
+```py
+pd.Series.count(self)  # return number of non-NA/null observations in the Series
+pd.DataFrame.count(self, numeric_only=False)  # count non-NA cells for each column or row
+# NUMERIC_ONLY: {bool} -- Include only float, int or boolean data -- DEFAULT False
+```
+
+### DESCRIBE
+
+Generate descriptive statistics summarizing central tendency, dispersion and shape of dataset's distribution (exclude NaN).
+
+```py
+pd.Series.describe(self, percentiles=None, include=None, exclude=None)
+pd.DataFrame.describe(self, percentiles=None, include=None, exclude=None)
+# PERCENTILES: {list-like of numbers} -- percentiles to include in output,between 0 and 1 -- DEFAULT [.25, .5, .75]
+# INCLUDE: {all, None, list of dtypes} -- white list of dtypes to include in the result. ignored for Series
+# EXCLUDE: {None, list of dtypes} -- black list of dtypes to omit from the result. ignored for Series
+```
+
+### MAX - MIN
+
+```py
+pd.Series.max(self, skipna=None, numeric_only=None)  # maximum of the values for the requested axis
+pd.Series.min(self, skipna=None, numeric_only=None)  # minimum of the values for the requested axis
+pd.DataFrame.max(self, axis=None, skipna=None, numeric_only=None)  # maximum of the values for the requested axis
+pd.DataFrame.min(self, axis=None, skipna=None, numeric_only=None)  # minimum of the values for the requested axis
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### IDXMAX - IDXMIN
+
+```py
+pd.Series.idxmax(self, skipna=True)  # row label of the maximum value
+pd.Series.idxmin(self, skipna=True)  # row label of the minimum value
+pd.DataFrame.idxmax(self, axis=0, skipna=True)  #  Return index of first occurrence of maximum over requested axis
+pd.DataFrame.idxmin(self, axis=0, skipna=True)  #  Return index of first occurrence of minimum over requested axis
+# AXIS:{0, 1, index, columns} -- row-wise or column-wise
+# SKIPNA: {bool} -- exclude NA/null values. ff an entire row/column is NA, result will be NA
+```
+
+### QUANTILE
+
+```py
+pd.Series.quantile(self, q=0.5, interpolation='linear')  # return values at the given quantile
+pd.DataFrame.quantile(self, q=0.5, axis=0, numeric_only=True, interpolation='linear') # return values at the given quantile over requested axis
+# Q: {flaot, array} -- value between 0 <= q <= 1, the quantile(s) to compute -- DEFAULT 0.5 (50%)
+# NUMERIC_ONLY: {bool} -- if False, quantile of datetime and timedelta data will be computed as well
+# INTERPOLATION: {linear, lower, higher, midpoint, nearest} -- SEE DOCS
+```
+
+### SUM
+
+```py
+pd.Series.sum(self, skipna=None, numeric_only=None, min_count=0)  # sum of the values
+pd.DataFrame.sum(self, axis=None, skipna=None, numeric_only=None, min_count=0)  # sum of the values for the requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+# MIN_COUNT: {int} -- required number of valid values to perform the operation. if fewer than min_count non-NA values are present the result will be NA
+```
+
+### MEAN
+
+```py
+pd.Series.mean(self, skipna=None, numeric_only=None)   # mean of the values
+pd.DataFrame.mean(self, axis=None, skipna=None, numeric_only=None)  # mean of the values for the requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### MEDIAN
+
+```py
+pd.Series.median(self, skipna=None, numeric_only=None)  # median of the values
+pd.DataFrame.median(self, axis=None, skipna=None, numeric_only=None)  # median of the values for the requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### MAD (mean absolute deviation)
+
+```py
+pd.Series.mad(self, skipna=None)  # mean absolute deviation
+pd.DataFrame.mad(self, axis=None, skipna=None)  # mean absolute deviation of the values for the requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+```
+
+### VAR (variance)
+
+```py
+pd.Series.var(self, skipna=None, numeric_only=None)  # unbiased variance
+pd.DataFrame.var(self, axis=None, skipna=None, ddof=1, numeric_only=None)  #  unbiased variance over requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values. if an entire row/column is NA, the result will be NA
+# DDOF: {int} -- Delta Degrees of Freedom. divisor used in calculations is N - ddof (N represents the number of elements) -- DEFAULT 1
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### STD (standard deviation)
+
+```py
+pd.Series.std(self, skipna=None, ddof=1, numeric_only=None)  # sample standard deviation
+pd.Dataframe.std(self, axis=None, skipna=None, ddof=1, numeric_only=None)  # sample standard deviation over requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values. if an entire row/column is NA, the result will be NA
+# DDOF: {int} -- Delta Degrees of Freedom. divisor used in calculations is N - ddof (N represents the number of elements) -- DEFAULT 1
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### SKEW
+
+```py
+pd.Series.skew(self, skipna=None, numeric_only=None)  # unbiased skew Normalized bt N-1
+pd.DataFrame.skew(self, axis=None, skipna=None, numeric_only=None)  #  unbiased skew over requested axis Normalized by N-1
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### KURT
+
+Unbiased kurtosis over requested axis using Fisher's definition of kurtosis (kurtosis of normal == 0.0). Normalized by N-1.
+
+```py
+pd.Series.kurt(self, skipna=None, numeric_only=None)
+pd.Dataframe.kurt(self, axis=None, skipna=None, numeric_only=None)
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values when computing the result
+# NUMERIC_ONLY: {bool} -- include only float, int, boolean columns, not implemented for Series
+```
+
+### CUMSUM (cumulative sum)
+
+```py
+pd.Series.cumsum(self, skipna=True)  # cumulative sum
+pd.Dataframe.cumsum(self, axis=None, skipna=True)  # cumulative sum over requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values. if an entire row/column is NA, the result will be NA
+```
+
+### CUMMAX - CUMMIN (cumulative maximum - minimum)
+
+```py
+pd.Series.cummax(self, skipna=True)  # cumulative maximum
+pd.Series.cummin(self, skipna=True)  # cumulative minimum
+pd.Dataframe.cummax(self, axis=None, skipna=True)  # cumulative maximum over requested axis
+pd.Dataframe.cummin(self, axis=None, skipna=True)  # cumulative minimum over requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values. if an entire row/column is NA, the result will be NA
+```
+
+### CUMPROD (cumulative product)
+
+```py
+pd.Series.cumprod(self, skipna=True)  # cumulative product
+pd.Dataframe.cumprod(self, axis=None, skipna=True)  # cumulative product over requested axis
+# AXIS: {0, 1, index, columns} -- axis for the function to be applied on
+# SKIPNA: {bool} -- exclude NA/null values. if an entire row/column is NA, the result will be NA
+```
+
+### DIFF
+
+Calculates the difference of a DataFrame element compared with another element in the DataFrame.  
+(default is the element in the same column of the previous row)
+
+```py
+pd.Series.diff(self, periods=1)
+pd.DataFrame.diff(self, periods=1, axis=0)
+# PERIODS: {int} -- Periods to shift for calculating difference, accepts negative values -- DEFAULT 1
+# AXIS: {0, 1, index, columns} -- Take difference over rows or columns
+```
+
+### PCT_CHANGE
+
+Percentage change between the current and a prior element.
+
+```py
+pd.Series.Pct_change(self, periods=1, fill_method='pad', limit=None, freq=None)
+pd.Dataframe.pct_change(self, periods=1, fill_method='pad', limit=None)
+# PERIODS:{int} -- periods to shift for forming percent change
+# FILL_METHOD: {str, pda} -- How to handle NAs before computing percent changes -- DEFAULT pad
+# LIMIT: {int} -- number of consecutive NAs to fill before stopping -- DEFAULT None
+```
+
+## HANDLING MISSING DATA
+
+### FILTERING OUT MISSING DATA
+
+```py
+pd.Series.dropna(self, inplace=False)  # return a new Series with missing values removed
+pd.DataFrame.dropna(axis=0, how='any', tresh=None, subset=None, inplace=False)  # return a new DataFrame with missing values removed
+# AXIS: {tuple, list} -- tuple or list to drop on multiple axes. only a single axis is allowed
+# HOW: {any, all} -- determine if row or column is removed from DataFrame (ANY = if any NA present, ALL = if all values are NA). DEFAULT any
+# TRESH: {int} -- require that many non-NA values
+# SUBSET: {array} -- labels along other axis to consider
+# INPLACE: {bool} -- if True, do operation inplace and return None -- DEFAULT False
+```
+
+### FILLING IN MISSING DATA
+
+Fill NA/NaN values using the specified method.
+
+```py
+pd.Series.fillna(self, value=None, method=None, inplace=False, limit=None)
+pd.DataFrame.fillna(self, value=None, method=None, axis=None, inplace=False, limit=None)
+# VALUE: {scalar, dict, Series, DataFrame} -- value to use to fill holes, dict/Series/DataFrame specifying which value to use for each index or column
+# METHOD: {backfill, pad, None} -- method to use for filling holes -- DEFAULT None
+# AXIS: {0, 1, index, columns} -- axis along which to fill missing values
+# INPLACE: {bool} -- if true fill in-place (will modify views of object) -- DEFAULT False
+# LIMIT: {int} --  maximum number of consecutive NaN values to forward/backward fill -- DEFAULT None
+```
+
+## HIERARCHICAL INDEXING (MultiIndex)
+
+Enables storing and manipulation of data with an arbitrary number of dimensions.  
+In lower dimensional data structures like Series (1d) and DataFrame (2d).
+
+### MULTIIINDEX CREATION
+
+```py
+pd.MultiIndex.from_arrays(*arrays, names=None)  # convert arrays to MultiIndex
+pd.MultiIndex.from_tuples(*arrays, names=None)  # convert tuples to MultiIndex
+pd.MultiIndex.from_frame(df, names=None)  # convert DataFrame to MultiIndex
+pd.MultiIndex.from_product(*iterables, names=None)     #  MultiIndex from cartesian product of iterables
+pd.Series(*arrays)  # Index constructor makes MultiIndex from Series
+pd.DataFrame(*arrays)  # Index constructor makes MultiINdex from DataFrame
+```
+
+### MULTIINDEX LEVELS
+
+Vector of label values for requested level, equal to the length of the index.
+
+```py
+pd.MultiIndex.get_level_values(self, level)
+```
+
+### PARTIAL AND CROSS-SECTION SELECTION
+
+Partial selection "drops" levels of the hierarchical index in the result  in a completely analogous way to selecting a column in a regular DataFrame.
+
+```py
+pd.Series.xs(self, key, axis=0, level=None, drop_level=True)  # cross-section from Series
+pd.DataFrame.xs(self, key, axis=0, level=None, drop_level=True)  #  cross-section from DataFrame
+# KEY: {label, tuple of label} -- label contained in the index, or partially in a MultiIndex
+# AXIS: {0, 1, index, columns} -- axis to retrieve cross-section on -- DEFAULT 0
+# LEVEL: -- in case of key partially contained in MultiIndex, indicate which levels are used. Levels referred by label or position
+# DROP_LEVEL: {bool} -- If False, returns object with same levels as self -- DEFAULT True
+```
+
+### INDEXING, SLICING
+
+Multi index keys take the form of tuples.
+
+```py
+df.loc[('lvl_1', 'lvl_2', ...)]  # selection of single row
+df.loc[('idx_lvl_1', 'idx_lvl_2', ...), ('col_lvl_1', 'col_lvl_2', ...)]  # selection of single value
+
+df.loc['idx_lvl_1':'idx_lvl_1']  # slice of rows (aka partial selection)
+df.loc[('idx_lvl_1', 'idx_lvl_2') : ('idx_lvl_1', 'idx_lvl_2')]  # slice of rows with levels
+
+```
+
+### REORDERING AND SORTING LEVELS
+
+```py
+pd.MultiIndex.swaplevel(self, i=-2, j=-1)  # swap level i with level j
+pd.Series.swaplevel(self, i=-2, j=-1) # swap levels i and j in a MultiIndex
+pd.DataFrame.swaplevel(self, i=-2, j=-1, axis=0) # swap levels i and j in a MultiIndex on a partivular axis
+
+pd.MultiIndex.sortlevel(self, level=0, ascending=True, sort_remaining=True)  # sort MultiIndex at requested level
+# LEVEL: {str, int, list-like} -- DEFAULT 0
+# ASCENDING: {bool} -- if True, sort values in ascending order, otherwise descending -- DEFAULT True
+# SORT_REMAINING: {bool} -- sort by the remaining levels after level
+```
+
+## DATA LOADING, STORAGE FILE FORMATS
+
+```py
+pd.read_fwf(filepath, colspecs='infer', widths=None, infer_nrows=100)  # read a table of fixed-width formatted lines into DataFrame
+# FILEPATH: {str, path object} -- any valid string path is acceptable, could be a URL. Valid URLs: http, ftp, s3, and file
+# COLSPECS: {list of tuple (int, int), 'infer'} -- list of tuples giving extents of fixed-width fields of each line as half-open intervals { [from, to) }
+# WIDTHS: {list of int} -- list of field widths which can be used instead of "colspecs" if intervals are contiguous
+# INFER_ROWS: {int} -- number of rows to consider when letting parser determine colspecs -- DEFAULT 100
+
+pd.read_excel()  # read an Excel file into a pandas DataFrame
+pd.read_json()  # convert a JSON string to pandas object
+pd.read_html()  # read HTML tables into a list of DataFrame objects
+pd.read_sql()  # read SQL query or database table into a DataFrame
+
+pd.read_csv(filepath, sep=',', *args, **kwargs )  # read a comma-separated values (csv) file into DataFrame
+pd.read_table(filepath, sep='\t', *args, **kwargs)  # read general delimited file into DataFrame
+# FILEPATH: {str, path object} -- any valid string path is acceptable, could be a URL. Valid URLs: http, ftp, s3, and file
+# SEP: {str} -- delimiter to use -- DEFAULT \t (tab)
+# HEADER {int, list of int, 'infer'} -- row numbers to use as column names, and the start of the data -- DEFAULT 'infer'
+# NAMES:{array} -- list of column names to use -- DEFAULT None
+# INDEX_COL: {int, str, False, sequnce of int/str, None} -- Columns to use as row labels of DataFrame, given as string name or column index -- DEFAULT None
+# SKIPROWS: {list-like, int, callable} -- Line numbers to skip (0-indexed) or number of lines to skip (int) at the start of the file
+# NA_VALUES: {scalar, str, list-like, dict} -- additional strings to recognize as NA/NaN. if dict passed, specific per-column NA values
+# THOUSANDS: {str} -- thousand separator
+# *ARGS, **KWARGS -- SEE DOCS
+
+# write object to a comma-separated values (csv) file
+pd.DataFrame.to_csv(self, path_or_buf, sep=',', na_rep='', columns=None, header=True, index=True, encoding='utf-8', line_terminator=None, decimal='.', *args, **kwargs)
+# SEP: {str len 1} --  Field delimiter for the output file
+# NA_REP: {str} -- missing data representation
+# COLUMNS: {sequence} -- colums to write
+# HEADER: {bool, list of str} -- write out column names. if list of strings is given its assumed to be aliases for column names
+# INDEX: {bool, list of str} -- write out row names (index)
+# ENCODING: {str} -- string representing encoding to use -- DEFAULT "utf-8"
+# LINE_TERMINATOR: {str} -- newline character or character sequence to use in the output file -- DEFAULT os.linesep
+# DECIMAL: {str} -- character recognized as decimal separator (in EU ,)
+
+pd.DataFrame.to_excel()
+pd.DataFrame.to_json()
+pd.DataFrame.to_html()
+pd.DataFrame.to_sql()
+```
--- a/python/libs/math/seaborn.md
+++ b/python/libs/math/seaborn.md
@ -0,0 +1,218 @@
+# Seaborn Lib
+
+## Basic Imports For Seaborn
+
+```python
+import numpy as np
+import pandas as pd
+import matplotlib.pyplot as plt
+import seaborn as sns
+
+# set aesthetic parameters in one step
+sns.set(style='darkgrid')
+#STYLE: {None, darkgrid, whitegrid, dark, white, ticks}
+```
+
+## REPLOT (relationship)
+
+```python
+sns.replot(x='name_in_data', y='name_in_data', hue='point_color', size='point_size', style='point_shape', data=data)
+# HUE, SIZE and STYLE: {name in data} -- used to differentiate points, a sort-of 3rd dimension
+# hue behaves differently if the data is categorical or numerical, numerical uses a color gradient
+# SORT: {False, True} -- avoid sorting data in function of x
+# CI: {None, sd} -- avoid computing confidence intervals or plot standard deviation
+# (aggregate multiple measurements at each x value by plotting the mean and the 95% confidence interval around the mean)
+# ESTIMATOR: {None} -- turn off aggregation of multiple observations
+# MARKERS: {True, False} -- evidenziate observations with dots
+# DASHES: {True, False} -- evidenziate observations with dashes
+# COL, ROW: {name in data}  -- categorical variables that will determine the grid of plots
+# COL_WRAP: {int} -- "Wrap" the column variable at this width, so that the column facets span multiple rows. Incompatible with a row facet.
+# SCATTERPLOT
+# depicts the joint distribution of two variables using a cloud of points
+# kind can be omitted since scatterplot is the default for replot
+sns.replot(kind='scatter')  # calls scatterplot()
+sns.scatterplot()  # underlying axis-level function of replot()
+```
+
+### LINEPLOT
+
+Using semantics in lineplot will determine the aggregation of data.
+
+```python
+sns.replot(ci=None, sort=bool, kind='line')
+sns.lineplot()  # underlying axis-level function of replot()
+```
+
+## CATPLOT (categorical)
+
+Categorical: divided into discrete groups.
+
+```python
+sns.catplot(x='name_in_data', y='name_in_data', data=data)
+# HUE: {name in data} -- used to differenziate points, a sort-of 3rd dimension
+# COL, ROW: {name in data}  -- categorical variables that will determine the grid of plots
+# COL_WRAP: {int} -- "Wrap" the column variable at this width, so that the column facets span multiple rows. Incompatible with a row facet.
+# ORDER, HUE_ORDER: {list of strings} -- order of categorical levels of the plot
+# ROW_ORDER, COL_ORDER: {list of strings} -- order to organize the rows and/or columns of the grid in
+# ORIENT: {'v', 'h'} -- Orientation of the plot (can also swap x&y assignment)
+# COLOR: {matplotlib color} -- Color for all of the elements, or seed for a gradient palette
+# CATEGORICAL SCATTERPLOT - STRIPPLOT
+# adjust the positions of points on the categorical axis with a small amount of random “jitter”
+sns.catplot(kind='strip', jitter=float)
+sns.stripplot()
+# SIZE: {float}  -- Diameter of the markers, in points
+# JITTER: {False, float} -- magnitude of points jitter (distance from axis)
+```
+
+### CATEGORICAL SCATTERPLOT - SWARMPLOT
+
+Adjusts the points along the categorical axis preventing overlap.
+
+```py
+sns.catplot(kind='swarm')
+sns.swarmplot()
+# SIZE: {float}  -- Diameter of the markers, in points
+# CATEGORICAL DISTRIBUTION - BOXPLOT
+# shows the three quartile values of the distribution along with extreme values
+sns.catplot(kind='box')
+sns.boxplot()
+# HUE: {name in data} -- box for each level of the semantic moved along the categorical axis so they don’t overlap
+# DODGE: {bool} -- whether elements should be shifted along the categorical axis if hue is used
+```
+
+### CATEGORICAL DISTRIBUTION - VIOLINPLOT
+
+Combines a boxplot with the kernel density estimation procedure.
+
+```py
+sns.catplot(kind='violin')
+sns.violonplot()
+```
+
+### CATEGORICAL DISTRIBUTION - BOXENPLOT
+
+Plot similar to boxplot but optimized for showing more information about the shape of the distribution.  
+It is best suited for larger datasets.
+
+```py
+sns.catplot(kind='boxen')
+sns.boxenplot()
+```
+
+### CATEGORICAL ESTIMATE - POINTPLOT
+
+Show point estimates and confidence intervals using scatter plot glyphs.
+
+```py
+sns.catplot(kind='point')
+sns.pointplot()
+# CI: {float, sd} -- size of confidence intervals to draw around estimated values, sd -> standard deviation
+# MARKERS: {string, list of strings} --  markers to use for each of the hue levels
+# LINESTYLES: {string, list of strings} -- line styles to use for each of the hue levels
+# DODGE: {bool, float} -- amount to separate the points for each hue level along the categorical axis
+# JOIN: {bool} -- if True, lines will be drawn between point estimates at the same hue level
+# SCALE: {float} -- scale factor for the plot elements
+# ERRWIDTH: {float} -- thickness of error bar lines (and caps)
+# CAPSIZE: {float} -- width of the "caps" on error bars
+```
+
+### CATEGORICAL ESTIMATE - BARPLOT
+
+Show point estimates and confidence intervals as rectangular bars.
+
+```py
+sns.catplot(kind='bar')
+sns.barplot()
+# CI: {float, sd} -- size of confidence intervals to draw around estimated values, sd -> standard deviation
+# ERRCOLOR: {matplotlib color} -- color for the lines that represent the confidence interval
+# ERRWIDTH: {float} -- thickness of error bar lines (and caps)
+# CAPSIZE: {float} -- width of the "caps" on error bars
+# DODGE: {bool} -- whether elements should be shifted along the categorical axis if hue is used
+```
+
+### CATEGORICAL ESTIMATE - COUNTPLOT
+
+Show the counts of observations in each categorical bin using bars.
+
+```py
+sns.catplot(kind='count')
+sns.countplot()
+# DODGE: {bool} -- whether elements should be shifted along the categorical axis if hue is used
+```
+
+## UNIVARIATE DISTRIBUTIONS
+
+### DISTPLOT
+
+Flexibly plot a univariate distribution of observations
+
+```py
+# A: {series, 1d-array, list}
+sns.distplot(a=data)
+# BINS: {None, arg for matplotlib hist()} -- specification of hist bins, or None to use Freedman-Diaconis rule
+# HIST: {bool} - whether to plot a (normed) histogram
+# KDE: {bool} - whether to plot a gaussian kernel density estimate
+# HIST_KWD, KDE_KWD, RUG_KWD: {dict} -- keyword arguments for underlying plotting functions
+# COLOR: {matplotlib color} -- color to plot everything but the fitted curve in
+```
+
+### RUGPLOT
+
+Plot datapoints in an array as sticks on an axis.
+
+```py
+# A: {vector} -- 1D array of observations
+sns.rugplot(a=data) # -> axes obj with plot on it
+# HEIGHT: {scalar} --  height of ticks as proportion of the axis
+# AXIS: {'x', 'y'} -- axis to draw rugplot on
+# AX: {matplotlib axes} -- axes to draw plot into, otherwise grabs current axes
+```
+
+### KDEPLOT
+
+Fit and plot a univariate or bivariate kernel density estimate.
+
+```py
+# DATA: {1D array-like} -- input data
+sns.kdeplot(data=data)
+# DATA2 {1D array-like} -- second input data. if present, a bivariate KDE will be estimated.
+# SHADE: {bool} -- if True, shade-in the area under KDE curve (or draw with filled contours is bivariate)
+```
+
+## BIVARIATE DISTRIBUTION
+
+### JOINTPLOT
+
+Draw a plot of two variables with bivariate and univariate graphs.
+
+```py
+# X, Y: {string, vector} -- data or names of variables in data
+sns.jointplot(x=data, y=data)
+# DATA:{pandas DataFrame} -- DataFrame when x and y are variable names
+# KIND: {'scatter', 'reg', 'resid', 'kde', 'hex'} -- kind of plot to draw
+# COLOR: {matplotlib color} -- color used for plot elements
+# HEIGHT: {numeric} -- size of figure (it will be square)
+# RATIO: {numeric} -- ratio of joint axes height to marginal axes height
+# SPACE: {numeric} -- space between the joint and marginal axes
+# JOINT_KWD, MARGINAL_KWD, ANNOT_KWD: {dict} -- additional keyword arguments for the plot components
+```
+
+## PAIR-WISE RELATIONISPS IN DATASET
+
+### PAIRPLOT
+
+Plot pairwise relationships in a dataset.
+
+```py
+# DATA: {pandas DataFrame} -- tidy (long-form) dataframe where each column is a variable and each row is an observation
+sns.pairplot(data=pd.DataFrame)
+# HUE: {string (variable name)} -- variable in data to map plot aspects to different colors
+# HUE_ORDER: {list of strings} -- order for the levels of the hue variable in the palette
+# VARS: {list of variable names} -- variables within data to use, otherwise every column with numeric datatype
+# X_VARS, Y_VARS: {list of variable names} -- variables within data to use separately for rows and columns of figure
+# KIND: {'scatter', 'reg'} -- kind of plot for the non-identity relationships
+# DIAG_KIND: {'auto', 'hist', 'kde'} -- Kind of plot for the diagonal subplots. default depends hue
+# MARKERS: {matplotlib marker or list}
+# HEIGHT:{scalar} -- height (in inches) of each facet
+# ASPECT: {scalar} -- aspect * height gives the width (in inches) of each facet
+```