Skip to content

updates from upstream #3

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 388 commits into from
Mar 15, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
388 commits
Select commit Hold shift + click to select a range
2eca9e8
CLN/REF: Split up / clean Categorical constructor tests (#32211)
dsaxton Feb 25, 2020
786bfd9
TST: show_versions test with unmerged commits (#32139)
jbrockmendel Feb 26, 2020
c5e3e25
implement test_select_dtypes (#32250)
jbrockmendel Feb 26, 2020
5f9fb1d
test_combine.py (#32253)
jbrockmendel Feb 26, 2020
972ab61
Collect Series.append tests (#32254)
jbrockmendel Feb 26, 2020
7c7048c
REF: test_to_timestamp (#32248)
jbrockmendel Feb 26, 2020
f3a9b78
CLN: generic tests (#32256)
jbrockmendel Feb 26, 2020
a152c30
PERF: lazify consolidate_check and dont consolidate in get_slice (#32…
jbrockmendel Feb 26, 2020
89ed095
CLN: simplify+annotate _shallow_copy (#32244)
jbrockmendel Feb 26, 2020
7c5d3d5
REF: include CategoricalIndex in index_cached parametrization (#32215)
jbrockmendel Feb 26, 2020
2fc8559
CLN: simplify CategoricalIndex._simple_new (#32204)
jbrockmendel Feb 26, 2020
9bf3a28
ENH: Timestamp constructor now raises more explanatory error message …
fujiaxiang Feb 26, 2020
8b200c1
ENH: Implement DataFrame.value_counts (#31247)
dsaxton Feb 26, 2020
6c74f88
use ExtensionIndex._concat_same_dtype (#32232)
jbrockmendel Feb 26, 2020
cf957ad
REF: simplify IntervalIndex/IntervalArray _shallow_copy (#32247)
jbrockmendel Feb 26, 2020
2885b30
CLN/TST: parametrize some tests in tests.indexing.test_float (#32187)
jbrockmendel Feb 26, 2020
a713063
Add link to pandas-tfrecords (#32143)
schipiga Feb 26, 2020
1b8b428
CLN: remove dtype kwarg from _simple_new (#32260)
jbrockmendel Feb 26, 2020
6e04264
ENH: add fold support to Timestamp constructor (#31563)
AlexKirko Feb 26, 2020
27f0000
REGR: preserve freq in DTI/TDI outer join (#32166)
jbrockmendel Feb 26, 2020
35537dd
BUG: Cast pd.NA to pd.NaT in to_datetime (#32214)
dsaxton Feb 26, 2020
9535246
Added message to pytest raises for test_constructor_dict (#32236)
Dom-L-G Feb 26, 2020
e39cd30
Fix exception causes in 14 modules (#32235)
cool-RR Feb 26, 2020
e6bd49f
use numexpr for Series comparisons (#32047)
jbrockmendel Feb 26, 2020
d8115ce
DOC: Fix pandas.index.copy summary documentation (#32006)
asepwhite Feb 26, 2020
9a8e83a
DOC: Improve documentation for Index.where (#32009)
laymonage Feb 26, 2020
52a63ab
DOC: Fix errors in pandas.Series.argmax (#32019)
farhanreynaldo Feb 26, 2020
ed7adcd
CI: Remove docs build from pipelines (#32074)
datapythonista Feb 26, 2020
2c060b4
DOC: add missing links to introduction to pandas (#32198)
raisadz Feb 27, 2020
8621970
TST: move misplaced to_datetime test (#32273)
jbrockmendel Feb 27, 2020
97c0ce9
Troubleshoot CI (#32284)
jbrockmendel Feb 27, 2020
eb6f8d3
test_droplevel (#32272)
jbrockmendel Feb 27, 2020
66c6e8e
Use defaultdict for optimization (#32278)
jaketae Feb 27, 2020
922f932
PERF: pass through to numpy validation for iloc setitem (#32257)
jbrockmendel Feb 27, 2020
736761f
TST: misplaced arithmetic tests (#32275)
jbrockmendel Feb 27, 2020
cad3f1c
CLN: Follow-up to #32158 (#32290)
simonjayhawkins Feb 27, 2020
0233a55
TST: implement test_first (#32274)
jbrockmendel Feb 27, 2020
674dd69
CI: Temporary fix to the docs build while we fix the ssh problems (#3…
datapythonista Feb 27, 2020
217a428
TST: test_to_time (#32285)
jbrockmendel Feb 27, 2020
1d18e95
implement test_to_period (#32270)
jbrockmendel Feb 27, 2020
86f0468
fixed minor docstring typo (#32298)
mankoff Feb 27, 2020
d5d6834
REF/TST: misplaced tests in test_timeseries, test_timezones (#32300)
jbrockmendel Feb 27, 2020
b4cbc19
Added in a error message (#32105)
Samira-g-js Feb 27, 2020
4800ab4
DOC: Add example for multiindex series and dataframe merge (#32068)
ryankarlos Feb 27, 2020
ea1d8fa
BUG: fix in categorical merges (#32079)
MarcoGorelli Feb 27, 2020
1e7dc49
WEB: Add greeting note to CoC (#32024)
MarcoGorelli Feb 27, 2020
5fa9860
API/BUG: raise only KeyError failed on geitem/loc lookups (#31867)
jbrockmendel Feb 27, 2020
edcf1c8
DOC: Reorder 1.0 releases in whatsnew/index.rst (#32309)
mroeschke Feb 28, 2020
fc60870
STY: spaces in wrong place (#32323)
ShaharNaveh Feb 28, 2020
24827e5
Fix exception causes all over the code (#32322)
cool-RR Feb 28, 2020
c20528e
REF: test_first_valid_index (#32317)
jbrockmendel Feb 28, 2020
77ee2bf
REF/TST: misplaced MultiIndex tests (#32314)
jbrockmendel Feb 28, 2020
4e5e73e
TST: remove invalid internals tests (#32297)
jbrockmendel Feb 28, 2020
bf613c1
TST: Use `sort` fixture in more places (#32292)
SaturnFromTitan Feb 28, 2020
26de103
CLN: Removed unused variables defenition (#32328)
ShaharNaveh Feb 28, 2020
ece0517
CLN: some code cleanups (#32177)
ShaharNaveh Feb 28, 2020
b512ed5
CLN: _libs.interval looping with cdef index (#32329)
ShaharNaveh Feb 28, 2020
48abf99
TST/CLN: Follow-up to #31867 (#32324)
simonjayhawkins Feb 29, 2020
7d9c200
DOC: Fixed reference to `convert_dtypes` in `to_numeric` (#32295) (#3…
jqmviegas Feb 29, 2020
aeeca53
TYP: Update type naming in formatter (#32345)
xcz011 Feb 29, 2020
89f3f4e
print merged df result (#32370)
ryankarlos Feb 29, 2020
43321fa
DOC: Fix SS06 formatting errors in merge_asof docstrings (#32351)
Nael-Nathanael Feb 29, 2020
712000e
Changed kind parameter from integer to int, Added example (#32361)
zakybilfagih Feb 29, 2020
edc27e4
solve 'check_category_order' description ends with '.' (#32358)
tolhassianipar Feb 29, 2020
183129f
Silence warnings when compiling pandas/_libs/parsers.pyx (#32368)
ShaharNaveh Feb 29, 2020
01f3998
TST/REF: move tools test files (#32338)
jbrockmendel Feb 29, 2020
0ce463d
DOC: Minor typo fixes for code style guide (#32379)
dan1261 Mar 1, 2020
7e6b583
REF: collect+parametrize reorder_levels tests (#32373)
jbrockmendel Mar 2, 2020
8b347ea
TST: Allow definition of `pd.CategoricalDtype` with a specific `categ…
rushabh-v Mar 2, 2020
a1f9ae2
TYP: annotations for internals, set_axis (#32376)
jbrockmendel Mar 2, 2020
f25ed6f
misplaced DataFrame.join test (#32375)
jbrockmendel Mar 2, 2020
78c1a74
DOC: Fixed ES01, PR07, SA04 error in pandas.core.groupby.DataFrameGro…
Iqrar99 Mar 2, 2020
3e5fe8e
BUG: Pickle NA objects (#32104)
TomAugspurger Mar 2, 2020
68603b1
DOC: Fix SA04 errors in docstrings #28792 (#32182)
AdrianMastronardi Mar 3, 2020
712fd01
CLN: remove _igetitem_cache (#32319)
jbrockmendel Mar 3, 2020
08deb10
Avoid unnecessary values_from_object (#32398)
jbrockmendel Mar 3, 2020
d219c2c
ENH: infer freq in timedelta_range (#32377)
jbrockmendel Mar 3, 2020
bdcb5da
BUG: 2D DTA/TDA arithmetic with object-dtype (#32185)
jbrockmendel Mar 3, 2020
f2a1325
TST: broken off from #32187 (#32258)
jbrockmendel Mar 3, 2020
d012623
REF: simplify PeriodIndex._shallow_copy (#32280)
jbrockmendel Mar 3, 2020
604beff
CLN: setitem_with_indexer cleanups (#32341)
jbrockmendel Mar 3, 2020
4ba48f0
BUG: None / Timedelta incorrectly returning NaT (#32340)
jbrockmendel Mar 3, 2020
116f8d2
TST: Using more fixtures in of tests/base/test_ops.py (#32313)
SaturnFromTitan Mar 3, 2020
e2f1bc0
CLN: remove unused values from interpolate call (#32400)
jbrockmendel Mar 3, 2020
fcadff3
CLN: some code cleanups to pandas/_libs/missing.pyx (#32367)
ShaharNaveh Mar 3, 2020
861df91
BUG: fixes bug when using sep=None and comment keyword for read_csv (…
s-scherrer Mar 3, 2020
94fdc30
Don't create _join_functions (#32336)
dsaxton Mar 3, 2020
ebf9668
API: replace() should raise an exception if invalid argument is given…
a-y-khan Mar 3, 2020
821aa25
BUG: Fix __ne__ comparison for Categorical (#32304)
dsaxton Mar 3, 2020
f6b6c15
CLN: clean-up show_versions and consistently use null for json output…
simonjayhawkins Mar 3, 2020
4018550
Add missing newline (#32404)
Mar 3, 2020
0dc93cd
TST: add message check to pytest.raises (tests/arrays/test_boolean.py…
ShilpaSugan Mar 3, 2020
283f81c
TYP: internals (#32403)
jbrockmendel Mar 3, 2020
706e642
DOC: Fix SA04 errors in docstrings xref #28792 (#32180)
AdrianMastronardi Mar 3, 2020
d206479
REF: remove _convert_scalar_indexer (#31962)
jbrockmendel Mar 3, 2020
ce7670c
Implement BlockManager.iset (#32350)
jbrockmendel Mar 3, 2020
d33b002
CLN: remove unreachable branch (#32405)
jbrockmendel Mar 3, 2020
dcd86e5
REF: avoid using internals methods for to_timestamp, to_period (#32347)
jbrockmendel Mar 4, 2020
c5f0ebf
TYP/cln: generic._make_*_function (#32363)
topper-123 Mar 4, 2020
0d04683
TST: Split and simplify test_value_counts_unique_nunique (#32281)
SaturnFromTitan Mar 4, 2020
67aae80
CLN: avoid values_from_object in NDFrame (#32422)
jbrockmendel Mar 4, 2020
3242b4e
CLN: move away from .values, _ndarray_values (#32419)
jbrockmendel Mar 4, 2020
fd8384e
Series append raises TypeError (#32090)
hvardhan20 Mar 4, 2020
9aaaf1b
Fix BUG: overflow on pd.Timedelta(nanoseconds=) constructor (#32424)
roberthdevries Mar 4, 2020
86ed2b6
PERF: lazify blknos and blklocs (#32261)
jbrockmendel Mar 4, 2020
609b8da
Fix PR08, RT02, RT03, and SA01 on pandas.Index.fillna (#32355)
RafifEL Mar 4, 2020
bdb4a08
DOC: Fix EX02 in pandas.Index.get_loc (#32429)
farhanreynaldo Mar 4, 2020
690e382
CI: fix test_matplotlib_scatter_datetime64 (#32442)
WillAyd Mar 4, 2020
c5f11ab
CI: mypy fixup for #32261 (#32438)
WillAyd Mar 4, 2020
d4cc3cc
CI: ax.rowNum and ax.colNum attributes deprecated in Matplotlib 3.2 (…
simonjayhawkins Mar 5, 2020
79c9ec9
CLN: avoid _internal_get_values in groupby.generic (#32427)
simonjayhawkins Mar 5, 2020
f7bed05
DOC: correct issue number for PR #32424 (#32436)
simonjayhawkins Mar 5, 2020
8914c01
TYP/CLN: Optional[Hashable] -> pandas._typing.Label (#32371)
simonjayhawkins Mar 6, 2020
f6b3e82
DOC: Fix errors in pandas.Series.argmin (#32286)
farhanreynaldo Mar 6, 2020
015c1c1
Made apt changes to the documentation for pandas.Series.str.replace()…
ashwinpn Mar 6, 2020
96a40ed
Fixing RT02 pandas.Index.dropna and PR08 pandas.Index.fillna (#32359)
zaki-indra Mar 6, 2020
54c5e9e
CLN: imports in pandas/io/excel/_base.py (#32481)
ShaharNaveh Mar 6, 2020
970499d
fix errors docstrings pandas.to_numeric (#32354)
isoletslicer Mar 6, 2020
45d412f
CI: Fix flaky test_value_counts_null (#32449)
SaturnFromTitan Mar 7, 2020
2e0e013
CLN: assorted cleanups, annotations (#32475)
jbrockmendel Mar 7, 2020
7217393
CLN: remove is_period_arraylike, is_datetime_arraylike (#32406)
jbrockmendel Mar 7, 2020
777c0f9
CLN: remove unreachable _internal_get_values in blocks (#32472)
jbrockmendel Mar 7, 2020
ed8df2d
REF: de-nest Series.__setitem__ (#32078)
jbrockmendel Mar 7, 2020
edae9d1
Implement __array__ on ExtensionIndex (#32255)
jbrockmendel Mar 7, 2020
8f51c99
TST: Removed import of itertools (#32364)
ShaharNaveh Mar 7, 2020
3d08aa5
TST: refactored test_factorize (#32311)
SaturnFromTitan Mar 7, 2020
20474f5
DOC: DataFrame.ewm docstring clean-up (#32212)
sursu Mar 7, 2020
2a2258d
CI: Adding script to validate consistent and correct capitalization a…
Mar 7, 2020
6852012
DOC: Fix examples in documentation (#31472)
ShaharNaveh Mar 7, 2020
684a291
DOC: Update the pandas.DatetimeIndex docstring (#32360)
Hori75 Mar 7, 2020
00f1825
BUG: PeriodIndex.asof_locs (#32310)
jbrockmendel Mar 8, 2020
aa27b9a
CLN: avoid _ndarray_values, values in MultiIndex (#32452)
jbrockmendel Mar 8, 2020
957fc3c
BUG/DEPR: loc.__setitem__ incorrectly accepting positional slices (#3…
jbrockmendel Mar 8, 2020
19ae087
PERF: do DataFrame.op(series, axis=0) blockwise (#31296)
jbrockmendel Mar 8, 2020
6f5287b
CLN: avoid values_from_object in nanops (#32508)
jbrockmendel Mar 8, 2020
90bb1aa
API: allow step!=1 slice with IntervalIndex (#31658)
jbrockmendel Mar 8, 2020
08c6597
CLN: remove check_series_type (#32513)
jbrockmendel Mar 8, 2020
edb863e
CLN: avoid values_from_object in construction (#32504)
jbrockmendel Mar 8, 2020
a3985f8
Fix failure to convert string "uint64" to NaN (#32541)
roberthdevries Mar 9, 2020
3e1275e
CLN: to_dense->np.asarray (#32545)
jbrockmendel Mar 9, 2020
787dc8a
BUG: Fixed bug, where pandas._libs.lib.maybe_convert_objects function…
AnnaDaglis Mar 9, 2020
3e8f32c
TST: separate out pd.crosstab tests from test_pivot (#32536)
jbrockmendel Mar 9, 2020
ae79bb2
CLN: remove Categorical.put (#32554)
jbrockmendel Mar 9, 2020
76a1710
DOC: Add missing question mark icon (#32564)
stijnvanhoey Mar 10, 2020
3d591cb
DOC: add redirects from Rolling to rolling.Rolling (#31875)
rushabh-v Mar 10, 2020
c143f08
TST: Fix bare pytest raises in generic/test_frame.py (#32565)
andresmcneill Mar 10, 2020
dc4de58
TST: add test.indexes.common.Base.create_index and annotate .create_i…
topper-123 Mar 10, 2020
5000420
Fix warning in unit test (#32563)
roberthdevries Mar 10, 2020
00bb09d
CLN: remove unused in pd._testing (#32534)
jbrockmendel Mar 10, 2020
5b45f4b
TST: remove unused kwargs in assert_sp_array_equal (#32525)
jbrockmendel Mar 10, 2020
db9a50c
CLN: unused code in reshape.merge (#32509)
jbrockmendel Mar 10, 2020
113c255
CLN: remove unnecessary to_dense (#32533)
jbrockmendel Mar 10, 2020
37659d4
BUG: string methods with NA (#31684)
prakhar987 Mar 10, 2020
a77ad8b
ENH: `Styler.highlight_null` can accepts `subset` argument (#31350)
immaxchen Mar 10, 2020
79d3b08
DOC: cleanup 1.0.2 whatsnew (#32592)
TomAugspurger Mar 10, 2020
09a46a4
DOC: Add extended summary, update parameter types desc, update return…
sagungrp Mar 10, 2020
1951c8e
CLN: Avoid bare pytest.raises in computation/test_eval.py (#32507)
Vlek Mar 10, 2020
ff6c8a0
DOC: Fix EX01 in pandas.DataFrame.idxmax (#32551)
farhanreynaldo Mar 11, 2020
650cf74
Better error message for OOB result (#32499)
TomAugspurger Mar 11, 2020
a2acd1b
BUG: Fix bug, where BooleanDtype columns are converted to Int64 (#32490)
AnnaDaglis Mar 11, 2020
5fadbe4
Avoid bare pytest.raises in dtypes/cast/test_upcast.py (#32603)
Vlek Mar 11, 2020
8bc471a
BUG: Fix rolling functions with variable windows on decreasing index …
leftys Mar 11, 2020
f33120c
DOC: Fixed errors in pandas.DataFrame.asfreq PR07, RT02, RT03, SA04 (…
tolhassianipar Mar 11, 2020
d4815a5
CLN: use _values_for_argsort for join_non_unique, join_monotonic (#32…
jbrockmendel Mar 11, 2020
5ebbb56
CLN: avoid _internal_get_values in pandas._testing (#32570)
jbrockmendel Mar 11, 2020
8e74325
CLN: remove SingleBlockManager.get_values (#32522)
jbrockmendel Mar 11, 2020
b9762eb
REG: Restore read_csv function for some file-likes (#32577)
gfyoung Mar 11, 2020
9a31bdd
CLN: rename get_block_values, simplify (#32521)
jbrockmendel Mar 11, 2020
c910ec3
TST: stricter tests, avoid check_categorical=False, check_less_precis…
jbrockmendel Mar 11, 2020
8c38283
CLN: avoid values_from_object in Series (#32426)
jbrockmendel Mar 11, 2020
b8ee1e1
TYP: enforce annotation on SingleBlockManager.__init__ (#32421)
jbrockmendel Mar 11, 2020
03b1f19
CLN: remove unnecessary values_from_objects in groupby.ops (#32547)
jbrockmendel Mar 11, 2020
4b99525
CLN: values_from_object in computation.pytables (#32557)
jbrockmendel Mar 11, 2020
33f67d9
BUG: iloc.__setitem__ with duplicate columns (#32477)
jbrockmendel Mar 11, 2020
a0972f4
CLN: avoid values_from_object in Index.equals (#32505)
jbrockmendel Mar 11, 2020
4867415
REF: Remove BlockManager.rename_axis (#32349)
jbrockmendel Mar 11, 2020
2579484
CLN: simplify get_values usage in groupby (#32523)
jbrockmendel Mar 11, 2020
3984df1
CLN: avoid Block.get_values in io.sql (#32524)
jbrockmendel Mar 11, 2020
eac5cd1
TST: fixturize skipna in test_nanops (#32607)
jbrockmendel Mar 11, 2020
e8e02c0
PERF: copy cached attributes on index shallow_copy (#32568)
topper-123 Mar 11, 2020
362aef1
DOC: Fix link to monthly meeting calendar (#32602)
datapythonista Mar 11, 2020
817d57d
TYP: Add type hint for DataFrame.T and certain array types (#32532)
qwhelan Mar 11, 2020
7fa8ee7
CLN: Split and fixturized test_fillna in tests/base/test_ops.py (#32483)
SaturnFromTitan Mar 11, 2020
f4f1b69
CI: Update web and docs to OVH with the right structure (#32530)
datapythonista Mar 11, 2020
c1fd95b
DOC: fix formatting / links of API refs in 1.0.2 whatsnew (#32620)
jorisvandenbossche Mar 11, 2020
983fae6
BUG: non-iterable value in meta raise error in json_normalize (#31524)
charlesdong1991 Mar 11, 2020
8ac101d
TST: revert parts of #32571 (#32630)
jbrockmendel Mar 11, 2020
ecb5b57
Ensure valid Block mutation in SeriesBinGrouper. (#32561)
TomAugspurger Mar 11, 2020
a3642ec
TYP: Remove _ensure_type (#32633)
simonjayhawkins Mar 11, 2020
1f2385e
DOC: Reorganize Getting Started documentation pages (#32389)
mroeschke Mar 11, 2020
5e27d0a
Clarify pivot_table fill_value description (#32618)
taljaards Mar 11, 2020
1b76440
BUG: Add extra check for failing UTF-8 conversion (#32548)
roberthdevries Mar 12, 2020
9bc3ee0
REG: dt64 shift with integer fill_value (#32591)
jbrockmendel Mar 12, 2020
9e7cb7c
BUG: Fix DateFrameGroupBy.mean error for Int64 dtype (#32223)
dsaxton Mar 12, 2020
ef7e720
BUG: GroupBy aggregation of DataFrame with MultiIndex columns breaks …
MarcoGorelli Mar 12, 2020
c85f5f2
CLN: trim unnecessary checks (#32643)
jbrockmendel Mar 12, 2020
27ad779
TST: tighten check_categorical=False tests (#32636)
jbrockmendel Mar 12, 2020
fa48f5f
REF: implement _get_engine_target (#32611)
jbrockmendel Mar 12, 2020
da67e1e
CLN: avoid values_from_object in reshape.merge (#32537)
jbrockmendel Mar 12, 2020
7f6a22c
CLN: Suppres compile warnings of pandas/io/sas/sas.pyx (#32517)
ShaharNaveh Mar 12, 2020
bb4395e
ENH: Move corrwith from transformation to reduction kernels in groupb…
fujiaxiang Mar 12, 2020
f3859cb
DOC: Fix capitalization of the word pandas in the docs (#32439)
joybh98 Mar 12, 2020
940bbb5
TST/VIZ: add test for legend colors for DataFrame with duplicate colu…
gabrielvf1 Mar 12, 2020
ec77341
DOC: Remove absolute urls from the docs (#32539)
ArkadeepAdhikari Mar 12, 2020
9060d88
PERF: copy cached attributes on extension index shallow_copy (#32640)
topper-123 Mar 12, 2020
9c85af8
BUG: pd.ExcelFile closes stream on destruction (#32544)
roberthdevries Mar 12, 2020
4aba9ed
DOC: fix announce formtting (#32656)
TomAugspurger Mar 12, 2020
7147c38
DOC: Organize regressions (#32658)
TomAugspurger Mar 12, 2020
05925d2
CLN: removed unused import (#32518)
ShaharNaveh Mar 12, 2020
25e2f86
fix infer_freq raises section (#32642)
wholmgren Mar 12, 2020
059f9bf
CLN: Remove PY2 compat code (#32677)
ShaharNaveh Mar 13, 2020
2b37609
DOC: filter method example is more clear (#32690)
epizzigoni Mar 13, 2020
c5987d6
Fix wrong docstring in qcut (#32679)
masterpiga Mar 13, 2020
f676a21
REF: make DatetimeIndex._simple_new actually simple (#32282)
jbrockmendel Mar 14, 2020
8d654ca
CLN: remove unused int_block (#32695)
jbrockmendel Mar 14, 2020
2bc24d5
CLN: avoid _ndarray_values in reshape.merge (#32693)
jbrockmendel Mar 14, 2020
d8f60c3
REF: put all post-processing at end of DataFrame._reduce (#32671)
jbrockmendel Mar 14, 2020
ea9942b
CLN: remove unused kwargs from BlockManager.downcast (#32691)
jbrockmendel Mar 14, 2020
fa26bce
values->_values (#32662)
jbrockmendel Mar 14, 2020
6a3f030
ENH: implement EA.size (#32644)
jbrockmendel Mar 14, 2020
44b3855
CLN: remove Block.array_dtype, SingleBlockManager.array_dtype (#32612)
jbrockmendel Mar 14, 2020
3c16cea
REF: implement nanops.na_accum_func (#32597)
jbrockmendel Mar 14, 2020
011f411
ENH: IntegerArray.astype(dt64) (#32538)
jbrockmendel Mar 14, 2020
3b66021
TST: make tests stricter (#32527)
jbrockmendel Mar 14, 2020
f35bcac
TST: fix test creating invalid CategoricalBlock (#32519)
jbrockmendel Mar 14, 2020
5c7a901
PERF: Using Numpy C-API arange (#32681)
ShaharNaveh Mar 14, 2020
8111d64
PERF: MultiIndex._shallow_copy (#32669)
topper-123 Mar 14, 2020
029f707
Disallow lossy SparseArray conversion (#32501)
jbrockmendel Mar 14, 2020
98a6b3c
ENH: implement ExtensionIndex.insert (#32476)
jbrockmendel Mar 14, 2020
fd2e002
ENH: Categorical.fillna allow Categorical/ndarray (#32420)
jbrockmendel Mar 14, 2020
aa9a6dc
DOC: Fix EX01 in pandas.DataFrame.idxmin (#32697)
farhanreynaldo Mar 14, 2020
427af4d
CI: Update pipelines config to trigger on PRs (#32706)
datapythonista Mar 14, 2020
b32c4e8
skip 32 bit linux (#32708)
WillAyd Mar 14, 2020
d691633
CLN: Clean frame/test_constructors.py (#32610)
dsaxton Mar 14, 2020
ff7e2fa
CLN: Remove redundant index test from tests/base/test_ops.py (#32484)
SaturnFromTitan Mar 14, 2020
1a5b11d
BUG: Fix DataFrame.apply(..., raw=True) not calling with raw array (#…
kernc Mar 14, 2020
810a4e5
BUG: assignment to multiple columns when some column do not exist (#2…
howsiwei Mar 14, 2020
4603c63
BUG: retain tz in to_records (#32535)
jbrockmendel Mar 14, 2020
9e9785b
CLN: avoid Block.get_values where it is unnecessary (#32707)
jbrockmendel Mar 14, 2020
e734449
Deprecate Aliases as orient Argument in DataFrame.to_dict (#32516)
elmonsomiat Mar 14, 2020
0ed6d53
BUG: pivot_table losing tz (#32558)
jbrockmendel Mar 14, 2020
bab3b1f
REF: avoid runtime import of Index (#32710)
jbrockmendel Mar 15, 2020
e7e5b61
BUG: DatetimeArray._from_sequence accepting bool dtype (#32668)
jbrockmendel Mar 15, 2020
41322cf
Simplified get_blkno_indexers (#32645)
WillAyd Mar 15, 2020
7a24c35
DEPR: Categorical.to_dense (#32639)
jbrockmendel Mar 15, 2020
2e114ce
BUG: Fix file descriptor leak (#32598)
roberthdevries Mar 15, 2020
74c5306
TST: Add extra test for pandas.to_numeric() for issue #32394 (#32560)
roberthdevries Mar 15, 2020
74f6579
BUG: Don't multiply sets during construction (#32594)
dsaxton Mar 15, 2020
2b34275
CLN: Remove redundant tests for .duplicated and .drop_duplicates in t…
SaturnFromTitan Mar 15, 2020
6620dc6
ENH: Add support for dataclasses in the DataFrame constructor (#27999)
asosnovsky Mar 15, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
43 changes: 20 additions & 23 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -125,35 +125,32 @@ jobs:
- name: Check ipython directive errors
run: "! grep -B1 \"^<<<-------------------------------------------------------------------------$\" sphinx.log"

- name: Merge website and docs
run: |
mkdir -p pandas_web/docs
cp -r web/build/* pandas_web/
cp -r doc/build/html/* pandas_web/docs/
if: github.event_name == 'push'

- name: Install Rclone
run: sudo apt install rclone -y
if: github.event_name == 'push'

- name: Set up Rclone
run: |
RCLONE_CONFIG_PATH=$HOME/.config/rclone/rclone.conf
mkdir -p `dirname $RCLONE_CONFIG_PATH`
echo "[ovh_cloud_pandas_web]" > $RCLONE_CONFIG_PATH
echo "type = swift" >> $RCLONE_CONFIG_PATH
echo "env_auth = false" >> $RCLONE_CONFIG_PATH
echo "auth_version = 3" >> $RCLONE_CONFIG_PATH
echo "auth = https://auth.cloud.ovh.net/v3/" >> $RCLONE_CONFIG_PATH
echo "endpoint_type = public" >> $RCLONE_CONFIG_PATH
echo "tenant_domain = default" >> $RCLONE_CONFIG_PATH
echo "tenant = 2977553886518025" >> $RCLONE_CONFIG_PATH
echo "domain = default" >> $RCLONE_CONFIG_PATH
echo "user = w4KGs3pmDxpd" >> $RCLONE_CONFIG_PATH
echo "key = ${{ secrets.ovh_object_store_key }}" >> $RCLONE_CONFIG_PATH
echo "region = BHS" >> $RCLONE_CONFIG_PATH
CONF=$HOME/.config/rclone/rclone.conf
mkdir -p `dirname $CONF`
echo "[ovh_host]" > $CONF
echo "type = swift" >> $CONF
echo "env_auth = false" >> $CONF
echo "auth_version = 3" >> $CONF
echo "auth = https://auth.cloud.ovh.net/v3/" >> $CONF
echo "endpoint_type = public" >> $CONF
echo "tenant_domain = default" >> $CONF
echo "tenant = 2977553886518025" >> $CONF
echo "domain = default" >> $CONF
echo "user = w4KGs3pmDxpd" >> $CONF
echo "key = ${{ secrets.ovh_object_store_key }}" >> $CONF
echo "region = BHS" >> $CONF
if: github.event_name == 'push'

- name: Sync web with OVH
run: rclone sync --exclude pandas-docs/** web/build ovh_host:prod
if: github.event_name == 'push'

- name: Sync web
run: rclone sync pandas_web ovh_cloud_pandas_web:dev
- name: Sync dev docs with OVH
run: rclone sync doc/build/html ovh_host:prod/pandas-docs/dev
if: github.event_name == 'push'
14 changes: 14 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,20 @@ repos:
- id: flake8
language: python_venv
additional_dependencies: [flake8-comprehensions>=3.1.0]
- id: flake8
name: flake8-pyx
language: python_venv
files: \.(pyx|pxd)$
types:
- file
args: [--append-config=flake8/cython.cfg]
- id: flake8
name: flake8-pxd
language: python_venv
files: \.pxi\.in$
types:
- file
args: [--append-config=flake8/cython-template.cfg]
- repo: https://github.com/pre-commit/mirrors-isort
rev: v4.3.21
hooks:
Expand Down
91 changes: 35 additions & 56 deletions asv_bench/benchmarks/algorithms.py
Original file line number Diff line number Diff line change
Expand Up @@ -31,83 +31,62 @@ def time_maybe_convert_objects(self):

class Factorize:

params = [[True, False], ["int", "uint", "float", "string"]]
param_names = ["sort", "dtype"]

def setup(self, sort, dtype):
N = 10 ** 5
data = {
"int": pd.Int64Index(np.arange(N).repeat(5)),
"uint": pd.UInt64Index(np.arange(N).repeat(5)),
"float": pd.Float64Index(np.random.randn(N).repeat(5)),
"string": tm.makeStringIndex(N).repeat(5),
}
self.idx = data[dtype]

def time_factorize(self, sort, dtype):
self.idx.factorize(sort=sort)


class FactorizeUnique:

params = [[True, False], ["int", "uint", "float", "string"]]
param_names = ["sort", "dtype"]
params = [
[True, False],
[True, False],
["int", "uint", "float", "string", "datetime64[ns]", "datetime64[ns, tz]"],
]
param_names = ["unique", "sort", "dtype"]

def setup(self, sort, dtype):
def setup(self, unique, sort, dtype):
N = 10 ** 5
data = {
"int": pd.Int64Index(np.arange(N)),
"uint": pd.UInt64Index(np.arange(N)),
"float": pd.Float64Index(np.arange(N)),
"float": pd.Float64Index(np.random.randn(N)),
"string": tm.makeStringIndex(N),
}
self.idx = data[dtype]
assert self.idx.is_unique

def time_factorize(self, sort, dtype):
"datetime64[ns]": pd.date_range("2011-01-01", freq="H", periods=N),
"datetime64[ns, tz]": pd.date_range(
"2011-01-01", freq="H", periods=N, tz="Asia/Tokyo"
),
}[dtype]
if not unique:
data = data.repeat(5)
self.idx = data

def time_factorize(self, unique, sort, dtype):
self.idx.factorize(sort=sort)


class Duplicated:

params = [["first", "last", False], ["int", "uint", "float", "string"]]
param_names = ["keep", "dtype"]

def setup(self, keep, dtype):
N = 10 ** 5
data = {
"int": pd.Int64Index(np.arange(N).repeat(5)),
"uint": pd.UInt64Index(np.arange(N).repeat(5)),
"float": pd.Float64Index(np.random.randn(N).repeat(5)),
"string": tm.makeStringIndex(N).repeat(5),
}
self.idx = data[dtype]
# cache is_unique
self.idx.is_unique

def time_duplicated(self, keep, dtype):
self.idx.duplicated(keep=keep)


class DuplicatedUniqueIndex:

params = ["int", "uint", "float", "string"]
param_names = ["dtype"]
params = [
[True, False],
["first", "last", False],
["int", "uint", "float", "string", "datetime64[ns]", "datetime64[ns, tz]"],
]
param_names = ["unique", "keep", "dtype"]

def setup(self, dtype):
def setup(self, unique, keep, dtype):
N = 10 ** 5
data = {
"int": pd.Int64Index(np.arange(N)),
"uint": pd.UInt64Index(np.arange(N)),
"float": pd.Float64Index(np.random.randn(N)),
"string": tm.makeStringIndex(N),
}
self.idx = data[dtype]
"datetime64[ns]": pd.date_range("2011-01-01", freq="H", periods=N),
"datetime64[ns, tz]": pd.date_range(
"2011-01-01", freq="H", periods=N, tz="Asia/Tokyo"
),
}[dtype]
if not unique:
data = data.repeat(5)
self.idx = data
# cache is_unique
self.idx.is_unique

def time_duplicated_unique(self, dtype):
self.idx.duplicated()
def time_duplicated(self, unique, keep, dtype):
self.idx.duplicated(keep=keep)


class Hashing:
Expand Down
Loading