DEPR: DataFrame.lookup #35224

erfannariman · 2020-07-11T00:05:41Z

xref DEPR: let's deprecate #18262
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

erfannariman · 2020-07-11T00:07:47Z

This probably needs more discussion, but thought I open this PR as a starting point.

jreback

you need to change the existing tests to assert that there is a FutureWarning otherwise tests will fail

pandas/core/frame.py

erfannariman · 2020-07-11T00:24:37Z

In the xref you mention "Maybe a standalone function somewhere", is this still the idea or are we going to mention melt and loc? @jreback

jreback · 2020-07-11T00:25:52Z

no just put a nice example in the indexing docs - reference this in the deprecation
you can melt and use loc there

don’t need a standalone function

doc/source/user_guide/indexing.rst

pandas/core/frame.py

pep8speaks · 2020-07-15T20:26:47Z

Hello @erfannariman! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-09-16 17:58:12 UTC

erfannariman · 2020-07-15T20:55:50Z

@WillAyd I put Label-based "fancy indexing" on top, but it still fails with the same message. The message is not really informative, "warning before the summary" really does not make sense to me.

pandas/core/frame.py

doc/source/user_guide/indexing.rst

jreback · 2020-09-13T22:15:45Z

doc/source/user_guide/indexing.rst

+                       'B': [80, 55, 76, 67]})
+    df
+    melt = df.melt('col')
+    df['lookup'] = melt.query('col == variable')['value'].to_numpy()


rather than query, use .loc here as its more idiomatic (and plus its a single statement). Do we actually need to convert to numpy? (e.g. to avoid alignment)

The reason I used to_numpy is indeed for the index alignment. With .loc and without to_numpy, I would need reset_index. So that would look like:

melt = df.melt('col') melt = melt.loc[melt['col'] == melt['variable'], 'value'] df['lookup'] = melt.reset_index(drop=True) print(df) col A B lookup 0 A 80.0 80 80.0 1 A 23.0 55 23.0 2 B NaN 76 76.0 3 B 22.0 67 67.0

Which one do you prefer? I can understand that this is more idiomatic, althought it's a bit more code.

Btw now I'm typing it, I realize this method would fail if the index was not 0, .., n-1

This is a horrible choice as an alternative for lookup. Essentially, melt = df.melt('col') duplicates the data (not to mention the extra variable column) while one can resolve to numpy indexing.

I'm not sure what's the implementation of lookup, but it's a real request emerging from what I could see on SO. To me pivot and pivot_table are more redundant than lookup.

@quanghm my response is to the tone of the argument

i am not averse to the function of lookup but it's not going to be a top level api

i am not averse to the function of lookup but it's not going to be a top level api

Yet another reason why I don't go and make a pull request. I don't understand what's other than top level api that I can implement lookup functionality, and still don't understand performant.

@quanghm my response is to the tone of the argument

Can you please be more specific? The way I see it, @erfannariman was basically saying that unless I made a pull request, I cannot criticize the suggested alternative as horrible even when I spent time to analyze, test and show that it is, in fact, horrible. OK, if horrible is an offensive word, my apology, English is not my first language. Let me re-phrase it is very bad.

I do feel that my effort to contribute here is not valued. That was the best I can afford to better Pandas. Not everyone can afford to make a pull request, modify the code, run and pass all the unit tests.

Thanks for all the work maintaining and developing Pandas anyway.

From what I learned from the other discussion, you are not the one that has a say on bringing back lookup, and those that do don't seem to be willing to review the issue.

Opening a ticket with everything you wrote here has more chance to get reviewed by the core devs than to write all this on a closed PR. That way the other devs can see it as well, since I am quite sure they are not seeing this whole discussion.

If you don't feel comfortable open an issue, please let me know and I wil open one, since Jeff mentioned he's open for a (discussion) for re-implementing lookup.

Actually I already opened the ticket since it's better to have the discussion there: see #40140 @quanghm @jreback

doc/source/whatsnew/v1.1.0.rst

pandas/core/frame.py

WillAyd

lgtm @jreback

doc/source/user_guide/indexing.rst

doc/source/whatsnew/v1.1.2.rst

jreback · 2020-09-17T02:39:47Z

thanks @erfannariman very nice!

erfannariman added 5 commits July 11, 2020 02:01

DEPR - 18262 - deprecate lookup

5bbddce

DEPR - 18262 - changes black

02b50ac

Add test for deprecation lookup

ca8bf05

Added deprecation to whatsnew

dea45b9

Add test for deprecation lookup

5f116ad

jreback requested changes Jul 11, 2020

View reviewed changes

pandas/core/frame.py Outdated Show resolved Hide resolved

jreback added Deprecate Functionality to remove in pandas Indexing Related to indexing on series/frames, not to indexes themselves labels Jul 11, 2020

erfannariman added 9 commits July 11, 2020 19:15

FIX - 18262 - add warnings to other tests

0c04e90

Merge remote-tracking branch 'upstream/master' into 18262-depr-lookup

758468d

DOC - 18262 - added example to lookup values

7ac3b32

DOC - 18262 - point to example in depr

2ab80cf

FIX - 18262 - deprecation warning before summary

94a6c0f

FIX - 18262 - whitespaces after comma

a339131

FIX - 18262 - removed double line break

269e4cc

Fix variable in ipython

0c40d69

18262 - removed linebreak

1ca23bc

WillAyd reviewed Jul 13, 2020

View reviewed changes

doc/source/user_guide/indexing.rst Outdated Show resolved Hide resolved

pandas/core/frame.py Outdated Show resolved Hide resolved

erfannariman added 2 commits July 15, 2020 22:21

18262 - Fix merge conflict

9681a3d

18262 - replaced depr message

6342ad2

erfannariman added 2 commits July 15, 2020 22:28

18262 - line break too long line

3dfe19d

18262 - set header size

187d47b

jreback requested changes Jul 17, 2020

View reviewed changes

pandas/core/frame.py Outdated Show resolved Hide resolved

erfannariman added 2 commits July 28, 2020 21:23

[FIX] - 18262 - Merge conflict

db63df7

[FIX] - 18262 - removed extra dash header

227fad5

erfannariman added 4 commits September 13, 2020 20:53

moved depr version to 1.2

dc2d367

test with linking to user guide

293bd7a

Remove line break

cbca163

Merge branch 'master' into 18262-depr-lookup

90fa6a9

jreback requested changes Sep 13, 2020

View reviewed changes

erfannariman added 7 commits September 14, 2020 23:03

Merge branch 'master' into 18262-depr-lookup

3eefd8e

Revert whatsnew v1.1.0

4c3c163

Added depr message in whatsnew v1.2.0

b5a34e3

replace query with loc

ba4fb8a

add melt and loc to depr msg

6b91db6

add dot

ff7724f

added colon hyperlink

104e3cb

erfannariman requested review from jreback and WillAyd September 15, 2020 13:52

WillAyd approved these changes Sep 15, 2020

View reviewed changes

jreback requested changes Sep 15, 2020

View reviewed changes

doc/source/user_guide/indexing.rst Outdated Show resolved Hide resolved

doc/source/whatsnew/v1.1.2.rst Outdated Show resolved Hide resolved

updates

25e78dd

erfannariman requested a review from jreback September 16, 2020 18:05

erfannariman mentioned this pull request Sep 16, 2020

CLN: sql static method #36410

Closed

3 tasks

jreback added this to the 1.2 milestone Sep 17, 2020

jreback mentioned this pull request Sep 17, 2020

DEPR: let's deprecate #18262

Closed

34 tasks

jreback approved these changes Sep 17, 2020

View reviewed changes

jreback merged commit 1a3a2c1 into pandas-dev:master Sep 17, 2020

erfannariman deleted the 18262-depr-lookup branch September 17, 2020 09:22

rhshadrach pushed a commit to rhshadrach/pandas that referenced this pull request Sep 17, 2020

DEPR: DataFrame.lookup (pandas-dev#35224)

5309fa7

kesmit13 pushed a commit to kesmit13/pandas that referenced this pull request Nov 2, 2020

DEPR: DataFrame.lookup (pandas-dev#35224)

d0e95b4

jreback mentioned this pull request Jan 14, 2021

ENH: Reimplement and undeprecate DataFrame.lookup #39171

Closed

jreback mentioned this pull request Jan 26, 2021

DOC: link to correct PR #39406

Merged

erfannariman mentioned this pull request Mar 1, 2021

ENH: re-implement DataFrame.lookup. #40140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEPR: DataFrame.lookup #35224

DEPR: DataFrame.lookup #35224

erfannariman commented Jul 11, 2020

erfannariman commented Jul 11, 2020 •

edited

Loading

jreback left a comment

erfannariman commented Jul 11, 2020

jreback commented Jul 11, 2020

pep8speaks commented Jul 15, 2020 •

edited

Loading

erfannariman commented Jul 15, 2020

jreback Sep 13, 2020

erfannariman Sep 14, 2020 •

edited

Loading

erfannariman Sep 14, 2020

erfannariman Sep 14, 2020 •

edited

Loading

quanghm Jan 25, 2021

jreback Mar 1, 2021

jreback Mar 1, 2021

quanghm Mar 1, 2021

erfannariman Mar 1, 2021

erfannariman Mar 1, 2021

WillAyd left a comment

jreback commented Sep 17, 2020

DEPR: DataFrame.lookup #35224

DEPR: DataFrame.lookup #35224

Conversation

erfannariman commented Jul 11, 2020

erfannariman commented Jul 11, 2020 • edited Loading

jreback left a comment

Choose a reason for hiding this comment

erfannariman commented Jul 11, 2020

jreback commented Jul 11, 2020

pep8speaks commented Jul 15, 2020 • edited Loading

Comment last updated at 2020-09-16 17:58:12 UTC

erfannariman commented Jul 15, 2020

Choose a reason for hiding this comment

erfannariman Sep 14, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

erfannariman Sep 14, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

jreback commented Sep 17, 2020

erfannariman commented Jul 11, 2020 •

edited

Loading

pep8speaks commented Jul 15, 2020 •

edited

Loading

erfannariman Sep 14, 2020 •

edited

Loading

erfannariman Sep 14, 2020 •

edited

Loading