Skip to content

why doesn't \p{Emoji}+ match all emoji? #947

Answered by BurntSushi
phungleson asked this question in Q&A
Discussion options

You must be logged in to vote

In the future, when filing issues, it would be helpful to provide code that I can run. That way, I can be sure I know what it is you're talking about.

With that said, my guess here is that you're conflating \p{Emoji} with the concept of "emoji." \p{Emoji} is a Unicode property, and it is one component of the Unicode emoji technical standard. Actually detecting and extracting emoji requires implementing that UTS, which will use \p{Emoji} for sure, but it is nowhere close to sufficient to implement. Appendix A of UTS#51 outlines the various properties related to emoji, and Emoji is merely one of them. Presumably one would need to use all of them to implement proper emoji extraction. (I note…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@phungleson
Comment options

Answer selected by phungleson
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #946 on January 18, 2023 12:25.