Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add unic-segment component #174

Merged
merged 2 commits into from
Oct 11, 2017
Merged

Add unic-segment component #174

merged 2 commits into from
Oct 11, 2017

Conversation

behnam
Copy link
Member

@behnam behnam commented Oct 6, 2017

Make all unic-ucd-segment properties Total, as turns out using these properties is easier as a total property, instead of partial, as we implemented initially. The reason for it is the fact that the default value (called Other in this context) is usually used in the same level as other values.

The algorithm implementation is based on unicode-segmentation, with changes to use unic-ucd-segment.

The API is kept minimal at the moment, with only segment boundry objects and their indices objects. As we expand our string-level APIs, we can add on what's needed.

The tests for the algorithm are imported. Also new test is created for unic-ucd-segment, using the same source.

Closes #135

Turns out using these properties is easier as a total property, instead
of partial, as we implemented initially. The reason for it is the fact
that the default value (called `Other` in this context) is usually used
in the same level as other values.

Update variant name for `WordBreak::DoubleQuote`, which was using the
abbr name instead.

Add missing component version test against `unic-ucd-core`.
@behnam behnam requested a review from CAD97 October 6, 2017 09:10
@behnam behnam added A: lib-impl Library Implementation C: segmentation Unicode Text Segmentation labels Oct 6, 2017
@behnam behnam added this to the UNIC-0.7 milestone Oct 6, 2017
The algorithm implementation is based on
[`unicode-segmentation`](https://github.com/unicode-rs/unicode-segmentation).

The API is kept minimal at the moment, with only segment boundry objects
and their indices objects. As we expand our string-level APIs, we can
add on what's needed.
@behnam
Copy link
Member Author

behnam commented Oct 11, 2017

bors: r+

bors bot added a commit that referenced this pull request Oct 11, 2017
174: Add unic-segment component r=behnam a=behnam

Make all `unic-ucd-segment` properties Total, as turns out using these properties is easier as a total property, instead of partial, as we implemented initially. The reason for it is the fact that the default value (called `Other` in this context) is usually used in the same level as other values.

The algorithm implementation is based on [`unicode-segmentation`](https://github.com/unicode-rs/unicode-segmentation), with changes to use `unic-ucd-segment`.

The API is kept minimal at the moment, with only segment boundry objects and their indices objects. As we expand our string-level APIs, we can add on what's needed.

The tests for the algorithm are imported. Also new test is created for `unic-ucd-segment`, using the same source.

Closes <#135>
@bors
Copy link
Contributor

bors bot commented Oct 11, 2017

Build succeeded

@bors bors bot merged commit 029f7d3 into master Oct 11, 2017
@behnam behnam deleted the segment branch October 14, 2017 20:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: lib-impl Library Implementation C: segmentation Unicode Text Segmentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant