Skip to content

Conversation

@oneby-wang
Copy link

  1. Add List<Character> punctuationMarks in TokenTextSplitter.
  2. Add protected method: getLastPunctuationIndex() in TokenTextSplitter.
  3. Add unit test in TokenTextSplitterTest.

Signed-off-by: oneby-wang <onebywang@qq.com>
@oneby-wang oneby-wang force-pushed the token_text_splitter_expose_punctuation_mark branch from 4a50283 to 632df5f Compare November 24, 2025 00:53
@ilayaperumalg ilayaperumalg self-assigned this Dec 2, 2025
@ilayaperumalg ilayaperumalg added the enhancement New feature or request label Dec 2, 2025
@ilayaperumalg ilayaperumalg added this to the 2.0.0.M1 milestone Dec 2, 2025
@ilayaperumalg ilayaperumalg self-assigned this Dec 10, 2025
@ilayaperumalg
Copy link
Member

@oneby-wang Thanks for the PR! Rebased and merged as 9773099 and added documentation via e59be78. Backported into 1.1.x as c0e279a and 7abe9a0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants