Add incremental TF-IDF learning fix for chatbot sample (issue #157) by Hardikrepo · Pull Request #189 · microsoft/AI

Hardikrepo · 2026-07-01T18:32:44Z

Summary

Adds a community sample fixing the O(n^2) growth issue raised in Chatbot #157: the original chatbot's learn_from_pair refit the entire TF-IDF vectorizer on every call.
The fix reuses the already-fitted vectorizer's transform() for new examples and appends via scipy.sparse.vstack (O(1) amortized per call), with a periodic full rebuild (default every 20 additions) to resync vocabulary/IDF weights.

Context

See discussion in #157 for the original script and review comments identifying the O(n^2) issue and suggesting an incremental approach.

Test plan

Run community-samples/tfidf-chatbot-incremental-fix/simple_chatbot.py interactively and confirm learn: commands work and responses remain correct after several incremental learns.

Avoids O(n^2) growth from refitting the vectorizer on every learn_from_pair call by appending via scipy.sparse.vstack and rebuilding only periodically.

Hardikrepo · 2026-07-01T18:35:48Z

@microsoft-github-policy-service agree

Add incremental TF-IDF learning fix for chatbot in issue microsoft#157

a5a7d28

Avoids O(n^2) growth from refitting the vectorizer on every learn_from_pair call by appending via scipy.sparse.vstack and rebuilding only periodically.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add incremental TF-IDF learning fix for chatbot sample (issue #157)#189

Add incremental TF-IDF learning fix for chatbot sample (issue #157)#189
Hardikrepo wants to merge 1 commit into
microsoft:masterfrom
Hardikrepo:fix/chatbot-incremental-tfidf-157

Hardikrepo commented Jul 1, 2026

Uh oh!

Hardikrepo commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Hardikrepo commented Jul 1, 2026

Summary

Context

Test plan

Uh oh!

Hardikrepo commented Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant