Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

vectorizer_test.py 448 B

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
  1. from bald.vectorizer import Vectorizer, LabelVectorizer
  2. from bald.vocab import Vocab
  3. def test_char_vectorizer():
  4. vocab = Vocab()
  5. vocab.add_token("a")
  6. vocab.add_token("b")
  7. ve = Vectorizer(vocab)
  8. seq = ["a","c","b"]
  9. assert ve.pre_vectorize(seq) == [1,3,0,4,2]
  10. def test_label_vectorizer():
  11. sequence = ["O", "I-PER", "B-ORG", "I-ORG"]
  12. seq_v = LabelVectorizer.pre_vectorize(sequence)
  13. assert seq_v == [0,0,1,2,2,0]
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...