AI/NLP ๐Ÿ—ฃ

    [NLP] BoW(Bag of words)

    Bag of Words ๋‹จ์–ด ๋ฐ ๋ฌธ์„œ๋ฅผ ์ˆซ์žํ˜•ํƒœ๋กœ ๋‚˜ํƒ€๋‚ด๋Š” ๊ฐ€์žฅ ๊ฐ„๋‹จํ•œ ๊ธฐ๋ฒ•์œผ๋กœ์„œ TextMining ๋ถ„์•ผ์—์„œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ์ˆ ์ด ์ ์šฉ๋˜๊ธฐ ์ด์ „์— ๋งŽ์ด ํ™œ์šฉ๋˜๋˜ ๋ฐฉ์‹์ด๋ผ๊ณ  ํ•œ๋‹ค. Step 1. Constructing the vocabulary containing unique words Example sentences: "John really really loves this movie", "Jane really likes this song" ์ด ๋ฌธ์žฅ์—์„œ really์™€ this๋Š” ์ค‘๋ณต๋˜๊ธฐ์— ํ•œ ๋ฒˆ๋งŒ ํฌํ•จํ•˜๋ฉด ๋œ๋‹ค. Vocabulary: {"John", "really", "loves", "this", "movie", "Jane", "likes", "song"} Step 2. Encoding unique words ..