๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๐ŸŒด Course Review (Master)/๐Ÿš€ ๊ธฐ๊ณ„ ํ•™์Šต2

MLE์˜ ๊ตฌ์ฒด์ ์ธ ๋œป๊ณผ ์ถ”์ •๋ฐฉ๋ฒ•. KL Divergence์™€ MLE์˜ ์ƒ๊ด€๊ด€๊ณ„ Maximum Likelihood Estimation Likelihood function์ด๋ž€ ๋ฌด์—‡์ธ๊ฐ€? ํŒŒ๋ผ๋ฏธํ„ฐ ํ•ด์„์— ๋”ฐ๋ผ ์–ผ๋งˆ๋‚˜ ๊ด€์ธก์น˜๊ฐ€ ์–ผ๋งˆ๋‚˜ ๊ฐ€๋Šฅ์„ฑ์ด ์žˆ๋Š”์ง€๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ๊ฒƒ์„ ์˜๋ฏธํ•ฉ๋‹ˆ๋‹ค. ์ฆ‰, ์ €ํฌ๊ฐ€ ์ง‘์ค‘ํ•ด์•ผ ํ•˜๋Š” ๊ฒƒ์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ์–ด๋–ป๊ฒŒ ํ•ด์„ํ•˜๋Š๋ƒ์ธ๋ฐ ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ๋ถ€ํ„ฐ ๋„์ถœ๋˜๋Š” ๊ฐ’(์–ผ๋งˆ๋‚˜ ๊ฐ€๋Šฅ์„ฑ์ด ์žˆ๋Š”์ง€์— ๋Œ€ํ•œ ๊ฐ’ == Likelihood)์ด ๊ฐ€์žฅ ํฐ ๊ฒƒ์„ ์ฐพ๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•์ด MLE ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. ๋ณดํ†ต MLE๋ฅผ ๊ตฌํ•  ๋•Œ Log ๋ฅผ ๋ถ™์ด๊ฒŒ ๋˜๋Š”๋ฐ ๊ทธ ์ด์œ ๋Š” Logํ•จ์ˆ˜๋Š” ๋‹จ์ผ ์ฆ๊ฐ€ํ•จ์ˆ˜ ์ด๊ธฐ ๋•Œ๋ฌธ์— ๊ธฐ์กด ์ตœ๋Œ€๊ฐ’์„ ๊ฐ–๊ฒŒ๋˜๋Š” ์„ธํƒ€์— ์˜ํ–ฅ์„ ๋ฏธ์น˜์ง€ ์•Š๊ณ  ๊ณ„์‚ฐ์„ ๋” ์‰ฝ๊ฒŒํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค. ๋˜ํ•œ Log ์—ฐ์‚ฐ์˜ ๊ฒฝ์šฐ ๊ณฑ์…ˆ์ด ๋ชจ๋‘ ํ•ฉ์œผ๋กœ ํ‘œํ˜„๋˜๋Š” ์žฅ์ ์ด ์žˆ์Šต๋‹ˆ๋‹ค. Binomial Distribution์˜ ๊ฒฝ์šฐ๋ฅผ ์‚ดํŽด๋ณด.. 2024. 3. 7.
๊ธฐ๊ณ„ํ•™์Šต์—์„œ Estimation์„ ํ•˜๋Š” ๊ทผ๋ณธ์  ์ด์œ . Parametric Estimation์ด๋ž€? Motivation for MLE ๋งŒ์ผ ๋™์ „์„ ๋˜์กŒ์„ ๋•Œ ๋™์ „์˜ ์•ž๋ฉด์ด ๋‚˜์˜ฌ ํ™•๋ฅ ์„ x ๋ผ๊ณ  ํ•˜์ž. 10๋ฒˆ ๋˜์กŒ์„ ๊ฒฝ์šฐ ์•ž๋ฉด์ด 7๋ฒˆ ๋‚˜์˜ค๊ณ  ๋’ท๋ฉด์ด 3๋ฒˆ ๋‚˜์˜จ ๊ฒฝ์šฐ x๋Š” 0.7 ์ด๋ผ๊ณ  ๋‹ตํ•  ๊ฒƒ์ด๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ์ด ํ™•๋ฅ ์„ ์–ด๋–ป๊ฒŒ ์ถ”๋ก ํ•œ ๊ฒƒ์ธ๊ฐ€? ์ด ์ถ”๋ก  ๊ณผ์ •์€ ๊ด€์ธก(Observation) ์œผ๋กœ๋ถ€ํ„ฐ ํŒŒ๋ผ๋ฏธ๋” (Parameter)๋ฅผ ๋„์ถœํ•˜๋Š” ๊ณผ์ •์œผ๋กœ ์ƒ๊ฐํ•  ์ˆ˜ ์žˆ๋‹ค. ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ํŒจํ„ด(Pattern)์„ ์ฐพ์•„๋‚ด๋Š” ๋ฐฉ๋ฒ• ์ค‘ ํ•˜๋‚˜๋Š” The Most likely probability model์„ ์ฐพ๋Š” ๊ฒƒ์ด๋‹ค. ๋Œ€ํ‘œ์ ์œผ๋กœ ๋‘ ๊ฐ€์ง€ ๋ฐฉ์‹์œผ๋กœ ํŒจํ„ด์„ ์ฐพ์•„๋‚ด๋Š”๋ฐ ํ•˜๋‚˜๋Š” Supervised Learning์œผ๋กœ ์ •๋‹ต label ์—†์ด ์ž…๋ ฅ๊ฐ’ x์™€ ์ถœ๋ ฅ๊ฐ’ y์˜ ๊ฒฝํ–ฅ์„ฑ์„ ์ฐพ์•„๋‚ด๋Š” ๊ฒƒ์ด๊ณ  ๋‹ค๋ฅธ ํ•˜๋‚˜๋Š” Unsupervised Learning.. 2024. 3. 7.