Njengoba umthamo wedatha engahlelekile uqhubeka ukhula ngokuphawulekayo, isidingo samathuluzi okuhlaziya umbhalo anembile nasebenza kahle siye saba bucayi kakhulu ezimbonini ezihlukahlukene, njengokumaketha, ezezimali, ukunakekelwa kwezempilo, kanye nesayensi yezenhlalo.
Ngokwesiko, ukuhlaziywa kombhalo kuye kwenziwa kusetshenziswa izindlela ezisekelwe emithethweni nezindlela zokufunda zomshini ezifana ne-SpaCY kanye nenqubo ye-transformer. Nakuba lezi zindlela seziphumelele, zidinga umzamo omkhulu kanye nobuchwepheshe ukuze ziphelele.
Ngokufika kwezinhlobo zezilimi ezinkulu (LLM) ezifana I-ChatGPT di I-OpenAI. Ibonise amakhono amangalisayo ekukhiqizeni umbhalo ofana nomuntu kanye nesimo sokuqonda, okuwenza ube yithuluzi elithembisayo lemisebenzi yokuhlaziya umbhalo njenge entity recognition
, sentiment analysis
, futhi topic modeling
.
Ake sibone manje ukuthi singakwenza kanjani ukuhlukanisa umbhalo sisebenzisa i-ChatGPT.
Esikhathini esidlule, besihlala sisebenzisa amamodeli ahlukene emisebenzini ehlukene ekufundeni komshini. Isibonelo, uma ngifuna ukukhipha ulwazi embhalweni, ngizodinga ukusebenzisa imodeli yokuqashelwa kwebhizinisi (NER - Named Entity Recognition
), uma ngidinga ukuhlukanisa umbhalo wami ube amakilasi ahlukene, ngizodinga imodeli yokuhlukanisa. Umsebenzi ngamunye ohlukene ubudinga ukuthi amamodeli aqeqeshwe ngendlela ehlukene ngomsebenzi ngamunye, ngokudlulisela ukufunda noma ngokuqeqeshwa.
Ngokwethulwa kwe Large Language Models (LLM), imodeli ye-LLM izokwazi ukwenza imisebenzi eminingi ye-NLP ngokuqeqeshwa noma ngaphandle kokuqeqeshwa. Noma yimuphi umsebenzi kungaba defikuqedwe kalula ngokushintsha imiyalo kumiyalo.
Manje ake sibone ukuthi ungawenza kanjani umsebenzi we-NLP wendabuko I-ChatGPT futhi uyiqhathanise nendlela yesintu. Imisebenzi ye-NLP ezokwenziwa ngu I-ChatGPT kulesi sihloko kukhona:
Sentiment analysis
I-Entity Entity Recognition (NER) ibhekisela emsebenzini wokuhlonza ngokuzenzakalelayo amabhulokhi edatha yombhalo. Isetshenziselwa kakhulu ukukhipha izigaba zebhizinisi ezibalulekile njengamagama ezidakamizwa kumanothi omtholampilo, imigomo ehlobene nengozi evela ezimangalweni zomshwalense, neminye imigomo eqondene nesizinda kumarekhodi.
Qaphela ukuthi lo msebenzi uqondene ngqo nesizinda sezokwelapha. Bekuvame ukudinga ukuthi sichasise futhi siqeqeshe imigqa yedatha engaphezu kuka-10.000 kumodeli eyodwa ukuze yazi isigaba esithile kanye nethemu embhalweni. I-ChatGPT ingakwazi ukuhlonza kahle igama ngaphandle kwanoma yimuphi umbhalo oqeqeshwe kusengaphambili noma ukucushwa kahle, okuwumphumela omuhle uma kuqhathaniswa!
Ukuhlukaniswa kombhalo kubhekisela enqubweni ezenzakalelayo yokuthola nokuhlukanisa umbhalo ngezigaba kusuka kudatha enkulu, idlala indima ebalulekile ekubuyiseni nasekukhishweni kwedatha yombhalo. Izibonelo zezinhlelo zokusebenza zokuhlukanisa umbhalo zifaka izexwayiso zomtholampilo noma ukuhlukaniswa kwezinto eziyingozi, ukuhlukaniswa ngezigaba okuzenzakalelayo kokuxilonga, nokutholwa kogaxekile.
Sentiment analysis
Sentiment analysis
kubandakanya ukunquma imizwa noma imizwa evezwa esiqeshini sombhalo. Ihlose ukuhlukanisa umbhalo ngokwezigaba zangaphambilidefinite, njengephozithivu, engemihle, noma engathathi hlangothi, ngokusekelwe emuzweni owumsuka ovezwe umlobi.
Izicelo zokuhlaziya imizwa zifaka:
Izifinyezo ezizenzakalelayo zibhekisela enqubweni lapho izihloko eziyinhloko zombhalo owodwa noma ngaphezulu zibonwa futhi zethulwe ngendlela emfushane nenembile. Lokhu kuvumela umsebenzisi ukuthi abheke izingcezu ezinkulu zedatha ngesikhathi esifushane. Isibonelo sezinhlelo zokusebenza zifaka isistimu yesifinyezo evumela ukukhiqizwa okuzenzakalelayo kwezifinyezo ezivela kuma-athikili ezindaba kanye nokufingqwa kolwazi ngokukhipha imisho ezifushaneni zephepha locwaningo.
I-ChatGPT iyithuluzi elihle kakhulu lokufingqa, ikakhulukazi lezindatshana ezinde nezibuyekezo eziyinkimbinkimbi. Ngokunamathisela ukubuyekezwa ku-ChatGPT, singakwazi kalula ukwazi isifinyezo sokubuyekezwa komkhiqizo shazi.
Njengoba inhloso yalesi sihloko iwukuhlola ikhono lama-LLM okwenza imisebenzi yokuhlaziya umbhalo, kubalulekile ukuqaphela ukulinganiselwa kwawo. Eminye yemikhawulo eyinhloko yama-LLM ihlanganisa:
Ercole Palmeri
I-Coveware ye-Veeam izoqhubeka nokuhlinzeka ngezinsizakalo zokuphendula izigameko zokuntshontshwa kwe-inthanethi. I-Coveware izohlinzeka ngama-forensics kanye nekhono lokulungisa…
Ukulungiswa okuqagelayo kuguqula umkhakha kawoyela negesi, ngendlela emisha nesebenzayo yokuphatha izitshalo.…
I-CMA yase-UK ikhiphe isexwayiso mayelana nokuziphatha kwe-Big Tech emakethe yezobunhloli bokwenziwa. Lapho…
Isinqumo esithi "Case Green", esakhiwe yi-European Union ukuze kuthuthukiswe ukusebenza kahle kwamandla ezakhiwo, siphothule inqubo yaso yomthetho ngokuthi...