Buyini ubuchwepheshe be-OCR?
I-Optical Character Recognition (IsiNgisi: I-Optical Character Recognition, OCR) ibhekisela enqubweni yokuhlaziya nokubona amafayela ezithombe zezinto zombhalo ukuze kutholwe ulwazi lombhalo nesakhiwo.
Ngokufana nokuqashelwa kwesithombe kanye nobuchwepheshe bombono womshini, inqubo yokucubungula yobuchwepheshe be-OCR nayo ihlukaniswe yaba okokufaka, ukucubungula kwangaphambili, ukucutshungulwa kwaphakathi nethemu, ukucutshungulwa kwangemuva kanye nenqubo yokukhipha.
ngena
Kumafomethi ezithombe ahlukene, kunamafomethi okugcina ahlukene kanye nezindlela zokuminyanisa ezihlukene.Okwamanje, kukhona i-OpenCV, i-CxImage, njll.
Ukucutshungulwa kwangaphambili - ukwenza i-binarization
Iningi lezithombe ezithathwe amakhamera edijithali namuhla ziyizithombe ezinemibala, equkethe inani elikhulu lolwazi futhi ayifanele ubuchwepheshe be-OCR.
Ngokuqukethwe kwesithombe, singamane sikuhlukanise ngaphambili nangemuva.Ukuze senze ikhompuyutha isheshe futhi yenze kangcono izibalo ezihlobene ne-OCR, sidinga ukucubungula isithombe sombala kuqala, ukuze kuhlale kuphela ulwazi lwangaphambili nolwazi lwangemuva esithombeni.I-binarization ingaqondwa kalula ngokuthi "okumnyama nokumhlophe".
ukuncishiswa komsindo wesithombe
Ezithombeni ezihlukene, incazelo yomsindo ingase yehluke, futhi inqubo yokukhipha umsindo ngokwezimpawu zomsindo ibizwa ngokuthi ukunciphisa umsindo.
ukulungiswa kokutsheka
Ngenxa yokuthi abasebenzisi abajwayelekile, lapho bethatha izithombe zemibhalo, kunzima ukudubula ngokuphelele ngokuhambisana nokuqondanisa okuvundlile nokuma mpo, ngakho-ke izithombe ezithathiwe zizogwenywa nakanjani, okudinga isofthiwe yokucubungula izithombe ukuze ilungiswe.
Ukucutshungulwa kwethemu emaphakathi – ukuhlaziywa kwesakhiwo
Inqubo yokuhlukanisa izithombe zemibhalo zibe izigaba namagatsha ibizwa ngokuthi ukuhlaziywa kwesakhiwo.Ngenxa yokuhlukahluka nobunkimbinkimbi bamadokhumenti angempela, lesi sinyathelo sisadinga ukuthuthukiswa.
ukusika izinhlamvu
Ngenxa yemikhawulo yemibandela yokuthwebula nokubhala, izinhlamvu zivame ukunamathela futhi amapeni aphukile.Ukusebenzisa ngokuqondile izithombe ezinjalo ekuhlaziyeni kwe-OCR kuzokhawulela kakhulu ukusebenza kwe-OCR.Ngakho-ke, ukuhlukaniswa kwezinhlamvu kuyadingeka, okungukuthi, ukuhlukanisa izinhlamvu ezihlukene.
Ukuqashelwa kwezinhlamvu
Esigabeni sokuqala, ukufaniswa kwezifanekiso kwakusetshenziswa kakhulu, futhi esigabeni sakamuva, ukukhishwa kwesici kwakusetshenziswa kakhulu.Ngenxa yethonya lezici ezinjengokususwa kombhalo, ukujiya kwe-stroke, ipeni eliphukile, ukunamathela, ukuzungezisa, njll., ubunzima bokukhishwa kwesici buthinteka kakhulu.
Ukubuyiselwa kwesakhiwo
Abantu bathemba ukuthi umbhalo owaziwayo usahlelwa njengesithombe sombhalo wokuqala, futhi izigaba, izikhundla, nokuhleleka kuphuma kumadokhumenti e-Word, imibhalo ye-PDF, njll., futhi le nqubo ibizwa ngokuthi ukubuyiselwa kwesakhiwo.
ukucubungula okuthunyelwe
Ngokuhambisana nengqikithi yolimi oluthile, umphumela wokuqashelwa uyalungiswa.
okukhiphayo
Khipha izinhlamvu ezaziwayo njengombhalo ngefomethi ethile.
Yiziphi izinhlelo zokusebenza zamatheminali aphathwayo asuselwa kubuchwepheshe be-OCR?
Ngetheminali ephathwayo ye-PDA elayishwe isofthiwe yokuqaphela uhlamvu ye-OCR, izicelo eziningi zesigcawu zingenziwa, njengalezi: ukuqashelwa kwepuleti lemoto, ukuqashelwa kwenombolo yesiqukathi, ukuqashelwa kwelebula lesisindo senyama yenkomo kanye nesemvu, ukuqashelwa kwendawo efundeka umshini wokudlula, ukuqashelwa kokufunda imitha kagesi. , ikhoyili yensimbi Ukuqashelwa kwezinhlamvu ezifuthwe.
Isikhathi sokuthumela: Nov-16-2022