Implementation of Tesseract OCR for Automated Payment Validation in E-Commerce
Abstract
The rapid expansion of e-commerce in Indonesia has resulted in a significant increase in digital transactions, necessitating expedited and precise payment verification. Administrators at the SweetJab hijab e-commerce platform must manually verify bank transfer receipts, a process that is time-consuming and susceptible to errors. This study utilises Optical Character Recognition (OCR) with the Tesseract engine as a supplementary approach for verifying transfer payments on the SweetJab website. The methodology encompasses image preprocessing (resizing to 200%, converting to greyscale, and enhancing contrast), employing Tesseract OCR with PSM 6 and an LSTM model for character recognition, and utilising regular expressions (regex) to extract structured transaction data. We employed Black Box Testing and Character Error Rate (CER) computations on 40 preliminary test samples and 40 post-implementation samples to assess the system. The initial test demonstrated an accuracy of 89.5%, which increased to 92.5% upon complete system integration. This study demonstrates that OCR is an effective method for extracting information from payment receipts, while maintaining security through a final manual verification by the administrator.
References
Bahar, B., Raban, R. D. Y., & Arnie. (2023). Model pendeteksi nominal uang kertas rupiah menggunakan teknologi optical character recognition. Jurnal Ticom: Technology of Information and Communication, 12(1), 8–13. https://doi.org/10.70309/ticom.v12i1.101
Banu, K., Andreas, D., Anggoro, W., & Setiawan, A. (2023). OCR: Masa depan pengenalan karakter optik dan dampaknya pada kehidupan modern. Jurnal Teknologi Informasi, 9(2), 147–156.
Dijaya, R. (2023). Buku ajar pengolahan citra digital. Penerbit Akademik.
Ente, N. H., Rohandi, M., & Yusuf, R. (2023). Sistem informasi pengarsipan menggunakan optical character recognition berbasis web di kantor notaris dan PPAT. Diffusion: Jurnal Sistem Informasi dan Teknologi, 3(2), 195–203.
Fahimah & Harsono. (2023). Literature review of the evolution of payment system paradigms: From cash to cashless with digital payment. Social, Humanities, and Education Studies Conference Series, 6(3), 11–18.
Hamidah, W. (2022). Deteksi nominal uang kertas menggunakan OCR (Optical Character Recognition). Jurnal Sistem Informasi, 7(2), 72–76.
Hartanto, S., Sugiharto, A., & Endah, S. N. (2012). Optical character recognition menggunakan algoritma template matching correlation. Jurnal Masyarakat Informatika, 5(9), 1–12.
Maharsi. (2024). The growth of digital payments in Indonesia: Harnessing its influence for SMEs and Indonesia's competitiveness. Global South Review, 6(2), 54.
Panggabean, A. W., Lubis, F. H., & Julia, R. R. (2025). Peningkatan keterbacaan teks pada citra struk pembayaran menggunakan segmentasi OTSU. DEVICE: Journal of Information System, Computer Science and Information Technology, 6(1), 155–164.
Prananta, F., Atqiya, M. Z., & Zamhariry, M. F. (2025). Pengembangan aplikasi web untuk pemrosesan gambar dan ekstraksi teks menggunakan Tesseract OCR berbasis web. Jurnal Sistem Informasi Aplikasi Teknologi Informasi, 2(1), 38–44.
Rahmadani, T., et al. (2023). Pengukuran akurasi OCR menggunakan character error rate. Jurnal Rekayasa Sistem Komputer, 6(2).
Rahmawati, A. N., Wibowo, S. A., & Sunarya, U. (2021). Analisis sistem optical character recognition (OCR) pada dokumen digital menggunakan metode Tesseract. Jurnal Informatika Telekomunikasi Elektronika, 8(5), 4777–4785.
Susanty, M., & Setiawan, E. (2024). Implementasi optical character recognition dan text similarity untuk pelaporan uang muka kerja. Jurnal Informatika Universitas Pamulang, 9(4), 187–193.





