Classifying Tenant Legal Inquiries: A Comparative Study of Traditional and Deep Learning Approaches

Main Article Content

Hanfei Zhang

Abstract

Legal aid organizations face increasing demand for tenant protection services amid limited resources. Accurate classification of tenant inquiries enables efficient case routing and volunteer attorney matching. This study compares traditional machine learning methods (Naive Bayes, Support Vector Machines) with deep learning approaches (BERT fine-tuning) for classifying tenant legal inquiries across four categories: illegal eviction, housing repairs, security deposit disputes, and rent disagreements. Experiments on 450 de-identified tenant assistance requests reveal that sample size critically impacts method selection. Traditional approaches demonstrate robust performance with fewer than 150 samples, while BERT achieves superior accuracy (F1-score 0.89 vs 0.81) with datasets exceeding 300 samples. Mixed-issue cases involving multiple complaint types pose consistent challenges across all methods. Results inform practical deployment strategies for legal aid intake workflows.

Article Details

Section

Articles

How to Cite

Classifying Tenant Legal Inquiries: A Comparative Study of Traditional and Deep Learning Approaches. (2026). Journal of Science, Innovation & Social Impact, 1(1), 452-462. https://sagespress.com/index.php/JSISI/article/view/66

References

1. Q. Li, H. Peng, J. Li, C. Xia, R. Yang, L. Sun, and L. He, "A survey on text classification: From traditional to deep learning," ACM Transactions on Intelligent Systems and Technology (TIST), vol. 13, no. 2, pp. 1-41, 2022. doi: 10.1145/3495162

2. Y. Wahba, N. Madhavji, and J. Steinbacher, "A comparison of svm against pre-trained language models (plms) for text classification tasks," In International Conference on Machine Learning, Optimization, and Data Science, September, 2022, pp. 304-313. doi: 10.1007/978-3-031-25891-6_23

3. I. Chalkidis, M. Fergadiotis, P. Malakasiotis, N. Aletras, and I. Androutsopoulos, "LEGAL-BERT: The muppets straight out of law school," arXiv preprint arXiv:2010.02559, 2020. doi: 10.18653/v1/2020.findings-emnlp.261

4. L. Tunstall, N. Reimers, U. E. S. Jo, L. Bates, D. Korat, M. Wasserblat, and O. Pereg, "Efficient few-shot learning without prompts," arXiv preprint arXiv:2209.11055, 2022.

5. V. Carneiro-Diaz, A. Grille-Zallas, and D. Lage-Etchart, "Automated legal analysis of rental contract clauses using large language models," SoftwareX, vol. 31, p. 102337, 2025. doi: 10.1016/j.softx.2025.102337

6. X. Chen, C. Ren, and T. A. Thomas, "Evaluating tenant-landlord tensions using generative ai on online tenant forums," Journal of Computational Social Science, vol. 8, no. 2, p. 50, 2025. doi: 10.1007/s42001-025-00378-8

7. I. Chalkidis, A. Jana, D. Hartung, M. Bommarito, I. Androutsopoulos, D. Katz, and N. Aletras, "LexGLUE: A benchmark dataset for legal language understanding in English," In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), May, 2022, pp. 4310-4330. doi: 10.18653/v1/2022.acl-long.297

8. L. E. Dor, A. Halfon, A. Gera, E. Shnarch, L. Dankin, L. Choshen, and N. Slonim, "Active learning for BERT: an empirical study," In Proceedings of the 2020 conference on empirical methods in natural language processing (EMNLP), November, 2020, pp. 7949-7962.

9. C. M. Greco, and A. Tagarelli, "Bringing order into the realm of Transformer-based language models for artificial intelligence and law," Artificial Intelligence and Law, vol. 32, no. 4, pp. 863-1010, 2024. doi: 10.1007/s10506-023-09374-7

10. X. Tang, H. Mou, J. Liu, and X. Du, "Research on automatic labeling of imbalanced texts of customer complaints based on text enhancement and layer-by-layer semantic matching," Scientific Reports, vol. 11, no. 1, p. 11849, 2021. doi: 10.1038/s41598-021-91189-0

11. M. Hagan, "Towards human-centred standards for legal help AI," Philosophical Transactions of the Royal Society A, vol. 382, no. 2270, p. 20230157, 2024. doi: 10.1098/rsta.2023.0157

12. D. Song, A. Vold, K. Madan, and F. Schilder, "Multi-label legal document classification: A deep learning-based approach with label-attention and domain-specific pre-training," Information Systems, vol. 106, p. 101718, 2022. doi: 10.1016/j.is.2021.101718

13. Y. Xiong, G. Chen, and J. Cao, "Research on Public Service Request Text Classification Based on BERT-BiLSTM-CNN Feature Fusion," Applied Sciences (2076-3417), vol. 14, no. 14, 2024. doi: 10.3390/app14146282

14. T. Mashiat, A. DiChristofano, P. J. Fowler, and S. Das, "Beyond eviction prediction: Leveraging local spatiotemporal public records to inform action," In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, June, 2024, pp. 1383-1394. doi: 10.1145/3630106.3658978

15. G. Gauthier-Melançon, O. M. Ayala, L. Brin, C. Tyler, F. Branchaud-Charron, J. Marinier, and D. Le, "Azimuth: Systematic error analysis for text classification," arXiv preprint arXiv:2212.08216, 2022.