中文题目: | 历代史志目录的数据集成与可视化 | ||||||
英文题目: | Diachronic Data Integration and Visualization of Ancient Book Catalogs Compiled for Imperial Collections | ||||||
作 者: | 李文琦,王凤翔,孙显斌,黄芷欣,李芃蓓 | ||||||
刊物名称: | 中国图书馆学报 | ||||||
发表年度: | 2023-01-09 | ||||||
卷: | 49 | ||||||
期: | 01 | ||||||
页码: | 82-98 | ||||||
中文摘要: | 古籍目录及其分类体系具有重要的学术价值,数字学术的发展为古籍目录的数字化保存和利用以及开展数字工具支持的目录学研究提供了新的契机。本文以时间跨度两千多年的八种史志目录为数据源,以机器预处理与专家校对相结合的人机迭代方式对数据进行记录拆分和字段抽取、数据补全、规范化以及书目认同,最终完成11万余条书目记录的结构化、规范化集成。在此数据集的基础上,从领域专家的研究需求出发,结合统计、可视化、检索等方法,利用人机交互技术构建了一个历代古籍目录可视化分析系统。该系统包括书目统计以及分类演化分析两个主要部分:一方面可对书目数据进行细粒度统计和可视化呈现,以帮助学者清晰地比较、追踪类目的消长;另一方面可对所有典籍在历代目录中的分类演变轨迹以及各类目所收典籍的源流进行可视化分析,以更好地实现类目分合转化的模式识别。本研究为数字学术背景下的目录学研究提供了数据基础和分析工具,不仅为学者省去了大量数据收集、整理的时间,还通过新的技术和视角助力分析、比较等解释性研究。图8。表3。参考文献36。 |
||||||
英文摘要: | Ancient hook catalogs record and classify a large number of Chinese ancient books. They are of great academic value for studying both ancient literature and traditional knowledge organization. The development of digital scholarship shed new light on the digital preservation and reuse of these ancient book catalogs as well as the domain research supported by digital tools. Digital scholarship facilitates the digitization and datafication of ancient hook catalogs. Morcover, new methods and computational tools are provided to enable the exploration of large collections, and new research questions can be raised from fresh perspeetives Recent studies have introduced computational methods to analyze the abstracts and classification systems of the ancient book catalogs. But these studies were based on only one catalog or a particular category. 1t is imperative to integrate the catalogs throughout the history and provide digital ools for scholars to explore and analyze them diachronically and holistically. In this study., we selected eight representative catalogs, mostly from official histories. u sources. They were Hanshu Yiwenchi, Suishu Jingjizhi, Jivtangshu Jingjizhi, Xintangshu Yiwenzhi, Songshi Viwenzhi, Mingshi Viwenzhi, Qingshigao Yiwenzhi and Siku Quanshu Zongmu. These catalogs cover major dynasties in Chinese history with a time span of more than two thousand years. We adopted a semi-automated data processing approach to integrate the book entries in eight catalogs. The whole integration process was iterated by machine pre-processing and expert manual correetion and contained three main steps—record splitting and field segmentation, field completion and nomnalization and hook identification. Exentually we got more than 110 000 structured data records, and identified over 7 000 books that were recorded in at least two catalogs. Based on the integrated data, we designed and developed an interactive visual analysis system that included features of statistics, visualization and recon] query. The system is designed to mainly meet two research requirements proposed by expert users. First, the system provides granular statistics and graphs that can help scholars to compare and trace the change of hook volumes in different categories and catalogs. Second, it provides an interactive visualization tool that can be used to explore how different books are classified differently in cach catalog, and thus manifests the changes of knowledge organization as well as the origin and evolution of academic thoughis. In conclusion, this study provides data foundation and analytic tools for the studies of ancient hook catalogs in the context of digital scholarship. which not only saves the effort on manual data collection and collation, but also provides new perspeetives to identify and solve hemencutics problems with new techniques. 8 figs. 3 tabs. 36 refs. |
||||||