Published online by Cambridge University Press: 12 August 2020
DocuSky is a personal digital humanities platform for humanities scholars, which aims to become a platform on which a scholar can satisfy all her digital needs with no direct IT assistance. To this end, DocuSky provides tools for a scholar to download material from the Web and prepare (annotating, building metadata) her material, a one-click function to build a full-text searchable database, and tools for analysis and visualization. DocuSky advocates the separation of digital content and tools. Being an open platform, it encourages IT developers to build tools to suit scholars’ needs, and it has already incorporated several popular Web resources and external tools into its environment. Interoperability is ensured through the format DocuXML. In addition to describing the design principles of DocuSky, we will show its main features, together with several important tools and examples. DocuSky was originally developed for Sinological studies. We are enriching it to work in other languages.
Department of Computer Science and Information Engineering, National Taiwan University, and Research Center for Digital Humanities, National Taiwan University.
2 Donald Sturgeon, “Chinese Text Project,” 2020, https://ctext.org.
3 Chinese Buddhist Electronic Text Association, “Cbeta,” 2020, https://www.cbeta.org/.
4 Research Center for Digital Humanities of NTU, “Taiwan History Digital Library (THDL),” 2020, doi:10.6681/NTURCDH.DB_THDL/Text.
5 Hsieh-Chang Tu and Jieh Hsiang, “Docusky Collaboration Platform,” Research Center for Digital Humanities of NTU, 2018–2020, https://docusky.org.tw.
6 Research Center for Digital Humanities of NTU, “DocuXML 1.2 Scheme,” http://docusky.org.tw/DocuSky/documentation/docs/DocuXml-1.2-Scheme.html.
7 Hou Ieong Brent Ho and Hilde De Weerdt, MARKUS. Text Analysis and Reading Platform. 2014–, https://dh.chinese-empires.eu/beta/. Funded by the European Research Council and the Digging into Data Challenge.
8 J. Hsiang and C.A. Weng. “Multiple-Contextualization: Problems and Challenges on Digital Archives,” Essential Digital Humanities: Defining Patterns and Paths, edited by Jieh Hsiang (Taipei: NTU Press, 2012), pp. 25–60.
9 Jieh Hsiang, Shih-Pei Chen, Hou-Ieong Ho, and Hsieh-Chang Tu, “Discovering Relationships from Imperial Court Documents of Qing Dynasty.” International Journal of Humanities and Arts Computing 6.1–2 (2012), 22–41.
10 Shih-Pei Chen, et al, “Discovering Land Transaction Relations from Land Deeds of Taiwan,” Literary and Linguistic Computing 28.2 (2013), 257–270. Available at www.researchgate.net/publication/260021449_Discovering_land_transaction_relations_from_land_deeds_of_Taiwan.
11 Michael Stanley-Baker, “Medicine and Religion in China,” 2020, https://michaelstanley-baker.com/posts/.
12 https://zh.wikisource.org/zh-hant/ zh-hant/本草經集注.
13 China Biographical Database Harvard University, 2005, https://projects.iq.harvard.edu/chinesecbdb/home.
14 DILA, “Buddhist Studies Authority Database Project, DILA, 2008, https://authority.dila.edu.tw/.
15 CHGIS Harvard Office and HGIS Center, Fudan University, “Chinese Historical GIS,” 2001, http://chgis.fas.harvard.edu/.
16 The API (with the conversion to DocuXML) was done by JenJou Hong of DILA (Dharma Dram Institute of Liberal Arts) and CBETA. His help is gratefully acknowledged.
17 Designed and implemented by Nungyao Lin. See Nung Yao Lin, Textual Geographic Information System and Applications, Graduate Institute of Networking and Multimedia, National Taiwan University, 2018. (In Chinese).
18 This can be done by simply clicking on the top-left corner of the Excel sheet. CSV, or Comma-Separated Values, is a plain text file in which each record (of the original Excel file) occupies a line, with the records of different fields separated by commas.
19 Courtesy of the GIS Center of Academia Sinica.
20 H. C. Tu, “Semi-Automatic Term Extraction with Simplified Term-Clips Method.” Digital Humanities: Between Past, Present, and Future, edited by J. Hsiang (Taipei: NTU Press, 2016), 171–206.
21 All three were developed by Po-Yu Hsieh. See P.Y. Hsieh, “Development and Deployment of Tools Based on Docusky Platform.” The 7th international Conference of Digital Archives and Digital Humanities, The Research Center for Digital Humanities of NTU, 2016.
22 Developed by Hsing Hsuan Song. See Hsin-Hsuan Sung, et al, “Finding Documents Related to Taiwan in the Veritable Records of Qing Using Relevance Feedback,” TPDL, 2019, 280–87.
23 Courtesy of Michael Stanley-Baker.
24 Humanities + Design, Stanford University, “Palladio,” Stanford University, 2013, https://hdlab.stanford.edu/palladio/.
25 Michael Stanley-Baker, “Health and Philosophy in Pre- and Early Imperial China,” Health: A History, edited by Peter Adamson (Oxford: Oxford University Press, 2019), 7–42.
26 Research Center for Digital Humanities of NTU, “DocuXML 1.2 Scheme,” http://docusky.org.tw/DocuSky/documentation/docs/DocuXml-1.2-Scheme.html.
27 Implementing a converter is usually significantly easier than creating a tool.
28 H.C. Tu, “A Platform for Constructing and Analyzing Personal Databases,” Journal of Digital Archives and Digital Humanities, 2.1 (2018), 71–90, doi:10.6853/DADH.201810_2.0004.