July 7-8th, 2010, Paris, France, c/o CICM 2010
News | Objectives | Programme | Deadlines | Submissions | Proceedings | Keynote | Topics | Committees | Accommodation | Travel | Registration | Feedback
Mathematicians dream of a digital archive containing all peer-reviewed mathematical literature ever published, properly linked and validated/verified. It is estimated that the entire corpus of mathematical knowledge published over the centuries does not exceed 100,000,000 pages, an amount easily manageable by current information technologies.
Following success of DML 2008 and DML 2009, workshop's objectives are to formulate the strategy and goals of a global mathematical digital library and to summarize the current successes and failures of ongoing technologies and related projects, asking such questions as:
Every submission will be refereed by three to four PC members on the basis of technical quality, novelty, potential impact for building DML, and clarity.
Papers should conform to the Springer LNCS style, preferably using LaTeX2e and the Springer llncs class files.
Full paper: 5-15 LNCS pages
Short paper/poster/demo/work in progress report: 2-5 LNCS pages.
Paper length is not strict for both categories.
Via Easychair conference system.
has been published (viii+135 pages with author, name and subject indexes) by Masaryk University Press, ISBN 978-80-210-5242-0. All DML proceedings have been indexed by Thomson Reuters in Conference Proceedings Citation Index CPCI and Google Scholar and are available in digital form from electronic archive DML-CZ. You may order printed copy from this e-shop. Best papers will be chosen for a postconference book published by renowned publisher or for a journal special issue [as in 2008, cf. MCS Vol 3, issue 3].
Masakazu Suzuki (Project Infty, Kyushu University, JP): Mathematical Formulae Recognition and Logical Structure Analysis of Mathematical Papers
Abstract: In most cases the current on-line journals in mathematics are supplied in the form of PDF with print images of papers in the front and OCR'ed hidden texts behind to provide with search facilily using key words. The embedded hidden texts usually does not include good information about mathematical formulae in the papers. We can say that, for the future development of DML, it is desirable to include, in the digitised journals, more structured information of the content of mathematical papers, e.g. tag information to indicate logical structure of papers such as hedding of sections, definitions, theorems, lemmas, etc., together with mathematical formulae structures included.
In the talk, I will present the current stage of our technology to extract such information from the scanned images in the retro-digitised mathematical papers. Mechanically-prepared new journals in the form of PDF are also the target of our research since it is not an easy task to get uniform structure description of mathematical formulae for example from the original LaTeX source with various styles and macro commands depending on authors. Although there are many methods presented in literature to recognize mathematical formulae, very few applications appeared to do this task in practical sense. One of the major problem in the development of math OCR is to avoid fatal effects caused by mis-recognition and mis-segmentation of characters and symbols. In the talk, I will explain first the method we took to overcome this difficulty. Some demonstration of our software InftyReader to recognize mathematical documents will also be given in the lecture. Secondly, as a better approach to recognize a large number of pages like the case of DML, our adaptive method to improve the recognition rates of characters/symbols, mathematical formulae structures and logical structures of articles will also be presented.
(include, but are not limited to)
Petr Sojka, Michal Růžička, in addition to the CICM local/general chairs Renaud Rioboo and Laurence Rideau
-- Indeed I enjoyed the workshop. And I enjoyed a lot the friendly atmosphere - we didn't really feel like outsiders. So thanks for that. -- I liked the DML workshop, and I hope we will be able to contribute to EuDML in the near future.
|
|
|
|
|
|
Do you want to know more about DML 2010?
Comments/questions/inquiries: to be sent to:
dml2010 at easychair dot org.