Challenges in eBook Production Presented by Bruce D. Rosenblum CEO Inera Incorporated STM International, 1 December 2011 Copyright 2011 Inera Incorporated. All Rights Reserved
Copyright 2011 Inera Incorporated. All Rights Reserved
Copyright 2011 Inera Incorporated. All Rights Reserved
So You Want an e-Book Workflow… Introduction
DTD
selection
XML
and Composition
File
organization and naming
Metadata Fronts
Body
and backs
content challenges Copyright 2011 Inera Incorporated. All Rights Reserved
Journals Look… Well… Plain
Copyright 2011 Inera Incorporated. All Rights Reserved
Books Look… Well… "Designy"
Copyright 2011 Inera Incorporated. All Rights Reserved
Book XML: Why Now? Market
demand
Multi-platform Production Delivery
delivery
costs
time
Copyright 2011 Inera Incorporated. All Rights Reserved
Book XML Rationale Journal
XML workflow focus
Online delivery Metadata delivery Book
XML workflow
Production efficiency ePub creation
Copyright 2011 Inera Incorporated. All Rights Reserved
Journal/Book Issues Comparison Journals
Standard designs 3B2, XPP batch pagination
Automation, automation, automation
Books
Designers rule InDesign centric Let's do this all by hand
XML production requirements inherently contradict traditional book production. We are starting to address this issue… Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Selection Unlike
journals, no single standard DTD for books Some choices
TEI DocBook NLM/JATS DITA Roll your own proprietary DTD
How
do I choose? Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Selection Based
on
Your content Front list vs. back list vs. historical content Discipline(s), e.g. Humanities vs. Life sciences
Your XML use-cases Tools you may want to use
Copyright 2011 Inera Incorporated. All Rights Reserved
TEI DTD Origins:
Widely Great
Academic community (Brown University)
used in humanities
for historical materials
E.g. preserving line break/pagination information Poetry Least-known
Weakest
by suppliers
commercial tool support Copyright 2011 Inera Incorporated. All Rights Reserved
DocBook DTD Origins:
Great Lots
Technical publication (O’Reilly)
for technical and trade books
of commercial tool support
FrameMaker, ArborText Well-known OASIS
by suppliers
standard
Copyright 2011 Inera Incorporated. All Rights Reserved
NLM/JATS DTD Origins: Scholarly journal archiving & publication Widely used by journal publishers Great for
Science publications, multi-author works Publishers doing books and journals Content with structured references
Well-known by suppliers NISO standard (journal tag suite) Book model not as mature; BITS revision 2012
But 3.0 is useable Copyright 2011 Inera Incorporated. All Rights Reserved
DTD Commonalities Any
of these DTDs work well for simple monographs
All
of these DTDs are designed for customization, if necessary
Copyright 2011 Inera Incorporated. All Rights Reserved
XML Creation
Author – Really?
After PDF Common, but less robust Final XML is not proofed
Before copy-editing – requires XML editors
After copy-editing, before composition Best compromise
Content edited in Word Proofed PDF created from XML Copyright 2011 Inera Incorporated. All Rights Reserved
Typesetting Books
But
are an InDesign world
InDesign is a limited XML platform
Difficult to import/export richly tagged XML Automate
InDesign XML-driven page layout
Custom scripts Customizable off-the-shelf commercial software Allows automatic layout and manual tweaking
Copyright 2011 Inera Incorporated. All Rights Reserved
XML, Composition, & Corrections Easy
(relatively speaking): XML InDesign
Hard
Corrections in InDesign InDesign richly tagged XML Options
Correct in InDesign; export IDML; XSLT final XML Correct in XML and re-flow content to InDesign Works best when InDesign page layout is automated
Copyright 2011 Inera Incorporated. All Rights Reserved
File Organization
Is an XML book One large file? One chapter per file?
It depends… Can individual chapters stand alone? Will you sell individual chapters? Will you re-package individual chapters?
The linking problem Inter-chapter references "See chapter 9" Back-of-the-book reference list Copyright 2011 Inera Incorporated. All Rights Reserved
File Naming
Bad No system; Random editor's choice Author names Accented letters, disambiguation
Good Book ID Internal ID, e.g. BK12345.xml ISBN, e.g. 978-1-4094-1940-2.xml
Chapters E.g. 978-1-4094-1940-2_C001.xml, 978-1-4094-1940-2_intro.xml
No special characters or spaces Copyright 2011 Inera Incorporated. All Rights Reserved
Artwork File Names
Design a system that is
Logical and consistent Unambiguous Robust Minimizes manual work in file preparation Bad: "Insert BK12345-Fig2.5.jpg" Good: "Figure 2.5: Figure title" Transform auto-generates: Avoids possible mis-typing of figure name
File extensions: XML is better without E.g. TIFF for PDF, GIF for online delivery Script adds appropriate extension when rendering Copyright 2011 Inera Incorporated. All Rights Reserved
Example Image Names Follow
and extend book conventions
Figure: 978-1-4094-1940-2_C001_FG003 But
what about other types
Plate: 978-1-4094-1940-2_C001_PL003
Map: 978-1-4094-1940-2_C001_MP003 Exhibit: 978-1-4094-1940-2_C001_EX003 Equation: 978-1-4094-1940-2_C001_EQ003 Consider
all artwork types across publications Copyright 2011 Inera Incorporated. All Rights Reserved
Special Artwork Naming Unnumbered
images
Figure: 978-1-4094-1940-2_C001_UN003 Special
cases
When the "name" is better
Lots of unnumbered images Field guide
Copyright 2011 Inera Incorporated. All Rights Reserved
Equations Format
options
MathML TeX Images Consider composition and delivery requirements InDesign works best with images eBook readers can't render MathML or TeX
Future
proof
MathML + images Copyright 2011 Inera Incorporated. All Rights Reserved
Metadata
Identifiers ISBN Print, PDF, ePub… How many? Consult www.bisg.org
Book ID (internal publisher ID) DOI Chapters, figures, tables, too?
Book "type" (publisher classification) Publisher and imprint information Copyright, publication date, edition Authors, editors, translators Book-level Chapter-level
More… Each publisher has unique requirements Copyright 2011 Inera Incorporated. All Rights Reserved
Fronts Title and half-title page Copyright page Tables of contents
Regular and Expanded List of figures, tables, etc.
Prelims
Preface Introduction Dedication Acknowledgements Copyright 2011 Inera Incorporated. All Rights Reserved
Title and Half Title Page The Instrumental Music of Schmeltzer, Biber, Muffat and their Contemporaries Brewer Charles E. Associate Professor of Musicology, The Florida State University, USA British Library Cataloguing in Publication Data 784'.09032-dc22
Copyright 2011 Inera Incorporated. All Rights Reserved
Tables of Contents To
XML or Not?
Included in XML Data redundancy and corrections Simple, but risk of error
Excluded from XML Build automatically from chapters Expanded TOC information in each chapter Requires some script expertise, but more robust
Copyright 2011 Inera Incorporated. All Rights Reserved
Prelims Introduction,
Preface, etc.
"Mini-chapters" Usually
very simple
But…
Unnumbered artwork End signature One
file or many? Copyright 2011 Inera Incorporated. All Rights Reserved
Backs Bibliography
and References
Notes Glossary Index
Copyright 2011 Inera Incorporated. All Rights Reserved
Bibliography and References Linking
to CrossRef
Not required, but desirable Linking
from chapters
Consider chapter-level reference lists Back-of-the-book bibliography
Copyright 2011 Inera Incorporated. All Rights Reserved
Notes Best
to place in each chapter
Avoids Script
linking problems
can collect all for back-of-book
Copyright 2011 Inera Incorporated. All Rights Reserved
Glossary XML
setup
Back-chapter Inline definition Allows
marginalia presentation
Script
can collect for back-of-book
Copyright 2011 Inera Incorporated. All Rights Reserved
Glossary XML Example Cognitions represent any "knowledge…"
Cognitions A person's knowledge, opinions, or beliefs.
Copyright 2011 Inera Incorporated. All Rights Reserved
Indexes
The hand-curated index, not auto-generated index Vestige of print or useful scholarly tool?
Print index Used to find useful information Used to evaluate book contents
Electronic index Used in a "search" world? Perhaps to evaluate book contents
The index isn't dead yet, is it? Copyright 2011 Inera Incorporated. All Rights Reserved
The Index Workflow Problem Integrated
creation in Word
Authors not index specialists Hard to integrate into editorial Can't create until book fully paginated Index How
specialists are not XML experts
to markup richly linked index?
It's not easy Perhaps unlinked text index is OK? Copyright 2011 Inera Incorporated. All Rights Reserved
Body Content Challenges Complex
Table
boxes
formatting
Discontinuous "See
Lists
page…"
Copyright 2011 Inera Incorporated. All Rights Reserved
Complex Boxes Box 1: Box Title This is some text in a sidebar box. Boxes may also contain figures, lists, equations, or even sub-boxes inside
Copyright 2011 Inera Incorporated. All Rights Reserved
Text in Boxes Exhibit 1Factors
conditions high
significant
Copyright 2011 Inera Incorporated. All Rights Reserved
Table Formatting Shaded
cells
CSS attributes in HTML model table Cell content Special
cell borders
E.g. double-underline in financial tables CSS attributes in HTML model table CALS requires custom setup Copyright 2011 Inera Incorporated. All Rights Reserved
Discontinuous Lists 1. Item 1
2. Item 2 3. Item 3 Some interesting text in the middle of a list, but not part of a list item 4. Item 4 5. Item 5
1list-item>Item 1 2list-item>Item 2 3list-item>Item 3 Some interesting text in the middle of a list, but not part of a list item 4list-item>Item 4…
Images courtesy CFA Institute Copyright 2011 Inera Incorporated. All Rights Reserved
"See page" Problem In
print: "See page 253"
What Link
does this mean in an eBook?
to
A paragraph A section head An arbitrary point No
good solution
Except, perhaps, author education? Copyright 2011 Inera Incorporated. All Rights Reserved
Afterthought: Book Errata
We all make mistakes… Provide Errata URL in front of book (DOI is better) Form to report errors
Update errata page as errors are found Discussion: http://www.linkedin.com/groupAnswers?viewQuestionAndAnswer s=&discussionID=81521719&gid=65026&trk=eml-anet_dig-b_ndpst_ttle-cn&ut=0u5NFofoH3hl01 Example: http://www.berkshirepublishing.com/brw/product.asp?projID=65 Copyright 2011 Inera Incorporated. All Rights Reserved
Conclusions eBooks
are here, now
Production
more complex than journals
XML requirements Workflow requirements
InDesign Limitations But
all can be overcome
While adding XML as a product driver And gaining production efficiencies and cost savings Copyright 2011 Inera Incorporated. All Rights Reserved
Questions? Bruce Rosenblum Inera Incorporated +1 (617) 932 - 1932
[email protected] www.inera.com
Copyright 2011 Inera Incorporated. All Rights Reserved