Challenges in ebook Production

Challenges in eBook Production Presented by Bruce D. Rosenblum CEO Inera Incorporated STM International, 1 December 2011 Copyright  2011 Inera Incorp...
Author: Pierce Perry
1 downloads 0 Views 1MB Size
Challenges in eBook Production Presented by Bruce D. Rosenblum CEO Inera Incorporated STM International, 1 December 2011 Copyright  2011 Inera Incorporated. All Rights Reserved

Copyright  2011 Inera Incorporated. All Rights Reserved

Copyright  2011 Inera Incorporated. All Rights Reserved

So You Want an e-Book Workflow…  Introduction

 DTD

selection

 XML

and Composition

 File

organization and naming

 Metadata  Fronts

 Body

and backs

content challenges Copyright  2011 Inera Incorporated. All Rights Reserved

Journals Look… Well… Plain

Copyright  2011 Inera Incorporated. All Rights Reserved

Books Look… Well… "Designy"

Copyright  2011 Inera Incorporated. All Rights Reserved

Book XML: Why Now?  Market

demand

 Multi-platform  Production  Delivery

delivery

costs

time

Copyright  2011 Inera Incorporated. All Rights Reserved

Book XML Rationale  Journal

XML workflow focus

 Online delivery  Metadata delivery  Book

XML workflow

 Production efficiency  ePub creation

Copyright  2011 Inera Incorporated. All Rights Reserved

Journal/Book Issues Comparison  Journals

 Standard designs  3B2, XPP batch pagination

 Automation, automation, automation

 Books

 Designers rule  InDesign centric  Let's do this all by hand

XML production requirements inherently contradict traditional book production. We are starting to address this issue… Copyright  2011 Inera Incorporated. All Rights Reserved

DTD Selection  Unlike

journals, no single standard DTD for books  Some choices     

TEI DocBook NLM/JATS DITA Roll your own proprietary DTD

 How

do I choose? Copyright  2011 Inera Incorporated. All Rights Reserved

DTD Selection  Based

on

 Your content  Front list vs. back list vs. historical content  Discipline(s), e.g. Humanities vs. Life sciences

 Your XML use-cases  Tools you may want to use

Copyright  2011 Inera Incorporated. All Rights Reserved

TEI DTD  Origins:

 Widely  Great

Academic community (Brown University)

used in humanities

for historical materials

 E.g. preserving line break/pagination information  Poetry  Least-known

 Weakest

by suppliers

commercial tool support Copyright  2011 Inera Incorporated. All Rights Reserved

DocBook DTD  Origins:

 Great  Lots

Technical publication (O’Reilly)

for technical and trade books

of commercial tool support

 FrameMaker, ArborText  Well-known  OASIS

by suppliers

standard

Copyright  2011 Inera Incorporated. All Rights Reserved

NLM/JATS DTD Origins: Scholarly journal archiving & publication  Widely used by journal publishers  Great for 

 Science publications, multi-author works  Publishers doing books and journals  Content with structured references

Well-known by suppliers  NISO standard (journal tag suite)  Book model not as mature; BITS revision 2012 

 But 3.0 is useable Copyright  2011 Inera Incorporated. All Rights Reserved

DTD Commonalities  Any

of these DTDs work well for simple monographs

 All

of these DTDs are designed for customization, if necessary

Copyright  2011 Inera Incorporated. All Rights Reserved

XML Creation 

Author – Really?



After PDF  Common, but less robust  Final XML is not proofed



Before copy-editing – requires XML editors



After copy-editing, before composition  Best compromise

 Content edited in Word  Proofed PDF created from XML Copyright  2011 Inera Incorporated. All Rights Reserved

Typesetting  Books

 But

are an InDesign world

InDesign is a limited XML platform

 Difficult to import/export richly tagged XML  Automate

InDesign XML-driven page layout

 Custom scripts  Customizable off-the-shelf commercial software  Allows automatic layout and manual tweaking

Copyright  2011 Inera Incorporated. All Rights Reserved

XML, Composition, & Corrections  Easy

(relatively speaking): XML  InDesign

 Hard

 Corrections in InDesign  InDesign  richly tagged XML  Options

 Correct in InDesign; export IDML; XSLT  final XML  Correct in XML and re-flow content to InDesign  Works best when InDesign page layout is automated

Copyright  2011 Inera Incorporated. All Rights Reserved

File Organization 

Is an XML book  One large file?  One chapter per file?



It depends…  Can individual chapters stand alone?  Will you sell individual chapters?  Will you re-package individual chapters?



The linking problem  Inter-chapter references  "See chapter 9"  Back-of-the-book reference list Copyright  2011 Inera Incorporated. All Rights Reserved

File Naming 

Bad  No system; Random editor's choice  Author names  Accented letters, disambiguation



Good  Book ID  Internal ID, e.g. BK12345.xml  ISBN, e.g. 978-1-4094-1940-2.xml

 Chapters  E.g. 978-1-4094-1940-2_C001.xml, 978-1-4094-1940-2_intro.xml

 No special characters or spaces Copyright  2011 Inera Incorporated. All Rights Reserved

Artwork File Names 

Design a system that is    

Logical and consistent Unambiguous Robust Minimizes manual work in file preparation  Bad: "Insert BK12345-Fig2.5.jpg"  Good: "Figure 2.5: Figure title" Transform auto-generates: Avoids possible mis-typing of figure name



File extensions: XML is better without  E.g. TIFF for PDF, GIF for online delivery  Script adds appropriate extension when rendering Copyright  2011 Inera Incorporated. All Rights Reserved

Example Image Names  Follow

and extend book conventions

 Figure: 978-1-4094-1940-2_C001_FG003  But

what about other types

 Plate: 978-1-4094-1940-2_C001_PL003

 Map: 978-1-4094-1940-2_C001_MP003  Exhibit: 978-1-4094-1940-2_C001_EX003  Equation: 978-1-4094-1940-2_C001_EQ003  Consider

all artwork types across publications Copyright  2011 Inera Incorporated. All Rights Reserved

Special Artwork Naming  Unnumbered

images

 Figure: 978-1-4094-1940-2_C001_UN003  Special

cases

 When the "name" is better

 Lots of unnumbered images  Field guide 

Copyright  2011 Inera Incorporated. All Rights Reserved

Equations  Format

options

 MathML  TeX  Images  Consider composition and delivery requirements  InDesign works best with images  eBook readers can't render MathML or TeX

 Future

proof

 MathML + images Copyright  2011 Inera Incorporated. All Rights Reserved

Metadata 

Identifiers  ISBN  Print, PDF, ePub… How many? Consult www.bisg.org

 Book ID (internal publisher ID)  DOI  Chapters, figures, tables, too?



  

Book "type" (publisher classification) Publisher and imprint information Copyright, publication date, edition Authors, editors, translators  Book-level  Chapter-level



More… Each publisher has unique requirements Copyright  2011 Inera Incorporated. All Rights Reserved

Fronts Title and half-title page  Copyright page  Tables of contents 

 Regular and Expanded  List of figures, tables, etc. 

Prelims    

Preface Introduction Dedication Acknowledgements Copyright  2011 Inera Incorporated. All Rights Reserved

Title and Half Title Page The Instrumental Music of Schmeltzer, Biber, Muffat and their Contemporaries Brewer Charles E. Associate Professor of Musicology, The Florida State University, USA British Library Cataloguing in Publication Data 784'.09032-dc22

Copyright  2011 Inera Incorporated. All Rights Reserved

Tables of Contents  To

XML or Not?

 Included in XML  Data redundancy and corrections  Simple, but risk of error

 Excluded from XML  Build automatically from chapters  Expanded TOC information in each chapter  Requires some script expertise, but more robust

Copyright  2011 Inera Incorporated. All Rights Reserved

Prelims  Introduction,

Preface, etc.

 "Mini-chapters"  Usually

very simple

 But…

 Unnumbered artwork  End signature  One

file or many? Copyright  2011 Inera Incorporated. All Rights Reserved

Backs  Bibliography

and References

 Notes  Glossary  Index

Copyright  2011 Inera Incorporated. All Rights Reserved

Bibliography and References  Linking

to CrossRef

 Not required, but desirable  Linking

from chapters

 Consider  chapter-level reference lists  Back-of-the-book bibliography

Copyright  2011 Inera Incorporated. All Rights Reserved

Notes  Best

to place in each chapter

 Avoids  Script

linking problems

can collect all for back-of-book

Copyright  2011 Inera Incorporated. All Rights Reserved

Glossary  XML

setup

 Back-chapter  Inline definition  Allows

marginalia presentation

 Script

can collect for back-of-book

Copyright  2011 Inera Incorporated. All Rights Reserved

Glossary XML Example Cognitions represent any "knowledge…"

Cognitions A person's knowledge, opinions, or beliefs.

Copyright  2011 Inera Incorporated. All Rights Reserved

Indexes 

The hand-curated index, not auto-generated index  Vestige of print or useful scholarly tool?



Print index  Used to find useful information  Used to evaluate book contents



Electronic index  Used in a "search" world?  Perhaps to evaluate book contents



The index isn't dead yet, is it? Copyright  2011 Inera Incorporated. All Rights Reserved

The Index Workflow Problem  Integrated

creation in Word

 Authors not index specialists  Hard to integrate into editorial  Can't create until book fully paginated  Index  How

specialists are not XML experts

to markup richly linked index?

 It's not easy  Perhaps unlinked text index is OK? Copyright  2011 Inera Incorporated. All Rights Reserved

Body Content Challenges  Complex

 Table

boxes

formatting

 Discontinuous  "See

Lists

page…"

Copyright  2011 Inera Incorporated. All Rights Reserved

Complex Boxes Box 1: Box Title This is some text in a sidebar box. Boxes may also contain figures, lists, equations, or even sub-boxes inside

Copyright  2011 Inera Incorporated. All Rights Reserved

Text in Boxes Exhibit 1Factors

conditions high

significant

Copyright  2011 Inera Incorporated. All Rights Reserved

Table Formatting  Shaded

cells

 CSS attributes in HTML model table  Cell content  Special

cell borders

 E.g. double-underline in financial tables  CSS attributes in HTML model table  CALS requires custom setup Copyright  2011 Inera Incorporated. All Rights Reserved

Discontinuous Lists 1. Item 1

2. Item 2 3. Item 3 Some interesting text in the middle of a list, but not part of a list item 4. Item 4 5. Item 5

1list-item>Item 1 2list-item>Item 2 3list-item>Item 3 Some interesting text in the middle of a list, but not part of a list item 4list-item>Item 4…

Images courtesy CFA Institute Copyright  2011 Inera Incorporated. All Rights Reserved

"See page" Problem  In

print: "See page 253"

 What  Link

does this mean in an eBook?

to

 A paragraph  A section head  An arbitrary point  No

good solution

 Except, perhaps, author education? Copyright  2011 Inera Incorporated. All Rights Reserved

Afterthought: Book Errata  

We all make mistakes… Provide  Errata URL in front of book (DOI is better)  Form to report errors

 



Update errata page as errors are found Discussion: http://www.linkedin.com/groupAnswers?viewQuestionAndAnswer s=&discussionID=81521719&gid=65026&trk=eml-anet_dig-b_ndpst_ttle-cn&ut=0u5NFofoH3hl01 Example: http://www.berkshirepublishing.com/brw/product.asp?projID=65 Copyright  2011 Inera Incorporated. All Rights Reserved

Conclusions  eBooks

are here, now

 Production

more complex than journals

 XML requirements  Workflow requirements

 InDesign Limitations  But

all can be overcome

 While adding XML as a product driver  And gaining production efficiencies and cost savings Copyright  2011 Inera Incorporated. All Rights Reserved

Questions? Bruce Rosenblum Inera Incorporated +1 (617) 932 - 1932 [email protected] www.inera.com

Copyright  2011 Inera Incorporated. All Rights Reserved