imaging meeting

Modified: February 14, 2026 7:58 PM Created: September 25, 2025 2:28 PM Master Type: Notes Hide: No Starred: No Status: Done

Casual FADGI & AI tiers

FADGI Guidelines for Photography, Hooks Edition (aka “yeah, sure or nah”)

Topic	Quick Version	Low-Effort Take
Star Rating	1–4 stars; 3–4 = museum-level	We aim 3–4 stars for prints/exhibition; “yeah, we’re fine.”
PPI / Resolution	Enough pixels to make prints look good at intended size	300 ppi for normal prints, 75–150 ppi for giant wall vinyls; RAW has way more than needed.
Color Accuracy	Neutral gray = baseline	Grey card + consistent lighting; optional swatch cards = theater.
File Format	Uncompressed is king	RAW = archival master, TIFF/PSD for working/printing. Keep it simple.
Noise / Grain	Don’t confuse sensor grain with bad technique	A7R grain is normal; scanners would blur it anyway. “Yep, fine.”
Lighting	Even, consistent; backlit for negatives	Check. You do this. No one’s judging.
Metadata	Record object info, capture settings	Do it. Easy. Doesn’t affect prints.
Retouch / Clean-up	Only if it matters for display	Dust spots on glass plates? Sure, fix what shows. Everything else? Nah.
Oversampling / Resampling	Optional, for extreme size prints	Your RAW already covers it. “Whatever.”
Repeatability	Consistency across operators	Only matters for multi-person teams. You’re a two-person team = bonus points.

TL;DR for the meeting:

We’re hitting 3–4 star FADGI quality.
RAW files + consistent lighting + grey card = all technical bases covered.
Prints can be resampled to the size needed; sensor detail is overkill for anything practical.
Optional things (swatch cards, extreme PPI, hyper-microscopic dust cleanup) = theater or nice-to-have, not required.

AI pricing per year, Tier 1 $600, Tier 2 $1900, Tier 3 $3250 etc. per individual AI task on an asset, 36,000 / 180,000 / 360,000

so on average like 3-6 tasks per asset whole collection of 75k is like 225k tasks not including multiple versions.
max is 1.8m tasks for additional $11k a year

Notes with Jennifer

denoise, naming (if there’s time—must be understandable to ppl like us), AI!, file sizes

we’ve scanned over so and so images

saving raw files to xyz, we are making TIFFs and jpgs, go into a DAMs

brief bullet points about dams from my report

from raw we can make tiffs of any size

discuss archo images ?

bullet list / outline

potential outside contractors to scan acetate

interns and contractors

recto/verso

DAMS of less interest so start with resolution and RAW files and can make endless iterations (list numbers dpi) and also a JPG for discovery purposes

where you’re storing files / how much per MB

manageable file sizes for less important images / different dpi, only save verso for some, discovery images

there are approximately 8000 items in the 36? binders and the final 20? binders were reviewed mid-august (lol), of those about half of the objects in a binder are flagged for exhibition, catalogue, or other project?, several binders have been scanned in full and X others have had their flagged objects scanned but i’ve been catching up on the ones rose completed since august

maybe give an estimate of degraded images also scanned as an aside

cost of B2 backblaze

cost of dxo photolab 9

cost of AI addition

cost of DAMS

mention PSD as a footnote and maybe a dummy guide to screen dpi → print dpi

jennifer also had the good idea to “train” the facial recognition
Initial outline

1. Context etc.
- RAW capture & file handling—file storage location & purpose; PSD, TIFF, JPG derivatives; footnote gutter/cropping
  - Sony A7R V (sensor, megapixels, RAW capture) + maybe footnote the lenses
- FADGI, how we’re efficiently meeting requirements
2. DPI & Resolution
- On-screen vs print dpi
  - Guide for screen dpi → print dpi
- Resampling / oversampling logic
- Iterations from RAW for different sizes
3. Image Processing
- DxO Photolab 9 (denoise, warping corrections)
  - Sensor grain vs image noise
  - Historical context for photographic texture
  - Trade-offs in denoising, denoise explanation
- Photoshop (retouching, dust/bubble removal, cropping)
- Stitching for large images (8 ft wall vinyl, multi-shot)
5. DAMS & Storage
- Pics.io & a very very brief overview of its features and cost
  - Footnote likely only (1) record in TMS for the collection pointing toward the DAMS
- Broad strokes file policy/file retention: RAW as the master file, archival TIFFs, JPGs for discovery; PSD as footnote
- File sizes per format (in MB to establish how to calculate DAMS storage)
- Storage location / B2 Backblaze costs
  - Number of binders reviewed / total; approximate number of images digitized (including deteriorated)
- Manageable settings for less critical images, verso handling (case-by-case, discovery images)
- AI Tools & Automation
  - Pics.io features (descriptions, OCR, face recognition, intelligent search)
  - AI training & use for cataloging
  - Cost per tier (fill in later)
6. “If we have the time” comments
- Potential outside contractors for acetate (intern or contractor roles); archival images or objects requiring special care
- Naming conventions (e.g. ‘N’ for Nitrate, ‘r’ and ‘v’ for verso, etc.)
Outline draft 2

# 1. Context & Capture
- RAW capture & file handling — Sony A7R V produces ~61MP RAW files; stored on external and cloud backup
  - External drives are Western Digital [type] 12TB + B2 Backblaze; note monthly cost.
  - Footnote: Images are captured with a “gutter” for efficiency; final crops can reduce dimensions to e.g. ~8k × 5k or 7k × 4k pixels but are still large enough for printing.
- FADGI compliance — Workflow and settings align with recommended practices for preservation digitization.
  - Broad strokes FADGI requirements, touch briefly on how we meet them
# 2. DPI & Resolution
- On-screen vs print dpi — Screens show pixel dimensions directly; print requires translating pixels to inches. (e.g. 6000 px ÷ 300 dpi ≈ 20 inches).
  - Guide: Simple conversion explanation for non-technical users.
- Resampling / oversampling logic — Why capture higher resolution than final output; allows flexibility without rescanning.
# 3. Image Processing
- Sensor grain vs. digital noise clarified.
  - Explanation of noise/denoising
    - DxO Photolab 9 — Used for denoising and correcting warped/curved images.
  - Historical context: film grain was a material property, digital noise comes from sensor limits.
  - Trade-offs: reduces distraction but can remove detail.
- Editing — Photoshop, make digital positives, retouching (dust, scratches, bubbles), tonal adjustments.
  - Mention automation in passing (autocropping, JPG derivatives, etc.)
  - Stitching — For oversize works (e.g. 8 ft wall vinyls); multi-shot merges for very large images.
- Discuss platen vs. vacuum table vs. caveats of post-processing/forced perspective edits
# 4. DAMS & Storage
- Pics.io — Cloud-based DAMS; basic overview of features and cost.
  - Likely one single TMS record pointing to DAMS repository.
- File policy & retention — Multiple derivative sizes, RAW as master; TIFFs for preservation/archival; JPGs for discovery/access; PSD noted for working files/editing.
  - Verso handling — Full capture for recto; verso retained as case-by-case or lower resolution when less critical.
  - Manageable settings — 8-bit TIFF or 300 dpi for less-critical items to control storage load.
- File sizes per format — Typical ranges: TIFF 40–60MB; JPG ~5–10MB; RAW ~120MB.
- Progress overview — ~8,000 items across 36 binders; ~20 reviewed in August; ~5,000 images digitized including deteriorated.
- AI tools — OCR, description generation, face recognition; potential for training on local collection.
  - Note tiered costs (to be filled).
  - Caveats for AI
# 5. Additional (If Time)
- Outside contractors — For acetate or fragile formats; interns or contractors could help.
- Naming conventions — e.g., “N” for nitrate, “r”/“v” for recto/verso, structured identifiers for binder/item.
FADGI working draft

The Federal Agencies Digitization Guidelines Initiative (FADGI) provides a standardized framework for digitizing cultural heritage materials, ensuring high-quality digital reproductions. The FADGI star rating system—ranging from 1 to 4 stars—assesses image quality based on technical parameters such as resolution, color accuracy, and noise levels. A 3-star rating indicates “very good professional image quality appropriate for most uses,” while a 4-star rating represents “the best imaging practical today,” suitable for high-fidelity applications like exhibition prints.

Our current digitization process aligns with FADGI 3-star standards. We utilize a Sony A7R V camera with a 61MP full-frame sensor, capturing images at approximately 9500 × 6300 pixels. This high resolution allows for detailed reproductions, even when cropped or resampled for various formats. We employ consistent lighting conditions and calibrated white balance, ensuring color accuracy and uniformity across images. Additionally, our use of uncompressed RAW files preserves image integrity, facilitating high-quality conversions and post-processing adjustments.

In summary, our digitization practices adhere to FADGI’s 3-star guidelines, ensuring that our digital reproductions meet professional standards for quality and consistency. By maintaining these practices, we support the preservation and accessibility of cultural heritage materials through high-fidelity digital representations.

—

The Federal Agencies Digitization Guidelines Initiative (FADGI) provides standards for high-quality digitization of cultural heritage materials. Its star ratings—1 to 4 stars—measure image fidelity, including resolution, color accuracy, and noise. A 3-star rating indicates very good professional quality for most uses, while 4-star represents the highest practical fidelity suitable for exhibition-level reproductions (FADGI 3rd Edition, p. 17

Our workflow aligns with 3-star standards and incorporates practices that edge toward 4-star quality. Using a 61 MP Sony A7R V, we capture images at ~9500 × 6300 pixels with consistent lighting and calibrated white balance. Uncompressed RAW files preserve full sensor detail, enabling high-quality resampling, color correction, and large-format derivatives. Careful post-processing minimizes visible noise while retaining true texture from the original material.

In short, our digitization ensures professional-grade quality, ready for exhibition and archival use. By capturing maximum detail and maintaining color fidelity, we’re already approaching the best-practical standards outlined in FADGI.

—

We meet the core FADGI 3-star requirements consistently, and for most practical purposes, our workflow approaches 4-star quality. High-resolution RAW capture with the 61 MP Sony A7R V preserves maximum detail, and ideal oversampling is applied whenever images are intended for print or exhibition, balancing workflow, storage, and scaling needs.

Color fidelity is maintained using calibrated white balance and attention to swatches, ensuring accurate reproduction across all formats. Cropping, dust removal, and blemish corrections are applied selectively for exhibition copies or high-visibility derivatives, providing polished final products while keeping routine digitization efficient.

In short, our process delivers professional-grade, high-fidelity images suitable for archival use and large-format display, satisfying 3-star FADGI standards while implementing 4-star practices whenever necessary for output quality.
Backblaze B2

For a medium-sized nonprofit organization aiming to back up a high-resolution digitization project, Backblaze B2 offers a cost-effective and scalable cloud storage solution. As of October 2023, Backblaze B2 charges $6 per terabyte (TB) per month for storage, with free egress (data download) up to three times the amount of data stored per month. Any additional egress beyond this allowance is priced at $0.01 per gigabyte (GB). Backblaze

To estimate the monthly cost for your project, consider the following:
- Storage Costs: Multiply the total data size (in TB) by $6.
- Egress Costs: If your monthly data download exceeds three times the storage size, calculate the excess and apply the $0.01 per GB rate.
For instance, storing 10 TB of data would cost $60 per month. If you download 40 TB in a month, the first 30 TB are free, and the remaining 10 TB would incur a charge of $100. Thus, the total monthly cost would be $160.

This pricing structure is particularly advantageous for organizations with substantial data storage needs, as it combines affordability with generous egress allowances. Additionally, Backblaze B2’s transparent pricing model ensures predictable costs without hidden fees.
Outline draft 3

# 1. Context & Capture
- RAW capture & file handling — Sony A7R V produces ~61MP RAW files; stored on external and cloud backup.
  - External drives are Western Digital 12TB, backed up to B2 Backblaze (monthly subscription cost noted separately).
  - Plain-language note: RAW files are like “digital negatives” — they contain everything the camera captured and give the most flexibility for later editing.
  - Footnote: Images are captured with a “gutter” for efficiency; final crops can reduce dimensions to ~8k × 5k or 7k × 4k pixels, still large enough for high-quality prints.
- FADGI compliance — Workflow and settings align with preservation digitization standards.
  - Plain-language note: FADGI sets out requirements for things like sharpness, color accuracy, and resolution. We’re meeting these by capturing at high resolution, using calibrated tools, and keeping RAW + TIFF masters.
# 2. DPI & Resolution
- On-screen vs print dpi — Screens display images in pixels, but printing spreads those pixels across inches.
  - Plain-language guide: Divide the pixel count by the print dpi to get the size. Example: 6000 px ÷ 300 dpi = 20 inches. (So a 6000px file prints about 20 inches wide at 300 dpi.)
- Resampling / oversampling logic — We capture larger than necessary so we can make smaller versions later.
  - Plain-language note: Oversampling means we don’t have to rescan every time a new project needs a different size — one big capture can be downscaled into many smaller versions.
# 3. Image Processing
- Sensor grain vs. digital noise
  - Plain-language note: Film grain came from the chemistry of film; digital noise comes from the electronics in the camera sensor. Both show up as speckles in low light or high sensitivity settings.
  - DxO Photolab 9 — Used for denoising (reducing speckles) and correcting warped or curved images.
    - Plain-language note: Denoising software can make photos cleaner and more readable, but too much can erase small details.
  - Historical context: Earlier photos often have visible grain as part of their look; digital noise doesn’t carry the same “authentic” feel.
- Editing (Photoshop) — Used for making digital positives, removing dust/scratches, balancing tones, and cropping.
  - Automation: Some steps (like cropping or making JPGs) can be scripted for efficiency.
  - Stitching: Large items (like 8 ft wall vinyls) require multiple photos merged into one seamless image.
- Capture methods —
  - Platen scanners: Keep photos flat, but can cause Newton rings (rainbow-like interference).
  - Vacuum tables: Hold items flat, but are expensive and rarely justified.
  - Post-editing (artificial correction): Can fix distortion, but may blur edges and lose authenticity compared to capturing the true shape.
# 4. DAMS & Storage
- Pics.io — Cloud-based DAMS; organizes, searches, and manages assets.
  - Features include OCR (text recognition), AI-based description, face recognition, and intelligent search.
  - Likely one record in TMS links the entire collection to Pics.io.
- File policy & retention —
  - RAW = master file (never altered).
  - TIFFs = archival/preservation versions.
  - JPGs = smaller, shareable discovery images.
  - PSDs = working files (not permanent).
- Verso handling — Rectos always saved; versos saved at lower resolution or only when relevant (damage, inscriptions, or markings).
- Manageable settings — For less-critical images, we can save smaller versions (e.g., 8-bit TIFF or 300 dpi) to cut down storage cost.
- File sizes per format —
  - RAW ~120MB
  - TIFF 40–60MB
  - JPG 5–10MB
- Progress overview — ~8,000 items across 36 binders. ~20 binders reviewed in August. ~5,000 images digitized so far, including deteriorated nitrate/acetate scans.
- Storage location & cost — Files live on external 12TB drives and B2 Backblaze cloud storage. (Note monthly cost separately.)
- AI tools — OCR, auto-descriptions, face recognition, intelligent search.
  - Plain-language note: AI can be trained to recognize people or recurring subjects, helping speed up cataloging.
  - Caveats/drawbacks with AI
# 5. Additional (If Time)
- Outside contractors — Possible for acetate or fragile formats; interns or contractors could assist.
- Naming conventions — e.g., “N” for nitrate, “r/v” for recto/verso, consistent identifiers tied to binders.

Expanded information

# Consistent high-resolution capture: 61 MP Sony A7R V RAW files preserve full detail; ideal oversampling is applied when images are destined for print or exhibition, balancing storage, workflow, and scaling.

# Accurate color reproduction: Calibrated white balance and attention to swatches ensure fidelity across formats, meeting professional standards without slowing routine capture.

# Polished final products: Cropping tightly and removing dust/blemishes is done selectively for exhibition copies or high-visibility derivatives, keeping routine digitization fast while still enabling exhibition-quality output.

# Efficient adherence to FADGI: Our workflow hits all 3-star requirements and implements 4-star practices as needed, delivering professional-grade images at scale without unnecessary extra steps.

# Objects are photographed using the Sony A7R V, capturing high-resolution RAW files through Capture One to preserve maximum sensor detail. These RAWs can be imported into DxO PhotoLab 9 for tasks like denoising, optical corrections, and minor warping adjustments, ensuring clean and geometrically accurate images. More extensive retouching, such as dust and blemish removal, color adjustments, and compositional edits, is performed in Photoshop before producing the final archival TIFF files and JPG derivatives for downstream use, ensuring both preservation-quality masters and accessible copies for exhibition or digital distribution.

# The unprocessed RAW files are retained as digital masters, allowing flexible resampling and export at various DPI levels depending on output needs. This means the same RAW can be used to generate small web derivatives, standard prints, or large-format exhibition prints, with resampling applied only when necessary, preserving maximum detail and image integrity while supporting multiple use cases.

# Using the scaled 4400 ppi numbers (8‑bit TIFF ≈ 806 MB, 16‑bit TIFF ≈ 1.75 GB) along with your other updated file sizes, here’s a consistent set of collection storage estimates:

# Single-Item File Size Assumptions (Updated)

Format	Bit Depth / PPI	Estimated File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	1.75 GB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
TIFF 8‑bit, 4400 ppi	8-bit	806 MB
JPEG derivative	8-bit	5 MB
Sony uncompressed RAW	14‑16 bit	120 MB
PSD (working file)	16-bit	160 MB

### Collection Examples

# Scenario A: 1,000 images

16‑bit 1200 ppi TIFFs → 130 GB
16‑bit 4400 ppi TIFFs → 1.75 TB
8‑bit 1200 ppi TIFFs → 60 GB
8‑bit 4400 ppi TIFFs → 806 GB
JPG derivatives → 5 GB
RAWs → 120 GB
PSDs → 160 GB

# Scenario B: 50,000 images

16‑bit 1200 ppi TIFFs → 6.5 TB
16‑bit 4400 ppi TIFFs → 87.5 TB
8‑bit 1200 ppi TIFFs → 3 TB
8‑bit 4400 ppi TIFFs → 40.3 TB
JPG derivatives → 250 GB
RAWs → 6 TB
PSDs → 8 TB

# Scenario C: 75,000 images, 2–3 assets per object

# Assume 2.5 assets per object → ~187,500 files total

16‑bit 1200 ppi TIFFs → 24.4 TB
16‑bit 4400 ppi TIFFs → 328 TB
8‑bit 1200 ppi TIFFs → 11.3 TB
8‑bit 4400 ppi TIFFs → 151 TB
JPG derivatives → 937 GB
RAWs → 22.5 TB
PSDs → 30 TB

File Type	1200 ppi / 8‑bit	4400 ppi / 8‑bit	1200 ppi / 16‑bit	4400 ppi / 16‑bit	RAW	PSD	JPEG
1,000 images	60 GB	806 GB	130 GB	1.75 TB	120 GB	160 GB	5 GB
50,000 images	3 TB	40.3 TB	6.5 TB	87.5 TB	6 TB	8 TB	250 GB
75,000 images, 2.5 assets each (~187,500 files)	11.3 TB	151 TB	24.4 TB	328 TB	22.5 TB	30 TB	937 GB

# Alternating between 1200 ppi for standard archival/derivative TIFFs and 4400 ppi only when needed can drastically reduce storage (e.g., 1.75 TB vs 87.5 TB for 50,000 images, 16‑bit).

# RAW files serve as master digital captures and can support resampling to any DPI, so you don’t need 4400 ppi TIFFs for every asset.

# JPEGs are tiny and sufficient for online previews, sharing, and low-resolution use.

# PSDs are heavier than 1200 ppi TIFFs but still smaller than full 4400 ppi masters; keeping them only for working files avoids unnecessary storage bloat.

This chart visually justifies alternating quality instead of permanently creating massive 4400 ppi TIFFs, and gives your boss a clear, quantitative argument to support workflow flexibility.

# 1. Automatic descriptions / keywords

Pics.io analyzes each image and generates text describing its content (e.g., “man,” “portrait,” “suit”). This makes it easy to search, filter, and organize large collections without manually tagging every image.

# 2. OCR / transcriptions

If an image contains text—like a sign, letter, or pamphlet—Pics.io can read it and make it searchable. You can find images by the words they contain without typing them yourself.

# 3. Face recognition

Pics.io detects faces and can group images of the same person together. Useful for portraits, historical collections, or any scenario where tracking people matters.

# 4. Intelligent descriptive AI search

This feature lets you search using natural language or descriptive queries (“woman in red dress at a table”) and returns images that match, based on the AI-generated descriptions and analysis. It essentially lets the system understand the content, not just keywords.

# 5. AI task limits / credits

Every AI operation counts against your allotted credits (e.g., 36,000 or 180,000 per year), so you can prioritize which images get processed automatically.

Use case summary:

Speeds up metadata creation for large collections.
Makes images searchable by content, text, or faces, not just filenames.
Lets you find images using natural language with the intelligent search.
Credits system helps manage workflow and AI usage efficiently.

# After the camera captures a RAW image, the first step is often to clean up technical imperfections. DxO PhotoLab has denoise tools that reduce the “speckle” or grain you sometimes see in photos, especially in shadows or smooth areas like sky or fabric. This helps the image look smoother and more consistent while keeping the important details intact.

# Another key feature is optical correction. Lenses can introduce distortions, like curved lines or uneven edges, and slight perspective warping can happen when the camera isn’t perfectly aligned. DxO PhotoLab can automatically straighten these curves and correct lens-specific distortions, producing a truer-to-life representation of the object.

# Finally, using DxO PhotoLab at this stage prepares the image for more detailed editing in Photoshop. Once denoising and corrections are done, Photoshop can be used for targeted retouching—removing dust, blemishes, or minor imperfections—before creating the final archival TIFF and derivative JPEG files. This two-step workflow ensures that images are both technically accurate and visually clean.

# Camera & RAW Capture: The Sony A7R series is a high-resolution digital camera capturing RAW files, which record all the light and color data the sensor detects without in-camera processing. RAW files allow for flexible post-processing without degrading image quality, preserving the maximum detail for archival and exhibition purposes.

# DPI & Pixels vs Print: Images are captured as a grid of pixels. Screen DPI (like 1200 or 4400) refers to how many pixels the file contains, while print DPI (like 300) refers to how densely those pixels are laid out on paper. You can reduce DPI for printing without losing detail if the file has enough total pixels.

# Camera Sensors & Sony Lens: The A7R sensor has extremely fine photosites, letting it capture subtle textures and details. Paired with the Sony 50mm f/2.8 macro lens, it can sharply render small objects. Lens choice affects field of view, magnification, and focus precision, critical for photographing small negatives or prints.

# Noise/Grain & Denoising: High-resolution sensors naturally capture grain or noise, tiny speckles visible when zoomed to 100%. DxO PhotoLab and similar software can denoise, reducing this speckling while retaining true detail, making images cleaner for editing and exhibition without removing actual texture.

# Resampling & Iterations: From the RAW master, images can be resampled to different DPIs for printing or online display. Resampling is basically “adding or removing pixels to match a new size” (think stretching or shrinking a grid). Multiple iterations—archival TIFFs, lower-res TIFFs, JPEGs—can be generated without ever touching the original RAW.

# Noise / Grain Explained Straight: Noise or grain is what the sensor actually “sees” beyond just the object—tiny variations in light, photon arrival, and sensor electronics. Historically, grain was literally the chemical texture in film emulsions; now it’s digital analog, the sensor’s finest response to light. Even if a glass plate negative looks smooth to the eye, the sensor records microscopic variations in light hitting each photosite. Grain isn’t a flaw—it’s a byproduct of capturing ultra-fine detail, and very high-resolution sensors make it visible in ways earlier cameras or scanners did not. Think of it as the fingerprint of light itself—tiny dots your eye may not notice but the camera preserves.

# DPI Conversion Math (Layman Version): Imagine your image is a grid of tiny squares (pixels). On screen, you can zoom in or out—the squares stay the same, but they appear bigger or smaller. For print, the squares need to be packed tightly enough to look smooth.

Example: If your RAW is 8100 × 5800 pixels, that’s 46.9 million squares.
For a 16 × 20 inch print: 8100 ÷ 16 ≈ 506 pixels per inch horizontally; 5800 ÷ 20 ≈ 290 vertically. Your print DPI is around 300, which is plenty for crisp prints.
For a giant 8 ft wall vinyl: 8100 ÷ 96 ≈ 84 DPI horizontally. The pixels are more spread out, but at typical viewing distance, it still looks fine.

The key: pixels = total squares, DPI = how tightly you pack them on paper. You can’t invent more true pixels, but a large file lets you choose DPI for different output sizes without losing detail.

1. Camera & RAW Capture

# Sony A7R captures RAW files recording all photons hitting the sensor.

# RAW preserves maximum detail and color, letting us generate multiple outputs without touching the original.

2. Pixels vs DPI & Printing

# Pixels = total data captured; DPI = how densely we print them.

High-resolution RAWs let us print at 300 DPI for standard prints or lower DPI for very large prints (like wall vinyls) without losing visual quality.

3. Lens & Sensor

50 mm f/2.8 macro lens provides sharp focus and magnification for small objects.

# Sensor resolution captures fine detail, which can make natural grain/noise visible.

4. Noise / Grain & Denoising

# Noise/grain = tiny light variations recorded by the sensor.

DxO Photolab denoises while keeping texture; it’s like cleaning speckles without losing the object’s fine detail.

5. Resampling & Iterations

# Resampling = adding/removing pixels to match a new size.

From RAW master, we can make: archival TIFFs, lower-res TIFFs, JPEGs, or custom print versions.

6. Verso / Secondary Capture

Usually capture one iteration (JPEG or 8‑bit TIFF) unless recto is damaged.
Optional, case-by-case captures slow workflow and are rarely necessary.

7. Storage & File Sizes (typical)

RAW ≈ 120 MB; PSD ≈ 160 MB; 16‑bit TIFF ≈ 130 MB; 8‑bit TIFF ≈ 60 MB; JPEG ≈ 5 MB.
Discovery/lower-priority images can be saved at smaller size/resolution to save space and cost.

8. Printing / Oversampling Notes

Oversampling used as needed for exhibition prints.

# Stitching multiple captures possible for very large objects, increasing effective pixel count for crisp prints.

# Platen vs. vacuum vs. editing

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

Outline draft 4

Here’s a fully synthesized Markdown version of your outline with all the relevant paragraphs and tables merged in, keeping technical details, examples, plain-language notes, and scenarios intact:

# 1. Context & Capture

Camera & RAW Capture
- The Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 pixels), recording all photons hitting the sensor without in-camera processing.
- RAW files act as digital masters: they store full sensor data, allowing images to be reprocessed, color-corrected, or resampled multiple times for different outputs (prints, digital displays, derivatives) while maintaining maximum quality.
- Plain-language note: RAW files are like “digital negatives”—they contain everything the camera captured and give the most flexibility for later editing.
- Footnote: Images are captured with a “gutter” for efficiency; final crops can reduce dimensions to ~8k × 5k or 7k × 4k pixels, still large enough for high-quality prints.
- Consistent high-resolution capture: Oversampling is applied when images are destined for print or exhibition, balancing storage, workflow, and scaling.
- Accurate color reproduction: Calibrated white balance and attention to swatches ensure fidelity across formats, meeting professional standards without slowing routine capture.

Capture Methods

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

Post-editing (artificial correction): Can fix distortion, but may blur edges and lose authenticity compared to capturing the true shape.
Platen scanners: Keep photos flat, but can cause Newton rings (rainbow-like interference).
Vacuum tables: Hold items flat, but are expensive and rarely justified.

FADGI Compliance
- Workflow and settings align with preservation digitization standards.
- Plain-language note: FADGI sets out requirements for things like sharpness, color accuracy, and resolution. We’re meeting these by capturing at high resolution, using calibrated tools, and keeping RAW + TIFF masters.
- Efficient adherence to FADGI: Workflow hits all 3-star requirements and implements 4-star practices as needed, delivering professional-grade images at scale without unnecessary extra steps.

# 2. DPI & Resolution

On-screen vs Print DPI
- The “DPI” in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the capture resolution of the image—how many pixels per inch the sensor recorded—not a fixed printing requirement.
- For print, images are resampled (changing the pixel count) to a lower DPI (e.g., 300 ppi for standard prints, 96 ppi for very large wall vinyls) because print size and viewing distance determine how many pixels per inch are actually needed for a visually sharp result.
- Plain-language guide: Divide the pixel count by the print DPI to get the size. Example: 6000 px ÷ 300 dpi = 20 inches. (So a 6000 px file prints about 20 inches wide at 300 dpi.)
- DPI Cheat Sheet (Screen vs Print, Easy Version)
  1. Pixels = total squares captured. Example: 8100 × 5800 = 46.9 MP.
  2. Print DPI = how tightly you pack those pixels. More DPI = crisper print; less DPI = pixels spread out.
  3. Standard print (16 × 20 in): 8100 ÷ 16 ≈ 506 DPI, 5800 ÷ 20 ≈ 290 DPI → ~300 DPI, perfect for exhibition prints.
  4. Large wall vinyl (8 ft ≈ 96 in): 8100 ÷ 96 ≈ 84 DPI, 5800 ÷ 96 ≈ 60 DPI → still looks good from normal viewing distance.
  5. Key idea: Pixels are fixed; DPI just determines how big you print them. Big files let you pick DPI for each output.
Resampling & Pixel Limits
- Resampling reduces the image’s pixel density to match the intended print size without inventing extra detail—you cannot create new original pixels, only interpolate (smoothly fill gaps) or average existing ones.
- This allows extremely high-res captures to produce large prints efficiently while retaining visual fidelity at human viewing distances.
- Plain-language note: Oversampling means we don’t have to rescan every time a new project needs a different size—one big capture can be downscaled into many smaller versions.
Resampling & Iterations Workflow
- From the RAW master, images can be resampled to different DPIs for printing or online display.
- Multiple iterations—archival TIFFs, lower-res TIFFs, JPEGs—can be generated without ever touching the original RAW.
- Layman’s analogy: Like stretching or shrinking a grid of colored tiles: you don’t add new real tiles, you just make them fit the new space.
Stitching for Large Objects
- For very large objects, multiple overlapping images can be captured on the copy stand using the 61 MP Sony A7R V (~9500 × 6300 pixels per frame) and digitally stitched into a single composite.
- Stitching two or more frames effectively doubles or triples the total pixel dimensions, increasing the effective megapixel count to 120–180 MP or more.
- This allows the resulting image to maintain high DPI—300 ppi or more—for large-format prints while preserving fine detail and ensuring on-screen clarity when zoomed, without artificial interpolation.

# 3. Image Processing

Sensor Grain vs Digital Noise
- Plain-language note: Film grain came from the chemistry of film; digital noise comes from the electronics in the camera sensor. Both show up as speckles in low light or high sensitivity settings.
- Noise or grain is what the sensor actually “sees”—tiny variations in light, photon arrival, and sensor electronics. Historically, grain was the chemical texture in film emulsions; now it’s a digital analog—the sensor’s finest response to light. Even if a glass plate negative looks smooth, the sensor records microscopic variations.
- Very high-resolution sensors make this visible in ways earlier cameras or scanners did not; think of it as the fingerprint of light itself.
DxO PhotoLab 9
- Used for denoising (reducing speckles) and correcting warped or curved images.
- Plain-language note: Denoising software can make photos cleaner and more readable, but too much can erase small details.
- Optical correction: Lenses can introduce distortions, like curved lines or uneven edges, and slight perspective warping. DxO PhotoLab can straighten these curves and correct lens-specific distortions, producing a truer-to-life representation.
- Preparing the image in DxO ensures cleaner, geometrically accurate files for Photoshop.
Editing (Photoshop)
- Used for making digital positives, removing dust/scratches, balancing tones, and cropping.
- Automation: Some steps (like cropping or making JPGs) can be scripted for efficiency.
- Cropping tightly and removing dust/blemishes is done selectively for exhibition copies or high-visibility derivatives, keeping routine digitization fast while still enabling exhibition-quality output.
- Polished final products: Photoshop edits allow targeted retouching while RAW and TIFF masters remain untouched.

# 4. DAMS & Storage

Pics.io Overview
- Cloud-based DAMS; organizes, searches, and manages assets.
- Features: OCR, auto-descriptions, face recognition, intelligent search.
- One record in TMS likely links the entire collection to Pics.io.
- AI outputs may require manual review and correction to ensure accuracy and proper metadata assignment.
AI Tools
1. Automatic descriptions/keywords: Generates text describing content (“man,” “portrait,” “suit”).
2. OCR/transcriptions: Reads text within images and makes it searchable.
3. Face recognition: Groups images of the same person.
4. Intelligent descriptive AI search: Allows natural-language queries to find images.
5. AI task limits/credits: Each operation counts against yearly allotment, prioritizing workflow efficiency.
- Plain-language note: AI can be trained to recognize people or recurring subjects, helping speed up cataloging.
File Policy & Retention
- RAW = master file (never altered).
- TIFFs = archival/preservation versions.
- JPGs = smaller, shareable discovery images.
- PSDs = working files (not permanent).
- Versos saved selectively (lower resolution or only when relevant).
- For less-critical images, saving as 8-bit TIFF or 300 dpi creates a “Discovery” version that’s still usable without massive storage costs.
- Priority objects get full-resolution archival TIFFs/RAWs.
File Sizes per Format

Format	Bit Depth / PPI	Estimated File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	1.75 GB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
TIFF 8‑bit, 4400 ppi	8-bit	806 MB
JPEG derivative	8-bit	5 MB
Sony uncompressed RAW	14‑16 bit	120 MB
PSD (working file)	16-bit	160 MB

Collection Storage Scenarios
- 1,000 images: 16-bit 1200 ppi TIFFs → 130 GB; 16-bit 4400 ppi TIFFs → 1.75 TB; RAWs → 120 GB; JPEGs → 5 GB; PSDs → 160 GB
- 50,000 images: 16-bit 1200 ppi TIFFs → 6.5 TB; 16-bit 4400 ppi TIFFs → 87.5 TB; RAWs → 6 TB; JPEGs → 250 GB; PSDs → 8 TB
- 75,000 images (~2.5 assets per object → ~187,500 files): 16-bit 1200 ppi TIFFs → 24.4 TB; 16-bit 4400 ppi TIFFs → 328 TB; RAWs → 22.5 TB; JPEGs → 937 GB; PSDs → 30 TB
Storage Location & Cost
- Files live on external 12 TB drives and B2 Backblaze cloud storage. Monthly cost noted separately.
Progress Overview
- ~8,000 items across 36 binders. ~20 binders reviewed in August. ~5,000 images digitized so far, including deteriorated nitrate/acetate scans.

# 5. Additional (If Time)

Outside Contractors
- Possible for acetate or fragile formats; interns or contractors could assist.
Naming Conventions
- e.g., “N” for nitrate, “r/v” for recto/verso, consistent identifiers tied to binders.

# References / Plain-Language Notes (Synthesized)

RAW = digital master; preserves sensor data.
Oversampling = capturing extra resolution so you can downscale later without rescanning.
DPI = packing of pixels; print DPI can be reduced for large prints if pixel count is sufficient.
Resampling = interpolating pixels for new output size without inventing original detail.
Noise/grain = sensor artifact, natural texture; denoising reduces speckle while preserving detail.
AI = speeds up metadata, OCR, face recognition; outputs require review to ensure accuracy.
Stitching = for very large objects, multiple frames create one high-resolution composite.

This synthesizes your outline with all the technical info, plain-language explanations, storage scenarios, AI notes, and image-processing details merged in, while avoiding redundant repetitions but keeping all the rich content intact.

If you want, I can also add inline references to tables and scenarios directly under the bullets where they belong, so it reads more like a formal report/manual rather than an outline. That would make it fully “ready-to-share.” Do you want me to do that next?

Old draft for DPI

On-screen vs Print DPI
- The “DPI” in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the capture resolution of the image—how many pixels per inch the sensor recorded—not a fixed printing requirement.
- For print, images are resampled (changing the pixel count) to a lower DPI (e.g., 300 ppi for standard prints, 96 ppi for very large wall vinyls) because print size and viewing distance determine how many pixels per inch are actually needed for a visually sharp result.
  - Resampling reduces the image’s pixel density to match the intended print size without inventing extra detail—you cannot create new original pixels, only interpolate (smoothly fill gaps) or average existing ones.
    - From the RAW master, images can be resampled to different DPIs for printing or online display. Multiple iterations—archival TIFFs, lower-res TIFFs, JPEGs—can be generated without ever touching the original RAW.
  - This allows extremely high-res captures to produce large prints efficiently while retaining visual fidelity at human viewing distances.
  - Oversampling is applied when images are destined for print or exhibition, balancing storage, workflow, and scaling. (I think this is the 4400 ppi stuff)
  - Pixels = total squares captured; MP (megapixels) = that total divided by one million
    - So for example: 8100 × 5800 = 46.9 MP. Print DPI = how tightly you pack those pixels. More DPI = crisper print; less DPI = pixels spread out.
    - Divide the pixel count by the print DPI to get the size. Example: 6000 px ÷ 300 dpi = 20 inches. (So a 6000 px file prints about 20 inches wide at 300 dpi.)
      - Standard print (16 × 20 in): 8100 ÷ 16 ≈ 506 DPI, 5800 ÷ 20 ≈ 290 DPI → ~300 DPI, perfect for exhibition prints.
      - Large wall vinyl (8 ft ≈ 96 in): 8100 ÷ 96 ≈ 84 DPI, 5800 ÷ 96 ≈ 60 DPI → still looks good from normal viewing distance.
  - Key idea: Pixels are fixed; DPI just determines how big you print them. Big files let you pick DPI for each output.
Stitching for High Priority or Large Objects
- For high priority or very large objects, multiple overlapping images can be captured on the copy stand using the 61 MP Sony A7R V (~9500 × 6300 pixels per frame) and digitally stitched into a single composite.
- Stitching two or more frames effectively doubles or triples the total pixel dimensions, increasing the effective megapixel count to 120–180 MP or more.
- This allows the resulting image to maintain high DPI—300 ppi or more—for large-format prints while preserving fine detail and ensuring on-screen clarity when zoomed, without artificial interpolation.
Updated DPI draft
- On-screen vs Print DPI
  - The “DPI” in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the capture resolution of the sensor—how many pixels it recorded per inch—not a fixed printing requirement.
  - For print, images are resampled (adjusting pixel count) to match the intended print size and viewing distance.
    - Resampling reduces pixel density without creating new original pixels—you cannot invent detail, only interpolate or average existing ones.
      - This approach allows extremely high-res captures to produce large prints efficiently while retaining visual fidelity at normal viewing distances.
    - Oversampling such as using a 4400 ppi TIFF is applied for print or exhibition to balance storage, workflow, and scaling.
  - Pixels and Megapixels (MP)
    - Pixels = total squares captured; MP = total pixels ÷ 1,000,000. For example, 8100 × 5800 pixels = 46,980,000 px ≈ 46.9 MP.
    - Print DPI determines how tightly those pixels are packed. More DPI = crisper print, less DPI = pixels spread out.
    - To calculate print size: divide pixel dimensions by print DPI.
      - Example: 6000 px ÷ 300 DPI = 20 inches wide.
        
        Standard print (16 × 20 in): 8100 px ÷ 16 in ≈ 506 DPI, 5800 px ÷ 20 in ≈ 290 DPI → ~300 DPI, ideal for exhibition prints.
        
        Large wall vinyl (8 ft ≈ 96 in): 8100 px ÷ 96 in ≈ 84 DPI, 5800 px ÷ 96 in ≈ 60 DPI → acceptable for normal viewing distances.
    - Key idea: Pixels are fixed; DPI just determines print size. Higher-res files allow flexibility in selecting DPI for different outputs.
- Stitching for High-Priority or Large Objects
  - For high-priority or very large objects, multiple overlapping captures can be taken on the copy stand with the 61 MP Sony A7R V (~9500 × 6300 px per frame) and digitally stitched into a single composite.
  - Stitching two or more frames effectively increases total pixel dimensions, boosting the effective megapixel count to 120–180 MP or more.
  - This ensures large-format prints maintain high DPI (300 ppi+) while preserving fine detail and supporting on-screen clarity at high zoom levels, without relying on artificial interpolation.
  ——

Outline draft 5

# 1. Context & Capture ← iffy header

Camera & RAW Capture ← don’t like this header
- File formats relevant to the Hooks Brothers Collection include:
  - RAW = master file
  - TIFFs = archival/preservation versions
  - JPGs = smaller, shareable access images
  - PSDs = working files (not permanent)
- The Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 pixels), recording all photons hitting the sensor without in-camera processing.
  - Images are captured with a “gutter” for efficiency; final crops can reduce dimensions at times but are still large enough for high-quality prints.
- RAW files act as digital masters: they store full sensor data, allowing images to be reprocessed, color-corrected, or resampled multiple times for different outputs (prints, digital displays, derivatives) while maintaining maximum quality. // From the RAW master, multiple outputs can be generated—archival TIFFs, lower-res TIFFs, JPEGs—without touching the original RAW.
  - RAW files contain everything the camera captured and give the most flexibility for later editing. [Caveat that RAWs by themselves can look underdeveloped and are often not compatible with many other non-photo software, and they often don’t contain certain [metadata???] like portrait/landscape orientation, and therefore while they are integral to keep as a master copy, they can’t fill in for e.g. a processed/edited archival TIFF.]
FADGI sets standards for digitizing cultural heritage materials to ensure high-quality, long-lasting digital surrogates. Key guidelines include capturing images at sufficient resolution, using accurate color management, saving in uncompressed or minimally compressed formats with proper metadata, and handling originals carefully. Workflows should produce master files separate from access copies, include quality checks, and follow best practices for storage and preservation.
- We’re meeting these by capturing at high resolution, using calibrated tools, and keeping RAW + TIFF masters, [etc.]. By default our workflow hits all 3-star requirements, and [routinely] implements 4-star practices [wherever] necessary for output quality.

# 2. DPI & Resolution

On-screen vs Print DPI
- The “DPI” in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the sensor’s capture resolution—how many pixels per inch were recorded—not a fixed print requirement.
- For print, images are resampled to match the intended size and viewing distance.
  - Resampling lowers pixel density without creating new pixels—you cannot invent detail, only interpolate or average existing ones. Physics sets a limit; the number of original pixels is fixed at the moment of capture, so larger prints can never have more true detail than the capture provides. This lets high-res captures produce large prints efficiently while retaining visual fidelity.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for print or exhibition to balance storage, workflow, and scaling.
Pixels and Megapixels (MP) ← i don’t like this as a header
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. For example, 8100 × 5800 px = 46,980,000 px ≈ 46.9 MP.
- Print DPI determines how tightly pixels are packed: more DPI = crisper print; less DPI = pixels spread out.
  - To calculate print size: divide pixel dimensions by print DPI. For example, 6000 px ÷ 300 DPI = 20 inches wide.
  - Pixels are fixed; DPI determines print size. Higher-res files allow flexibility for different outputs.
High-Priority or Large Objects
- For high-priority or very large objects, multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be digitally “stitched” together into a single composite.
- Stitching two or more frames increases total pixel dimensions, raising effective megapixels to 120–180 MP or more.
- This preserves high DPI (300 ppi+) for large prints while maintaining fine detail and on-screen clarity, without artificial interpolation.

# 3. Image Processing ← not a great header

Sensor Grain vs Digital Noise ← not a great header. does this even need a header
- Film grain came from the chemistry of film; digital noise comes from the electronics in the camera sensor. Both show up as speckles in low light or high sensitivity settings.
- Noise or grain is what the sensor actually “sees”—tiny variations in light, photon arrival, and sensor electronics. Historically, grain was the chemical texture in film emulsions; now it’s a digital analogue—the sensor’s finest response to light. Even if a glass plate negative looks smooth, the sensor records microscopic variations. Very high-resolution sensors make this visible in ways earlier cameras or scanners did not.
Noise can be artificially reduced via “denoising” within photography editing software. The [better word than “leading vendor”] for noise removal is DxO’s PhotoLab 9, which can be used for denoising, correcting warped or curved images, [better phrasing than “and other techniques”].
- These techniques have their own shortcomings; denoising works by “softening” the edges and textures within an image, potentially erasing small details. Optical correction [something about how it blurs/loses fidelity I swear I had this in my notes originally??].
Exported prints and (optionally denoised) negatives are further adjusted via automation (batch cropping, bulk JPG conversion) and manual post-processing in Adobe Photoshop 2025, which allows for creating digital positives from the negatives, fine-tuning exposure and leveling, and retouching [for] dust, scratches, bubbling in the acetate, etc.
- More detailed and extensive retouching withheld for exhibition copies or high-visibility derivatives.
- Edited files are kept in a Photoshop “wrapper” (PSD) that is lossless, to facilitate any further adjustments and changes [to the file]. PSDs are retained for 1 year after creation.

# 4. DAMS & Storage

All RAW files are automatically stored on MacOS drive via Capture One. Working files are kept on local MacBook Pro 2TB SSD, currently backed up to MBMA’s Google Drive account and an external Western Digital [Passport?] 12TB drive. Google Drive and WD Passport external drives also back up older files, PSDs, and any derivatives and copies.
- For increased stability and reliability, and more robust backup and retrieval options, suggest moving collection’s cloud backups to Backblaze B2. If the museum’s IT department ever expands, this backup storage could also be reconfigured to be used for cost-effective DAMS storage.
There are 36 binders with 20 binders flagged for digitization in August. About 50% of each binder is flagged for exhibition or catalogue, and focus is on digitizing flagged objects in the binders before moving on to flagged objects in the rest of the collection. Approximately 5000 images were digitized (including deteriorated nitrates/acetates) prior to those 20 binders introduced in August.

File sizes vary by bit depth, format, and ppi and directly impact the cost of storage for backup or a DAMS.

Format	Bit Depth	Average File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
Uncompressed RAW	16 bit	130 MB
PSD (working file)	16-bit	700 MB

DAMS Storage Scenarios

Number of Images	16-bit 1200 ppi TIFF	16-bit 4400 ppi TIFF	8-bit 1200 ppi TIFF	RAW	JPEG
1,000	130 GB	360 GB	60 GB	130 GB	5 GB
50,000	6.5 TB	18 TB	3 TB	6.5 TB	250 GB
~187,500	9.75 TB	27 TB	4.5 TB	9.75 TB	375 GB

Storage costs would increase as digitized collection grew. Cost of external drives is also recurring as new ones occasionally replace [old ones.]
Suggested [stratification] of [which/how many images] are stored per object. All RAWs are retained.
- Select objects receive oversampled TIFFs for print or may be “stitched” together. Priority objects get full-resolution archival TIFFs.
- For less-critical images, saving as 8-bit TIFF or at 300 dpi may be the best option for “discovery” version that is still usable for research and reference.
- Versos saved selectively, e.g. only the negative copy of a verso as an 8-bit TIFF or at 300 dpi.

Pics.io DAMS
- Pics.io will serve as the primary cloud DAMS, providing controlled, role-based access, quick and curated public sharing, and search across metadata, keywords, and full-text content. The system stores TIFF masters and generates JPG derivatives on demand, supports batch ingest and bulk metadata updates, and allows creation of custom fields and full version tracking.
  - One record in TMS likely links the entire collection to Pics.io.
- It also supports AI-driven tasks based on a set number of credits per year, which include:
  - Automatic generation of detailed object descriptions and keywords (e.g. “woman,” “portrait,” “wedding,” “bouquet.”)
  - [Image] OCR [and audio/video transcriptions], reading text within images and making it searchable.
  - Facial recognition, tagging and grouping images of the same person.
  - Intelligent descriptive search, allowing for natural-language queries to find images.
    - Could upload images outside the Hooks collection to “train” it to recognize certain [figures].
  - AI outputs may require manual review and correction to ensure accuracy and proper metadata assignment.

# 5. Additional (If Time)

Naming Conventions
- e.g. “N” for nitrate, “P” for prints, “r/v” suffix for recto/verso, etc.
Outside Contractors and interns

[Flattening images??? how to phrase this lol]

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

# References / Plain-Language Notes (Synthesized)

RAW = digital master; preserves sensor data.
Oversampling = capturing extra resolution so you can downscale later without rescanning.
DPI = packing of pixels; print DPI can be reduced for large prints if pixel count is sufficient.
Resampling = interpolating pixels for new output size without inventing original detail.
Noise/grain = sensor artifact, natural texture; denoising reduces speckle while preserving detail.
AI = speeds up metadata, OCR, face recognition; outputs require review to ensure accuracy.
Stitching = for very large objects, multiple frames create one high-resolution composite.

Draft 6

# 1. Collection Capture & File Types

File Formats
- RAW = master file
- TIFF = archival/preservation version
- JPG = smaller, shareable access images
- PSD = working files (temporary, not permanent)
Camera Capture
- Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 px), recording all photons hitting the sensor without in-camera processing.
  - Images may include extra margin (“gutter”) for efficiency; final crops reduce dimensions but remain large enough for high-quality prints.
- RAW files serve as digital masters, storing full sensor data and allowing multiple reprocessing, color corrections, or resampling for prints, digital displays, or derivatives without altering the original.
  - RAWs may look underdeveloped initially, can lack some metadata (e.g., orientation flags), and are not always compatible with non-photo software, but remain essential as master copies.
FADGI Compliance
- FADGI guidelines ensure high-quality, long-lasting digital surrogates. Key practices include: capturing at sufficient resolution, using calibrated color management, storing in uncompressed or minimally compressed formats, retaining master vs. access copies, handling originals carefully, performing quality checks, and following proper storage/preservation procedures.
- Our workflow meets all 3-star requirements by default, and implements 4-star practices where needed for output quality.

# 2. Resolution, Pixels, & Print Scaling

DPI vs Print
- DPI in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects sensor capture resolution, not a fixed print requirement.
- For print, images are resampled to match intended size and viewing distance.
  - Resampling reduces pixel density but cannot create new detail; original pixels are fixed at capture.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for exhibition or print to balance storage, workflow, and scaling.
Pixels & Megapixels
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. Example: 8100 × 5800 px = 46.9 MP.
- Print DPI determines pixel packing: higher DPI = crisper print, lower DPI = pixels spread out.
  - Print size calculation: divide pixel dimensions by DPI. Example: 6000 px ÷ 300 DPI = 20 in.
High-Priority / Large Objects
- Multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be stitched into a single composite.
- Stitching increases total pixel dimensions (120–180 MP+), preserving high DPI (300 ppi+) for large prints and on-screen clarity without artificial interpolation.

# 3. Image Quality & Processing

Sensor Grain vs Digital Noise
- Film grain comes from chemistry; digital noise comes from sensor electronics. Both appear as speckles in low light or high ISO.
- High-res sensors reveal microscopic variations that older cameras or scanners could not capture.
Denoising & Corrections
- Noise can be reduced with software such as DxO PhotoLab 9, handling denoising, geometric correction, and related adjustments.
- Denoising softens edges/textures and may erase fine detail; optical corrections can slightly blur or reduce fidelity.
Post-Processing
- Exports (including optionally denoised negatives) are further processed via batch automation (cropping, bulk JPG conversion) and manual editing in Photoshop 2025.
  - Tasks include generating digital positives, exposure adjustment, leveling, and retouching dust, scratches, or acetate defects.
  - Detailed retouching is reserved for exhibition copies or high-visibility derivatives.
  - Edited files are stored in PSD wrappers (lossless) for up to one year to allow further adjustments.

# 4. Storage, File Sizes, & DAMS

Local Storage
- RAWs stored via Capture One on MacOS; working files on 2 TB MacBook Pro SSD, backed up to Google Drive and a 12 TB external drive.
- Older files, PSDs, and derivatives are also backed up on external drives.
- For greater stability and reliability, we suggest moving the collection’s cloud backups to Backblaze B2. If the museum’s IT department expands, this storage could later be reconfigured for cost-effective DAMS use.
Collection Status
- 36 binders total; 20 flagged for digitization in August.
- About 50% of each binder is for exhibition/catalogue; digitization prioritizes flagged objects first.
- ~5,000 images digitized prior to August (including deteriorated nitrates/acetates).

File Sizes & Storage Scenarios

Format	Bit Depth	Avg File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
RAW (uncompressed)	16-bit	130 MB
PSD (working)	16-bit	700 MB

Draft 7.1

# 1. Collection Capture & File Types

File Formats
- RAW = master file
- TIFF = archival/preservation version
- JPG = smaller, shareable access image
- PSD = working files (temporary, not permanent)
Camera Capture
- Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 px), recording all photons hitting the sensor without in-camera processing.
  - Images may include extra margin (“gutter”) for efficiency; final crops reduce dimensions but remain large enough for high-quality prints.
- RAW files serve as digital masters, storing full sensor data and allowing multiple reprocessing, color corrections, or resampling for prints, digital displays, or derivatives without altering the original.
  - RAWs may appear underdeveloped, can lack some metadata (e.g., orientation flags), and are not always compatible with non-photo software, but remain essential as master copies.
FADGI Compliance
- FADGI guidelines ensure the creation of high-quality, long-lasting digital surrogates. Images should be captured at sufficient resolution, with accurate color management, and saved as uncompressed or minimally compressed files that include proper metadata. Originals must be handled carefully, master and access copies maintained separately, and quality checks performed regularly.
- Our workflow meets all 3-star requirements by default, and implements 4-star practices wherever higher output quality is required.

# 2. Resolution, Pixels, & Print Scaling

DPI vs. Print
- DPI in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects sensor capture resolution, not a fixed print requirement.
- For print, images are resampled to match intended size and viewing distance.
  - Resampling reduces pixel density but cannot create new detail; original pixels are fixed at capture.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for exhibition or print to balance storage, workflow, and scaling.
Pixels & Megapixels
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. Example: 8100 × 5800 px = 46.9 MP.
- Print DPI determines pixel packing: higher DPI = crisper print, lower DPI = pixels spread out.
  - Print size calculation: divide pixel dimensions by DPI. Example: 6000 px ÷ 300 DPI = 20 in.
High-Priority / Large Objects
- Multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be stitched into a single composite.
- Stitching increases total pixel dimensions (120–180 MP+), preserving high DPI (300 ppi+) for large prints and on-screen clarity without artificial interpolation.

# 3. Image Quality & Processing

Sensor Grain vs. Digital Noise
- Film grain comes from chemistry; digital noise comes from sensor electronics. Both appear as speckles in low light or high ISO.
- High-resolution sensors reveal microscopic variations invisible to older cameras or scanners.
Denoising & Corrections
- Noise can be reduced using software such as DxO PhotoLab 9, which handles denoising, geometric correction, and related adjustments.
- Denoising softens edges and textures, potentially erasing fine detail; optical corrections may slightly blur or reduce fidelity.
Post-Processing
- Exports (including optionally denoised negatives) are further processed via batch automation (cropping, bulk JPG conversion) and manual editing in Photoshop 2025.
  - Tasks include generating digital positives, exposure adjustment, leveling, and retouching dust, scratches, or acetate defects.
  - Detailed retouching is reserved for exhibition copies or high-visibility derivatives.
  - Edited files are stored in PSD wrappers (lossless) for up to one year to allow further adjustments.

# 4. Storage, File Sizes, & DAMS

Local Storage
- RAWs stored via Capture One on MacOS; working files on 2 TB MacBook Pro SSD, backed up to Google Drive and a 12 TB external drive.
- Older files, PSDs, and derivatives are also backed up on external drives.
- For greater stability, reliability, and backup/retrieval options, we suggest moving the collection’s cloud backups to Backblaze B2. If the museum’s IT department expands, this storage could later be reconfigured for cost-effective DAMS use.
Collection Status
- 36 binders total; 20 flagged for digitization in August.
- About 50% of each binder is for exhibition/catalogue; digitization prioritizes flagged objects first.
- ~5,000 images digitized prior to August (including deteriorated nitrates/acetates).

File Sizes & Storage Scenarios

Format	Bit Depth	Avg File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
RAW (uncompressed)	16-bit	130 MB
PSD (working)	16-bit	700 MB

Draft 7

# 1. Collection Capture & File Types

File Formats
- RAW = master file
- TIFF = archival/preservation version
- JPG = smaller, shareable access image
- PSD = working files (temporary, not permanent)
Camera Capture
- Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 px), recording all photons hitting the sensor without in-camera processing.
  - Images may include extra margin (“gutter”) for efficiency; final crops reduce dimensions but remain large enough for high-quality prints.
- RAW files serve as digital masters, storing full sensor data and allowing multiple reprocessing, color corrections, or resampling for prints, digital displays, or derivatives without altering the original.
  - RAWs may appear underdeveloped, can lack some metadata (e.g., orientation flags), and are not always compatible with non-photo software, but remain essential as master copies.

# 2. Resolution, Pixels, & Print Scaling

DPI vs. Print
- DPI in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects sensor capture resolution, not a fixed print requirement.
- For print, images are resampled to match intended size and viewing distance.
  - Resampling reduces pixel density but cannot create new detail; original pixels are fixed at capture.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for exhibition or print to balance storage, workflow, and scaling.
Pixels & Megapixels
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. Example: 8100 × 5800 px = 46,980,000 px ≈ 46.9 MP.
- Print DPI determines pixel packing: higher DPI = crisper print, lower DPI = pixels spread out.
  - Print size calculation: divide pixel dimensions by DPI. Example: 6000 px ÷ 300 DPI = 20 in.
- Pixels are fixed; DPI determines print size. Higher-res files allow flexibility for different outputs.
High-Priority / Large Objects
- Multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be stitched into a single composite.
- Stitching two or more frames increases total pixel dimensions, raising effective megapixels to 120–180 MP or more.
- Preserves high DPI (300 ppi+) for large prints while maintaining fine detail and on-screen clarity, without artificial interpolation.

# 3. Image Quality & Processing

Sensor Grain vs. Digital Noise
- Film grain comes from chemistry; digital noise comes from sensor electronics. Both appear as speckles in low light or high ISO.
- High-resolution sensors reveal microscopic variations invisible to older cameras or scanners.
Denoising & Corrections
- Noise can be reduced using software such as DxO PhotoLab 9, which handles denoising, geometric correction, and related adjustments.
- Denoising softens edges and textures, potentially erasing fine detail; optical corrections may slightly blur or reduce fidelity.
Post-Processing
- Exports (including optionally denoised negatives) are further processed via batch automation (cropping, bulk JPG conversion) and manual editing in Photoshop 2025.
  - Tasks include generating digital positives, exposure adjustment, leveling, and retouching dust, scratches, or acetate defects.
  - Detailed retouching is reserved for exhibition copies or high-visibility derivatives.
  - Edited files are stored in PSD wrappers (lossless) for up to one year to allow further adjustments.

# 4. Storage, File Sizes, & DAMS

Local Storage
- RAWs stored via Capture One on MacOS; working files on 2 TB MacBook Pro SSD, backed up to Google Drive and a 12 TB external drive.
- Older files, PSDs, and derivatives are also backed up on external drives.
- For greater stability, reliability, and backup/retrieval options, collection cloud backups are suggested to move to Backblaze B2. If the museum’s IT department expands, this storage could later be reconfigured for cost-effective DAMS use.
Collection Status
- 36 binders total; 20 flagged for digitization in August.
- About 50% of each binder is for exhibition/catalogue; digitization prioritizes flagged objects first.
- ~5,000 images digitized prior to August (including deteriorated nitrates/acetates).

File Sizes & Storage Scenarios

Format	Bit Depth	Avg File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
RAW (uncompressed)	16-bit	130 MB
PSD (working)	16-bit	700 MB

# 5. Pics.io DAMS & AI Tools

Primary Cloud DAMS
- Pics.io serves as the main cloud DAMS, providing controlled, role-based access, curated public sharing, and search across metadata, keywords, and full-text content.
- Stores TIFF masters and generates JPG derivatives on demand, supports batch ingest, bulk metadata updates, custom fields, and full version tracking.
- Likely links to TMS so one record ties the entire collection to Pics.io.
AI-Driven Tasks (Credit-Based)
- Automatic generation of detailed object descriptions and keywords (e.g., “woman,” “portrait,” “wedding,” “bouquet”).
- OCR and audio/video transcription: reads text within images and makes it searchable.
- Facial recognition: tagging and grouping images of the same person.
- Intelligent descriptive search: allows natural-language queries to locate images.
  - Could upload images outside the Hooks collection to train recognition for certain figures.
- Caveat: AI outputs may require manual review and correction to ensure accuracy and proper metadata assignment.

# 6. Additional / Optional Tasks

Naming Conventions
- e.g., “N” for nitrate, “P” for prints, “r/v” suffix for recto/verso, etc.
Outside Contractors and Interns

Flattening Methods

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

Draft 8

# 1. Collection Capture & File Types

File Formats
- RAW = master file
- TIFFs = archival/preservation versions
- JPGs = smaller, shareable access images
- PSDs = working files (not permanent)
Camera Capture
- The Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 pixels), recording all photons hitting the sensor without in-camera processing.
  - Images are captured with a “gutter” for efficiency; final crops may reduce dimensions slightly but still remain large enough for high-quality prints.
- RAW files serve as digital masters, storing full sensor data and allowing images to be reprocessed, color-corrected, or resampled multiple times for prints, digital displays, or derivative outputs, without altering the original.
  - RAW files may look underdeveloped, can lack certain metadata (e.g., portrait/landscape orientation), and are often incompatible with non-photo software, but are essential as master copies.
FADGI Standards
- FADGI provides guidelines for digitizing cultural heritage materials to ensure high-quality, long-lasting digital surrogates.
- Practices include capturing at sufficient resolution, using accurate color management, saving in uncompressed or minimally compressed formats with proper metadata, and careful handling of originals.
- Our workflow produces master files separate from access copies, includes quality checks, and routinely meets 3-star requirements, implementing 4-star practices where necessary.

# 2. Resolution, Pixels, & Print Scaling

DPI vs. Print
- The DPI in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the sensor’s capture resolution, not a fixed print requirement.
- For print, images are resampled to match the intended size and viewing distance.
  - Resampling lowers pixel density but does not create new detail; original pixels are fixed at capture.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for exhibition or print to balance storage, workflow, and scaling.
Pixels & Megapixels
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. Example: 8100 × 5800 px = 46,980,000 px ≈ 46.9 MP.
- Print DPI determines how tightly pixels are packed: higher DPI = crisper print; lower DPI = pixels spread out.
  - To calculate print size: divide pixel dimensions by print DPI. Example: 6000 px ÷ 300 DPI = 20 inches wide.
- Pixels are fixed; DPI determines print size. Higher-res files allow flexibility for different outputs.
High-Priority or Large Objects
- For very large or high-priority objects, multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be digitally stitched into a single composite.
- Stitching two or more frames increases total pixel dimensions, raising effective megapixels to 120–180 MP or more.
- This preserves high DPI (300 ppi+) for large prints while maintaining fine detail and on-screen clarity, without artificial interpolation.

# 3. Image Quality & Processing

Sensor Grain vs. Digital Noise
- Film grain originates from the chemical process of film; digital noise comes from the camera sensor electronics. Both appear as speckles in low light or high sensitivity settings.
- Very high-resolution sensors reveal microscopic variations even when originals look smooth, capturing details invisible to older cameras or scanners.
Noise Reduction & Corrections
- Noise can be reduced using software such as DxO PhotoLab 9, which performs denoising, correction for warped or curved images, and related adjustments.
- Denoising softens edges and textures, which can erase small details. Optical corrections may slightly blur or reduce fidelity.
Post-Processing
- Exported prints and optionally denoised negatives are further adjusted via batch automation (cropping, bulk JPG conversion) and manual editing in Adobe Photoshop 2025.
  - Tasks include generating digital positives from negatives, fine-tuning exposure, leveling, and retouching dust, scratches, or bubbling in acetate.
  - Detailed retouching is reserved for exhibition copies or high-visibility derivatives.
  - Edited files are kept in PSD wrappers (lossless) to allow further adjustments and are retained for one year.

# 4. Storage, File Sizes, & DAMS

Local Storage
- RAW files are automatically stored on MacOS via Capture One. Working files are kept on a 2 TB MacBook Pro SSD, backed up to MBMA’s Google Drive account and a 12 TB external Western Digital drive.
- Older files, PSDs, and derivatives are also backed up on external drives.
- For increased stability and reliability, and more robust backup and retrieval options, moving cloud backups to Backblaze B2 is suggested. If the museum IT expands, this could be reconfigured for cost-effective DAMS storage.
Collection Status
- 36 binders total; 20 binders flagged for digitization in August.
- About 50% of each binder is flagged for exhibition or catalogue; digitization focuses on flagged objects first.
- Approximately 5,000 images were digitized prior to these 20 binders, including deteriorated nitrates and acetates.

File Sizes & Storage Scenarios

Format	Bit Depth	Average File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
Uncompressed RAW	16-bit	130 MB
PSD (working file)	16-bit	700 MB

DAMS Storage Scenarios

Number of Images	16-bit 1200 ppi TIFF	16-bit 4400 ppi TIFF	8-bit 1200 ppi TIFF	RAW	JPEG
1,000	130 GB	360 GB	60 GB	130 GB	5 GB
50,000	6.5 TB	18 TB	3 TB	6.5 TB	250 GB
~187,500	9.75 TB	27 TB	4.5 TB	9.75 TB	375 GB

Storage costs rise as digitization progresses. External drives incur recurring replacement costs.
Suggested stratification: all RAWs retained; select objects receive oversampled TIFFs for print or stitched composites. Priority objects get full-resolution archival TIFFs; less-critical images may be saved as 8-bit TIFF or 300 DPI for research/discovery use.
Versos saved selectively, e.g., only the negative copy of a verso as 8-bit TIFF or at 300 DPI.

# 5. Pics.io DAMS & AI Tools

Primary Cloud DAMS
- Pics.io serves as the main cloud DAMS, providing controlled, role-based access, curated public sharing, and search across metadata, keywords, and full-text content.
- Stores TIFF masters and generates JPG derivatives on demand, supports batch ingest, bulk metadata updates, custom fields, and full version tracking.
- One TMS record likely links the entire collection to Pics.io.
AI-Driven Tasks
- Automatic generation of detailed object descriptions and keywords (e.g., “woman,” “portrait,” “wedding,” “bouquet”).
- OCR and audio/video transcription, reading text within images to make it searchable.
- Facial recognition, tagging and grouping images of the same person.
- Intelligent descriptive search for natural-language queries; could upload external images to train recognition of certain figures.
- Caveat: AI outputs may require manual review and correction to ensure accuracy and proper metadata assignment.

# 6. Additional Agenda Items

Naming Conventions
- e.g., “N” for nitrate, “P” for prints, “r/v” suffix for recto/verso, etc.
Outside Contractors and Interns

Object Flattening

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

Draft 9

Here’s Draft 8 with Draft 7 headers and full fidelity to Draft 5 bullets and content:

# 1. Collection Capture & File Types

Camera & RAW Capture
- File formats relevant to the Hooks Brothers Collection include:
  - RAW = master file
  - TIFFs = archival/preservation versions
  - JPGs = smaller, shareable access images
  - PSDs = working files (not permanent)
- The Sony A7R V captures ~61 MP full-frame RAW files (~9500 × 6300 pixels), recording all photons hitting the sensor without in-camera processing.
  - Images are captured with margin for efficiency; final crops can reduce dimensions at times but are still large enough for high-quality prints.
- RAW files act as digital masters: they store full sensor data, allowing images to be reprocessed, color-corrected, or resampled multiple times for different outputs (prints, digital displays, derivatives) while maintaining maximum quality.
  - RAW files contain everything the camera captured and give the most flexibility for later editing. RAWs may appear underdeveloped, can lack some metadata (e.g., orientation flags), and are not always compatible with non-photo software, but remain essential as master copies.
FADGI sets standards for digitizing cultural heritage materials to ensure high-quality, long-lasting digital surrogates. Key guidelines include capturing images at sufficient resolution, using accurate color management, saving in uncompressed or minimally compressed formats with proper metadata, and handling originals carefully. Workflows should produce master files separate from access copies, include quality checks, and follow best practices for storage and preservation.
- Our workflow meets these by capturing at high resolution, using calibrated tools, and keeping RAW + TIFF masters. By default, the workflow hits all 3-star requirements and routinely implements 4-star practices where necessary for output quality.

# 2. Resolution, Pixels, & Print Scaling

On-Screen vs Print DPI
- The “DPI” in a RAW or TIFF (e.g., 1200 or 4400 ppi) reflects the sensor’s capture resolution—how many pixels per inch were recorded—not a fixed print requirement.
- For print, images are resampled to match the intended size and viewing distance.
  - Resampling lowers pixel density without creating new pixels—you cannot invent detail, only interpolate or average existing ones. The number of original pixels is fixed at capture, so larger prints cannot contain more true detail than captured. High-res captures allow large prints while retaining visual fidelity.
  - Oversampling (e.g., 4400 ppi TIFFs) is used for print or exhibition to balance storage, workflow, and scaling.
Pixels & Megapixels
- Pixels = total squares captured; MP = total pixels ÷ 1,000,000. Example: 8100 × 5800 px = 46,980,000 px ≈ 46.9 MP.
- Print DPI determines how tightly pixels are packed: more DPI = crisper print; less DPI = pixels spread out.
  - To calculate print size: divide pixel dimensions by print DPI. Example: 6000 px ÷ 300 DPI = 20 inches wide.
  - Pixels are fixed; DPI determines print size. Higher-res files allow flexibility for different outputs.
High-Priority or Large Objects
- For high-priority or very large objects, multiple overlapping captures on the 61 MP Sony A7R V (~9500 × 6300 px per frame) can be digitally “stitched” together into a single composite.
- Stitching two or more frames increases total pixel dimensions, raising effective megapixels to 120–180 MP or more.
- This preserves high DPI (300 ppi+) for large prints while maintaining fine detail and on-screen clarity, without artificial interpolation.

# 3. Image Quality & Processing

Sensor Grain vs Digital Noise
- Film grain comes from the chemistry of film; digital noise comes from the electronics in the camera sensor. Both show up as speckles in low light or high sensitivity settings.
- Noise or grain is what the sensor actually “sees”—tiny variations in light, photon arrival, and sensor electronics. Historically, grain was the chemical texture in film emulsions; now it’s a digital analogue—the sensor’s finest response to light. Even if a glass plate negative looks smooth, the sensor records microscopic variations. Very high-resolution sensors make this visible in ways earlier cameras or scanners did not.
Denoising & Corrections
- Noise can be artificially reduced via software such as DxO PhotoLab 9, which handles denoising, correcting warped or curved images, and related adjustments.
- These techniques have their own shortcomings; denoising works by “softening” edges and textures within an image, potentially erasing small details. Optical corrections may slightly blur or reduce fidelity.
Post-Processing
- Exported prints and (optionally denoised) negatives are further adjusted via automation (batch cropping, bulk JPG conversion) and manual post-processing in Adobe Photoshop 2025.
  - Tasks include generating digital positives, exposure adjustment, leveling, and retouching dust, scratches, bubbling in the acetate, etc.
  - More detailed and extensive retouching is reserved for exhibition copies or high-visibility derivatives.
  - Edited files are kept in a Photoshop “wrapper” (PSD) that is lossless, to facilitate any further adjustments. PSDs are retained for 1 year after creation.

# 4. Storage, File Sizes, & DAMS

Local Storage
- All RAW files are automatically stored on MacOS drive via Capture One. Working files are kept on local MacBook Pro 2TB SSD, currently backed up to MBMA’s Google Drive account and an external Western Digital 12TB drive. Google Drive and WD Passport external drives also back up older files, PSDs, and any derivatives and copies.
- For increased stability, reliability, and more robust backup and retrieval options, suggest moving collection cloud backups to Backblaze B2. If the museum’s IT department expands, this backup storage could later be reconfigured for cost-effective DAMS use.
Collection Status
- There are 36 binders, with 20 flagged for digitization in August.
- About 50% of each binder is flagged for exhibition or catalogue. Digitization focuses first on flagged objects in those 20 binders.
- Approximately 5,000 images were digitized (including deteriorated nitrates/acetates) prior to the August batch.

File Sizes & Storage Scenarios

Format	Bit Depth	Average File Size
TIFF 16‑bit, 1200 ppi	16-bit	130 MB
TIFF 16‑bit, 4400 ppi	16-bit	360 MB
TIFF 8‑bit, 1200 ppi	8-bit	60 MB
JPEG derivative	8-bit	5 MB
Uncompressed RAW	16 bit	130 MB
PSD (working file)	16-bit	700 MB

Number of Images	16-bit 1200 ppi TIFF	16-bit 4400 ppi TIFF	8-bit 1200 ppi TIFF	RAW	JPEG
1,000	130 GB	360 GB	60 GB	130 GB	5 GB
50,000	6.5 TB	18 TB	3 TB	6.5 TB	250 GB
~187,500	9.75 TB	27 TB	4.5 TB	9.75 TB	375 GB

Storage costs rise as the collection grows. Costs of external drives recur as old drives are occasionally replaced.
Suggested stratification of images stored per object: all RAWs are retained.
- Select objects receive oversampled TIFFs for print or may be stitched together. Priority objects get full-resolution archival TIFFs.
- Less-critical images may be saved as 8-bit TIFFs or at 300 DPI for research/reference “discovery” versions.
- Versos saved selectively, e.g., only the negative copy of a verso as an 8-bit TIFF or 300 DPI.

Pics.io DAMS
- Pics.io serves as the primary cloud DAMS, providing controlled, role-based access, curated public sharing, and search across metadata, keywords, and full-text content.
- Stores TIFF masters and generates JPG derivatives on demand, supports batch ingest, bulk metadata updates, custom fields, and full version tracking.
- One TMS record likely links the entire collection to Pics.io.
AI-Driven Tasks (Credit-Based)
- Automatic generation of detailed object descriptions and keywords (e.g., “woman,” “portrait,” “wedding,” “bouquet”).
- OCR and audio/video transcription to make text searchable.
- Facial recognition to tag and group images of the same person.
- Intelligent descriptive search: allows natural-language queries to find images.
  - Could upload images outside the Hooks collection to train recognition for certain figures.
- Caveat: AI outputs may require manual review and correction to ensure accuracy and proper metadata assignment.

# 5. Additional / Optional Tasks

Naming Conventions
- e.g., “N” for nitrate, “P” for prints, “r/v” suffix for recto/verso, etc.
Outside Contractors and Interns

Flattening Approaches

Method	Pros	Cons
Platen	Flattens curved originals, consistent capture	Risk of Newton rings, pressure damage
Vacuum Table	Gentle flattening, good for film & prints	Expensive, suction risk to fragile items
Post-Processing	No physical contact, quick correction	Can blur, loses authenticity from artificial distortion

This keeps Draft 5 bullet fidelity, uses Draft 7 header names, and only edits phrasing for clarity/readability.

Do you want me to bold the sub-section headers like “Camera & RAW Capture” and “Pixels & Megapixels” throughout to fully match Draft 7 style?

Yes—I can do that. I’ll bold all sub-section headers so the style fully matches Draft 7 while keeping the Draft 5 bullets and content intact.