Skip to main content
The Daily Philadelphia

All of Philadelphia, every day

News

Philadelphia's Archives and City Agencies Move to Fix a Yearslong Duplicate Image Problem

A citywide effort to clean up redundant and mislabeled digital photographs in municipal records hit a significant milestone this week, with two major departments completing their first full database audits.

Share

By Philadelphia News Desk · Published 4 July 2026, 2:36 PM

4 min read

Updated 10 h ago· 4 July 2026, 4:56 PM

How we reported this

This article was generated by AI from the linked public sources. The Daily Philadelphia is independently owned and covers Philadelphia news free from advertiser or sponsor influence. Read our editorial standards →

Philadelphia's Archives and City Agencies Move to Fix a Yearslong Duplicate Image Problem
Photo: Breck, Samuel, 1771-1862. [from old catalog] / Public domain (Wikimedia Commons)

Philadelphia's Department of Records and the Office of Innovation and Technology wrapped up a joint audit this week that identified more than 14,000 duplicate image files sitting inside the city's central digital asset management system — redundant photographs, scanned documents, and mislabeled graphics that had accumulated across municipal databases since at least 2019. The cleanup, which officials have been quietly working through since January, now moves into an active deletion and reclassification phase.

The timing matters. Philadelphia is midway through a broader effort to modernize how city agencies store and share public records, a push that gained urgency after a 2024 controller's office review flagged storage inefficiencies costing the city money in cloud infrastructure fees. Duplicate image files, which sound like a minor housekeeping problem, in practice slow down public-records requests, confuse archival searches, and inflate the cost of maintaining servers that now hold records for more than two dozen city departments.

What the Audit Found

The audit covered image libraries maintained by the Department of Licenses and Inspections, the Philadelphia Water Department, and the Parks and Recreation Department, among others. L&I alone had logged more than 4,800 duplicate photographs of building inspections in the Kensington and Fishtown neighborhoods — images uploaded multiple times by different inspectors using the city's PermitPHL platform without a deduplication check built into the workflow. Parks and Recreation had a smaller but still substantial backlog of redundant photographs tied to restoration work at Clark Park in West Philadelphia and Pennypack Park in the Northeast.

The problem was compounded by a software migration the city undertook in early 2022, when several departments moved from legacy on-premise servers to a shared cloud environment managed through a contract with a third-party vendor. During that migration, files were copied rather than moved in many cases, effectively doubling entire folders of images overnight. The Office of Innovation and Technology did not catch the issue until a routine storage review in late 2024.

Philadelphia's digital storage bill for municipal records has grown considerably in recent years. The city's Five-Year Financial Plan, released in March 2026, listed cloud infrastructure and data management as one of the fastest-growing line items in the technology budget, with projected spending reaching roughly $18 million annually by fiscal year 2028. Reducing unnecessary file duplication is one lever city officials have identified to keep those costs from accelerating further.

Next Steps for City Departments

Starting the week of July 7, the Office of Innovation and Technology will deploy an automated deduplication tool across three pilot departments — L&I, the Water Department, and the Revenue Department — before rolling it out citywide in the fall. The tool uses hash-matching to identify identical files regardless of filename, a method that should catch duplicates that manual reviews missed. Departments will have 30 days to flag any files marked for deletion before they are permanently removed.

The Free Library of Philadelphia's Digital Collections team, which manages a separate but adjacent archive of historical city photographs housed at the Parkway Central branch on Vine Street, is not part of this municipal cleanup but has been watching the process closely. Library archivists have their own deduplication backlog involving digitized prints from the now-closed South Philadelphia neighborhood photography collections, and they are in early conversations with the Office of Innovation and Technology about whether the city's new tool could be adapted for library use under a shared services arrangement.

For residents who file public-records requests through the city's Right-to-Know portal, the practical effect of the cleanup may not be immediately visible — but officials say search response times within internal systems should improve once the bulk of duplicates are cleared. Anyone with a pending records request that involves photographs or scanned inspection documents is advised to check the city's records portal at phila.gov/records for status updates as the deletion phase begins next week.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Philadelphia

Covering news in Philadelphia. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Philadelphia news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Philadelphia and accept our Privacy Policy. Unsubscribe anytime.