Skip to main content
The Daily Philadelphia

All of Philadelphia, every day

News

How Philadelphia's Digital Archives Ended Up Full of the Same Image Twice — And What the City Is Doing About It

Duplicate images have quietly bloated city databases and muddied public records for years; here's the paper trail that explains how Philadelphia got here.

Share

By Philadelphia News Desk · Published 4 July 2026, 2:48 PM

4 min read

Updated 4 h ago· 4 July 2026, 11:17 PM

How we reported this

This article was generated by AI from the linked public sources. The Daily Philadelphia is independently owned and covers Philadelphia news free from advertiser or sponsor influence. Read our editorial standards →

How Philadelphia's Digital Archives Ended Up Full of the Same Image Twice — And What the City Is Doing About It
Photo: Brown, David Paul, 1795-1872 / Public domain (Wikimedia Commons)

Philadelphia's Office of Innovation and Technology quietly flagged a persistent problem last year that had been accumulating since at least 2017: thousands of duplicate image files scattered across the city's public-facing digital archives, property databases, and departmental document repositories. The redundancy isn't a minor housekeeping nuisance. It has inflated storage costs, slowed retrieval times for permit records at the Department of Licenses and Inspections, and, in at least a handful of documented cases, caused the wrong property photograph to appear alongside the wrong parcel record in the city's Atlas mapping portal.

The timing matters because Philadelphia is mid-way through a broader digital infrastructure overhaul tied to the administration's five-year technology modernization plan, and duplicated image assets represent one of the messier inherited problems standing between the city and a cleaner, faster public data experience. With Fourth of July events cancelled across the region today due to record heat, city workers are largely off the streets — but inside Municipal Services Building on JFK Boulevard, IT contractors are running deduplication scripts that officials say have been months in the planning.

How the Duplication Problem Built Up Over Time

The roots of the issue go back to a 2014 migration of the city's legacy records systems, when departments transferred documents independently rather than through a unified pipeline. The Department of Records, headquartered at City Hall, and the Planning Commission, based on the 14th floor of 1515 Arch Street, each maintained separate image repositories with overlapping source material. When those repositories were later connected to shared portals, deduplication logic was never uniformly applied. A photograph taken of a rowhouse on Frankford Avenue in Kensington, for example, might exist as three separate files under slightly different filenames — each linked to a different internal workflow, none flagged as redundant.

The Philadelphia Land Bank, established under a 2013 ordinance, compounded the problem when it began ingesting property images from L&I, the Water Department, and community development organizations simultaneously. Each source agency had its own file-naming convention, which meant automated duplicate-detection tools — which typically rely on matching filenames or metadata — missed visually identical images that had been renamed or compressed differently at the source.

By 2022, an internal audit referenced in a City Controller's Office performance review found that roughly 18 percent of image assets in the city's primary property database were duplicates or near-duplicates. Storage costs for the affected systems ran to an estimated six figures annually, though the Controller's review did not publish a precise dollar figure for image storage alone.

What Replacement and Remediation Actually Looks Like

The current remediation effort, contracted through a vendor selected via the city's procurement process on the Pennsylvania eMarketplace platform, uses perceptual hashing — a technique that compares images based on visual content rather than filename — to identify matches. The process is being piloted first on the Atlas portal's property image layer, which is publicly accessible and covers more than 550,000 parcels across all of Philadelphia's neighborhoods, from Chestnut Hill in the northwest to Point Breeze in South Philadelphia.

Residents who use Atlas to research properties before buying or redeveloping them have reported mismatched photographs as far back as 2019 on the city's 311 feedback portal. The Neighborhood Advisory Committee network, which operates through the Planning Commission's civic engagement programs, has relayed complaints from community development corporations in Germantown and Strawberry Mansion specifically, where rapid turnover of vacant lots has made accurate parcel photography especially consequential for block planning efforts.

The practical upshot for Philadelphians is straightforward: the Atlas portal's property images should become more reliable over the coming months as the deduplication pass moves from pilot to full deployment, with the Office of Innovation and Technology targeting completion before the end of the third quarter of 2026. Anyone who spots a mislabeled or duplicated image in Atlas can still flag it through the city's 311 service — online, by phone, or through the Philly311 app — and the reports feed directly into the remediation queue that contractors are currently working through.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Philadelphia

Covering news in Philadelphia. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Philadelphia news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Philadelphia and accept our Privacy Policy. Unsubscribe anytime.