𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝘁 𝗩𝗲𝗿𝘀𝗶𝗼𝗻 𝗖𝗼𝗻𝘁𝗿𝗼𝗹 𝗪𝗶𝘁𝗵𝗼𝘂𝘁 𝗗𝘂𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗻𝗴 𝗙𝗶𝗹𝗲𝘀

📅1 hour ago⏱2 min read

Storing full copies of files for every version or fork wastes space. If you change one line in a project with ten files, you should not save all ten files again.

I faced this problem while building my LaTeX Writer project. I needed a way to handle version control and project forking without high storage costs.

I looked at how GitHub works. GitHub does not store a full repository every time you make a change. It stores content separately and uses references to link files and commits.

I built my system using three main components:

Metadata: This stores IDs for projects, owners, and folders.
File Records: These store file names and links to content.
Blobs: This is where the actual content lives.

The system works through content hashing. When you save a file, the system generates a unique ID based on the content. If the content already exists, the system reuses the existing Blob. It does not create a new one.

This approach makes forking easy and cheap. When you fork a project:

The system creates a new Project ID.
It creates new metadata for files and folders.
It points the new metadata to the existing Blobs.

No actual file content is copied during a fork. You only duplicate the small metadata records.

When you edit a fork, the process stays efficient:

You change the content.
The system hashes the new content.
It creates a new Blob only if that exact content does not exist.
The metadata for your fork points to the new Blob.
The original project still points to the old Blob.

This method provides several benefits:

Content deduplication saves massive amounts of space.
Forking happens instantly.
Version management stays organized.
Database growth stays slow.

You get GitHub-like functionality without the heavy storage overhead.

Source: https://dev.to/prashant_patil_49/building-github-inspired-version-control-and-forking-without-duplicating-project-files-5aap

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝘁 𝗩𝗲𝗿𝘀𝗶𝗼𝗻 𝗖𝗼𝗻𝘁𝗿𝗼𝗹 𝗪𝗶𝘁𝗵𝗼𝘂𝘁 𝗗𝘂𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗻𝗴 𝗙𝗶𝗹𝗲𝘀

Continue reading

𝗕𝘂𝗶𝗹𝗱 𝗔 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗲𝗿 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗕𝗮𝘀𝗲

𝗕𝘂𝗶𝗹𝗱 𝗬𝗼𝘂𝗿 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗲𝗿 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗦𝘆𝘀𝘁𝗲𝗺

𝗔 𝗣𝗥𝗔𝗚𝗠𝗔𝗧𝗜𝗖 𝗚𝗜𝗧 𝗪𝗢𝗥𝗞𝗙𝗟𝗢𝗪 𝗙𝗢𝗥 𝗖𝗥𝗢𝗦𝗦 𝗥𝗘𝗣𝗢𝗦𝗜𝗧𝗢𝗥𝗬 𝗖𝗢𝗟𝗟𝗔𝗕𝗢𝗥𝗔𝗧𝗜𝗢𝗡

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗮 𝗧𝗶𝗸𝗧𝗼𝗸 𝗩𝗶𝗱𝗲𝗼 𝗔𝗿𝗰𝗵𝗶𝘃𝗲 𝗦𝘆𝘀𝘁𝗲𝗺

𝗔𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗻𝗴 𝗢𝗽𝗲𝗻 𝗦𝗼𝘂𝗿𝗰𝗲 𝗪𝗶𝘁𝗵 𝗚𝗶𝘁𝗛𝘂𝗯