Save downloaded DB archives to disk before unzipping #700

aeisenberg · 2020-12-07T23:37:55Z

This fixes two classes of DBs that can't be installed directly from
downloading:

DBs whose central directories do not align with their file headers.
We need to download and save the entire archive before we can read
the central directory and use that to guide the unzipping.
Large DBs require too much memory so can't be downloaded and unzipped
in a single stream.

We also add proper progress notifications to the download progress
monitor so users are aware of how many more MBs are left to download.

It's not yet possible to do the same for unzipping using the current
unzipper library, since unzipping using the central directory does not
expose a stream.

Fixes #621
Fixes #622

Integration tests are passing and added two more for progress monitoring.

Manually tested on the linux database from lgtm https://lgtm.com/projects/g/torvalds/linux/ci and it is working.

Checklist

CHANGELOG.md has been updated to incorporate all user visible changes made by this pull request.
Issues have been created for any UI or other user-facing changes made by this pull request.
[n/a] @github/docs-content-dsp has been cc'd in all issues for UI or other user-facing changes made by this pull request.

This fixes two classes of DBs that can't be installed directly from downloading: 1. DBs whose central directories do not align with their file headers. We need to download and save the entire archive before we can read the central directory and use that to guide the unzipping. 2. Large DBs require too much memory so can't be downloaded and unzipped in a single stream. We also add proper progress notifications to the download progress monitor so users are aware of how many more MBs are left to download. It's not yet possible to do the same for unzipping using the current unzipper library, since unzipping using the central directory does not expose a stream.

alexet

LGTM. I tried to resolve the merge conflict in the ui but have realised it would have probably been cleaner for you to rebase instead. Feel free to remove my merge commit and rebase instead.

alexet · 2020-12-14T16:26:04Z

Actually squash and merge will do the right thing.

aeisenberg · 2020-12-14T16:26:45Z

Thanks for the review!

This fixes two classes of DBs that can't be installed directly from downloading: 1. DBs whose central directories do not align with their file headers. We need to download and save the entire archive before we can read the central directory and use that to guide the unzipping. 2. Large DBs require too much memory so can't be downloaded and unzipped in a single stream. We also add proper progress notifications to the download progress monitor so users are aware of how many more MBs are left to download. It's not yet possible to do the same for unzipping using the current unzipper library, since unzipping using the central directory does not expose a stream. Co-authored-by: Alexander Eyers-Taylor <[email protected]>

aeisenberg requested a review from adityasharad December 7, 2020 23:37

aeisenberg marked this pull request as draft December 7, 2020 23:38

aeisenberg force-pushed the aeisenberg/download-to-file branch from 9571223 to 747ff45 Compare December 8, 2020 00:25

aeisenberg marked this pull request as ready for review December 8, 2020 00:27

aeisenberg mentioned this pull request Dec 8, 2020

AST Viewing unable to use extracted database bundle's src.zip #699

Closed

alexet self-requested a review December 14, 2020 16:22

Merge branch 'main' into aeisenberg/download-to-file

1e4b984

alexet approved these changes Dec 14, 2020

View reviewed changes

alexet merged commit 9ffb3a1 into github:main Dec 14, 2020

aeisenberg deleted the aeisenberg/download-to-file branch December 14, 2020 16:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save downloaded DB archives to disk before unzipping #700

Save downloaded DB archives to disk before unzipping #700

aeisenberg commented Dec 7, 2020 •

edited

Loading

alexet left a comment

alexet commented Dec 14, 2020

aeisenberg commented Dec 14, 2020

Save downloaded DB archives to disk before unzipping #700

Save downloaded DB archives to disk before unzipping #700

Conversation

aeisenberg commented Dec 7, 2020 • edited Loading

Checklist

alexet left a comment

Choose a reason for hiding this comment

alexet commented Dec 14, 2020

aeisenberg commented Dec 14, 2020

aeisenberg commented Dec 7, 2020 •

edited

Loading