2017 Winter Project Week/IPFS NoSQL Combination

From NAMIC Wiki
Revision as of 01:47, 13 January 2017 by Hans.meine (talk | contribs) (add Satra)
Jump to: navigation, search
Home < 2017 Winter Project Week < IPFS NoSQL Combination

Key Investigators

  • Hans Meine (University of Bremen, Fraunhofer MEVIS = FME)
  • Steve Pieper (Isomics)
  • Satra Ghosh

Project Description

Objective Approach and Plan Progress and Next Steps
  • Evaluate IPFS / NoSQL combination for MIC databases
  • Evaluate IPFS' PSK feature for "private clouds"
  • Build prototype for scanning images
    • put images / files into IPFS
    • put metadata into NoSQL database (ElasticSearch is what we used at FME, CouchDB is what Steve used in Chronicle)
  • Build prototype for browsing / showing images
    • should update live when images appear in the DB
    • should fetch image data from IPFS
  • IPFS stability and status
  • Performed several experiments that showed disappointing performance (maybe MIT wifi related?), but eventually successful transfers
  • QmPyXW927iBPHVk3hfyzXAPGDpup26WGEh4LYK6da2xMhA is a TCGA subdirectory transferred to several project week participants' computers
  • Discussed interesting deduplication idea: Can original DICOM, anonymized DICOM and e.g. Nifty files share data blocks?
    • Answer: Yes, there's a --chunker argument to ipfs add which defines the chunking algorithm. Apparently, the rabin chunker should already perform well, without any particular knowledge about the file formats, but it does not seem to be the default algorithm. (Valid arguments include: 'rabin' 'rabin-[avg]' or 'rabin-[min]-[avg]-[max]' with integer parameters.)

Background and References