Open Distributed Scientific Annotations Cloud


Each reader of scientific paper can publish their annotations to a distributed public annotations cloud, others can load as they read, and discuss.


So, let's say you are reading a paper, have ideas and annotations during the process of reading. You click (or point) to the location, where you want to add an annotation. The system takes the context of the location on the paper (e.g., the reader extracts large enough context of surrounding words or sentences, which uniquely identifies the location, and allows later display the same annotation around the same text in other formats - be it HTML on the web, or other. If that's a picture, then the picture features are extracted via, and the pixel location, allowing to display the same annotation on top of the same image in other formats). Essentially, we would have the context IDs and coordinates with context associated with feature sets, with 1:1 correspondence between context IDs and feature sets, and 1:many correspondence between context ID and annotation.

Then, who-ever reads the paper, in what-ever reader, they could load public annotations, browse their history. This would be nice to have a conversation per annotation. E.g., each annotation creates a possibility for thread of comments. Inside the comments, you could refer to other annotations.

Moreover, each paper would have its paperid generated based on the feature extraction from the paper's text, especially title, summary, and, if there exists, just use the DOI. It seems good to make such system as widely usable as possible, not just for scientific papers, but for any PDFs in general.

Hopefully, this would make reading papers not a lonely activity at all, and cross-pollination of ideas lead to many new developments.


(別通知) (可選) 請,登錄

是重要的位置還是要註釋的實際文本片段?也許您可以以某種方式散列文本片段,然後處理任何文檔(主要是任何格式,只是提取文本)並生成可以在其中註釋的片段散列集(使用一些巧妙的滾動散列形式) )。然後使用它來獲取使用某種形式的內容尋址系統爲其創建的所有註釋。


Is it the location that is important, or the actual text snippet being annotated? Maybe you can somehow hash the text snippet, and then process any document (mostly in any format, just extracting the text) and produce the set of hashes of snippets that could ever be annotated in it (using some clever form of rolling hash or so). Then use that to fetch all the annotations ever created for it using some form of content-addressing system.

Your idea reminds me a bit of Xanadu.

    : Mindey, 尹與及
    :  -- 
    :  -- 



Maybe this idea could be extended to any content published on the web?