BioTorrents.de’s version of Gazelle
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

upload.php 22KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440
  1. <?php
  2. //Include the header
  3. View::show_header('Uploading Rules', 'rules');
  4. ?>
  5. <!-- Upload -->
  6. <div class="thin">
  7. <div class="header">
  8. <h2>Upload</h2>
  9. </div>
  10. <!-- Uploading Rules Index Links -->
  11. <br />
  12. <form class="search_form" name="rules" onsubmit="return false" action="">
  13. <input type="text" id="search_string" value="Filter (empty to reset)" />
  14. <span id="Index">Searching for <strong>upload</strong> returns all rules containing that term.
  15. Searching for <strong>upload+trump</strong> returns all rules containing both terms</span>
  16. </form>
  17. <br />
  18. <div class="before_rules">
  19. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  20. <ul>
  21. <li id="Introk"><a href="#Intro"><strong>Introduction</strong></a></li>
  22. <li id="defk"><a href="#def"><strong>Definitions</strong></a></li>
  23. <li id="hUk"><a href="#hU"><strong>Upload Rules</strong></a>
  24. <ul>
  25. <li id="h1.1k"><a href="#h1.1">1.1 <strong>General and Formatting</strong></a></li>
  26. <li id="h2.2k"><a href="#h1.2">1.2 <strong>Duplicates and Trumping</strong></a></li>
  27. <li id="h2.1k"><a href="#h1.3">1.3 <strong>Upload Form Walkthrough</strong></a></li>
  28. </ul>
  29. </li>
  30. </ul>
  31. </div>
  32. </div>
  33. <!-- Actual Uploading Rules -->
  34. <div id="actual_rules">
  35. <div class="before_rules">
  36. <h4 id="Intro"><a href="#Introk"><strong>&uarr;</strong></a> Introduction</h4>
  37. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  38. <p>
  39. <?= SITE_NAME ?> is the global DIYbio community's
  40. distributed data repository.
  41. The content includes richly annotated and searchable biological sequence and medical imaging data.
  42. It follows the example of private BitTorrent sites to
  43. <a href="https://www.cambridge.org/core/services/aop-cambridge-core/content/view/2F379FE0CB50DF502F0075119FD3E060/S1744137417000650a.pdf/institutional_solutions_to_freeriding_in_peertopeer_networks_a_case_study_of_online_pirate_communities.pdf"
  44. target="_blank">address the free-rider problem</a>
  45. without recourse to institutional funding.
  46. </p>
  47. <p>
  48. Please read this entire page carefully because it explains how the tracker organizes the content.
  49. Referring to this page often will help you search faster and upload smarter.
  50. I'll also go line-by-line through <a href="upload.php">the upload form</a>.
  51. </p>
  52. <p>
  53. Thanks for taking an interest in this project and contributing to its success.
  54. Please note that <?= SITE_NAME ?> isn't a pirate website.
  55. </p>
  56. </div>
  57. <h4 id="def"><a href="#defk"><strong>&uarr;</strong></a> Definitions</h4>
  58. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  59. <ul>
  60. <li><b>Torrent.</b> Broadly used as a noun to describe a <code>.torrent</code> file, the files associated with
  61. it, and any associated metadata indexed by the site. Used as a verb to describe the act of downloading or
  62. uploading data from or to the swarm.</li>
  63. <li><b>Swarm.</b> All peers associated with a given torrent.</li>
  64. <li><b>Peer.</b> A client that has announced to the tracker and is part of the swarm.</b>
  65. <li><b>Seed.</b> When used as a verb, describes the act of uploading torrent content to other peers. When used
  66. as a noun, describes a peer who has all of content associated with a torrent as is able to upload to peers.
  67. Sometimes referred to as a seeder.</li>
  68. <li><b>Leech.</b> When used as a verb, describes the act of downloading torrent content from another peer.
  69. When used as a noun, describes someone who is downloading or wants to download torrent content from another
  70. peer. Sometimes referred to as a leecher.</li>
  71. <li><b>Metadata.</b> The information we record here on the site for each torrent, such as title, encoding
  72. information, and tags.</li>
  73. <li><b>Hentai.</b> A subgenre of anime, manga, and games characterized by being pornographic.</li>
  74. </ul>
  75. </div>
  76. </div>
  77. <h4 id="hU"><a href="#hUk"><strong>&uarr;</strong></a> <a href="#hU"></a> Upload Rules</h4>
  78. <h5 id="h1"><a href="#h1k"><strong>&uarr;</strong></a> <a href="#h1">1</a> General and Formatting</h5>
  79. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  80. <ul>
  81. <li id="r1.1"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.1">1.1</a>
  82. <strong>Biology Only.</strong>
  83. <?=SITE_NAME?> is an annotated repository of biology data
  84. and a bioinformatics learning community.
  85. Gazelle in its current state requires lots of hardcoded metadata.
  86. I can help you adapt the design, e.g., for physics or astronomy data.
  87. A generalized science tracker is in development.
  88. </li>
  89. <li id="r1.2"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.2">1.2</a>
  90. <strong>Seed Forever.</strong>
  91. Private torrent trackers succeed when they offer quality niche content and a comfy interface.
  92. This isn't an NCBI data dump but a library of annotated info hashes that tomorrow's networks can ingest.
  93. Do not upload a torrent unless you intend to seed until there are at least 3 copies.
  94. Three is a good minimum swarm size.
  95. </li>
  96. <li id="r1.3"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.3">1.3</a>
  97. <strong>No Advertising.</strong>
  98. Please don't "tag" torrents, include ASCII art, or make your torrents look like they came from the Pirate Bay.
  99. These kinds of additions are allowed if they serve a relevant purpose.
  100. Enclosing a GPG-signed hash of your data isn't a bad idea at all.
  101. <ul>
  102. </ul>
  103. </li>
  104. <li id="r1.4"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.4">1.4</a>
  105. <strong>Speak English.</strong>
  106. <?=SITE_NAME?> is an Anglophone site.
  107. Everything but private messages, and especially torrents and the forums, should be in English.
  108. </li>
  109. <li id="r1.5"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.5">1.5</a>
  110. <strong>Good Data.</strong>
  111. Strive to release complete collections of the highest fidelity data in the most sensible format.
  112. Sometimes I wonder whether certain kinds of people are drawn to private torrent trackers, or if the site
  113. design encourages otherwise disinterested people to do well.
  114. </li>
  115. <li id="r1.6"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.6">1.6</a>
  116. <strong>No DRM.</strong>
  117. Archived releases must not be password protected.
  118. DRM of any kind isn't allowed.
  119. </li>
  120. <hr style="margin: 2em auto; opacity: 0.3; width: 50%;" />
  121. <li id="r1.7"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.7">1.7</a>
  122. <strong>Folder Structure.</strong>
  123. Each torrent should be a single folder so we can manage them easier.
  124. Please avoid unnecessary nested folders inside your torrent.
  125. Use one of the examples below for your main folder.
  126. <ul>
  127. <li>One-Shot Project Name</li>
  128. <li>Torrent Title - Accession Number</li>
  129. <li>Department/Lab - Project Name</li>
  130. <li>After the Name - Extra comments as necessary</li>
  131. </ul>
  132. I also strongly recommend you compress only large files or long image series, and not simply compress the
  133. entire dataset.
  134. This makes it easier to partially seed large datasets, work with discrete parts of the data, and know what's
  135. on disk.
  136. </li>
  137. <li id="r1.8"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.8">1.8</a>
  138. <strong>File Organization.</strong>
  139. Please either keep the original filenames from the processing service, or consistently use a legible naming
  140. scheme.
  141. Remove all .DS_Store, Thumbs.db, nfo files, and other junk files before making the torrent.
  142. It should be "clean."
  143. You're encouraged to keep Git repos, structured data reports, readmes, and other useful annotations.
  144. Files should sort appropriately: use leading zeroes.
  145. </li>
  146. <li id="r1.9"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.9">1.9</a>
  147. <strong>Compression.</strong>
  148. "10 GiB or 10,000 files."
  149. Compression is required if your torrent is > 10 GiB or if it contains > 10,000 files.
  150. Otherwise, please compress text files if it reduces the torrent size by > 30%.
  151. Never compress torrents < 2 GiB or binary files < 10 GiB. Multipart archives are only allowed for torrents > 10
  152. GiB.
  153. </li>
  154. <li id="r1.10"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.10">1.10</a>
  155. <strong>Metadata.</strong>
  156. Avoid matching folder names to <?=SITE_NAME?> metadata.
  157. The site design will change but the torrents are evergreen.
  158. When What.cd went down, it was possible to seed most of your old torrents at Redacted.ch because the info
  159. hashes matched.
  160. The <code>.torrent</code> points to cryptographically verified folders and files that are tracked for
  161. convenience.
  162. On the flipside, please add enough metadata so that people can pick it out of a list.
  163. </li>
  164. <li id="r1.11"><a href="#h1"><strong>&uarr;</strong></a> <a href="#r1.11">1.11</a>
  165. <strong>Supplemental Packs.</strong>
  166. I strongly recommend <a href="https://semver.org/" target="_blank">Semantic Versioning</a> for your original
  167. data.
  168. Supplemental packs may include a collection of documents, utilities, protocols, metadata, etc., specifically
  169. prepared for release.
  170. The collection should be a separate torrent if the collection constitutes a project in its own right.
  171. But if you have, e.g., 100 MiB of XML and no other metadata for 5 GB of photos, please include it in the main
  172. torrent.
  173. </li>
  174. </ul>
  175. </div>
  176. <h5 id="h2"><a href="#h2k"><strong>&uarr;</strong></a> <a href="#h2">2</a> Duplicates and Trumping</h5>
  177. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  178. <ul>
  179. <li id="r2.1"><a href="#h2"><strong>&uarr;</strong></a> <a href="#r2.1">2.1</a>
  180. <strong>Multiple Formats Allowed.</strong>
  181. It's fine if there's an EMBL and a FASTA of the same data. If you need to convert a dataset for your analysis,
  182. please upload a quality conversion with supplemetal info.
  183. Remember that small, one-shot metadata are better included with the data, and collections of docs and utils
  184. are better separate from it.
  185. If only the header is different and it follows the "10 GiB or 10,000 files" rule, uncompressed torrents trump
  186. compressed ones.
  187. It should be easy for others with the same data to change a line and check out the new torrent.
  188. </li>
  189. <li id="r2.2"><a href="#h2"><strong>&uarr;</strong></a> <a href="#r2.2">2.2</a>
  190. <strong>SemVer Trumps.</strong>
  191. Versioned data can be trumped at the patch level.
  192. Major and minor releases can coexist.
  193. Please add a link in the old data's Torrent Group Description to the new data.
  194. Git repos can be trumped at the commit and patch levels.
  195. Then normal SemVer rules take effect for properly tagged releases.
  196. </li>
  197. <li id="r2.3"><a href="#h2"><strong>&uarr;</strong></a> <a href="#r2.3">2.3</a>
  198. <strong>Report Trumps and Dupes.</strong>
  199. If you trump a torrent or notice a duplicate torrent, please use the report link [RP] to notify staff to
  200. remove it.
  201. If you are uploading a superior version, e.g., without watermarks, report the older torrent and include a link
  202. to your new torrent.
  203. Your torrent may be deleted as a dupe if the older torrent is not reported.
  204. </li>
  205. <li id="r2.4"><a href="#h2"><strong>&uarr;</strong></a> <a href="#r2.4">2.4</a>
  206. <strong>Two Weeks Unseeded.</strong>
  207. If you have the original torrent files for the inactive torrent, reseed those original files instead of
  208. uploading a new torrent.
  209. Uploading a replacement torrent should be done only when the original files are unavailable.
  210. </li>
  211. <li id="r2.5"><a href="#h2"><strong>&uarr;</strong></a> <a href="#r2.5">2.5</a>
  212. <strong>Watermarks.</strong>
  213. Data without watermarks trumps watermarked data.
  214. </ul>
  215. </div>
  216. <h5 id="h3"><a href="#h3k"><strong>&uarr;</strong></a> <a href="#h3">3</a> Upload Form Walkthrough</h5>
  217. <div class="box pad" style="padding: 10px 10px 10px 20px;">
  218. <ul>
  219. <li id="r3.1"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.1">3.1</a>
  220. <strong>Torrent File.</strong>
  221. Add the announce URL as the only item on the tracker list and click the checkbox marked private.
  222. Please don't share <code>.torrent</code> files because they have your passkey embedded in them.
  223. This passkey lets the tracker know who's uploading and downloading, and leaking it will nuke your ratio.
  224. <br /><br />
  225. </li>
  226. <li id="r3.2"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.2">3.2</a>
  227. <strong>Type.</strong>
  228. The categories loosely follow the central dogma.
  229. It depends on what alphabet the sequence uses, e.g., ACGT vs. ACGU.
  230. All medical imaging data goes in the Imaging category.
  231. All extra documentation, custom scripts, etc., goes in Documents.
  232. Plasmids and things that don't quite fit go in Other.
  233. <br /><br />
  234. </li>
  235. <li id="r3.3"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.3">3.3</a>
  236. <strong>Accession Number.</strong>
  237. Please add accession numbers when the data source issued them.
  238. The number can be any format but it must correspond to the actual torrent data.
  239. Don't add accession numbers just because the metadata matches.
  240. RefSeq and UniProt integration is in development.
  241. <br /><br />
  242. </li>
  243. <li id="r3.4"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.4">3.4</a>
  244. <strong>Torrent Title.</strong>
  245. A short description of the torrent contents.
  246. It doesn't need to match the folders but it should tell you what the data is at a glance.
  247. Please avoid adding other metadata such as Strain, Platform, etc., with a dedicated field.
  248. <br /><br />
  249. </li>
  250. <li id="r3.5"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.5">3.5</a>
  251. <strong>Organism.</strong>
  252. The relevant organism's binomial name and optional subspecies.
  253. Please use <em>Genus species subspecies</em> and no terms such as var. and subsp.
  254. Multiple organisms and a FASTA/GenBank parser are both in development.
  255. <br /><br />
  256. </li>
  257. <li id="r3.6"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.6">3.6</a>
  258. <strong>Strain/Variety.</strong>
  259. The strain's name if known.
  260. This should correspond to a specific cell line or breed.
  261. Otherwise, please omit it.
  262. <br /><br />
  263. </li>
  264. <li id="r3.7"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.7">3.7</a>
  265. <strong>Authors(s).</strong>
  266. The Author field should contain only the author name.
  267. The upload form supports multiple authors.
  268. Do a search before uploading to get their names right.
  269. Consistent author naming makes browsing easier because it groups torrents on a common page.
  270. ORCiD integration is in development.
  271. <br /><br />
  272. </li>
  273. <li id="r3.8"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.8">3.8</a>
  274. <strong>Department/Lab.</strong>
  275. The lab that did the experiments or the last author's home lab.
  276. Please use "Unaffiliated" for anonymous or unknown labs.
  277. <br /><br />
  278. </li>
  279. <li id="r3.9"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.9">3.9</a>
  280. <strong>Location.</strong>
  281. The lab's physical location in one of the below formats.
  282. <ul>
  283. <li>{City}, {State} {Postal Code}</li>
  284. <li>{Postal Code} {City}, {Country}</li>
  285. </ul>
  286. For example, Berkeley, CA 94720 or 10117 Berlin, Germany.
  287. It's okay to use the American style if the foreign address uses the same format.
  288. Please use "Unknown" for anonymous or unknown labs.
  289. <br /><br />
  290. </li>
  291. <li id="r3.10"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.10">3.10</a>
  292. <strong>Year.</strong>
  293. The year the data was first published.
  294. The publication that announced the data.
  295. <br /><br />
  296. </li>
  297. <li id="r3.11"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.11">3.11</a>
  298. <strong>Platform.</strong>
  299. The class of technology the data comes from.
  300. What sequencing or imaging technique is it the output of?
  301. <a
  302. href="forums.php?action=viewforum&forumid=<?= SUGGESTIONS_FORUM_ID ?>">Please
  303. post in the suggestions forum</a>
  304. if you'd like to request a new platform.
  305. Note that the platforms change for the Imaging category.
  306. <br /><br />
  307. </li>
  308. <li id="r3.12"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.12">3.12</a>
  309. <strong>Format.</strong>
  310. The file format of the data.
  311. What programs do you need to work with the data?
  312. <a
  313. href="forums.php?action=viewforum&forumid=<?= SUGGESTIONS_FORUM_ID ?>">Please
  314. post in the suggestions forum</a>
  315. if you'd like to request a new format.
  316. Note that the formats change for the Imaging category.
  317. A file extension parser is in development.
  318. <br /><br />
  319. </li>
  320. <li id="r3.13"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.13">3.13</a>
  321. <strong>Assembly Level.</strong>
  322. The resolution of the data.
  323. How much information about the organism does it represent?
  324. The options correspond in higher conceptual language to:
  325. a single piece of information, structural information, especially deep or broad information, and an exhaustive
  326. source.
  327. Please use the Other option if you'd like to enter a resolution such as "420 subjects."
  328. <br /><br />
  329. </li>
  330. <li id="r3.14"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.14">3.14</a>
  331. <strong>License.</strong>
  332. <?= SITE_NAME ?> only allows permissive licenses.
  333. If your data is original, please consider licensing it under one of the available options.
  334. The "Unspecified" option is for compatibility with existing releases.
  335. <br /><br />
  336. </li>
  337. <li id="r3.15"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.15">3.15</a>
  338. <strong>Tags.</strong>
  339. Please select at least five appropriate tags.
  340. Don't use irrelevant tags, and consider making new tags as a last resort.
  341. <a
  342. href="forums.php?action=viewforum&forumid=<?= SUGGESTIONS_FORUM_ID ?>">Please
  343. post in the suggestions forum</a>
  344. if you'd like to request a new official tag.
  345. <br /><br />
  346. </li>
  347. <li id="r3.16"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.16">3.16</a>
  348. <strong>Picture.</strong>
  349. Please upload a meaningful picture, especially if you plan to add the torrent to a collection.
  350. A photo of the sequence sample or a representative photo of the organism; an example (preferably not a
  351. thumbnail collection) from an imaging dataset;
  352. a screenshot of a useful table from the publication; or another similarly informative picture.
  353. <br /><br />
  354. </li>
  355. <li id="r3.17"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.17">3.17</a>
  356. <strong>Publications.</strong>
  357. DOI numbers should be well-formed, one per line.
  358. The system currently discards malformed DOI numbers instead of extracting them from arbitrary strings.
  359. An auto-extract feature is in development.
  360. If your research is a URI, please use the Torrent Group Description field.
  361. <br /><br />
  362. </li>
  363. <li id="r3.18"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.18">3.18</a>
  364. <strong>Torrent Group Description.</strong>
  365. General info about the object of study's function or significance.
  366. This is the main body text on a torrent's page.
  367. Please limit the contents of this field to concise and interesting knowledge.
  368. <br /><br />
  369. </li>
  370. <li id="r3.19"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.19">3.19</a>
  371. <strong>Torrent Description.</strong>
  372. Specific info about the protocols and equipment relevant to <em>this</em> data.
  373. This text is hidden by default.
  374. It displays when you click the Torrent Title next to [ DL | ... | PL ].
  375. Please discuss materials and methods here.
  376. <br /><br />
  377. </li>
  378. <li id="r3.20"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.20">3.20</a>
  379. <strong>Aligned Sequence.</strong>
  380. Does the data come with any metadata of an analytical nature, such as alignment data (mandatory)?
  381. If so, does the torrent folder contain the scripts used to generate the metadata (optional)?
  382. <br /><br />
  383. </li>
  384. <li id="r3.21"><a href="#h3"><strong>&uarr;</strong></a> <a href="#r3.21">3.21</a>
  385. <strong>Upload Anonymously.</strong>
  386. You'll still get upload credit even if you hide your username from the torrent details.
  387. I believe it's still visible to sysops.
  388. </li>
  389. </ul>
  390. </div>
  391. <!-- END Other Sections -->
  392. <?php include('jump.php'); ?>
  393. </div>
  394. <?php
  395. View::show_footer();