docs/known_issues.rst

   1 ==============
   2  Known Issues
   3 ==============
   4
   5 Below is a list of known issues in recent releases of Tahoe-LAFS, and how to
   6 manage them.  The current version of this file can be found at
   7 `<http://tahoe-lafs.org/source/tahoe-lafs/trunk/docs/known_issues.rst>`_.
   8
   9 If you've been using Tahoe-LAFS since v1.1 (released 2008-06-11) or if you're
  10 just curious about what sort of mistakes we've made in the past, then you might
  11 want to read `the "historical known issues" document
  12 <historical/historical_known_issues.txt>`_.
  13
  14 Issues in Tahoe-LAFS v1.8.2, released 2011-01-30
  15
  16   *  `Unauthorized deletion of an immutable file by its storage index`_
  17   *  `Potential unauthorized access by JavaScript in unrelated files`_
  18   *  `Potential disclosure of file through embedded hyperlinks or JavaScript in that file`_
  19   *  `Command-line arguments are leaked to other local users`_
  20   *  `Capabilities may be leaked to web browser phishing filter / "safe browsing" servers`_
  21   *  `Known issues in the FTP and SFTP frontends`_
  22   *  `Traffic analysis based on sizes of files/directories, storage indices, and timing`_
  23
  24 Unauthorized deletion of an immutable file by its storage index
  25 ---------------------------------------------------------------
  26
  27 Due to a flaw in the Tahoe-LAFS storage server software in v1.3.0 through
  28 v1.8.2, a person who knows the "storage index" that identifies an immutable
  29 file can cause the server to delete its shares of that file.
  30
  31 If an attacker can cause enough shares to be deleted from enough storage
  32 servers, this deletes the file.
  33
  34 This vulnerability does not enable anyone to read file contents without
  35 authorization (confidentiality), nor to change the contents of a file
  36 (integrity).
  37
  38 A person could learn the storage index of a file in several ways:
  39
  40 1. By being granted the authority to read the immutable file—i.e. by being
  41    granted a read capability to the file. They can determine the file's
  42    storage index from its read capability.
  43
  44 2. By being granted a verify capability to the file. They can determine the
  45    file's storage index from its verify capability. This case probably
  46    doesn't happen often because users typically don't share verify caps.
  47
  48 3. By operating a storage server, and receiving a request from a client that
  49    has a read cap or a verify cap. If the client attempts to upload,
  50    download, or verify the file with their storage server, even if it doesn't
  51    actually have the file, then they can learn the storage index of the file.
  52
  53 4. By gaining read access to an existing storage server's local filesystem,
  54    and inspecting the directory structure that it stores its shares in. They
  55    can thus learn the storage indexes of all files that the server is holding
  56    at least one share of. Normally only the operator of an existing storage
  57    server would be able to inspect its local filesystem, so this requires
  58    either being such an operator of an existing storage server, or somehow
  59    gaining the ability to inspect the local filesystem of an existing storage
  60    server.
  61
  62 how to manage it
  63 ~~~~~~~~~~~~~~~~
  64
  65 Tahoe-LAFS version v1.8.3 or newer (except v1.9a1) no longer has this flaw;
  66 if you upgrade a storage server to a fixed release then that server is no
  67 longer vulnerable to this problem.
  68
  69 Note that the issue is local to each storage server independently of other
  70 storage servers—when you upgrade a storage server then that particular
  71 storage server can no longer be tricked into deleting its shares of the
  72 target file.
  73
  74 If you can't immediately upgrade your storage server to a version of
  75 Tahoe-LAFS that eliminates this vulnerability, then you could temporarily
  76 shut down your storage server. This would of course negatively impact
  77 availability—clients would not be able to upload or download shares to that
  78 particular storage server while it was shut down—but it would protect the
  79 shares already stored on that server from being deleted as long as the server
  80 is shut down.
  81
  82 If the servers that store shares of your file are running a version of
  83 Tahoe-LAFS with this vulnerability, then you should think about whether
  84 someone can learn the storage indexes of your files by one of the methods
  85 described above. A person can not exploit this vulnerability unless they have
  86 received a read cap or verify cap, or they control a storage server that has
  87 been queried about this file by a client that has a read cap or a verify cap.
  88
  89 Tahoe-LAFS does not currently have a mechanism to limit which storage servers
  90 can connect to your grid, but it does have a way to see which storage servers
  91 have been connected to the grid. The Introducer's front page in the Web User
  92 Interface has a list of all storage servers that the Introducer has ever seen
  93 and the first time and the most recent time that it saw them. Each Tahoe-LAFS
  94 gateway maintains a similar list on its front page in its Web User Interface,
  95 showing all of the storage servers that it learned about from the Introducer,
  96 when it first connected to that storage server, and when it most recently
  97 connected to that storage server. These lists are stored in memory and are
  98 reset to empty when the process is restarted.
  99
 100 See ticket `#1528 <http://tahoe-lafs.org/trac/tahoe-lafs/ticket/1528>`_ for
 101 technical details.
 102
 103
 104 Potential unauthorized access by JavaScript in unrelated files
 105 --------------------------------------------------------------
 106
 107 If you view a file stored in Tahoe-LAFS through a web user interface,
 108 JavaScript embedded in that file might be able to access other files or
 109 directories stored in Tahoe-LAFS which you view through the same web
 110 user interface.  Such a script would be able to send the contents of
 111 those other files or directories to the author of the script, and if you
 112 have the ability to modify the contents of those files or directories,
 113 then that script could modify or delete those files or directories.
 114
 115 how to manage it
 116 ~~~~~~~~~~~~~~~~
 117
 118 For future versions of Tahoe-LAFS, we are considering ways to close off
 119 this leakage of authority while preserving ease of use -- the discussion
 120 of this issue is ticket `#615 <http://tahoe-lafs.org/trac/tahoe-lafs/ticket/615>`_.
 121
 122 For the present, either do not view files stored in Tahoe-LAFS through a
 123 web user interface, or turn off JavaScript in your web browser before
 124 doing so, or limit your viewing to files which you know don't contain
 125 malicious JavaScript.
 126
 127
 128 Potential disclosure of file through embedded hyperlinks or JavaScript in that file
 129 -----------------------------------------------------------------------------------
 130
 131 If there is a file stored on a Tahoe-LAFS storage grid, and that file
 132 gets downloaded and displayed in a web browser, then JavaScript or
 133 hyperlinks within that file can leak the capability to that file to a
 134 third party, which means that third party gets access to the file.
 135
 136 If there is JavaScript in the file, then it could deliberately leak
 137 the capability to the file out to some remote listener.
 138
 139 If there are hyperlinks in the file, and they get followed, then
 140 whichever server they point to receives the capability to the
 141 file. Note that IMG tags are typically followed automatically by web
 142 browsers, so being careful which hyperlinks you click on is not
 143 sufficient to prevent this from happening.
 144
 145 how to manage it
 146 ~~~~~~~~~~~~~~~~
 147
 148 For future versions of Tahoe-LAFS, we are considering ways to close off
 149 this leakage of authority while preserving ease of use -- the discussion
 150 of this issue is ticket `#127 <http://tahoe-lafs.org/trac/tahoe-lafs/ticket/127>`_.
 151
 152 For the present, a good work-around is that if you want to store and
 153 view a file on Tahoe-LAFS and you want that file to remain private, then
 154 remove from that file any hyperlinks pointing to other people's servers
 155 and remove any JavaScript unless you are sure that the JavaScript is not
 156 written to maliciously leak access.
 157
 158
 159 Command-line arguments are leaked to other local users
 160 ------------------------------------------------------
 161
 162 Remember that command-line arguments are visible to other users (through
 163 the 'ps' command, or the windows Process Explorer tool), so if you are
 164 using a Tahoe-LAFS node on a shared host, other users on that host will
 165 be able to see (and copy) any caps that you pass as command-line
 166 arguments.  This includes directory caps that you set up with the "tahoe
 167 add-alias" command.
 168
 169 how to manage it
 170 ~~~~~~~~~~~~~~~~
 171
 172 As of Tahoe-LAFS v1.3.0 there is a "tahoe create-alias" command that does
 173 the following technique for you.
 174
 175 Bypass add-alias and edit the NODEDIR/private/aliases file directly, by
 176 adding a line like this:
 177
 178   fun: URI:DIR2:ovjy4yhylqlfoqg2vcze36dhde:4d4f47qko2xm5g7osgo2yyidi5m4muyo2vjjy53q4vjju2u55mfa
 179
 180 By entering the dircap through the editor, the command-line arguments
 181 are bypassed, and other users will not be able to see them. Once you've
 182 added the alias, if you use that alias instead of a cap itself on the
 183 command-line, then no secrets are passed through the command line.  Then
 184 other processes on the system can still see your filenames and other
 185 arguments you type there, but not the caps that Tahoe-LAFS uses to permit
 186 access to your files and directories.
 187
 188
 189 Capabilities may be leaked to web browser phishing filter / "safe browsing" servers
 190 -----------------------------------------------------------------------------------
 191
 192 Firefox, Internet Explorer, and Chrome include a "phishing filter" or
 193 "safe browing" component, which is turned on by default, and which sends
 194 any URLs that it deems suspicious to a central server.
 195
 196 Microsoft gives a brief description of their filter's operation at
 197 `<http://blogs.msdn.com/ie/archive/2005/09/09/463204.aspx>`_. Firefox
 198 and Chrome both use Google's "safe browsing API" which is documented
 199 at `<http://code.google.com/apis/safebrowsing/>`_ and
 200 `<http://code.google.com/p/google-safe-browsing/wiki/Protocolv2Spec>`_.
 201
 202 This of course has implications for the privacy of general web browsing
 203 (especially in the cases of Firefox and Chrome, which send your main
 204 personally identifying Google cookie along with these requests without
 205 your explicit consent, as described in `Firefox bugzilla ticket #368255
 206 <https://bugzilla.mozilla.org/show_bug.cgi?id=368255>`_).
 207
 208 The reason for documenting this issue here, though, is that when using the
 209 Tahoe-LAFS web user interface, it could also affect confidentiality and integrity
 210 by leaking capabilities to the filter server.
 211
 212 Since IE's filter sends URLs by SSL/TLS, the exposure of caps is limited to
 213 the filter server operators (or anyone able to hack the filter server) rather
 214 than to network eavesdroppers. The "safe browsing API" protocol used by
 215 Firefox and Chrome, on the other hand, is *not* encrypted, although the
 216 URL components are normally hashed.
 217
 218 Opera also has a similar facility that is disabled by default. A previous
 219 version of this file stated that Firefox had abandoned their phishing
 220 filter; this was incorrect.
 221
 222 how to manage it
 223 ~~~~~~~~~~~~~~~~
 224
 225 If you use any phishing filter or "safe browsing" feature, consider either
 226 disabling it, or not using the WUI via that browser. Phishing filters have
 227 very limited effectiveness (see
 228 `<http://lorrie.cranor.org/pubs/ndss-phish-tools-final.pdf>`_), and phishing
 229 or malware attackers have learnt how to bypass them.
 230
 231 To disable the filter in IE7 or IE8:
 232 ++++++++++++++++++++++++++++++++++++
 233
 234 - Click Internet Options from the Tools menu.
 235
 236 - Click the Advanced tab.
 237
 238 - If an "Enable SmartScreen Filter" option is present, uncheck it.
 239   If a "Use Phishing Filter" or "Phishing Filter" option is present,
 240   set it to Disable.
 241
 242 - Confirm (click OK or Yes) out of all dialogs.
 243
 244 If you have a version of IE that splits the settings between security
 245 zones, do this for all zones.
 246
 247 To disable the filter in Firefox:
 248 +++++++++++++++++++++++++++++++++
 249
 250 - Click Options from the Tools menu.
 251
 252 - Click the Security tab.
 253
 254 - Uncheck both the "Block reported attack sites" and "Block reported
 255   web forgeries" options.
 256
 257 - Click OK.
 258
 259 To disable the filter in Chrome:
 260 ++++++++++++++++++++++++++++++++
 261
 262 - Click Options from the Tools menu.
 263
 264 - Click the "Under the Hood" tab and find the "Privacy" section.
 265
 266 - Uncheck the "Enable phishing and malware protection" option.
 267
 268 - Click Close.
 269
 270
 271 Known issues in the FTP and SFTP frontends
 272 ------------------------------------------
 273
 274 These are documented in `docs/frontends/FTP-and-SFTP.rst <frontends/FTP-and-SFTP.rst>`_
 275 and at `<http://tahoe-lafs.org/trac/tahoe-lafs/wiki/SftpFrontend>`_.
 276
 277
 278 Traffic analysis based on sizes of files/directories, storage indices, and timing
 279 ---------------------------------------------------------------------------------
 280
 281 Files and directories stored by Tahoe-LAFS are encrypted, but the ciphertext
 282 reveals the exact size of the original file or directory representation.
 283 This information is available to passive eavesdroppers and to server operators.
 284
 285 For example, a large data set with known file sizes could probably be
 286 identified with a high degree of confidence.
 287
 288 Uploads and downloads of the same file or directory can be linked by server
 289 operators, even without making assumptions based on file size. Anyone who
 290 knows the introducer furl for a grid may be able to act as a server operator.
 291 This implies that if such an attacker knows which file/directory is being
 292 accessed in a particular request (by some other form of surveillance, say),
 293 then they can identify later or earlier accesses of the same file/directory.
 294
 295 Observing requests during a directory traversal (such as a deep-check
 296 operation) could reveal information about the directory structure, i.e.
 297 which files and subdirectories are linked from a given directory.
 298
 299 Attackers can combine the above information with inferences based on timing
 300 correlations. For instance, two files that are accessed close together in
 301 time are likely to be related even if they are not linked in the directory
 302 structure. Also, users that access the same files may be related to each other.