Uploaded image for project: 'Help-Desk'
  1. Help-Desk
  2. HELP-5439

FIWARE.Request.Tech.Data.BigData-Analysis.Issue on Cosmos Global Instance: cannot transfer file parts

    Details

    • Type: extRequest
    • Status: Closed
    • Priority: Major
    • Resolution: Done
    • Fix Version/s: 2021
    • Component/s: FIWARE-TECH-HELP
    • Labels:
      None

      Description

      Dear FIWARE Staff,

      I would like to report a potential issue on the Cosmos Global Instance.

      We are using Cosmos to store input files for Hadoop jobs and their related
      output. We are transferring them from/to a remote back-end through the
      WebHDFS API.

      During outbound transfers of large files from Cosmos, the need to split the
      file content in parts has risen. If we use a simple OPEN operation like

      *http://<HOST>:<PORT>/webhdfs/v1/<PATH>?op=OPEN *

      the transfer is interrupted after 2 minutes even if the response has not
      transferred completely to the client. This should due to the timeout
      configuration of the HTTP Server deployed on Cosmos.

      As suggested by the WebHDFS API Doc
      <https://hadoop.apache.org/docs/r1.0.4/webhdfs.html#OPEN>, one should
      consider to transfer single parts of the requested file by setting offset
      and length parameters to complete the request successfully.

      The issue we found is that the length parameter seems to be ignored by
      the HTTP Server.
      For example, if we execute the following

      http://<HOST>:<PORT>/webhdfs/v1/<PATH>?op=OPEN&offset=100&length=10

      on a 200 bytes file, we would get the last 100 bytes of the file itself,
      and not bytes from 101 to 110.

      We checked the current version of Hadoop deployed on Cosmos and we found
      out that release *0.20.2-cdh3u6 *is currently running. By giving a look to
      the HDFS changelogs
      <https://hadoop.apache.org/docs/r0.23.11/hadoop-project-dist/hadoop-hdfs/CHANGES.txt>,
      there is an improvement (HDFS-3794) committed in release 0.23.3 that could
      be potentially related to the reported occurrence.

      Best Regards,
      Andrea Sassi

      Since January 1st, old domains won't be supported and messages sent to any domain different to @lists.fiware.org will be lost.
      Please, send your messages using the new domain (Fiware-tech-help@lists.fiware.org) instead of the old one.
      _______________________________________________
      Fiware-tech-help mailing list
      Fiware-tech-help@lists.fiware.org
      https://lists.fiware.org/listinfo/fiware-tech-help
      [Created via e-mail received from: Andrea Sassi <brainswitch@gmail.com>]

        Activity

        Transition Time In Source Status Execution Times Last Executer Last Execution Date
        Open Open In Progress In Progress
        19h 58m 1 Francisco Romero 02/Dec/15 11:37 AM
        In Progress In Progress Impeded Impeded
        47d 4h 31m 1 Francisco Romero 18/Jan/16 4:09 PM
        Impeded Impeded In Progress In Progress
        34d 23h 6m 1 Francisco Romero 22/Feb/16 3:16 PM
        In Progress In Progress Answered Answered
        6d 19h 43m 1 Francisco Romero 29/Feb/16 10:59 AM
        Answered Answered Closed Closed
        2s 1 Francisco Romero 29/Feb/16 10:59 AM

          People

          • Assignee:
            frb Francisco Romero
            Reporter:
            fw.ext.user FW External User
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: