Class SAMCodec

java.lang.Object
htsjdk.beta.codecs.reads.sam.SAMCodec
All Implemented Interfaces:
HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>, ReadsCodec, Upgradeable
Direct Known Subclasses:
SAMCodecV1_0

@InternalAPI public abstract class SAMCodec extends Object implements ReadsCodec
InternalAPI Base class for BundleResourceType.FMT_READS_SAM codecs.
  • Constructor Details

    • SAMCodec

      public SAMCodec()
  • Method Details

    • getFileFormat

      public String getFileFormat()
      Description copied from interface: HtsCodec
      Get the name of the file format supported by this codec. The format name defines the underlying format handled by this codec, and also corresponds to the format of the primary bundle resource that is required when decoding or encoding (see BundleResourceType and BundleResource.getFileFormat()).
      Specified by:
      getFileFormat in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Returns:
      the name of the underlying file format handled by this codec
    • getDisplayName

      public String getDisplayName()
      Description copied from interface: HtsCodec
      Get a user-friendly display name for this codec.

      It is recommended that the display name minimally include both the name of the supported file format and the supported version.
      Specified by:
      getDisplayName in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Returns:
      a user-friendly display name for this codec
    • ownsURI

      public boolean ownsURI(IOPath ioPath)
      Description copied from interface: HtsCodec
      Determine if this codec "owns" the URI contained in ioPath see (IOPath.getURI()).

      A codec "owns" the URI only if it has specific requirements on the URI protocol scheme, URI format, or query parameters that go beyond a simple file extension, AND it explicitly recognizes the URI as conforming to those requirements. File formats that only require a specific file extension should always return false from HtsCodec.ownsURI(htsjdk.io.IOPath), and should instead use the extension as a filter in HtsCodec.canDecodeURI(IOPath).

      Returning true from this method will cause the framework to bypass the stream-oriented signature probing that is used to resolve inputs to a codec handler. During codec resolution, if any registered codec returns true for this method on ioPath, the signature probing protocol will instead:

      1. immediately prune the list of candidate codecs to only those that return true for this method on ioPath
      2. not attempt to obtain an InputStream on the IOPath containing the URI, on the assumption that special handling is required in order to access the underlying resource (i.e., htsget codec would claim an "http://" URI if the rest of the URI conforms to the expected format for that codec's protocol).

      Any codec that returns true from HtsCodec.ownsURI(IOPath) for a given IOPath must also return true from HtsCodec.canDecodeURI(IOPath) for the same IOPath. For custom URI handlers, codecs should avoid making remote calls to determine the suitability or accessibility of the input resource; the return value for this method should be based only on the format of the URI that is presented. Operations that require remote access that can fail, such as validating server connectivity, authentication, or authorization, should be deferred until data is requested by the caller via the codec's HtsEncoder or HtsDecoder. Since this method is used during codec resolution, implementations should avoid calling methods that may throw exceptions.

      Specified by:
      ownsURI in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Parameters:
      ioPath - the ioPath to inspect
      Returns:
      true if the ioPath's URI represents a custom URI that this codec handles
    • canDecodeURI

      public boolean canDecodeURI(IOPath ioPath)
      Description copied from interface: HtsCodec
      Determine if the URI for ioPath (obtained via IOPath.getURI()) conforms to the expected URI format this codec's file format.

      Most implementations only look at the file extension (see IOPath.hasExtension(java.lang.String)). For codecs that implement formats that use specific, well known file extensions, the codec should reject inputs that do not conform to any of the accepted extensions. If the format does not use a specific extension, or if the codec cannot determine if it can decode the underlying resource without inspecting the underlying stream, it is safe to return true, after which the framework will subsequently call this codec's HtsCodec.canDecodeSignature(SignatureStream, String) method, at which time the codec can inspect the actual underlying stream via the SignatureStream.

      Implementations should generally not inspect the URI's protocol scheme unless the file format supported by the codec requires the use a specific protocol scheme. For codecs that do own a specific scheme or URI format, the return values for HtsCodec.ownsURI(IOPath) and HtsCodec.canDecodeURI(IOPath) must always be the same (both true or both false) for a given IOPath. For codecs that do not use a custom URI (and rely on NIO access), @link #ownsURI(IOPath)} should always return false, with only the value returned from HtsCodec.canDecodeURI(IOPath) varying based on features such as file extension probes.

      It is never safe to attempt to directly inspect the underlying stream for ioPath in this method. If the stream needs to be inspected, it should be done using the signature stream when the HtsCodec.canDecodeSignature(SignatureStream, String) method is called.

      For custom URI handlers (see HtsCodec.ownsURI(IOPath), codecs should avoid making remote calls to determine the suitability of the input resource; the return value for this method should be based only on the format of the URI that is presented. Since this method is used during codec resolution, implementations should avoid calling methods that may throw exceptions.
      Specified by:
      canDecodeURI in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Parameters:
      ioPath - to be decoded
      Returns:
      true if the codec can provide a decoder to provide this URI
    • canDecodeSignature

      public boolean canDecodeSignature(SignatureStream probingInputStream, String sourceName)
      Description copied from interface: HtsCodec
      Determine if the codec can decode an input stream by inspecting a signature embedded within the stream.

      The probingInputStream stream will contain only a fragment of the actual input stream, taken from the start of the stream, the size of which will be the lesser of:

      1. the number of bytes returned by HtsCodec.getSignatureProbeLength()
      2. the entire input stream, for streams that are smaller than HtsCodec.getSignatureProbeLength()

      Codecs that handle custom URIs that reference remote resources (those that return true for HtsCodec.ownsURI(htsjdk.io.IOPath)) should generally not inspect the stream, and should return false from this method, since the method will never be called with any resource for which HtsCodec.ownsURI(htsjdk.io.IOPath) returned true. Since this method is used during codec resolution, implementations should avoid calling methods that may throw exceptions.

      Specified by:
      canDecodeSignature in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Parameters:
      probingInputStream - the stream to be inspect for the resource's embedded signature and version
      sourceName - a display name describing the source of the input stream, for use in error messages
      Returns:
      true if this codec recognizes the stream by it's signature, and can provide a decoder to decode the stream, otherwise false
    • getSignatureProbeLength

      public int getSignatureProbeLength()
      Description copied from interface: HtsCodec
      Get the number of bytes of needed by this codec to probe an input stream for a format/version signature, and determine if it can supply a decoder for the stream.
      Specified by:
      getSignatureProbeLength in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Returns:
      the number of bytes this codec must consume from a stream in order to determine whether it can decode that stream. This number may differ from the actual signature size as returned by HtsCodec.getSignatureLength() for codecs that support compressed or encrypted streams, since they may require a larger and more semantically meaningful input fragment (such as an entire encrypted or compressed block) in order to inspect the plaintext signature.

      Therefore signatureProbeLength should be expressed in "compressed/encrypted" space rather than "plaintext" space. The length returned from this method is used to determine the size of the SignatureStream that is subsequently passed to HtsCodec.canDecodeSignature(SignatureStream, String).

      Note: Codecs that are custom URI handlers (those that return true for HtsCodec.ownsURI(IOPath)), should always return 0 from this method when it is called. Since this method is used during codec resolution, implementations should avoid calling methods that may throw exceptions.

    • getSignatureLength

      public int getSignatureLength()
      Description copied from interface: HtsCodec
      Get the number of bytes in the format and version signature used by the file format supported by this codec.
      Specified by:
      getSignatureLength in interface HtsCodec<ReadsDecoderOptions,ReadsEncoderOptions>
      Returns:
      if the file format supported by this codecs is not remote, and is accessible via a local file or stream, the size of the unique signature/version for this file format. otherwise 0.

      Note: Codecs that are custom URI handlers (those that return true for HtsCodec.ownsURI(htsjdk.io.IOPath)), should always return 0 from this method. Since this method is used during codec resolution, implementations should avoid calling methods that may throw exceptions.