Class DataUtil


  • public final class DataUtil
    extends java.lang.Object
    Internal static utilities for handling data.
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static java.nio.charset.Charset UTF_8  
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static Document load​(java.io.File file, java.lang.String charsetName, java.lang.String baseUri)
      Loads and parses a file to a Document, with the HtmlParser.
      static Document load​(java.io.File file, java.lang.String charsetName, java.lang.String baseUri, Parser parser)
      Loads and parses a file to a Document.
      static Document load​(java.io.InputStream in, java.lang.String charsetName, java.lang.String baseUri)
      Parses a Document from an input steam.
      static Document load​(java.io.InputStream in, java.lang.String charsetName, java.lang.String baseUri, Parser parser)
      Parses a Document from an input steam, using the provided Parser.
      static java.nio.ByteBuffer readToByteBuffer​(java.io.InputStream inStream, int maxSize)
      Read the input stream into a byte buffer.
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • UTF_8

        public static final java.nio.charset.Charset UTF_8
    • Method Detail

      • load

        public static Document load​(java.io.File file,
                                    @Nullable
                                    java.lang.String charsetName,
                                    java.lang.String baseUri)
                             throws java.io.IOException
        Loads and parses a file to a Document, with the HtmlParser. Files that are compressed with gzip (and end in .gz or .z) are supported in addition to uncompressed files.
        Parameters:
        file - file to load
        charsetName - (optional) character set of input; specify null to attempt to autodetect. A BOM in the file will always override this setting.
        baseUri - base URI of document, to resolve relative links against
        Returns:
        Document
        Throws:
        java.io.IOException - on IO error
      • load

        public static Document load​(java.io.File file,
                                    @Nullable
                                    java.lang.String charsetName,
                                    java.lang.String baseUri,
                                    Parser parser)
                             throws java.io.IOException
        Loads and parses a file to a Document. Files that are compressed with gzip (and end in .gz or .z) are supported in addition to uncompressed files.
        Parameters:
        file - file to load
        charsetName - (optional) character set of input; specify null to attempt to autodetect. A BOM in the file will always override this setting.
        baseUri - base URI of document, to resolve relative links against
        parser - alternate parser to use.
        Returns:
        Document
        Throws:
        java.io.IOException - on IO error
        Since:
        1.14.2
      • load

        public static Document load​(@WillClose
                                    java.io.InputStream in,
                                    @Nullable
                                    java.lang.String charsetName,
                                    java.lang.String baseUri)
                             throws java.io.IOException
        Parses a Document from an input steam.
        Parameters:
        in - input stream to parse. The stream will be closed after reading.
        charsetName - character set of input (optional)
        baseUri - base URI of document, to resolve relative links against
        Returns:
        Document
        Throws:
        java.io.IOException - on IO error
      • load

        public static Document load​(@WillClose
                                    java.io.InputStream in,
                                    @Nullable
                                    java.lang.String charsetName,
                                    java.lang.String baseUri,
                                    Parser parser)
                             throws java.io.IOException
        Parses a Document from an input steam, using the provided Parser.
        Parameters:
        in - input stream to parse. The stream will be closed after reading.
        charsetName - character set of input (optional)
        baseUri - base URI of document, to resolve relative links against
        parser - alternate parser to use.
        Returns:
        Document
        Throws:
        java.io.IOException - on IO error
      • readToByteBuffer

        public static java.nio.ByteBuffer readToByteBuffer​(java.io.InputStream inStream,
                                                           int maxSize)
                                                    throws java.io.IOException
        Read the input stream into a byte buffer. To deal with slow input streams, you may interrupt the thread this method is executing on. The data read until being interrupted will be available.
        Parameters:
        inStream - the input stream to read from
        maxSize - the maximum size in bytes to read from the stream. Set to 0 to be unlimited.
        Returns:
        the filled byte buffer
        Throws:
        java.io.IOException - if an exception occurs whilst reading from the input stream.