module encoding_detector
EncodingDetector class for managing chardet encoding detection.
Global Variables
- ENCODING
- OFF_WHITE
- CONFIDENCE_SCORE_RANGE
class EncodingDetector
Manager class to ease dealing with the encoding detection library chardet.
Each instance of this class manages a chardet.detect_all() scan on a single set of bytes.
Attributes:
bytes(bytes): The bytes to analyze.assessments(list[EncodingAssessment]): List ofEncodingAssessmentobjects fromchardetresults.force_decode_assessments(list[EncodingAssessment]): Assessments above force decode threshold.force_display_assessments(list[EncodingAssessment]): Assessments above force display threshold.has_any_idea(bool | None):Trueifchardethad any idea what the encoding might be,Falseif not,Noneifchardetwasn't run yet.raw_chardet_assessments(list[dict]): Raw list of dicts returned bychardet.detect_all().table(Table): A richTableobject summarizing the chardet results.unique_assessments(list[EncodingAssessment]): Unique assessments by encoding, highest confidence only.
method __init__
__init__(
_bytes: bytes,
assessments: list[EncodingAssessment] = <factory>,
force_decode_assessments: list[EncodingAssessment] = <factory>,
force_display_assessments: list[EncodingAssessment] = <factory>,
has_any_idea: bool | None = None,
raw_chardet_assessments: list[dict] = <factory>,
table: Table = <factory>,
unique_assessments: list[EncodingAssessment] = <factory>
) → None
property bytes
property bytes_len
method get_encoding_assessment
get_encoding_assessment(encoding: str) → EncodingAssessment
Get the chardet assessment for a specific encoding.
Args:
encoding(str): The encoding to look for.
Returns:
EncodingAssessment: Assessment for the encoding if it exists, otherwise a dummy with 0 confidence.
This file was automatically generated via lazydocs.