NAME¶
Encode::Detect::Detector - Detects the encoding of data
SYNOPSIS¶
use Encode::Detect::Detector;
my $charset = detect($octets);
my $d = new Encode::Detect::Detector;
$d->handle($octets);
$d->handle($more_octets);
$d->end;
my $charset = $d->getresult;
DESCRIPTION¶
This module provides an interface to Mozilla's universal charset detector, which
detects the charset used to encode data.
METHODS¶
$charset = Encode::Detect::Detector->detect($octets)¶
Detect the charset used to encode the data in $octets and return the charset's
name. Returns undef if the charset cannot be determined with sufficient
confidence.
$d = Encode::Detect::Detector-> new()¶
Creates a new "Encode::Detect::Detector" object and returns it.
$d->handle($octets)¶
Provides an additional chunk of data to be examined by the detector. May be
called multiple times.
Returns zero on success, nonzero if a memory allocation failed.
$d->eof¶
Informs the detector that there is no more data to be examined. In many cases,
this is necessary in order for the detector to make a decision on the charset.
$d->reset¶
Resets the detector to its initial state.
$d->getresult¶
Returns the name of the detected charset or "undef" if no charset has
(yet) been decided upon. May be called at any time.
SEE ALSO¶
Encode::Detect
AUTHOR¶
John Gardiner Myers <jgmyers@proofpoint.com>
SUPPORT¶
For help and thank you notes, e-mail the author directly. To report a bug,
submit a patch, or add to the wishlist please visit the CPAN bug manager at:
http://rt.cpan.org