NAME DateTime::Format::JP - Japanese DateTime Parser and Formatter SYNOPSIS use DateTime::Format::JP; my $fmt = DateTime::Format::JP->new( hankaku => 1, pattern => '%c', # default traditional => 0, kanji_number => 0, zenkaku => 0, time_zone => 'local', ); my $dt = DateTime->now; $dt->set_formatter( $fmt ); # set the encoding in and out to utf8 use open ':std' => ':utf8'; print "$dt\n"; # will print something like 令和3年7月12日午後2:30:20 my $dt = $fmt->parse_datetime( "令和3年7月12日午後2時30分" ); my $str = $fmt->format_datetime( $dt ); print "$str\n"; VERSION v0.1.0 DESCRIPTION This module is used to parse and format Japanese date and time. It is lightweight and yet versatile. It implements 2 main methods: "parse_datetime" and "format_datetime" both expects and return decoded utf8 string. You can use Encode to decode and encode from perl internal utf8 representation to real utf8. METHODS new The constructor accepts the following parameters: *hankaku* boolean If true, the digits used will be "half-size" (半角), or roman numbers like 1, 2, 3, etc. The opposite is *zenkaku* (全角) or full-width. This will enable the use of double-byte Japanese numbers that still look like roman numbers, such as: 1, 2, 3, etc. Defaults to true. *pattern* string The pattern to use to format the date and time. See below the available "PATTERN TOKENS" and their meanings. Defaults to %c *traditional* boolean If true, then it will use a more traditional date/time representation. The effect of this parameter on the formatting is documented in "PATTERN TOKENS" *kanji_number* boolean If true, this will have "format_datetime" use numbers in kanji, such as: 一, 二, 三, 四, etc. *zenkaku* boolean If true, this will use full-width, ie double-byte Japanese numbers that still look like roman numbers, such as: 1, 2, 3, etc. *time_zone* string The time zone to use when creating a DateTime object. Defaults to "local" error Returns the latest error set, if any. All method in this module return "undef" upon error and set an error that can be retrieved with this method. format_datetime Takes a DateTime object and returns a formatted date and time based on the pattern specified, which defaults to %c. You can call this method directly, or you can set this formatter object in so that ie will be used for stringification of the DateTime object. See below "PATTERN TOKENS" for the available tokens and their meanings. hankaku Sets or gets the boolean value for *hankaku*. kanji_number Sets or gets the boolean value for *kanji_number*. parse_datetime Takes a string representing a Japanese date, parse it and return a new DateTime. If an error occurred, it will return "undef" and you can get the error using "error" time_zone Sets or gets the string representing the time zone to use when creating DateTime object. This is used by "parse_datetime" traditional Sets or gets the boolean value for *traditional*. zenkaku Sets or gets the boolean value for *zenkaku*. SUPPORT METHODS kanji_to_romaji Takes a number in kanji and returns its equivalent value in roman (regular) numbers. lookup_era Takes an Japanese era in kanji and returns an "DateTime::Format::JP::Era" object lookup_era_by_date Takes a DateTime object and returns a "DateTime::Format::JP::Era" object make_datetime Returns a DateTime based on parameters provided. romaji_to_kanji Takes a number and returns its equivalent representation in Japanese kanji. Thus, for example, 1234 would be returned as "一千二百三十四" Please note that, since this is intended to be used only for dates, it does not format number over 9 thousand. If you think there is such need, please contact the author. romaji_to_zenkaku Takes a number and returns its equivalent representation in double-byte Japanese numbers. Thus, for example, 1234 would be returned as 1234 zenkaku_to_romaji Takes a string representing a number in full width (全角), i.e. double-byte and returns a regular equivalent number. Thus, for example, 1234 would be returned as 1234 PATTERN TOKENS Here are below the available tokens for formatting and the value they represent. In all respect, they are closely aligned with "strftime" in DateTime (see "strftime Patterns" in DateTime), except that the formatter object parameters provided upon instantiation alter the values used. * %% The % character. * %a The weekday name in abbreviated form such as: 月, 火, 水, 木, 金, 土, 日 * %A The weekday name in its long form such as: 月曜日, 火曜日, 水曜日, 木曜日, 金曜日, 土曜日, 日曜日 * %b The month name, such as 1月, 2月, etc... 12月 using regular digits. * %B The month name using full width (全角) digits, such as 1月, 2月, etc... 12月 * %h The month name using kanjis for numbers, such as 一月, 二月, etc... 十二月 * %c The datetime format in the Japanese standard most usual form. For example for "12th July 2021 14:17:30" this would be: 令和3年7月12日午後2:17:30 However, if *traditional* is true, then it would rather be: 令和3年7月12日午後2時17分30秒 And if *zenkaku* is true, it will use double-byte numbers instead: 令和3年7月12日午後2時17分30秒 And if *kanji_number* is true, it will then be: 令和3年七月十二日午後二時十七分三十秒 * %C The century number (year/100) as a 2-digit integer. This is the same as "strftime" in DateTime * %d or %e The day of month (1-31). However, if *zenkaku* is true, then it would rather be with full width (全角) numbers: 1-31 And if *kanji_number* is true, it will then be with numbers in kanji: 一, 二, etc.. 十, 十一, etc.. * %D Equivalent to "%E%y年%m月%d日" This is the Japanese style date including with the leading era name. If *zenkaku* is true, "full-width" (double byte) digits will be used and if *kanji_number* is true, numbers in kanji will be used instead. See %F for an equivalent date using the Gregorian years rather than the Japanese era. * %E This extension is the Japanese era, such as "令和" (i.e. "reiwa": the current era) * %F Equivalent to "%Y年%m月%d日" If *zenkaku* is true, "full-width" (double byte) digits will be used and if *kanji_number* is true, numbers in kanji will be used instead. For the year only the conversion from regular digits to Japanese kanjis will be done simply by interpolating the digits and not using numerals. For example 2021 would become "二〇二一" and not "二千二十一" * %g The year corresponding to the ISO week number, but without the century (0-99). This uses regular digits and is the same as "strftime" in DateTime * %G The ISO 8601 year with century as a decimal number. The 4-digit year corresponding to the ISO week number. This has the same format and value as %Y, except that if the ISO week number belongs to the previous or next year, that year is used instead. Also this returns regular digits. This uses regular digits and is the same as "strftime" in DateTime * %H The hour: 0-23 If *traditional* is enabled, this would rather be "0-23時" However, if *zenkaku* is true, then it would rather use full width (全角) numbers: "0-23時" And if *kanji_number* is true, it will then be something like "十時" * %I The hour on a 12-hour clock (1-12). If *zenkaku* is true, it will use full width numbers and if *kanji_number* is true, it will use numbers in kanji instead. * %j The day number in the year (1-366). This uses regular digits and is the same as "strftime" in DateTime * %m The month number (1-12). If *zenkaku* is true, it will use full width numbers and if *kanji_number* is true, it will use numbers in kanji instead. * %M The minute: 0-59 If *traditional* is enabled, this would rather be "0-59分" However, if *zenkaku* is true, then it would rather use full width (全角) numbers: "0-59分" And if *kanji_number* is true, it will then be something like "十分" * %n Arbitrary whitespace. Same as in "strftime" in DateTime * %N Nanoseconds. For other sub-second values use "%[number]N". This is a pass-through directly to "strftime" in DateTime * %p or %P Either produces the same result. Either AM (午前) or PM (午後) according to the given time value. Noon is treated as pm "午後" and midnight as am "午前". * %r Equivalent to "%p%I:%M:%S" Note that if *zenkaku* is true, the semi-colon used will be double-byte: ":" Also if you use this, do not enable *kanji_number*, because the result would be weird, something like: 午後二:十四:三十 # 2:14:30 in this example * %R Equivalent to "%H:%M" Note that if *zenkaku* is true, the semi-colon used will be double-byte: ":" Juste like for %r, avoid enabling *kanji_number* if you use this token. * %s Number of seconds since the Epoch. If *zenkaku* is enabled, this will return the value as double-byte number. * %S The second: "0-60" If *traditional* is enabled, this would rather be "0-60秒" However, if *zenkaku* is true, then it would rather use full width (全角) numbers: "0-60秒" And if *kanji_number* is true, it will then be something like "六十秒" 60 may occur for leap seconds. See DateTime::LeapSecond). * %t Arbitrary whitespace. Same as in "strftime" in DateTime * %T Equivalent to "%H:%M:%S" However, if *zenkaku* option is enabled, the numbers will be double-byte roman numbers and the separator will also be double-byte. For example: 14:20:30 * %U The week number with Sunday (日曜日) the first day of the week (0-53). The first Sunday of January is the first day of week 1. If *zenkaku* is enabled, it will return a double-byte number instead. * %u The weekday number (1-7) with Monday (月曜日) = 1, 火曜日 = 2, 水曜日 = 3, 木曜日 = 4, 金曜日 = 5, 土曜日 = 6, 日曜日 = 7 If *zenkaku* is enabled, it will return a double-byte number instead. This is the "DateTime" standard. * %w The weekday number (0-6) with Sunday = 0. If *zenkaku* is enabled, it will return a double-byte number instead. * %W The week number with Monday (月曜日) the first day of the week (0-53). The first Monday of January is the first day of week 1. If *zenkaku* is enabled, it will return a double-byte number instead. * %x The date format in the standard most usual form. For example for 12th July 2021 this would be: 令和3年7月12日 However, if *zenkaku* is true, then it would rather be: 令和3年7月12日 And if *kanji_number* is true, it will then be: 令和3年七月十二日 * %X The time format in the standard most usual form. For example for "14:17:30" this would be: 午後2:17:30 And if *zenkaku* is enabled, it would rather use a double-byte numbers and separator: 午後2:17:30 However, if *traditional* is true, then it would rather be: 午後2時17分30秒 And if *kanji_number* is true, it will then be: 午後二時十七分三十秒 * %y The year of the era. For example "2021-07-12" would be "令和3年7月12日" and thus the year value would be 3 If *zenkaku* is true, it will use full width numbers and if *kanji_number* is true, it will use numbers in kanji instead. * %Y A 4-digit year, including century (for example, 1991). If *zenkaku* is true, "full-width" (double byte) digits will be used and if *kanji_number* is true, numbers in kanji will be used instead. Same as in %F, the conversion from regular digits to Japanese kanjis will be done simply by interpolating the digits and not using numerals. For example 2021 would become "二〇二一" and not "二千二十一" * %z An RFC-822/ISO 8601 standard time zone specification. (For example +1100) If *zenkaku* is true, "full-width" (double byte) digits and "+/-" signs will be used and if *kanji_number* is true, numbers in kanji will be used instead. However, no numeral will be used. Thus a time zone offset such as +0900 would be returned as "+〇九〇〇" * %Z The timezone name. (For example EST -- which is ambiguous). This is the same as "strftime" in DateTime HISTORICAL NOTE Japanese eras, also known as 元号 (gengo) or 年号 (nengo) form one of the two parts of a Japanese year in any given date. It was instituted by and under first Emperor Kōtoku in 645 AD. So be warned that requiring an era-based Japanese date before will not yield good results. Era name were adopted for various reasons such as a to commemorate an auspicious or ward off a malign event, and it is only recently that era name changes are tied to a new Emperor. More on this here From 1334 until 1392, there were 2 competing regimes in Japan; the North and South. This period was called "Nanboku-chō" (南北朝). This module uses the official Northern branch. Also there has been two times during the period "Asuka" (飛鳥時代) with no era names, from 654/11/24 until 686/8/14 after Emperor Kōtoku death and from 686/10/1 until 701/5/3 after Emperor Tenmu's death just 2 months after his enthronement. Thus if you want a Japanese date using era during those two periods, you will get and empty era. More on this here AUTHOR Jacques Deguest SEE ALSO perl COPYRIGHT & LICENSE Copyright(c) 2021 DEGUEST Pte. Ltd. DEGUEST Pte. Ltd.