JavaScript Roman Numeral Converter

While looking for something quick to do during a brief internet outage, I wrote some code to convert to and from Roman numerals. Once things were back up I searched for equivalent code, but only found stuff that was multiple pages long, limited the range of what it could convert, or both. I figured I might as well share what I came up with:

function romanize (num) {
  if (!+num) return false;
  var digits = String(+num).split('');
  var key = ['','C','CC','CCC','CD','D','DC','DCC','DCCC','CM',
             '','X','XX','XXX','XL','L','LX','LXX','LXXX','XC',
             '','I','II','III','IV','V','VI','VII','VIII','IX'];
  var roman = '', i = 3;
  while (i--) roman = (key[+digits.pop() + (i * 10)] || '') + roman;
  return Array(+digits.join('') + 1).join('M') + roman;
}

function deromanize (str) {
  var str = str.toUpperCase();
  var validator = /^M*(?:D?C{0,3}|C[MD])(?:L?X{0,3}|X[CL])(?:V?I{0,3}|I[XV])$/;
  var token = /[MDLV]|C[MD]?|X[CL]?|I[XV]?/g;
  var key = {M:1000,CM:900,D:500,CD:400,C:100,XC:90,L:50,XL:40,X:10,IX:9,V:5,IV:4,I:1};
  var num = 0, m;
  if (!(str && validator.test(str))) return false;
  while (m = token.exec(str)) num += key[m[0]];
  return num;
}

How would you rewrite this code? Can you create a shorter version?

41 thoughts on “JavaScript Roman Numeral Converter”

Iván -DrSlump- Montes says:

March 16, 2008 at 5:40 am

This is a pretty common solution to the problem (just to romanize). I haven’t tested if it actually works but overall the algorithm is pretty simple and easy to port to any language.

By the way, your ‘deromanize’ is actually very easy to follow. Congrats.

function romanize(num) { var lookup = {M:1000,CM:900,D:500,CD:400,C:100,XC:90,L:50,XL:40,X:10,IX:9,V:5,IV:4,I:1}, roman = '', i; for ( i in lookup ) { while ( num >= lookup[i] ) { roman += i; num -= lookup[i]; } } return roman; }
Rafael says:

March 16, 2008 at 6:20 am

Another solution for romanize function can be

function rome(N,s,b,a,o,t){ t=N/1e3|0;N%=1e3; for(s=b='',a=5;N;b++,a^=7) for(o=N%a,N=N/a^0;o--;) s='IVXLCDM'.charAt(o>2?b+N-(N&=~1)+(o=1):b)+s; return Array(t+1).join('M')+s; }

The above algorithm is based on one found on polish javascript group ( http://groups.google.com/group/pl.comp.lang.javascript/msg/c8991a94f77c98ce ). In the original function ‘N’ can be 0-3999. I just modified it to handle bigger numbers.
Steve says:

March 16, 2008 at 10:44 am

Both algorithms are great, for opposite reasons… Iván’s for its simplicity, and Rafael’s for its dizzying complexity. (Obviously, both are nice and short.)

When I posted this I assumed romanize was the weaker of the two functions. Anyone have another take on the deromanize function?
Iván -DrSlump- Montes says:

March 16, 2008 at 4:25 pm

Well, without trying to validate the input string I guess the following is quite easy to understand deromanize algorithm. It will traverse the roman literal from right to left, adding by default and subtracting the letter equivalent value if it’s lower than its immediate sibling.

function deromanize( roman ) { var roman = roman.toUpperCase(), lookup = {I:1,V:5,X:10,L:50,C:100,D:500,M:1000}, arabic = 0, i = roman.length; while (i--) { if ( lookup[roman[i]] < lookup[roman[i+1]] ) arabic -= lookup[roman[i]]; else arabic += lookup[roman[i]]; } return arabic; }
Steve says:

March 16, 2008 at 6:09 pm

Nice. It’s always interesting to see alternative approaches.

One thing though… To make it work cross-browser you’ll need to either convert roman to an array (roman = roman.toUpperCase().split("")) or use the charAt method (roman.charAt(i) instead of roman[i]). Treating strings as arrays like that is not part of the ES3 spec (although I believe it’s proposed for inclusion in ES4) and doesn’t work in IE, unfortunately.
Steve says:

March 16, 2008 at 9:20 pm

Kris Kowal emailed me the following two versions of deromanize, both of which use an approach similar to Iván’s but take advantage of Kris’s Chiron JavaScript library and a heavy dose of functional programming.

include('boost.js'); var romans = dict({M:1000, D:500, C:100, L:50, X:10, V:5, I:1}); this.decode = function (roman) { return ( add( eachIter(roman.toUpperCase(), romans.get), [0] ) .to(edgesIter) .eachArgsIter(function (x, y) { return x < y ? -x : x; }) .sum() ) };

A variation on the same thing:

include('boost.js'); var romans = dict({M:1000, D:500, C:100, L:50, X:10, V:5, I:1}); this.decode = function (roman) { roman = each(roman, romans.get); return ( range(getLength(roman)) .eachIter(function (i) { return [roman.get(i), roman.get(i + 1, 0)]; }) .eachArgsIter(function (x, y) { return x < y ? -x : x; }) .sum() ) };

In both cases I edited the inner return to use the ternary ?: operator, because at least here I find that easier to read than if/else. From my perspective, these are probably the most interesting (but least practical, if for no other reason than the added function call overhead) of the implementations so far.

If the idioms are foreign to you, check out the Chiron source to figure out how it works.
Jonathan Snook says:

March 16, 2008 at 11:09 pm

This is a variation on Ivan’s but I think splitting into an array can actually simplify the code:

function deromanize( roman ) { var roman = roman.toUpperCase().split(''), lookup = {I:1,V:5,X:10,L:50,C:100,D:500,M:1000}, num = 0, val = 0; while (roman.length) { val = lookup[roman.shift()]; num += val * (val < lookup[roman[0]] ? -1:1); } return num; }

There’s not the same level of validation as the original but I like it for its simplicity.
Siderite says:

March 17, 2008 at 1:14 am

Very nice post, although I can’t imagine where I would need roman numbers anymore. Anyone wonder how the Romans actually read those numbers? I mean, we read every digit and say something about it. How did they do it?

BTW, I didn’t know about the +num test for numerical values. Always used parseInt. Thanks!
Steve says:

March 17, 2008 at 1:51 am

@Snook, IMO that’s a fairly significant improvement. Since the regex-based validation can easily be added in, it looks like your deromanize is the one to beat.

@Siderite, you might have already realized this, but the unary + operator and parseInt are not equivalent. + can convert strings to numbers, and returns NaN if the element cannot be converted. parseInt (which takes an optional second argument for the radix) does the same thing, but also extracts leading numbers from strings. E.g., parseInt("12x") returns 12, while +"12x" returns NaN. Additionally, parseInt and + make different assumptions about the radix when there’s a leading zero. +"012" returns 12, but parseInt("012") returns 10. The leading zero causes parseInt to treat it as an octal number in probably all browsers, despite octals being summarily deprecated in ES3. Of course, you can use parseInt("012",10) to get around that.

Incidentally, that explains why I’m using String(+num). The plus sign is there to remove leading zeros if the number is received as a string. I probably wouldn’t bother accounting for that case here if it took more than an extra character.
Scott Fitchet says:

March 17, 2008 at 2:14 pm

This blog does in fact appear to be badass.
Rafael says:

March 18, 2008 at 9:19 am

IMO, the while loop can be easily avoided

function derome(N){ if(!/^M*(?:D?C{0,3}|C[MD])(?:L?X{0,3}|X[CL])(?:V?I{0,3}|I[XV])$/.test(N)) throw new Error('Incorrect roman number'); var r=0; N.replace(/[MDLV]|C[MD]?|X[CL]?|I[XV]?/g,function(i){ r+={M:1000,CM:900,D:500,CD:400,C:100,XC:90,L:50,XL:40,X:10,IX:9,V:5,IV:4,I:1}[i]; }); return r; }

PS. When validating the string, IMO, the function should throw an Error, not return false. Making the function return false, makes a PHP-style function out of it, rather than “classic”-JavaScript one.

Or, we can return Number.NaN

if(!/^M*(?:D?C{0,3}|C[MD])(?:L?X{0,3}|X[CL])(?:V?I{0,3}|I[XV])$/.test(N)) return Number.NaN; // return +'a' if we want to make it shorter ;-)
Steve says:

March 18, 2008 at 11:25 am

@Rafael, I agree that returning NaN or throwing an error would be better than returning false.

However, I don’t really see the point of replacing a while loop with a replace loop. You’re essentially replacing this (stripped down) piece of my function:

var r=0,m,k={...},t=/regex/g; while(m=t.exec(N))r+=k[m[0]];

…with this:

var r=0; N.replace(/regex/g,function(i){r+={...}[i];});

That saves four characters. Granted, I specifically asked for shorter versions, but IMO it’s the same thing in less lucid form. Additionally, recreating the lookup object during each iteration is going to add a little bit of overhead. The other changes you made cause the function to return 0 when passed an empty string, and not handle lowercase Roman numeral characters.

If the whole point of the solution was its brevity, never mind all that 🙂 (but then you might as well remove the validation too).

BTW, return NaN is shorter than return +'a'. 😉
Rafael says:

March 18, 2008 at 6:02 pm

However, I don’t really see the point of replacing a while loop with a replace loop […] Additionally, recreating the lookup object during each iteration is going to add a little bit of overhead

I know, that my solution is only a bit shorter, but not very efficient. I also know, that recreating the object isn’t the best solution. But we can define (once again) a local variable in the scope of deromanize function. The main point of my algorithm was to show, that we can avoid using the while-loop. Only this. I just like to “play” with code.

PS. I didn’t know, that NaN is also a global object/value… I thought it’s a property of Number object. Guess, I have to dive more deeply into ECMAScript Spec.

Oh, and the problem while validating, it’s my omission, sorry.
if(!N||!/^M*(?:D?C{0,3}|C[MD])(?:L?X{0,3}|X[CL])(?:V?I{0,3}|I[XV])$/i.test(N=N.toUpperCase())) return NaN;
Steve says:

March 18, 2008 at 7:18 pm

@Rafael:

“I just like to ‘play’ with code.”

Me too. 🙂

Although while/exec() and replace() can often be used interchangeably, I tend to use while/exec() unless I’m actually replacing something in the string. It makes more sense to me and it’s more flexible.

Regarding the empty string handling, that was easy to miss given that I didn’t comment my code or explicitly test for that case. You could also deal with it by simply adding (?=.) at the front of the regex.
Cambrinus says:

March 23, 2008 at 4:01 am

“Very nice post, although I can’t imagine where I would need roman numbers anymore. Anyone wonder how the Romans actually read those numbers? I mean, we read every digit and say something about it. How did they do it?”

Dear Siderite One might need to read them, e.g. on old buildings, in old books. You obviously do not. Er…they used Latin to read their numbers; this, being logical, is the reason they can be easily reduced to code.
StuartMail says:

March 28, 2008 at 9:35 am

Steve
Just to say thank you for the explaination of adding leading + to input strings to stop parseInt from treating numbers (parsed in as strings) with leading zeros as Octal – solved a tricky problem for me with just one character added to my code 🙂
Stuart Mail
nILS says:

October 10, 2008 at 11:19 pm

Thanks you guys, I was looking for just that!
Pingback: How to convert Roman Numerals and Arabic numbers in ActionScript 3 | SWF It Good
John Vivenzio says:

May 1, 2009 at 11:50 pm

Ok – now I’m hooked on these teasers! : )

Quick variation on the 2 line makeChange algorithm to encode:

(does multi million conversions – though I can see how they’d read that fast enough to be of any use – can you picture the bailout being accounted for in roman numerals? How much graft do you think they’d be able to get away with using accounting sheets with:

all numbers in Ms: mdccccvMMDCCCLIX * mmmdccvMDCLIX ! : )

var n = [
// using lowercase to represent Unicode lined Roman Numerals
[1000000, “m”],
[ 500000, “d”],
[ 100000, “c”],
[ 90000, “cm”],
[ 50000, “l”],
[ 10000, “x”],
[ 9000, “xc”],
[ 5000, “v”],
[ 4000, “iv”],
[ 1000, “M”],
[ 900, “CM”],
[ 500, “D”],
[ 100, “C”],
[ 90, “XC”],
[ 50, “L”],
[ 40, “XL”],
[ 10, “X”],
[ 9, “IX”],
[ 5, “V”],
[ 4, “IV”],
[ 1, “I”],
[ 0]
];

var output = [];
var i = 0, x = 0;

function makeRoman(money) {
for (x = Math.floor(money / n[i][0]); x; x–)
output.push(n[i][1]);
return isNaN(n[i + 1]) ? makeRoman(money % n[i++][0]) : output;
}

makeRoman(17996).join(“”);
John Vivenzio says:

May 2, 2009 at 12:19 am

… and using the same table here’s the recursive decode:

var n = [
// using lowercase to represent Unicode lined Roman Numerals
[1000000, “m”],
[ 500000, “d”],
[ 100000, “c”],
[ 90000, “cm”],
[ 50000, “l”],
[ 10000, “x”],
[ 9000, “xc”],
[ 5000, “v”],
[ 4000, “iv”],
[ 1000, “M”],
[ 900, “CM”],
[ 500, “D”],
[ 100, “C”],
[ 90, “XC”],
[ 50, “L”],
[ 40, “XL”],
[ 10, “X”],
[ 9, “IX”],
[ 5, “V”],
[ 4, “IV”],
[ 1, “I”],
[ 0]
];

var output = [];
var i = 0, x = 0;
var arabic = 0;

function makeArabic(numbers) {
for (i = 0; numbers.length && n[i][0]; i++)
if (numbers.substring(0,n[i][1].length) == n[i][1]) {
arabic += n[i][0];
return makeArabic(numbers.substring(n[i][1].length));
}
return arabic;
}

makeArabic(“MCMXLIX”);
Yetty says:

August 4, 2010 at 6:38 am

Adding this functionality for all Numbers.

Number.prototype.romanize = function()
{
var digits = String(this).split(“”),
key = [“”,”C”,”CC”,”CCC”,”CD”,”D”,”DC”,”DCC”,”DCCC”,”CM”,
“”,”X”,”XX”,”XXX”,”XL”,”L”,”LX”,”LXX”,”LXXX”,”XC”,
“”,”I”,”II”,”III”,”IV”,”V”,”VI”,”VII”,”VIII”,”IX”],
roman = “”,
i = 3;
while (i–)
roman = (key[+digits.pop() + (i * 10)] || “”) + roman;
return Array(+digits.join(“”) + 1).join(“M”) + roman;
}
Alex Brollo says:

January 19, 2012 at 2:11 am

I was just searching for a romanize/deromanize script, to be used into http://it.wikisource.org (just to avoid to rediscover the wheel…).

Can I use you base code in the top of the page? Obviously I’ll link author & source. I see that page is covered by a copyright, so asking your permission is needed.

Thanks for your attention!

Alex
Steven Levithan says:

January 19, 2012 at 1:37 pm

@Alex Brollo, yes, that’s fine. Any of my own code on this blog is released under the MIT License, if not explicitly stated otherwise.
william malo says:

March 23, 2012 at 2:57 pm

I got a shorter code here.

romanize = function(a,b,c,d,e){for(b=c=”,d=5;a;c++,d^=7)for(e=a%d,a=a/d^0;e–;)b=’ IVXLCDM'[(e>2?c+a-(a&=~1)+(e=1):c)+1]+b;return b}

document.write(romanize(2524))
pl.c.l.j says:

May 16, 2012 at 5:22 am

this is still “polish algorithm” 😉
intrepidis says:

June 19, 2012 at 5:11 pm

Thanks for all this, guys. It was a joy to read, plus exactly what I was looking for. I’m writing a game (just a home project) and wanted Roman numerals in it… and couldn’t be bothered to write a function to do this. Cheers.
Pingback: Romanize integers and deromanize strings — ??????? ????? ? ???????? ? ????????. |

Functional style, almost even readable… ? ; )

function convert(num) {
  var rom = { M: 1000, CM: 900, D: 500, CD: 400,
              C:  100, XC:  90, L:  50, XL:  40,
              X:   10, IX:   9, V:   5, IV:   4,
              I:    1 };
  
  return Object.keys(rom).reduce(function (acc, ch) {
    acc.str += ch.repeat(acc.num / rom[ch]);
    acc.num %= rom[ch];
    return acc;
  }, { str: '', num: num }).str;
  
}

console.log(convert(3999)); // returns MMMCMXCIX

Pingback: Bonfire 21: Roman Numeral Converter | Coded Malarkey
Pingback: Bonfire: Roman Numeral Converter | piratadelvhs
McDecoil Philips says:

January 30, 2016 at 6:42 am

Please dear author could you kindly work me through your code, am a bit lost as to the logic from the while loop.
Thanks
I look forward to hearing from you.
Pingback: ??????? ????? ? ??????? ??????? ????????? - javascript ???????? ?????????-???? - ??????? ? ?????? ?? ????????????????
Pingback: javascript - Convertir un número en un Número Romano en javaScript
Pingback: javascript - La conversione di un numero in un Numero Romano in javaScript
Pingback: Display date in roman numerals - Tutorial Guruji
Pingback: Convert a number into a Roman Numeral in javaScript - PhotoLens
Pingback: Convert a number into a Roman numeral in JavaScript – w3toppers.com
Pingback: Convert a number into a Roman numeral in JavaScript [SOLVED] – Wanted Solution
Pingback: Convert a number into a Roman numeral in JavaScript
Roman says:

December 6, 2022 at 4:34 am

I think your code is pretty and clean. Making it more shorter will only affect the readability.

As you know, the roman numerals are limited to 3999 but I have seen people going above that, do you think that approach is correct?
Pingback: [Regex] How do you match only valid roman numerals with a regular expression? - Pixorix

41 thoughts on “JavaScript Roman Numeral Converter”

Leave a Reply