Bo Berglund
2018-01-18 16:38:04 UTC
A few weeks ago I migrated our CVS repositories to SVN using the
latest version of cvs2svn. The CVS server was on a Windows 2003 server
and the new VisualSVN server on a Windows 2016 Server. I did the
cvs2svn conversion on a Ubuntu server to which I had copied the CVS
repositories.
Then I imported the dump files on Windows 2016 via the VisualSVN GUI
commands.
It all looked like it was a success except for the handling of file
properties. Files marked binary in CVS are not marked as such in SVN
so I worry that they may become corrupted in the future. But it seems
like it does work OK.
Today I discovered a different issue that is more serious in nature:
In some projects there have been non-US characters used for file and
folder names like "Bygglovsansökan", "HÅL I TAKKASSETT" etc.
When I check out these projects from SVN the Swedish characters in the
names are now replaced by a series of high characters (hex view):
Å = C3 90 C2 9F
Ä = C3 90 C2 9E
Ö = -- did not find this character in svn yet --
å = C3 90 C2 96
ä = C3 90 C2 94
ö = C3 90 C2 A4
I don't know from where this problem originates, either it is a flaw
in the cvs2svn script, the configuration of the conversion or in the
format of the generated dump files.
Otherwise it may be a problem when importing the dump files into the
VisualSVN server....
What could I do to fix this?
(And please note that the new repository is in use so there are a
number of commits done since the migration...)
latest version of cvs2svn. The CVS server was on a Windows 2003 server
and the new VisualSVN server on a Windows 2016 Server. I did the
cvs2svn conversion on a Ubuntu server to which I had copied the CVS
repositories.
Then I imported the dump files on Windows 2016 via the VisualSVN GUI
commands.
It all looked like it was a success except for the handling of file
properties. Files marked binary in CVS are not marked as such in SVN
so I worry that they may become corrupted in the future. But it seems
like it does work OK.
Today I discovered a different issue that is more serious in nature:
In some projects there have been non-US characters used for file and
folder names like "Bygglovsansökan", "HÅL I TAKKASSETT" etc.
When I check out these projects from SVN the Swedish characters in the
names are now replaced by a series of high characters (hex view):
Å = C3 90 C2 9F
Ä = C3 90 C2 9E
Ö = -- did not find this character in svn yet --
å = C3 90 C2 96
ä = C3 90 C2 94
ö = C3 90 C2 A4
I don't know from where this problem originates, either it is a flaw
in the cvs2svn script, the configuration of the conversion or in the
format of the generated dump files.
Otherwise it may be a problem when importing the dump files into the
VisualSVN server....
What could I do to fix this?
(And please note that the new repository is in use so there are a
number of commits done since the migration...)
--
Bo Berglund
Developer in Sweden
Bo Berglund
Developer in Sweden