Discussion:
Using UTF8 in repository name?
Doug Robinson
2017-04-10 12:44:11 UTC
Permalink
Folks:

I understand that UTF8 is the standard inside of Subversion repositories.
My question is whether having the repository name itself be UTF8? For
instance zh_CN.utf8 ? I googled around a bit and don't see an answer.

If it is supported, is there some Apache setting that is required?

Thanks in advance.

Doug
--
*DOUGLAS B ROBINSON* SENIOR PRODUCT MANAGER

T +1 925 396 1125
*E* ***@wandisco.com
--
World Leader in Active Data Replication™
*Find out more wandisco.com <http://wandisco.com/>*

THIS MESSAGE AND ANY ATTACHMENTS ARE CONFIDENTIAL, PROPRIETARY AND MAY BE
PRIVILEGED

If this message was misdirected, WANdisco, Inc. and its subsidiaries,
("WANdisco") does not waive any confidentiality or privilege. If you are
not the intended recipient, please notify us immediately and destroy the
message without disclosing its contents to anyone. Any distribution, use or
copying of this email or the information it contains by other than an
intended recipient is unauthorized. The views and opinions expressed in
this email message are the author's own and may not reflect the views and
opinions of WANdisco, unless the author is authorized by WANdisco to
express such views or opinions on its behalf. All email sent to or from
this address is subject to electronic storage and review by WANdisco.
Although WANdisco operates anti-virus programs, it does not accept
responsibility for any damage whatsoever caused by viruses being passed.
Branko Čibej
2017-04-10 12:53:59 UTC
Permalink
Post by Doug Robinson
I understand that UTF8 is the standard inside of Subversion repositories.
My question is whether having the repository name itself be UTF8? For
instance zh_CN.utf8 ? I googled around a bit and don't see an answer.
If it is supported, is there some Apache setting that is required?
Thanks in advance.
As long as the underlying filesystem supports it, there should be no
problems.

-- Brane
Doug Robinson
2017-04-10 20:17:18 UTC
Permalink
Brane:

Thank you for replying. Based on this post:

https://www.svnforum.org/forum/opensource-subversion-forums/general-setup-and-troubleshooting/79832-why-repository-name-should-not-be-utf-8

I am wondering if the SVN client is properly URL-encoding the repository
name?

Based on:

https://en.wikipedia.org/wiki/Percent-encoding


I'm beginning to wonder if it is even possible (sort of similar to what
Mac/OS does to UTF8 text...)?

Thank you.

Doug
Post by Branko Čibej
Post by Doug Robinson
I understand that UTF8 is the standard inside of Subversion repositories.
My question is whether having the repository name itself be UTF8? For
instance zh_CN.utf8 ? I googled around a bit and don't see an answer.
If it is supported, is there some Apache setting that is required?
Thanks in advance.
As long as the underlying filesystem supports it, there should be no
problems.
-- Brane
--
*DOUGLAS B ROBINSON* SENIOR PRODUCT MANAGER

T +1 925 396 1125
*E* ***@wandisco.com
--
World Leader in Active Data Replication™
*Find out more wandisco.com <http://wandisco.com/>*

THIS MESSAGE AND ANY ATTACHMENTS ARE CONFIDENTIAL, PROPRIETARY AND MAY BE
PRIVILEGED

If this message was misdirected, WANdisco, Inc. and its subsidiaries,
("WANdisco") does not waive any confidentiality or privilege. If you are
not the intended recipient, please notify us immediately and destroy the
message without disclosing its contents to anyone. Any distribution, use or
copying of this email or the information it contains by other than an
intended recipient is unauthorized. The views and opinions expressed in
this email message are the author's own and may not reflect the views and
opinions of WANdisco, unless the author is authorized by WANdisco to
express such views or opinions on its behalf. All email sent to or from
this address is subject to electronic storage and review by WANdisco.
Although WANdisco operates anti-virus programs, it does not accept
responsibility for any damage whatsoever caused by viruses being passed.
Bert Huijben
2017-04-11 09:04:22 UTC
Permalink
This message on the forum is 100% about the server side configuration. The client has no known problems encoding paths in a url, but the url specification itself doesn’t document an explicit encoding and as such Apache Httpd has to do the translation of the first part of the URL to the local path. When this fails you see errors as in this forum post.



The part after the repository location is 100% handled inside the Subversion code and is always UTF-8 safe as it has no relation to real on-disk paths.



On platforms like MAC/OS or Windows the server handles UTF-8 paths directly, but on unix based platforms like Linux where paths are at the core just formed by a bunch of bytes without a special meaning, you have to use a proper configuration. If the configuration mismatches there is no way to convert from the path form to UTF-8 and the other way around.



Bert



From: Doug Robinson [mailto:***@wandisco.com]
Sent: maandag 10 april 2017 22:17
To: Branko Čibej <***@apache.org>
Cc: ***@subversion.apache.org
Subject: Re: Using UTF8 in repository name?



Brane:



Thank you for replying. Based on this post:



https://www.svnforum.org/forum/opensource-subversion-forums/general-setup-and-troubleshooting/79832-why-repository-name-should-not-be-utf-8



I am wondering if the SVN client is properly URL-encoding the repository name?



Based on:



https://en.wikipedia.org/wiki/Percent-encoding



I'm beginning to wonder if it is even possible (sort of similar to what Mac/OS does to UTF8 text...)?



Thank you.



Doug
Post by Doug Robinson
I understand that UTF8 is the standard inside of Subversion repositories.
My question is whether having the repository name itself be UTF8? For
instance zh_CN.utf8 ? I googled around a bit and don't see an answer.
If it is supported, is there some Apache setting that is required?
Thanks in advance.
As long as the underlying filesystem supports it, there should be no
problems.

-- Brane
--
DOUGLAS B ROBINSON SENIOR PRODUCT MANAGER



T +1 925 396 1125

E <mailto:***@wandisco.com> ***@wandisco.com



<Loading Image...>

World Leader in Active Data Replication™

Find out more <http://wandisco.com/> wandisco.com

THIS MESSAGE AND ANY ATTACHMENTS ARE CONFIDENTIAL, PROPRIETARY AND MAY BE PRIVILEGED

If this message was misdirected, WANdisco, Inc. and its subsidiaries, ("WANdisco") does not waive any confidentiality or privilege. If you are not the intended recipient, please notify us immediately and destroy the message without disclosing its contents to anyone. Any distribution, use or copying of this email or the information it contains by other than an intended recipient is unauthorized. The views and opinions expressed in this email message are the author's own and may not reflect the views and opinions of WANdisco, unless the author is authorized by WANdisco to express such views or opinions on its behalf. All email sent to or from this address is subject to electronic storage and review by WANdisco. Although WANdisco operates anti-virus programs, it does not accept responsibility for any damage whatsoever caused by viruses being passed.
Doug Robinson
2017-04-11 12:22:20 UTC
Permalink
Bert / Brane:

Thank you. Accurate as always!

The original poster just shared that setting the following in the Apache
config file did the trick:

SVNUseUTF8 On


So all set. Excellent.

Thank you.

Doug
Post by Bert Huijben
This message on the forum is 100% about the server side configuration. The
client has no known problems encoding paths in a url, but the url
specification itself doesn’t document an explicit encoding and as such
Apache Httpd has to do the translation of the first part of the URL to the
local path. When this fails you see errors as in this forum post.
The part after the repository location is 100% handled inside the
Subversion code and is always UTF-8 safe as it has no relation to real
on-disk paths.
On platforms like MAC/OS or Windows the server handles UTF-8 paths
directly, but on unix based platforms like Linux where paths are at the
core just formed by a bunch of bytes without a special meaning, you have to
use a proper configuration. If the configuration mismatches there is no way
to convert from the path form to UTF-8 and the other way around.
Bert
*Sent:* maandag 10 april 2017 22:17
*Subject:* Re: Using UTF8 in repository name?
https://www.svnforum.org/forum/opensource-subversion-
forums/general-setup-and-troubleshooting/79832-why-
repository-name-should-not-be-utf-8
I am wondering if the SVN client is properly URL-encoding the repository name?
https://en.wikipedia.org/wiki/Percent-encoding
I'm beginning to wonder if it is even possible (sort of similar to what
Mac/OS does to UTF8 text...)?
Thank you.
Doug
Post by Doug Robinson
I understand that UTF8 is the standard inside of Subversion repositories.
My question is whether having the repository name itself be UTF8? For
instance zh_CN.utf8 ? I googled around a bit and don't see an answer.
If it is supported, is there some Apache setting that is required?
Thanks in advance.
As long as the underlying filesystem supports it, there should be no
problems.
-- Brane
--
*DOUGLAS B ROBINSON* SENIOR PRODUCT MANAGER
*T* +1 925 396 1125 <(925)%20396-1125>
World Leader in Active Data Replication™
Find out more *wandisco.com <http://wandisco.com/>*
THIS MESSAGE AND ANY ATTACHMENTS ARE CONFIDENTIAL, PROPRIETARY AND MAY BE PRIVILEGED
If this message was misdirected, WANdisco, Inc. and its subsidiaries,
("WANdisco") does not waive any confidentiality or privilege. If you are
not the intended recipient, please notify us immediately and destroy the
message without disclosing its contents to anyone. Any distribution, use or
copying of this email or the information it contains by other than an
intended recipient is unauthorized. The views and opinions expressed in
this email message are the author's own and may not reflect the views and
opinions of WANdisco, unless the author is authorized by WANdisco to
express such views or opinions on its behalf. All email sent to or from
this address is subject to electronic storage and review by WANdisco.
Although WANdisco operates anti-virus programs, it does not accept
responsibility for any damage whatsoever caused by viruses being passed.
--
*DOUGLAS B ROBINSON* SENIOR PRODUCT MANAGER

T +1 925 396 1125
*E* ***@wandisco.com
--
World Leader in Active Data Replication™
*Find out more wandisco.com <http://wandisco.com/>*

THIS MESSAGE AND ANY ATTACHMENTS ARE CONFIDENTIAL, PROPRIETARY AND MAY BE
PRIVILEGED

If this message was misdirected, WANdisco, Inc. and its subsidiaries,
("WANdisco") does not waive any confidentiality or privilege. If you are
not the intended recipient, please notify us immediately and destroy the
message without disclosing its contents to anyone. Any distribution, use or
copying of this email or the information it contains by other than an
intended recipient is unauthorized. The views and opinions expressed in
this email message are the author's own and may not reflect the views and
opinions of WANdisco, unless the author is authorized by WANdisco to
express such views or opinions on its behalf. All email sent to or from
this address is subject to electronic storage and review by WANdisco.
Although WANdisco operates anti-virus programs, it does not accept
responsibility for any damage whatsoever caused by viruses being passed.
Loading...