Character sets misrepresented
Moderator: SourceGear
Character sets misrepresented
We have some non-standard characters in code (i.e. "ç"), and we've noticed they are represented by two characters when we diff by using the Vault client.
Here's an example:
Now, we are getting the right characters when getting the latest version of the files. Is there any potential problem with this?
Here's an example:
Now, we are getting the right characters when getting the latest version of the files. Is there any potential problem with this?
What version of Vault are you using?
I think what may take care of your issue is the new DiffMerge tool we have available separately: http://www.sourcegear.com/diffmerge/downloads.html.
Then in your Tools - Options - DiffMerge, you only need to change the path to point at the new tool.
I think what may take care of your issue is the new DiffMerge tool we have available separately: http://www.sourcegear.com/diffmerge/downloads.html.
Then in your Tools - Options - DiffMerge, you only need to change the path to point at the new tool.
3.5.1 (4786).Beth wrote:What version of Vault are you using?
I've tried the new DiffMerge tool and it shows the same two characters for the "ç" character.Beth wrote:I think what may take care of your issue is the new DiffMerge tool we have available separately: http://www.sourcegear.com/diffmerge/downloads.html.
Then in your Tools - Options - DiffMerge, you only need to change the path to point at the new tool.
However, we are not having any issues really. We only have concerns this could be a Vault problem handling different character sets, even though we are getting the right characters when we get the files from the Vault repositories.
My question is, since we are getting back the same characters we are storing but in the Vault programs they are showed different, could this be a character representation bug in the Vault programs? Are we safe?
-
- Posts: 534
- Joined: Tue Jun 05, 2007 11:37 am
- Location: SourceGear
- Contact:
this is a UTF8 issue
it looks like DiffMerge loaded the file assuming a latin-n character
encoding when it appears that the file is utf-8. the 2 characters
displayed in the screenshot are a UTF8 multi-byte sequence for an
accented character.
using the new version of DiffMerge, check the Ruleset (via the
Options dialog) used for files with that suffix and change it to
UTF8 (it's probably currently set to ISO-Latin-1). you may also
want to change the default ruleset as well.
then exit and restart DiffMerge and see if the file appears correctly.
to answer your other questions, vault does not need to look at the
character encoding to send/receive/store files. it does not change
files when sending/receiving/storing, so your files are safe.
this problem is limited to DiffMerge and how it interpreted the files
as it read them for display. in this case, the file was in a different
encoding which caused the conversion to unicode for display to be
wrong.
i hope i've answered all your questions and sorry for the confusion.
jeff
encoding when it appears that the file is utf-8. the 2 characters
displayed in the screenshot are a UTF8 multi-byte sequence for an
accented character.
using the new version of DiffMerge, check the Ruleset (via the
Options dialog) used for files with that suffix and change it to
UTF8 (it's probably currently set to ISO-Latin-1). you may also
want to change the default ruleset as well.
then exit and restart DiffMerge and see if the file appears correctly.
to answer your other questions, vault does not need to look at the
character encoding to send/receive/store files. it does not change
files when sending/receiving/storing, so your files are safe.
this problem is limited to DiffMerge and how it interpreted the files
as it read them for display. in this case, the file was in a different
encoding which caused the conversion to unicode for display to be
wrong.
i hope i've answered all your questions and sorry for the confusion.
jeff
-
- Posts: 534
- Joined: Tue Jun 05, 2007 11:37 am
- Location: SourceGear
- Contact:
thanks for the suggestion
thanks for the suggestion.
i'll log an a feature request to add an xml ruleset to the initial
set of rulesets.
j
i'll log an a feature request to add an xml ruleset to the initial
set of rulesets.
j