So I've converted all of the strings in my csv to UTF-8 but Monet was still complaining

!value '\357\322\226\257\240bO\271\23340\344j\231I06Wdn\277fx\334-\320\366\323' from line 43255 field 12 not inserted, expecting type clob

I think these are valid UTF-8 characters ïҖ¯ bO¹›40äj™I06Wdn¿fxÜ-ÐöÓ. No?


On Thu, Jul 23, 2015 at 12:41 PM, Stefan Manegold <Stefan.Manegold@cwi.nl> wrote:

Hi,

internally, MonetDB only handles UTF-8 encoded (textual) data.

Thus, as documented [1],
for bulk data load from a file, MonetDB requires the file to use UTF-8 encoding,
while for bulk data load via STDIN, MonetDB requires the data to use the same
encoding as mclient (see also [2]).

If your data is not in the required encoding, you'd first need to convert it
before loading it into MonetDB.

Best,
Stefan

[1] https://www.monetdb.org/Documentation/Manuals/SQLreference/CopyInto
[2] https://www.monetdb.org/Documentation/mclient-man-page



----- On Jul 23, 2015, at 6:42 PM, Tri Vuong trivektor@gmail.com wrote:

> Hi,
>
>
>
> I'm using "COPY INTO" to insert some data into MonetDB. The csv that I'm copying
> from has some weird characters like \367\251\240 in some string fields and it
> broke the insertion. The error that I got is:
>
> !value from line x field y not inserted, expecting type clob
>
> Does anyone know how to get around this problem? The version I'm using is
> MonetDB Database Server Toolkit v1.1 (Oct2014-SP4). Thanks.
>
> --
> Tri Vuong
>
> _______________________________________________
> users-list mailing list
> users-list@monetdb.org
> https://www.monetdb.org/mailman/listinfo/users-list

--
| Stefan.Manegold@CWI.nl | DB Architectures   (DA) |
| www.CWI.nl/~manegold/  | Science Park 123 (L321) |
| +31 (0)20 592-4212     | 1098 XG Amsterdam  (NL) |
_______________________________________________
users-list mailing list
users-list@monetdb.org
https://www.monetdb.org/mailman/listinfo/users-list



--
Tri Vuong