diff options
author | Max Bolingbroke <batterseapower@hotmail.com> | 2011-05-14 22:50:46 +0100 |
---|---|---|
committer | Max Bolingbroke <batterseapower@hotmail.com> | 2011-05-14 22:50:46 +0100 |
commit | dc58b7398910a433259a6c0f58a0d05a48555191 (patch) | |
tree | a01062281a0cf1dd42329110ff0d0326be407f2b /libraries/base/GHC/Environment.hs | |
parent | cdbce1218d9f9fb4152bdabffe8bbdee09f5ce60 (diff) | |
download | haskell-dc58b7398910a433259a6c0f58a0d05a48555191.tar.gz |
Big patch to improve Unicode support in GHC. Validated on OS X and Windows, this
patch series fixes #5061, #1414, #3309, #3308, #3307, #4006 and #4855.
The major changes are:
1) Make Foreign.C.String.*CString use the locale encoding
This change follows the FFI specification in Haskell 98, which
has never actually been implemented before.
The functions exported from Foreign.C.String are partially-applied
versions of those from GHC.Foreign, which allows the user to supply
their own TextEncoding.
We also introduce foreignEncoding as the name of the text encoding
that follows the FFI appendix in that it transliterates encoding
errors.
2) I also changed the code so that mkTextEncoding always tries the
native-Haskell decoders in preference to those from iconv, even on
non-Windows. The motivation here is simply that it is better for
compatibility if we do this, and those are the ones you get for
the utf* and latin1* predefined TextEncodings anyway.
3) Implement surrogate-byte error handling mode for TextEncoding
This implements PEP383-like behaviour so that we are able to
roundtrip byte strings through Strings without loss of information.
The withFilePath function now uses this encoding to get to/from CStrings,
so any code that uses that will get the right PEP383 behaviour automatically.
4) Implement three other coding failure modes: ignore, throw error, transliterate
These mimic the behaviour of the GNU Iconv extensions.
Diffstat (limited to 'libraries/base/GHC/Environment.hs')
-rw-r--r-- | libraries/base/GHC/Environment.hs | 36 |
1 files changed, 33 insertions, 3 deletions
diff --git a/libraries/base/GHC/Environment.hs b/libraries/base/GHC/Environment.hs index 60325b3611..73f85ed52d 100644 --- a/libraries/base/GHC/Environment.hs +++ b/libraries/base/GHC/Environment.hs @@ -1,12 +1,42 @@ -{-# LANGUAGE ForeignFunctionInterface #-} +{-# LANGUAGE CPP, ForeignFunctionInterface #-} module GHC.Environment (getFullArgs) where import Prelude import Foreign import Foreign.C + +#ifdef mingw32_HOST_OS +import GHC.IO (finally) +import GHC.Windows + +-- Ignore the arguments to hs_init on Windows for the sake of Unicode compat +getFullArgs :: IO [String] +getFullArgs = do + p_arg_string <- c_GetCommandLine + alloca $ \p_argc -> do + p_argv <- c_CommandLineToArgv p_arg_string p_argc + if p_argv == nullPtr + then throwGetLastError "getFullArgs" + else flip finally (c_LocalFree p_argv) $ do + argc <- peek p_argc + p_argvs <- peekArray (fromIntegral argc) p_argv + mapM peekCWString p_argvs + +foreign import stdcall unsafe "windows.h GetCommandLineW" + c_GetCommandLine :: IO (Ptr CWString) + +foreign import stdcall unsafe "windows.h CommandLineToArgvW" + c_CommandLineToArgv :: Ptr CWString -> Ptr CInt -> IO (Ptr CWString) + +foreign import stdcall unsafe "Windows.h LocalFree" + c_LocalFree :: Ptr a -> IO (Ptr a) +#else import Control.Monad +import GHC.IO.Encoding +import qualified GHC.Foreign as GHC + getFullArgs :: IO [String] getFullArgs = alloca $ \ p_argc -> @@ -14,8 +44,8 @@ getFullArgs = getFullProgArgv p_argc p_argv p <- fromIntegral `liftM` peek p_argc argv <- peek p_argv - peekArray (p - 1) (advancePtr argv 1) >>= mapM peekCString + peekArray (p - 1) (advancePtr argv 1) >>= mapM (GHC.peekCString fileSystemEncoding) foreign import ccall unsafe "getFullProgArgv" getFullProgArgv :: Ptr CInt -> Ptr (Ptr CString) -> IO () - +#endif
\ No newline at end of file |