Emscripten provides numerous methods to connect and interact between JavaScript and compiled C or C++:
Call compiled C functions from normal JavaScript:
Using direct function calls (faster but more complicated).
Call compiled C++ classes from JavaScript using bindings created with:
Call JavaScript functions from C/C++:
This article explains each of the methods listed above, and provides links to more detailed information.
备注
For information on how compiled code interacts with the browser environment, see Emscripten运行时环境. For file system related manners, see the 文件系统概述.
备注
Before you can call your code, the runtime environment may need to load a memory initialization file, preload files, or do other asynchronous operations depending on optimization and build settings. See How can I tell when the page is fully loaded and it is safe to call compiled functions? in the FAQ.
The easiest way to call compiled C functions from JavaScript is to use
ccall()
or cwrap()
.
ccall()
calls a compiled C function with specified parameters
and returns the result, while cwrap()
“wraps” a compiled C
function and returns a JavaScript function you can call normally.
cwrap()
is therefore more useful if you plan to call a compiled
function a number of times.
Consider the test/hello_function.cpp file shown below. The
int_sqrt()
function to be compiled is wrapped in extern "C"
to prevent C++ name mangling.
// Copyright 2012 The Emscripten Authors. All rights reserved.
// Emscripten is available under two separate licenses, the MIT license and the
// University of Illinois/NCSA Open Source License. Both these licenses can be
// found in the LICENSE file.
#include <math.h>
extern "C" {
int int_sqrt(int x) {
return sqrt(x);
}
}
To compile this code run the following command in the Emscripten home directory:
emcc test/hello_function.cpp -o function.html -sEXPORTED_FUNCTIONS=_int_sqrt -sEXPORTED_RUNTIME_METHODS=ccall,cwrap
EXPORTED_FUNCTIONS
tells the compiler what we want to be accessible from the
compiled code (everything else might be removed if it is not used), and
EXPORTED_RUNTIME_METHODS
tells the compiler that we want to use the runtime
functions ccall
and cwrap
(otherwise, it will not include them).
备注
EXPORTED_FUNCTIONS affects compilation to JavaScript. If you first compile to an object file, then compile the object to JavaScript, you need that option on the second command. If you do it all together as in the example here (source straight to JavaScript) then this just works, of course.
After compiling, you can call this function with cwrap()
using the
following JavaScript:
int_sqrt = Module.cwrap('int_sqrt', 'number', ['number'])
int_sqrt(12)
int_sqrt(28)
The first parameter is the name of the function to be wrapped, the second is
the return type of the function (or a JavaScript null value if there isn’t one), and the third is an array of parameter
types (which may be omitted if there are no parameters). The types are
“number” (for a JavaScript number corresponding to a C integer, float, or general
pointer), “string” (for a JavaScript string that corresponds to a C char*
that represents a string) or
“array” (for a JavaScript array or typed array that corresponds to a C array; for typed arrays, it must be a Uint8Array or Int8Array).
You can run this yourself by first opening the generated page
function.html in a web browser (nothing will happen on page
load because there is no main()
). Open a JavaScript environment
(Control-Shift-K on Firefox, Control-Shift-J on Chrome),
and enter the above commands as three separate commands, pressing
Enter after each one. You should get the results 3
and
5
— the expected output for these inputs using C++ integer
mathematics.
ccall()
is similar, but receives another parameter with the
parameters to pass to the function:
// Call C from JavaScript
var result = Module.ccall('int_sqrt', // name of C function
'number', // return type
['number'], // argument types
[28]); // arguments
// result is 5
备注
This example illustrates a few other points, which you should remember
when using ccall()
or cwrap()
:
These methods can be used with compiled C functions — name-mangled C++ functions won’t work.
We highly recommended that you export functions that are to be called from JavaScript:
Exporting is done at compile time. For example:
-sEXPORTED_FUNCTIONS=_main,_other_function
exports
main()
and other_function()
.
Note that you need _
at the
beginning of the function names in the EXPORTED_FUNCTIONS
list.
Note that _main
is mentioned in that list. If you don’t have it there,
the compiler will eliminate it as dead code. The list of exported
functions is the entire list that will be kept alive (unless other
code was kept alive in another manner).
Emscripten does dead code elimination to minimize code size — exporting ensures the functions you need aren’t removed.
At higher optimisation levels (-O2
and above), code is minified,
including function names.
Exporting functions allows you to continue to access them using the
original name through the global Module
object.
If you want to export a JS library function (something from a
src/library*.js
file, for example), then in addition to
EXPORTED_FUNCTIONS
, you need to add it to DEFAULT_LIBRARY_FUNCS_TO_INCLUDE
,
as the latter will force the method to actually be included in
the build.
The compiler will remove code it does not see is used, to improve code
size. If you use ccall
in a place it sees, like code in a --pre-js
or --post-js
, it will just work. If you use it in a place the compiler
didn’t see, like another script tag on the HTML or in the JS console like
we did in this tutorial, then because of optimizations
and minification you should export ccall from the runtime, using
EXPORTED_RUNTIME_METHODS
, for example using
-sEXPORTED_RUNTIME_METHODS=ccall,cwrap
,
and call it on Module
(which contains
everything exported, in a safe way that is not influenced by minification
or optimizations).
Say you have a C library that exposes some procedures:
//api_example.c
#include <stdio.h>
#include <emscripten.h>
EMSCRIPTEN_KEEPALIVE
void sayHi() {
printf("Hi!\n");
}
EMSCRIPTEN_KEEPALIVE
int daysInWeek() {
return 7;
}
Compile the library with emcc:
emcc api_example.c -o api_example.js -sMODULARIZE -sEXPORTED_RUNTIME_METHODS=ccall
Require the library and call its procedures from node:
var factory = require('./api_example.js');
factory().then((instance) => {
instance._sayHi(); // direct calling works
instance.ccall("sayHi"); // using ccall etc. also work
console.log(instance._daysInWeek()); // values can be returned, etc.
});
The MODULARIZE
option makes emcc
emit code in a modular format that is
easy to import and use with require()
: require()
of the module returns
a factory function that can instantiate the compiled code, returning a
Promise
to tell us when it is ready, and giving us the instance of the
module as a parameter.
(Note that we use ccall
here, so we need to add it to the exported runtime
methods, as before.)
Functions in the original source become JavaScript functions, so you can
call them directly if you do type translations yourself — this will be
faster than using ccall()
or cwrap()
, but a little
more complicated.
To call the method directly, you will need to use the full name as it
appears in the generated code. This will be the same as the original C
function, but with a leading _
.
备注
If you use ccall()
or cwrap()
, you do not need
to prefix function calls with _
– just use the C name.
The parameters you pass to and receive from functions need to be primitive values:
Integer and floating point numbers can be passed as-is.
Pointers can be passed as-is also, as they are simply integers in the generated code.
JavaScript string
someString
can be converted to achar *
usingptr = stringToNewUTF8(someString)
.备注
The conversion to a pointer allocates memory, which needs to be freed up via a call to
free(ptr)
afterwards (_free
in JavaScript side) -
char *
received from C/C++ can be converted to a JavaScript string usingUTF8ToString()
.There are other convenience functions for converting strings and encodings in preamble.js.
Other values can be passed via
emscripten::val
. Check out examples on as_handle and take_ownership methods.
Emscripten provides two main approaches for calling JavaScript from C/C++:
running the script using emscripten_run_script()
or writing
“inline JavaScript”.
The most direct, but slightly slower, way is to use
emscripten_run_script()
. This effectively runs the specified
JavaScript code from C/C++ using eval()
. For example, to call the
browser’s alert()
function with the text ‘hi’, you would call the
following JavaScript:
emscripten_run_script("alert('hi')");
备注
The function alert
is present in browsers, but not in node
or other JavaScript shells. A more generic alternative is to call
console.log.
A faster way to call JavaScript from C is to write “inline JavaScript”,
using EM_JS()
or EM_ASM()
(and related macros).
EM_JS is used to declare JavaScript functions from inside a C file. The “alert” example might be written using EM_JS like:
#include <emscripten.h>
EM_JS(void, call_alert, (), {
alert('hello world!');
throw 'all done';
});
int main() {
call_alert();
return 0;
}
EM_JS’s implementation is essentially a shorthand for implementing a JavaScript library.
EM_ASM is used in a similar manner to inline assembly code. The “alert” example might be written with inline JavaScript as:
#include <emscripten.h>
int main() {
EM_ASM(
alert('hello world!');
throw 'all done';
);
return 0;
}
When compiled and run, Emscripten will execute the two lines of JavaScript as if they appeared directly in the generated code. The result would be an alert, followed by an exception. (Note, however, that under the hood Emscripten still does a function call even in this case, which has some amount of overhead.)
You can also send values from C into JavaScript inside EM_ASM
,
for example:
EM_ASM({
console.log('I received: ' + $0);
}, 100);
This will show I received: 100
.
You can also receive values back, for example the following will print out I
received: 100
and then 101
:
int x = EM_ASM_INT({
console.log('I received: ' + $0);
return $0 + 1;
}, 100);
printf("%d\n", x);
See the emscripten.h docs
for more details.
备注
You need to specify if the return value is an int
, double
or pointer type using the appropriate macro EM_ASM_INT
,
EM_ASM_DOUBLE
or EM_ASM_PTR
.
(EM_ASM_PTR
is the same as EM_ASM_INT
unless
MEMORY64
is used, so is mostly needed in code that wants to be
compatible with MEMORY64
).
The input values appear as $0
, $1
, etc.
return
is used to provide the value sent from JavaScript back to C.
See how {
and }
are used here to enclose the code. This is
necessary to differentiate the code from the arguments passed later,
which are the input values (this is how C macros work).
When using the EM_ASM
macro, ensure that you only use
single quotes(‘). Double quotes(”) will cause a syntax error that
is not detected by the compiler and is only shown when looking at
a JavaScript console while running the offending code.
clang-format may clobber Javascript constructions, such as =>
turning to = >
. To avoid this, add to your .clang-format
:
WhitespaceSensitiveMacros: ['EM_ASM', 'EM_JS', 'EM_ASM_INT', 'EM_ASM_DOUBLE', 'EM_ASM_PTR', 'MAIN_THREAD_EM_ASM', 'MAIN_THREAD_EM_ASM_INT', 'MAIN_THREAD_EM_ASM_DOUBLE', 'MAIN_THREAD_EM_ASM_DOUBLE', 'MAIN_THREAD_ASYNC_EM_ASM']
.
Or, turn clang-format off by writing // clang-format off
before the EM_ASM
section and // clang-format on
after it.
It is possible to implement a C API in JavaScript! This is the approach used in many of Emscripten’s libraries, like SDL1 and OpenGL.
You can use it to write your own APIs to call from C/C++. To do this
you define the interface, decorating with extern
to mark the methods
in the API as external symbols. You then implement the symbols in
JavaScript by simply adding their definition to library.js (by
default). When compiling the C code, the compiler looks in the JavaScript
libraries for relevant external symbols.
By default, the implementation is added to library.js (and this is
where you’ll find parts of Emscripten’s libc). You can put
the JavaScript implementation in your own library file and add it using
the emcc option --js-library
. See
test_js_libraries in test/test_other.py for a complete working
example, including the syntax you should use inside the JavaScript library
file.
As a simple example, consider the case where you have some C code like this:
extern void my_js(void);
int main() {
my_js();
return 1;
}
备注
When using C++ you should encapsulate extern void my_js();
in an extern "C" {}
block to prevent C++ name mangling:
extern "C" {
extern void my_js();
}
Then you can implement my_js
in JavaScript by simply adding the
implementation to library.js (or your own file). Like our other
examples of calling JavaScript from C, the example below just creates
a dialog box using a simple alert()
function.
my_js: function() {
alert('hi');
},
If you add it to your own file, you should write something like
addToLibrary({
my_js: function() {
alert('hi');
},
});
addToLibrary
copies the properties of the input object into
LibraryManager.library
(the global object where all JavaScript library code
lives). In this case its adds a function called my_js
onto this object.
If you’re not familiar with JavaScript, say if you’re a C/C++ programmer and just using emscripten, then the following issues probably won’t come up, but if you’re an experienced JavaScript programmer you need to be aware some common JavaScript practices can not be used in certain ways in emscripten library files.
To save space, by default, emscripten only includes library properties
referenced from C/C++. It does this by calling toString
on each
used property on the JavaScript libraries that are linked in. That means
that you can’t use a closure directly, for example, as toString
isn’t compatible with that - just like when using a string to create
a Web Worker, where you also can’t pass a closure. (Note that this
limitation is just for the values for the keys of the object
passes to addToLibrary
in the JS library, that is, the toplevel
key-value pairs are special. Interior code inside a function can
have arbitrary JS, of course).
To avoid this limitation of JS libraries, you can put code in another file using
the --pre-js
or --post-js
options, which allow arbitrary normal
JS, and it is included and optimized with the rest of the output. That is
the recommended approach for most cases. Another option is another <script>
tag.
Alternatively, if you prefer to use a JS library file, you can have a function replace itself and have it called during initialization.
addToLibrary({
// Solution for bind or referencing other functions directly
good_02__postset: '_good_02();',
good_02: function() {
_good_02 = document.querySelector.bind(document);
},
// Solution for closures
good_03__postset: '_good_03();',
good_03: function() {
var callCount = 0;
_good_03 = function() {
console.log("times called: ", ++callCount);
};
},
// Solution for curry/transform
good_05__postset: '_good_05();',
good_05: function() {
_good_05 = curry(scrollTo, 0);
},
});
A __postset is a string the compiler will emit directly to the output file. For the example above this code will be emitted.
function _good_02() {
_good_o2 = document.querySelector.bind(document);
}
function _good_03() {
var callCount = 0;
_good_03 = function() {
console.log("times called: ", ++callCount);
};
}
function _good_05() {
_good_05 = curry(scrollTo, 0);
};
// Call each function once so it will replace itself
_good_02();
_good_03();
_good_05();
You can also put most of your code in the xxx__postset
strings.
The example below each method declares a dependency on $method_support
and are otherwise dummy functions. $method_support
itself has a
corresponding __postset
property with all the code to set the
various methods to the functions we actually want.
addToLibrary({
$method_support: {},
$method_support__postset: [
'(function() { ',
' var SomeLib = function() { ',
' this.callCount = 0; ',
' }; ',
' ',
' SomeLib.prototype.getCallCount = function() {',
' return this.callCount; ',
' }; ',
' ',
' SomeLib.prototype.process = function() { ',
' ++this.callCount; ',
' }; ',
' ',
' SomeLib.prototype.reset = function() { ',
' this.callCount = 0; ',
' }; ',
' ',
' var inst = new SomeLib(); ',
' _method_01 = inst.getCallCount.bind(inst); ',
' _method_02 = inst.process.bind(inst); ',
' _method_03 = inst.reset.bind(inst); ',
'}()); ',
].join('\n'),
method_01: function() {},
method_01__deps: ['$method_support'],
method_02: function() {},
method_01__deps: ['$method_support'],
method_03: function() {},
method_01__deps: ['$method_support'],
});
Note: If you are using node 4.1 or newer you can use multi-line strings. They are only used at compile time not runtime so output will still run in ES5 based environments.
Another option is to put most of your code in an object, not a function,
addToLibrary({
$method_support__postset: 'method_support();',
$method_support: function() {
var SomeLib = function() {
this.callCount = 0;
};
SomeLib.prototype.getCallCount = function() {
return this.callCount;
};
SomeLib.prototype.process = function() {
++this.callCount;
};
SomeLib.prototype.reset = function() {
this.callCount = 0;
};
var inst = new SomeLib();
_method_01 = inst.getCallCount.bind(inst);
_method_02 = inst.process.bind(inst);
_method_03 = inst.reset.bind(inst);
},
method_01: function() {},
method_01__deps: ['$method_support'],
method_02: function() {},
method_01__deps: ['$method_support'],
method_03: function() {},
method_01__deps: ['$method_support'],
});
See the library_*.js files for other examples.
备注
JavaScript libraries can declare dependencies (__deps
), however
those are only for other JavaScript libraries. See examples in
/src
with the name format library_*.js
You can add dependencies for all your methods using
autoAddDeps(myLibrary, name)
where myLibrary is the object with
all your methods, and name
is the thing they all depend upon.
This is useful when all the implemented methods use a JavaScript
singleton containing helper methods. See library_webgl.js
for
an example.
The keys passed into addToLibrary generate functions that are prefixed
by _
. In other words my_func: function() {},
becomes
function _my_func() {}
, as all C methods in emscripten have a _
prefix. Keys starting with $
have the $
stripped and no underscore added.
You can use addFunction
to return an integer value that represents a
function pointer. Passing that integer to C code then lets it call that value as
a function pointer, and the JavaScript function you sent to addFunction
will
be called.
See test_add_function in test/test_core.py for an example.
You should build with -sALLOW_TABLE_GROWTH
to allow new functions to be
added to the table. Otherwise by default the table has a fixed size.
备注
When using addFunction
on LLVM Wasm backend, you need to provide
an additional second argument, a Wasm function signature string. Each
character within a signature string represents a type. The first character
represents the return type of a function, and remaining characters are for
parameter types.
'v'
: void type
'i'
: 32-bit integer type
'j'
: 64-bit integer type (currently does not exist in JavaScript)
'f'
: 32-bit float type
'd'
: 64-bit float type
For example, if you add a function that takes an integer and does not return
anything, you can do addFunction(your_function, 'vi');
. See
test/interop/test_add_function_post.js for an example.
You can access memory using getValue(ptr, type)
and
setValue(ptr, value, type)
. The first argument is a
pointer (a number representing a memory address). type
must be an
LLVM IR type, one of i8
, i16
, i32
, i64
, float
,
double
or a pointer type like i8*
(or just *
).
There are examples of these functions being used in the tests — see test/core/test_utf.in and test/test_core.py.
备注
This is a lower-level operation than ccall()
and
cwrap()
— we do need to care what specific type (e.g.
integer) is being used.
You can also access memory ‘directly’ by manipulating the arrays that
represent memory. This is not recommended unless you are sure you know
what you are doing, and need the additional speed over getValue()
and setValue()
.
A case where this might be required is if you want to import a large amount of data from JavaScript to be processed by compiled code. For example, the following code allocates a buffer, copies in some data, calls a C function to process the data, and finally frees the buffer.
var buf = Module._malloc(myTypedArray.length*myTypedArray.BYTES_PER_ELEMENT);
Module.HEAPU8.set(myTypedArray, buf);
Module.ccall('my_function', 'number', ['number'], [buf]);
Module._free(buf);
Here my_function
is a C function that receives a single integer parameter
(or a pointer, they are both just 32-bit integers for us) and returns an
integer. This could be something like int my_function(char *buf)
.
The converse case of exporting allocated memory into JavaScript can be
tricky when Wasm-based memory is allowed to grow, by compiling with
-sALLOW_MEMORY_GROWTH
. Increasing the size of memory changes
to a new buffer and existing array views essentially become invalid,
so you cannot simply do this:
function func() {
let someView = HEAPU8.subarray(x, y);
compute(someView);
// This may grow memory, which would invalidate all views.
maybeGrow();
// If we grew, this use of an invalidated view will fail. Failure in this
// case will return undefined, the same as reading out of bounds from a
// typed array. If the operation were someView.subarray(), however, then it
// would throw an error.
return someView[z];
}
Emscripten will refresh the canonical views like HEAPU8
for you, which you
can use to refresh your own views:
function func() {
let someView = HEAPU8.subarray(x, y);
compute(someView);
// This may grow memory, which would invalidate all views.
maybeGrow();
// Create a new, fresh view after the possible growth.
someView = HEAPU8.subarray(x, y);
return someView[z];
}
Another option to avoid such problems is to copy the data, when that makes sense.
Module
is a global JavaScript object, with attributes that
Emscripten-generated code calls at various points in its execution.
Developers provide an implementation of Module
to control how
notifications from Emscripten are displayed, which files that are
loaded before the main loop is run, and a number of other behaviours.
For more information see 模块对象.
Sometimes compiled code needs to access environment variables (for instance,
in C, by calling the getenv()
function). Emscripten-generated JavaScript
cannot access the computer’s environment variables directly, so a
“virtualised” environment is provided.
The JavaScript object ENV
contains the virtualised environment variables,
and by modifying it you can pass variables to your compiled code. Care must
be taken to ensure that the ENV
variable has been initialised by
Emscripten before it is modified — using Module.preRun
is a
convenient way to do this.
For example, to set an environment variable MY_FILE_ROOT
to be
"/usr/lib/test/"
you could add the following JavaScript to your
Module
setup code:
Module.preRun = () => {ENV.MY_FILE_ROOT = "/usr/lib/test"};
Note that Emscripten will set default values for some environment variables
(e.g. LANG) after you have configured ENV
, if you have not set your own
values. If you want such variables to remain unset, you can explicitly set
their value to undefined. For example:
Module.preRun = () => {ENV.LANG = undefined};
The JavaScript methods for calling compiled C functions are efficient, but cannot be used with name-mangled C++ functions.
WebIDL Binder and Embind create bindings between C++ and JavaScript, allowing C++ code entities to be used in a natural manner from JavaScript. Embind additionally supports calling JavaScript code from C++.
Embind can bind almost any C++ code, including sophisticated C++ constructs
(e.g. shared_ptr
and unique_ptr
). The WebIDL Binder supports C++
types that can be expressed in WebIDL. While this subset is smaller than
supported by Embind, it is more than sufficient for most use cases —
examples of projects that have been ported using the binder include the
Box2D and Bullet physics engines.
Both tools allow mapped items to be used from JavaScript in a similar way. However they operate at different levels, and use very different approaches for defining the binding:
Embind declares bindings within the C/C++ file.
WebIDL-Binder declares the binding in a separate file. This is run through the binder tool to create “glue” code that is then compiled with the project.
备注
There is no strong evidence that one tool is “better” than the other in terms of performance (no comparative benchmarks exist), and both have been used successfully in a number of projects. The selection of one tool over the other will usually be based on which is the most natural fit for the project and its build system.
Emnapi is an unofficial Node-API implementation which can be used on Emscripten. If you would like to port existing Node-API addon to WebAssembly or compile the same binding code to both Node.js native addon and WebAssembly, you can give it a try. See Emnapi documentation for more details.