Gyrokinetic equation suitable for numerical simulation

The gyrokinetic equation (135) contains time derivatives of unknown δΦ and δA on the right-hand side, which is problematic if treated by using explicit ﬁnite diﬀerence in PIC simulations. Next, we discuss some methods that can eliminate these terms, making the gyrokinetic equation more amenable to PIC simulations.

4.1 Eliminate ∂⟨δϕ⟩∕∂t term on the right-hand side of Eq. (135)

The coeﬃcient before ∂F₀∕∂𝜀 in Eq. (135) involves the time derivative of ⟨δΦ⟩, which is problematic if treated by using explicit ﬁnite diﬀerence (I test the algorithm that treats this term by implicit scheme, the result roughly agrees with the standard method discussed in Sec. 5. In GEM’s split-weight scheme, ∂⟨δΦ⟩∕∂t is evaluated by using the vorticity equation (time derivative of the gyrokinetic Poisson equation).). It turns out that ∂⟨δΦ⟩∕∂t can be eliminated by deﬁning another gyro-phase independent function δf by

[ ] -∂ + (ve + V + δV )⋅∇ δf ∂t ∥ ∥ D D q-∂F0-[ ∂ ] − m ∂𝜀 ∂t + (v∥e∥ + VD +δVD ) ⋅∇ ⟨δΦ⟩ q [ ∂ ] ∂F − --⟨δΦ ⟩ --+ (v∥e∥ + VD +δVD ) ⋅∇ --0- m ∂t ∂𝜀 = − δVD ⋅∇F0 − -q∂⟨δL⟩ ∂F0. (142) m ∂t ∂𝜀

Noting that ∂F₀∕∂t = 0, e_∥⋅∇F₀ = 0, ∂F₀∕∂𝜀 ≈−F₀m∕T, we ﬁnd that the third line of the above equation is of order O(λ³) (after both sides being divided by F₀Ω). Therefore the third line can be dropped. Moving the second line to the right-hand side and noting that ⟨δL⟩ = ⟨δΦ−v ⋅δA⟩, the above equation is written as

[ ∂ ] -- + (v∥e∥ +VD + δVD )⋅∇ δf ∂t [ ] = − δVD ⋅∇F0 + -q∂F0- ∂⟨v-⋅δA⟩ + (v∥e∥ + VD + δVD )⋅∇⟨δΦ⟩ , (143) m ∂𝜀 ∂t

where two ∂⟨δΦ⟩∕∂t terms cancel each other. [Note that the right-hand side of Eq. (143) contains a nonlinear term δV_D ⋅∇_X⟨δΦ⟩. This is diﬀerent from the original Frieman-Chen equation, where all nonlinear terms appear on the left-hand side. For the electrostatic limit, this term disappears because δV_D is perpendicular to ∇⟨δΦ⟩.]

[ ∂ ] --+ (v∥e∥ + VD +δVD ) ⋅∇ δf ∂t = − δVD ⋅[∇F0 ( ) ] + q-∂F0- ∂⟨v-⋅δA-⟩+ v∥e∥ + VD + q-∇ ⟨v ⋅δA⟩ × e∥ ⋅∇ ⟨δΦ⟩ . (144) m ∂𝜀 ∂t m Ω

[Equation (144) corresponds to Eqs. (A8-A9) in Yang Chen’s paper[1], where the ﬁrst minus on the right-hand side of Eq. (A8) is wrong and should be replaced with q∕m; one q is missing before ∂⟨v ⋅ δA⟩∕∂t in Eq. (A9).]

4.2 Eliminate ∂⟨v ⋅ δA⟩∕∂t term on the right-hand side of GK equation

Similar to the method of eliminating ∂⟨δϕ⟩∕∂t, we deﬁne another gyro-phase independent function by

Most gyrokinetic simulations approximate the vector potential as δA ≈ δA_∥e_∥. Let us simplify Eq. (143) for this case. Then ⟨v ⋅ δA⟩ is written as

[ ∂ ] ∂t + (v∥e∥ + VD + δVD )⋅∇ δf(p∥) [ ] + q-∂F0- ∂- +(v∥e∥ + VD + δVD )⋅∇ ⟨v∥δA∥⟩ m ∂𝜀 ∂t ( ) q- ∂F0- + m ⟨v∥δA ∥⟩[(VD + δVD )⋅∇ ] ∂𝜀 = − δVD ⋅∇F0 q ∂F [ ∂⟨v δA ⟩ ] − ----0- − --∥---∥-− (v∥e∥ +VD + δVD )⋅∇ ⟨δΦ⟩ , (148) m ∂𝜀 ∂t

where use has been made of that ∂F₀∕∂t = 0 and e_∥⋅∇F₀ = 0. Further noting that v_∥δA_∥∼ δΦ, qδΦ∕T ∼ O(λ¹), ∂F₀∕∂𝜀 ∼−F₀m∕T, V_D∕v_t ∼ O(λ¹), δV_D∕v_t ∼ O(λ¹), and ρ∇F₀ ∼ O(λ¹)F₀, we ﬁnd that the red term of the above equation (after divided by ΩF₀) is of O(λ³), hence can be dropped. Move the second line to the right-hand side, giving

[ ∂ ] -- +(v∥e∥ + VD + δVD )⋅∇ δf (p∥) ∂t = − δVD ⋅∇F0 + q-∂F0[(v∥e∥ + VD + δVD )⋅∇ (⟨δΦ − v∥δA ∥⟩)], (149) m ∂𝜀

where ∂⟨v_∥δA_∥⟩∕∂t terms cancel each other. Further note that δV_D, given by Eq. (138), is perpendicular to ∇_X⟨δΦ − v_∥δA_∥⟩. Therefore the blue term in Eq. (149) is zero, then Eq. (149) simpliﬁes to

[-∂ ] (p∥) ∂t + (v∥e∥ + VD + δVD )⋅∇ δf q∂F = − δVD ⋅∇F0 + -----0(v∥e∥ + VD )⋅∇⟨δΦ⟩ q ∂F m ∂𝜀 − ----0-[(v∥e∥ + VD )⋅∇ ⟨v∥δA∥⟩]. (150) m ∂𝜀

where B₀(x) = B₀(X + ρ) with ρ = ρ(X,𝜀,μ,α). Since the scale length of B₀ is much larger than the thermal Larmor radius, B₀(x) ≈ B₀(X) and hence v_∥ can be approximated as a constant when gyro-angle α changes. Then v_∥ can be taken out of the gyro-averaging in expression (146), yielding

(v∥e∥ + VD ) ⋅∇⟨v∥δA∥⟩ = (v∥e∥ + VD )⋅∇ (v∥⟨δA∥⟩) = ⟨δA∥⟩(v∥e∥ + VD )⋅∇(v∥) + v (ve + VD )⋅∇⟨δA ⟩. (153) ∥ ∥∥ ∥

Using expression (151), ⟨δA_∥⟩(v_∥e_∥ + V_D) ⋅∇(v_∥) is written as

⟨δA∥⟩(v∥e∥ + VD ) ⋅∇(v∥) ≈ ⟨δA∥⟩(v∥e∥)⋅∇ ((v∥) ) = ⟨δA∥⟩(v∥e∥)⋅∇ σ∘2-𝜀-− 2μB0 (∘ ---------) = ⟨δA∥⟩σ(v∥e∥)⋅∇ 2𝜀 − 2μB0 = ⟨δA∥⟩σv∥ −√2μe-∥ ⋅∇B0 2 2𝜀 − 2μB0 = ⟨δA ⟩v − 2μe∥-⋅∇B0 ∥ ∥ 2v∥ = − ⟨δA ∥⟩μe∥ ⋅∇B0. (154)

(We can also obtain ∇v_∥ = −μ(∇B₀)∕v_∥ by using Eq. (378).) Using the above results, equation (150) is written as

[ ∂ ] ∂t + (v∥e∥ + VD + δVD ) ⋅∇ δf (p∥) = − δVD ⋅∇F0 + q-∂F0(v∥e∥ + VD )⋅∇ ⟨δΦ⟩ m ∂𝜀 − q-∂F0-[v∥(v∥e∥ + VD )⋅∇⟨δA∥⟩− ⟨δA ∥⟩μe∥ ⋅∇B0 ], (155) m ∂𝜀

which agrees with the so-called p_∥ formulism given in the GEM manual (the ﬁrst line of Eq. 28).

Besides the derivation given above, the equation can also be derived by using p_∥ = v_∥ + q⟨δA_∥⟩∕m as an independent variable (I did not verify this) and thus the name “p_∥ formulism”. There is another formulism called v_∥ formulism, which uses Eq. (144) as the gyrokinetic equation to be numerically solved. The diﬃculty of using v_∥ formulism is that the time derivative ∂A_∥∕∂t (in the weight evolution equation) needs to be treated by implicit schemes, otherwise it is numerical unstable[8]. On the other hand, the diﬃculty of using p_∥ formulism is that there is cancellation problems in Ampere’s law, as we will discuss in Sec. 4.7.

4.3 Summary of distribution function split

where F₀ is the equilibrium distribution function and δF is the perturbation. Then δF is further split as

where δf^(p_∥) satisﬁes the gyrokinetic equation (155). In PIC simulations, δf^(p_∥) is evolved by using markers and its moment is evaluated via Monte-Carlo integration. The blue and red terms explicitly depends on the unknown perturbed ﬁeld. After being integrated in the velocity space, these two terms give the polarization density and the skin current, respectively. The polarization density is discussed in Sec. 5. The skin current is discussed in Sec. (4.4), and the so-called “cancellation problem” is discussed in Sec. 4.7.

4.4 Skin current

Let us calculate the moments of q-
m

⟨v_∥δA_∥⟩

(the blue term in Eq. (157)). Denote this term by δf^(skin). Neglect the FLR eﬀect, then δf^(skin) is written as

The number density carried by δf^(skin) is zero. The parallel current carried by δf^(skin) is given by

∫ δj(∥skin) = qv∥δf(skin)dv ∫ q2 2 = − T v∥δA∥F0dv. (161)

Working in the spherical coordinates, then v_∥ = v cos𝜃 and dv = v² sin𝜃dvd𝜃dϕ. Then expression (161) is written as

2 ( )3∕2 ∫ ( 2) δj(skin)= − q-n0 -m-- δA∥ v2cos2 𝜃exp − mv- v2sin 𝜃dvd𝜃dϕ ∥ T 2πT ∫ ( 2T) q2 (-m--)3∕2 4π- 4 mv2- = − T n0 2πT δA∥ 3 v exp − 2T dv 2 ( )3∕2 ∫ = − q-n0 1- 2T-δA∥4π- x4 exp(− x2)dx T π m 3 q2 ( 1)3∕2 2T 4π 3√ π- = − T-n0 π- -m-δA∥-3--8-- 2 = − q-n0δA∥. (162) m

Using c = 1∕ √----
μ0𝜀0

and ω_p² = n₀q²∕(m𝜀₀), the above expression can be written as

where the c∕ω_p is called “skin depth” and thus this current is often called “skin current” (some authors call it “adiabatic current”). We note the skin current is inversely proportional to the particle mass. So it is contributed mainly by electrons.

Gyrokinetic simulations indicate that the skin current δj_∥e^(skin) is often much larger than the actual δj_∥e. This means that δj_∥e^(skin) nearly cancels the current carried by δf_e^(p_∥), giving a small net current. This raises the question of whether numerical cancellation error is signiﬁcant. It turns out that this error is indeed signiﬁcant, which gives rise to numerical instabilities if no special treatment is used.

vuquμ n q v = vuqu-1--n q v = nuq2u v2u= 1-v2u Tu 0 u u u Tu c2𝜀0 u u u Tu 𝜀0 c2 λ2c2

4.5 Mixed-variable pullback method

To mitigate the skin current cancellation problem, Mishchenko et al[7] introduced the “mixed-variable pullback” method. In this method, we deﬁne δA_∥^(h) by

with δA_∥^(s) determined by an evolution equation (inspired by the ideal Ohm’s law):

Then, starting from Eq. (143) and following the same procedure as that of Sec. 4.2, we obtain an equation for δf^(mv):

[ ] ∂-+ (v∥e∥ + VD + δVD )⋅∇ δf(mv) ∂t = − δVD ⋅∇F0 + q-∂F0(VD + δVD )⋅∇ ⟨δΦ⟩ m ∂ 𝜀 −-q∂F0-[v (v e + V + δV )⋅∇ ⟨δA(h)⟩− ⟨δA(h)⟩μe ⋅∇B ]. (168) m ∂𝜀 ∥ ∥ ∥ D D ∥ ∥ ∥ 0

where δA_∥^(h) is the unknown to solve for, δJ_||j^(mv) is the parallel current carried by δf_j^(mv), where the subscript j is species index. Note that δA_∥^(s) has been moved to the rhs because its value is already known, by solving Eq. (166), before we solve the Ampere equation for δA_∥^(h).

In the above, a part of δA_∥ is solved from an evolution equation and the remainder is solved from the Ampere’s law. Can this scheme help to reduce numerical error? If A_∥^(s) carries the dominant part of δA_∥, then δA_∥^(h) will be small, then the skin current δA_∥^(h)ω_p²∕c² will be small, implying that the cancellation error will be small.

How do we ensure A_∥^(s) carry the dominant part of δA_∥ over the entire simulation duration? In addition to a careful choice of the evolution equation for δA_∥^(s), we have another leverage that can help δA_∥^(s) to remain dominant: collect the whole δA_∥ into δA_∥^(s) at the end of each time step:

Then, to make δA_∥ untouched (so that electromagnetic ﬁeld remain unchanged), we set δA_∥^(h) to zero:

Here “old” and “new” refers to before and after the re-spliting, respectively. (This re-spliting is made at the end of each time step and does not correspond to any time evolution.) The re-splitting keeps the value of δA_∥ untouched and hence does not inﬂuence the electromagnetic ﬁeld. Meanwhile, we need to keep δf unchanged. The deﬁnition of Eq. (167) indicates that, for a given δf, the re-splitting will make the value of δf^(mv) change to

This is the new initial value for δf^(mv). After this, the physical state of the system remains unchanged. This scheme makes δA_∥^(h) remain small for each time step mainly because δA_∥^(s) are set to carry all the value of δA_∥ at the begining of each time step. It is reasonable to assume that the variation of δA_∥ in a small time interval Δt is small. Denote this variation by Δ. In the best scenario, this small varaition will be captured by δA_∥^(s) if its evlution equation is chosen wisely. In the worst scenario, the time evolution of δA_∥^(s) may requires larger variation (than Δ) to be imposed on δA_∥^(h). Even in this senario, δA_∥^(h) is still of order of Δ, which is the varation of δA_∥ in a small time step and hence small.

which is not an exact solution to the kinetic equation because X is not a constant of motion. We note that the radial coordinate of the guiding-center positon is approximately a constant of motion if the drift orbit width is small. So we restrict F_g0 to depending only on ψ, i.e.,

∂Fg0 ∂Fg0 ∇Fg0 = ∂n--∇n0 + -∂T- ∇T0 0 (0 2 ) = Fg0∇n0-+ Fg0 mv--− 3 ∇T0- n0 2T0( 2 T0) 1-dn0 mv2- 3 1-dT0 = Fg0n0 dψ ∇ψ + Fg0 2T0 − 2 T0 dψ ∇ψ [ ( mv2 3) ] = − Fg0 κn + 2T--− 2 κT ∇ ψ (177) 0

dδf(mv) [ ( mv2 3 ) ] q -------= δVD ⋅∇ψ κn + ----− - κT Fg0 −--(VD + δVD )⋅⟨∇δΦ ⟩Fg0 dt 2T0 2 T + q[v∥(v∥e∥ + VD + δVD )⋅⟨∇ δA(∥h)⟩− ⟨δA (h∥)⟩(μe ∥ ⋅∇B0 )]Fg0, (178) T

where use has been made of ∇⟨δΦ⟩ = ⟨∇δΦ⟩ and ∇⟨δA_∥^(h)⟩ = ⟨∇δA_∥^(h)⟩.

In ﬁeld-aligned coordinates

In ﬁeld-aligned coordinates (x,y,z) (where x = ψ, y = α, z = 𝜃), ∂δΦ∕∂x can be written as

Note that the direction of ∂r∕∂x at 𝜃 = −π is diﬀerent from that at 𝜃 = +π. Therefore the value of ∂δΦ∕∂x at 𝜃 = −π is diﬀerent from that at 𝜃 = +π. I.e., ∂δΦ∕∂x is a non-periodic function of 𝜃. Similarly, ∇y is also a non-perioidic functon of 𝜃. In other words, both ∂δΦ∕∂x and ∇y are discontinuous across the 𝜃 cut (𝜃 = ±π).

where we approximate ∇x, ∇y, and ∇z as constants when performing the gyro-averge since they are determined by the equilibrim magnetic ﬁeld, which is nearly constant on the Larmor radius scale. As is mentioned above, ∇y is not continuous at the 𝜃 cut. Do we need to worry about this? No. This is because we must stick to the same branch when we perform gryo-average on it, and hence ∇y is always continous. Then, do we need to worry about the disconunity of ∂δΦ∕∂x across the 𝜃 cut when performing the gyro-averge on it? We do not either. The reason is the same: we must stick to a single branch. The disconunity is just irrelevant here. The discontinuity only manifest itself when we need to infer value on 𝜃 = +π from that on 𝜃 = −π (vice versa), i.e., when across branch communication is explicitly needed. In TEK, the ﬁeld equations are not solved at 𝜃 = +π and hence the ﬁeld values are not directly obtained. Instead, the ﬁeld values at 𝜃 = +π are infered from the ﬁeld values at 𝜃 = −π. At the 𝜃 cut and for the same (x,ϕ), the continuity of ∇δΦ requires

where the superscript “+” and “−” refer to the location 𝜃 = +π and 𝜃 = −π, respectively. Dotting the above by ∇x, we obtain

[ ( ) ] dδf(mv)= δV ⋅∇x κ + mv2-− 3 κ F dt D n 2T0 2 T 0 q[ ∂δΦ dX′ ∂δΦ dX ′ ∂δΦ dX ′ ] − T- ⟨-∂x-⟩-dt-⋅∇x + ⟨∂y-⟩-dt-⋅∇y + ⟨∂z-⟩-dt-⋅∇z F0 ( (h) (h) (h) ) -q ( ∂δA∥-- dX- ∂δA∥-- dX- ∂δA∥-- dX- ) + T v∥ ⟨ ∂x ⟩dt ⋅∇x + ⟨ ∂y ⟩dt ⋅∇y + ⟨ ∂z ⟩dt ⋅∇z F0 − -q⟨δA(∥h)⟩(μe∥ ⋅∇B0 )F0, (184) T

where q_u,T_u,v_u are units (independent of species), then Eq. (184) is written as

[ ( ) ] dδf(mv)-= δVD ⋅∇x κn + mv2-− 3 κT F0 dt 2T0 2 q∕qu [ ∂δΦ-dX ′ ∂δΦ- dX′ ∂δΦ- dX ′ ] − T∕Tu- ⟨∂x-⟩-dt-⋅∇x + ⟨-∂y-⟩dt--⋅∇y + ⟨-∂z-⟩-dt-⋅∇z F0 ⌊ --(h) -(h) -(h) ⌋ q∕qu-vn- ⌈ ∂-δA-∥- dX- ∂δA∥--dX- ∂δA∥--dX- ⌉ + T∕Tu vuv∥ ⟨ ∂x ⟩ dt ⋅∇x + ⟨ ∂y ⟩ dt ⋅∇y + ⟨ ∂z ⟩ dt ⋅∇z F0 − q∕qu-vn⟨δA(h)⟩(μe ⋅∇B--)F . (186) T∕Tu vu ∥ ∥ 0 0

[ ( ) ] dδf(mv)-= ˙x(1) κ + mv2- − 3 κ F dt n 2T0 2 T 0 q∕qu [ ∂δΦ- ∂δΦ- ∂δΦ- ] − T∕T-- ⟨-∂x-⟩x˙+ ⟨∂y-⟩˙y+ ⟨-∂z-⟩(z˙− ˙z(0)) F0 u ⌊ -- -- -- ⌋ q∕qu-vn- ∂δA(∥h) ∂δA(∥h) ∂-δA(∥h) + T∕Tu vuv∥⌈ ⟨ ∂x ⟩x˙+ ⟨ ∂y ⟩y˙+ ⟨ ∂z ⟩˙z⌉ F0 q∕qu-vn -(h) -- ---- − T∕Tu vu⟨δA∥ ⟩(μe∥ ⋅∇ B0)F0. (187)

_____________________________________________________________________________________________Normalized Ampere’s equation__________________________________________________________________________________________________________________

__________________________________________________________________________________________________________________A_∥^(s) evolution_________________________________________________________________________________________________________________

____________________________________pullback_________________________________________________________

The pullback in Eq. (173) is now written as (for the case of F₀ being Maxwellian):

( ) δf(nmevw)= δf(omldv)+ q-⟨v∥δA (h∥)old⟩ − m- F0 (mv) mq (h) T = δfold − T-⟨v∥δA ∥old⟩F0 q∕q v -(h) = δf(omldv)− ---u--nv∥⟨δA∥old⟩F0. (192) T∕Tu vu

____________________________________________________________________________________________________________Perturbed drﬁt _______________________________________________________________________________________________________________________

∂δΦ (⟨ ⟩ ⟨ ⟩ ⟨ ⟩ ) − ⟨-∂x ⟩×2-B0-= −-12 ∂δΦ- ∇x + ∂δΦ- ∇y + ∂δΦ- ∇z × B0 B0 B⟨0 ⟩ ∂x ∂y⟨ ⟩ ∂z ⟨ ⟩ ∂δΦ- -1- ∂δΦ- -1- ∂δΦ- -1- = − ∂x B20∇x × B0 − ∂y B20∇y × B0 − ∂z B20∇z × B0.

⟨∂∂δΦx-⟩×-B0- ∂δΦ- -1- ∂δΦ- 1-- − B20 ⋅∇x = − ⟨ ∂y ⟩B20∇y ×B0 ⋅∇x − ⟨ ∂z ⟩B20∇z × B0 ⋅∇x ∂δΦ 1 ∂δΦ 1 = − ⟨-∂y-⟩B2∇x × ∇y ⋅B0 − ⟨-∂z-⟩B2∇x × ∇z ⋅B0. (193) 0 0

∂δΦ − ⟨-∂x-⟩×-B0-⋅∇y = − ⟨∂δΦ⟩-1-∇x × B ⋅∇y − ⟨∂-δΦ-⟩-1-∇z ×B ⋅∇y B20 ∂x B20 0 ∂z B20 0 ∂δΦ- 1-- ∂δΦ--1- = ⟨ ∂x ⟩B20∇x × ∇y ⋅B0 − ⟨ ∂z ⟩B20∇y × ∇z ⋅B0. (194)

− ⟨∂δ∂xΦ⟩×-B0-⋅∇z = − ⟨∂δΦ⟩-1-∇x × B ⋅∇z − ⟨∂-δΦ-⟩-1-∇y ×B ⋅∇z B20 ∂x B20 0 ∂y B20 0 ∂δΦ 1 ∂δΦ 1 = ⟨-∂x-⟩B2∇x × ∇z ⋅B0 + ⟨-∂y ⟩B2-∇y × ∇z ⋅B0. (195) 0 0

Then, to get the normalizing factor, consider a typical term of the δE × B₀ drift:

dx ⟨ ∂δΦ ⟩ 1 ---= − tn ---- -2∇x × ∇y ⋅B0 + ⋅⋅⋅ dt ∂y ⟨B0--⟩ = − Tu--tn-- ∂δΦ- 1-∇x × ∇y ⋅B- + ⋅⋅⋅ quBnL2n ∂y B20 0 ⟨ --⟩ -- -- -- = − Tu---1--- ∂δΦ- 12∇x × ∇y ⋅B0 + ⋅⋅⋅ , (197) quBnLnvn ∂y B0

where T_u∕(q_uv_nB_nL_n) is the normlizing factor we need when coding (in drift.f90).

( ) -v∥ -v∥- ∂δA∥- ∂δA-∥ ∂-δA∥ −B20B0 × ⟨∇ δA∥⟩ ≈ −B20B0 × ⟨ ∂x ⟩∇x + ⟨ ∂y ⟩∇y + ⟨ ∂z ⟩∇z .

-v∥ v∥- ( ∂δA∥- ∂δA∥- ) −B20⟨B0 × ∇(δA∥)⟩⋅∇x = − B20 B0 × ⟨ ∂y ⟩∇y + ⟨ ∂z ⟩∇z ⋅∇x v ( ∂δA ∂δA ) = --∥2 ⟨---∥⟩∇x × ∇y + ⟨---∥⟩∇x × ∇z ⋅B0, (198) B 0 ∂y ∂z

v v ( ∂δA ∂δA ) − -∥2⟨B0 × ∇(δA∥)⟩⋅∇y = − -∥2-B0 × ⟨---∥⟩∇x + ⟨---∥⟩∇z ⋅∇y B0 B0( ∂x ∂z ) = -v∥ ⟨∂δA∥⟩∇y × ∇x + ⟨∂δA∥⟩∇y × ∇z ⋅B , (199) B20 ∂x ∂z 0

v∥ v∥ ( ∂δA∥ ∂δA∥ ) − -2⟨B0 × ∇(δA∥)⟩⋅∇z = − -2-B0 × ⟨----⟩∇x + ⟨----⟩∇y ⋅∇z B0 B0( ∂x ∂y ) = -v∥ ⟨∂δA∥⟩∇z × ∇x + ⟨∂δA∥⟩∇z × ∇y ⋅B0. (200) B20 ∂x ∂y

dx-= t v∥⟨∂-δA-∥⟩∇x × ∇y ⋅B +⋅⋅⋅ dt nB20 ∂y 0 Tu vn v∥ ∂δA∥ -- -- -- = q-v-tnL2B----2⟨-∂y-⟩∇x × ∇y ⋅B0 + ⋅⋅⋅ u u n -nB 0 -- ---Tu----v∥- ∂δA∥--- -- -- = quvuLnBn B2 ⟨ ∂y ⟩∇x × ∇y ⋅B0 + ⋅⋅⋅ , (201) 0

where T_u∕(q_uv_uB_nL_n) is the normlizing factor I need when coding (in drift.f90).

-- ( 1 ∂z) -- ( ∂y ∂z ∂y ∂z ) -- ( 1 ∂z) (∇y × ∇z )⋅B0 = BR R-∂Z- + Bϕ ∂Z-∂R- − ∂R-∂Z- + BZ −R-∂R- ≡ bdgycgz(204)

4.6 Discretizing Laplacian operator

which is called “high-n approximation”, in which all derivatives with respect to z are dropped. This approximation reduces the Laplacian diﬀerential operator from 3D to 2D.

We assume that δA_∥ satisﬁes the zero boundary condition in the x direction: A_∥(x = x₀) = 0,A_∥(x = x₀ + L_x) = 0. Then the sine expansion can be used in this direction. In the y direction, full Fourier expansion is needed. I.e., at each value of z, A_∥(x,y,z) is approximated by the following two-dimensional expansion:

N∑y∕2 ( 2π ) Nx∑−1 − ∇2⊥δA∥ = exp in --y δAmn (z) n=−Ny∕2 Ly m=1 {[ ( )2 ( )2 ] ( ) ( ) ( )} × m-π |∇x |2 + n 2π- |∇y |2 sin mπ-(x− x0) − in2π- mπ- 2∇x ⋅∇ycos m-π(x− x0) Lx Ly Lx Ly Lx Lx

At x = x_j, with x_j = x₀ + jΔ and L_x∕Δ = N_x, the above expression is written as

2 N∑y∕2 ( 2π-) Nx∑−1 − ∇ ⊥δA∥ = exp in Lyy δAm,n(z) n{=[−Ny∕2 m=1 ] } (m π)2 2 ( 2π)2 2 ( jmπ ) 2π ( m π) ( jm π) × Lx- |∇x | + n Ly- |∇y | sin Nx-- − inLy Lx- 2∇x ⋅∇y cos -Nx-(207,)

2 N∑y∕2 ( 2π-) Nx∑−1-2- N∑x−1 ( j′m-π) − ∇ ⊥δA∥ = exp in Lyy Nx An(xj′,z)sin Nx n{=[−Ny∕2 m=1 j′=1] } (m π)2 2 ( 2π)2 2 ( jmπ ) 2π ( m π) ( jm π) × Lx- |∇x | + n Ly- |∇y | sin Nx-- − inLy Lx- 2∇x ⋅∇y cos -Nx-

Nx∑−1 -2-N∑x−1 ( j′m-π) Nx An (xj′,z)sin Nx {m=[1 j′=1 ] } ( mπ-)2 2 ( 2π)2 2 ( jm-π) 2π-(m-π) (jm-π) × Lx |∇x| + n Ly |∇y | sin Nx − in Ly Lx 2∇x ⋅∇y cos Nx (209)

Nx−1 ( ) M ′ ≡ ∑ -2-sin j′m-π jj ,n m=1 Nx Nx {[( )2 ( )2 ] ( ) ( ) ( )} × m-π |∇x |2 + n2π- |∇y|2 sin jm-π − in2π- mπ- 2∇x ⋅∇y cos jmπ- Lx Ly Nx Ly Lx Nx

4.7 Parallel Ampere’s Law in GEM code

where δJ_∥i′ and δJ_∥e′ is the parallel current carried by the distribution function δf^(p_∥), which are updated from the value at the nth time step using an explicit scheme and therefore does not depends on the ﬁeld at the (n + 1)th step. The blue terms in Eqs. (212) and (213) are the “skin current”, which explicitly depend on the unknown ﬁeld at the (n + 1)th step. If we want to solve Ampere’s law (211) by direct methods, then the blude terms need to be moved to the left-hand side. In this case, equation (211) is written as

∫ − ∇2 δA(n+1)− μ (v(n+1))2 q2i⟨δA(n+1)⟩ ∂Fi0-dv ⊥ ∥ 0 ∥ mi ∥ α ∂ 𝜀 ∫ (n+1) 2 q2e (n+1) ∂Fe0 − μ0 (v∥ ) me⟨δA ∥ ⟩α ∂𝜀 dv. = μ (δJ ′+ δJ′ ) (214) 0 ∥i ||e

Then we need to put the blue terms into matrix form. If we put the bule terms into martrix form by using numerical grid integration (as we do for the polarization density), then there arises the cancellation propblem (i.e., the two parts of the distribution are evaluated by diﬀerent methods, one is grid-based and the other is MC marker based, there is a risk that the sum of the two terms will be inaccurate when the two terms are of opposite signs and large amplitudes, and the ﬁnal result amplitude is expected to be much smaller than the amplituded of the two terms). If we get the matrix form by evaluating it numerically using MC markers (which can avoid the cancellation problem), the corresponding matrix will depends on markers and thus needs to be re-constructed each time-step, which is computationally expensive.

Therefore we go back to Eq. (211) and try to solve it using iterative methods. However, it is found numerically that directly using Eq. (211) as an iterative scheme is usually divergent. To obtain a convergent iterative scheme, we need to have an approximate form for the blue terms (bigger terms), which is independent of markers and so that it is easy to construct its matrix, and then subtract this approximate form from both sides. After doing this, the iterative scheme has better chance to be convergent (partially due to that the right-hand side becomes smaller). An approximate form is that derived by neglecting the FLR eﬀect given in Sec. 4.4. Using this, the iterative scheme for solving Eq. (211) is written as

( ) 2 (n+1) ω2pi (n+1) ω2pe (n+1) − ∇ ⊥δA∥ − − c2 δA∥ − c2 δA∥ ′ ′ = μ0(∫δJ∥i + δJ||e) (n+1) 2 q2i (n+1) ∂Fi0 + μ0 (v∥ ) mi⟨δA∥ ⟩α ∂𝜀 dv ∫ (n+1) q2 (n+1) ∂Fe0 + μ0 (v∥ )2me⟨δA∥ ⟩α -∂𝜀-dv ( 2 e 2 ) − − ωpiδA(n+1)− ωpeδA(n+1) . (215) c2 ∥ c2 ∥

In the drift-kinetic limit (i.e., neglecting the FLR eﬀect), the blue and red terms on the right-hand side of the above equation cancel each other exactly. Even in this case, it is found numerically that these terms need to be retained and the blue terms are evaluated using markers. Otherwise, numerical inaccuracy can give numerical instabilities, which is the so-called cancellation problem. The explanation for this is as follows. The blue terms are part of the current. The remained part of the current carried by δh is computed by using Monte-Carlo integration over markers. If the blue terms are evaluated analytically, rather than using Monte-Carlo integration over markers, then the cancellation between this analytical part and Monte-Carlo part can have large error (assume that there are two large contribution that have opposite signs in the two parts) because the two parts are evaluated using diﬀerent methods and thus have diﬀerent accuracy, which makes the cancellation less accurate.

Because the ion skin current is smaller than its electron counterpart by a factor of m_e∕m_i, its accuracy is not important. The cancellation error for ions is not a problem and hence can be neglected. In this case, equation (215) is simpliﬁed as

( ) 2 (n+1) ω2pi (n+1) ω2pe (n+1) − ∇ ⊥δA∥ − − c2-δA∥ − c2-δA∥ ′ ′ = μ0(δJ∥i + δJ||e) ∫ (n+1) 2 q2e (n+1) ∂Fe0 + μ0 (v∥ ) me-⟨δA ∥ ⟩α-∂𝜀-dv ( 2 ) − − ωpeδA (n+1) . (216) c2 ∥

4.8 Split-weight scheme for electrons in GEM code

where the term in blue is the so-called adiabatic response, which depends on the gyro-angle in guiding-center coordinates. Recall that the red term ⟨δΦ⟩_α, which is independent of the gyro-angle, is introduced in order to eliminate the time derivative ∂⟨δΦ⟩_α∕∂t term on the right-hand side of the original Frieman-Chen gyrokinetic equation.

The so-called generalized split-weight scheme corresponds to going back to the original Frieman-Chen gyrokinetic equation by introducing another ⟨δΦ⟩_α term with a free small parameter 𝜖_g. Speciﬁcally, δh in the above is split as

(If 𝜀_g = 1, then the two ⟨δΦ⟩_α terms in Eq. (217) and (218) cancel each other.) Substituting this expression into Eq. (), we obtain the following equation for δh_s:

[ ∂ ] ∂t + (v∥e∥ + VD + δVD )⋅∇X δhs [ ] + 𝜖g q-∂F0 ∂-+ (v∥e∥ + VD + δVD )⋅∇X ⟨δΦ ⟩α m ∂𝜀 ∂[t ] q- ∂- ∂F0- + 𝜖gm ⟨δΦ⟩α ∂t + (v∥e∥ + VD + δVD )⋅∇X ∂𝜀 = − δVD ⋅∇XF0 q ∂F0 − m-[(v∥e∥ + VD + δVD ) ⋅∇X (⟨v ⋅δA − δΦ ⟩α )]-∂𝜀-. (219)

Noting that ∂F₀∕∂t = 0, e_∥⋅∇F₀ = 0, ∇F₀ ∼ O(λ¹)F₀, we ﬁnd that the third line of the above equation is of order O(λ³) and thus can be dropped. Moving the second line to the right-hand side, the above equation is written as

[ ] ∂-+ (v e + V + δV )⋅∇ δh ∂t ∥ ∥ D D X s = − δVD ⋅∇XF0 q{ [∂ ⟨δΦ⟩α ]} ∂F0 − m- (v∥e∥ + VD + δVD )⋅∇X [⟨v ⋅δA⟩α − ⟨δΦ ⟩α]+ 𝜖g --∂t-- + VG ⋅∇X ⟨δΦ⟩α -∂(𝜀2 .20)

special case of 𝜖_g = 1

For the special case of 𝜖_g = 1 (the default and most used case in GEM code, Yang Chen said 𝜖_g < 1 cases are sometimes not accurate, so he gave up using it since 2009), equation (220) can be simpliﬁed as:

[ ∂ ] ∂t + (v∥e∥ + VD + δVD )⋅∇X δhs = − δV[D ⋅∇XF0 ] − q- VG ⋅∇X ⟨v ⋅δA ⟩α + ∂-⟨δΦ-⟩α ∂F0-, (221) m ∂t ∂𝜀

where two V_G ⋅⟨δΦ⟩_α terms cancel each other. Because the v_∥E_∥ term is one of the factors that make kinetic electron simulations diﬃcult, eliminating V_G ⋅⟨δΦ⟩_α term may be beneﬁcial for obtaining stable algorithms.

δF = δhs + q-⟨δΦ ⟩α ∂Fg0+ q(δΦ − ⟨δΦ ⟩α)∂F0-+ -q⟨v ⋅δA⟩α∂F0- m ∂𝜀 m ∂𝜀 m ∂𝜀 = δhs + q-δΦ∂Fg0 + q-⟨v⋅δA ⟩α∂F0, (222) m ∂𝜀 m ∂ 𝜀

where the adiabatic term will be moved to the left-hand side of the Poisson’s equation. The descretization of this term is much easier than the polarization density.

Equation (221) actually goes back to the original Frieman-Chen equation. The only diﬀerence is that q-
m

⟨v ⋅ δA⟩_α ∂F0
∂𝜀

is split from the perturbed distribution function. Considering this, equation (221) can also be obtained from the original Frieman-Chen equation (135) by writing δG₀ as

Substituting expression (223) into equation (135), we obtain the following equation for δh_s:

[ ] ∂-+ (v∥e∥ + VD + δVD )⋅∇X δhs ∂t [ ] + -q∂F0- -∂ + (v e + V + δV )⋅∇ ⟨v ⋅δA ⟩ m ∂𝜀 ∂t ∥ ∥ D D X α q [ ∂ ] ∂F0 + m-⟨v⋅δA ⟩α ∂t + (v∥e∥ +VD + δVD )⋅∇X -∂𝜀- = − δVD ⋅∇XF0 − q-∂⟨δΦ-−-v⋅δA-⟩α-∂F0, (225) m ∂t ∂𝜀

[ ∂ ] ∂t + (v∥e∥ + VD + δVD )⋅∇X δhs = − δV[D ⋅∇XF0 ] − q- ∂⟨δΦ⟩α + (v∥e∥ + VD + δVD )⋅∇X ⟨v⋅δA ⟩α ∂F0, (226) m ∂t ∂𝜀

In GEM, the split weight method is used only for electrons. When using this scheme, ∂δΦ∕∂t appears on the right-hand-side of the weight evolution equation. GEM makes use of the vorticity equation (time derivative of the Poissson equation) to evaluate ∂δΦ∕∂t.

4 Gyrokinetic equation suitable for numerical simulation