Skip to main content

Mechanics of moving defects in growing sheets: 3-d, small deformation theory


Growth and other dynamical processes in soft materials can create novel types of mesoscopic defects including discontinuities for the second and higher derivatives of the deformation, and terminating defects for these discontinuities. These higher-order defects move “easily", and can thus confer a great degree of flexibility to the material. We develop a general continuum mechanical framework from which we can derive the dynamics of higher order defects in a thermodynamically consistent manner. We illustrate our framework by obtaining the explicit dynamical equations for the next higher order defects in an elastic body beyond dislocations, phase boundaries, and disclinations, namely, surfaces of inflection and branch lines.


Hyperbolic sheets abound in nature (see Fig. 1). As Margaret Wertheim writes in her delightful essay “Corals, crochet and the cosmos: how hyperbolic geometry pervades the universe" (Wertheim 2016) – We have built a world of largely straight lines – the houses we live in, the skyscrapers we work in and the streets we drive on our daily commutes. Yet outside our boxes, nature teems with frilly, crenellated forms, from the fluted surfaces of lettuces and fungi to the frilled skirts of sea slugs and the gorgeous undulations of corals.

Fig. 1
figure 1

Examples of naturally occurring non-Euclidean elastic sheets

A natural question is – Why these shapes? One suggestion is that cells in living organisms proliferate to “maximize" their number (area) subject to any applicable constraints (Wertheim 2016) and this naturally results in hyperbolic geometries. This is a “static" argument which relates the mechanisms of growth to the resulting (quasi-2D) intrinsic geometry of living organisms. In this paper, we attempt to go beyond this “static" argument and develop models, based on thermodynamic considerations, to gain a quantitative understanding of the interplay between growth, mechanics and dynamics in soft objects. These models have the potential to describe the dynamical processes that result in the observed intricate three-dimensional (i.e. extrinsic) morphologies in nature.

Particularly striking examples of dynamical behaviors in organisms with differential growth (hyperbolic geometries) occur in sea slugs (Nudibranchia) and marine flatworms (Polycladida). These marine invertebrates are found in many environments, particularly in coral reefs. While most of them crawl on the sea floor, a few are capable of free swimming (Newman 2003). They move/swim by sending waves of undulations from the front to the back along their skirts (for sea slugs) or across their entire body (for flatworms). Figure 2 shows 4 frames from a video of a free-swimming sea slug (Jones 2010). The geometry of the slug is clearly hyperbolic. It has multiple undulations and undergoes significant bending deformations in the course of one swim cycle. While it is hard to quantify the strains within the organism it is not unreasonable to consider them small in comparison to the obvious large rotations/twist of the body.

Fig. 2
figure 2

A free swimming sea slug Hexabranchus Sanguineus. The frames are 2s apart. Images used with permission from the copyright holders of the original video (Jones 2010)

In a different context, the interplay between growth and dynamics is also relevant to the development of leaves, flowers and other plant tissues that can be modeled as thin laminae (Liang and Mahadevan 2009; Boudaoud 2010; Liang and Mahadevan 2011; Goriely 2017; Sharon and Sahaf 2018). Laboratory experiments using hydrogels (Klein et al. 2007; Kim et al. 2012; Levin et al. 2019) have led to a semi-quantitative understanding of time-dependent, dissipative deformations of thin soft materials with a prescribed prestrain. In living organisms, however, the prestrain is not prescribed a priori, and how the prestrain development may be related to mechanics is not clear. Complicated physico-chemical processes are involved that need to be incorporated into mathematical models. It therefore seems reasonable to derive systematic constraints on the mathematical description based on a careful consideration of the non-standard kinematics involved and the general principles of continuum thermomechanics.

Earlier work, reviewed briefly in “Statics and equilibria of non-Euclidean elastic sheets” section, implicates higher-order defects, in contrast to disclinations and dislocations, as playing a key role in the mechanics of intrinsically hyperbolic elastic sheets (Gemmer and Venkataramani 2011; 2013; Gemmer et al. 2016; Shearman and Venkataramani, in preparation). This points to the need for tools to describe the evolution of (terminating) discontinuities of the second-gradient of the displacement field - when viewed at the macroscopic scale - for a proper description of the soft material deformations involved. It turns out that, within a continuum mechanical perspective, this fits in nicely within the question of describing the coupled mechanics of discontinuities and singularities of the elastic displacement field and its higher derivatives up to order three. This is the question that is addressed in this paper.

While, as evident from Fig. 2, it is natural, and necessary, to consider unrestricted finite deformations when dealing with soft materials, we restrict attention to ‘small deformation’ kinematics in this first effort due to the extra subtleties involved with higher order defect kinematics. Thus, we consider deformations of a fixed reference configuration that may or may not be stress-free. When the configurations attained by the system remain in close proximity to this fixed configuration, this is an adequate assumption. We consistently invoke Occam’s razor as a guiding principle in our development - for instance, we restrict to the use of only ordinary stresses and couple stresses since forces and moments are the only agents of mechanical stimuli that we have some intuition for. Similarly, if branch point and surface defect velocities are to be the only dissipative mechanisms requiring constitutive specification without involving their spatial derivatives, then it turns out that the appropriate variables for the analysis of thermodynamics is in terms of the ‘singular parts’ of the first and higher order displacement gradients, instead of the more natural singular parts of the corresponding elastic distortion gradients that naturally arise in the analysis of defect kinematics. This is in sharp contrast to dislocation and g.disclination mechanics (Acharya and Fressengeas 2015) where this distinction does not arise because of the relatively lower order kinematics involved. We develop the relationship between the two types of entities in this paper.

In this article, we develop a framework from which we can derive the dynamics of higher order defects in a thermodynamically consistent manner. In particular, the framework is applicable to the motivating problems in this Introduction, namely the dynamics of Non-Euclidean (i.e. incompatible) sheets. Our framework can accommodate the specific details of the continuum mechanics of various applications. It can describe the evolution of defects in incompatible elasticity and the elastodynamics of growing bodies. It can also describe plastic deformation resulting from moving dislocations and disclinations and analogous behaviors from the motion of higher order defects. We expect that, with the appropriate choices of thermodynamic potentials and kinetic coefficients, our framework will be useful for problems governed by the interplay between growth, defects, thermodynamics and the balance laws of continuum mechanics.

Statics and equilibria of non-Euclidean elastic sheets

One approach to modeling the mechanics of a growing hyperelastic body, borrowed from the literature of finite elastoplasticity, is to assume a reference configuration \(\mathcal {S}\) and a deformation \(y:\mathcal {S} \to \mathbb {R}^{3}\) along with a multiplicative decomposition of the deformation gradient F=y as F=EG (or F=FeFp in the plasticity literature) where the two-point tensor G models the effect of the growth processes in the material and E is the “residual" elastic deformation (Goriely 2017). The energy of the configuration defined by y is then given by \(\int W(E) = \int W(F G^{-1})\), where W denotes a hyperelastic energy density, vanishing on SO(3) (Goriely 2017; Lewicka et al. 2014). In particular, the material is “stress-free" if FG−1SO(3), although, for a general G, there might not be any deformation of the body \(y:\mathcal {S} \to \mathbb {R}^{3}\) whose gradient is a rotation times G. Such objects, with no stress-free configurations in \(\mathbb {R}^{3}\), lead to incompatible elasticity.

The non-Euclidean formalism of thin sheet elasticity (Efrati et al. 2009) is a reduced dimensional description of thin elastically incompatible objects. The reference manifold \(\mathcal {S} = \Omega \times [-\frac {t}{2},\frac {t}{2}]\), where t, the thickness, is “small" compared to the “in-plane” dimensions of the center surface \(\Omega \subset \mathbb {R}^{2}\). In this setting, the effect of the growth has a reduced dimensional description as a 2-manifold (Ω,g,b) where g,b are symmetric (0,2) tensors. These tensors denote, respectively, the ‘target’ 1st and 2nd fundamental forms of the stress-free state of the sheet, pulled back to the reference manifold (Efrati et al. 2009). This framework also applies to incompatible elasticity (Ben Amar and Goriely 2005; Lewicka et al. 2014; Bhattacharya et al. 2016) where, in general, there exists no deformation \(f: \Omega \to \mathbb {R}^{3}\) realizing a surface in ambient three-dimensional space whose first and second fundamental forms match (the push-forward of) the targets g,b (by f), i.e., incompatible sheets have no stress-free configurations in our three dimensional space.

Assuming the Kirchhoff-Love hypothesis (Fung 1965), so that the (3D) deformation of a thin sheet is determined by the (2D) mapping \(y:\Omega \to \mathbb {R}^{3}\) on the center surface. This allows for an asymptotic expansion of the elastic energy as a sum of stretching and bending contributions (Efrati et al. 2009; Efrati et al. 2013) :

$$\begin{array}{*{20}l} E^{t}[y] =\int_{\Omega} \left[t \,Q_{3}(\nabla y^{T}\cdot \nabla y-g)+\frac{t^{3}}{12} \,Q_{3}(\nabla y^{T}\cdot \nabla N -b)\right]\,dA, \end{array} $$

where the oriented normal field N:ΩS2, also called the Gauss Normal map (Stoker 1989), is obtained from yT·N=0. Q3 is a non-degenerate quadratic form, on symmetric 2 × 2 matrices, that depends on the Poisson’s ratio ν of the material (Efrati et al. 2009), and dA is the area element on (Ω,g).

For various choices of g and b and boundary conditions, the energy functional (1) describes a variety of phenomena in thin sheets, including multiple-scale buckling in free sheets with ‘excess length’ near an edge, e.g. torn plastic or flat leaves treated with an Auxin near the edge (Sharon et al. 2002; Sharon et al. 2007; Sharon et al. 2004). The excess length near the edge is modeled by a metric g with negative intrinsic curvature (Efrati et al. 2013).

Starting with a fully 3D elastic energy, Lewicka and Pakzad (Lewicka and Reza Pakzad 2011) have obtained a reduced dimensional model for the limit t→0 using Γ–convergence. In particular, they showed that

$$\underset{t \to 0}{\Gamma - \lim} \, \frac{E^{t}[y]}{t^{3}} = \mathcal{E}^{*}[y] = \frac{1}{12} \left\{ \begin{array}{ll} \int_{\Omega} Q_{2}(\nabla y^{T}\cdot \nabla N -b)\, dA & \text{if} \nabla y^{T}\cdot \nabla y-g = 0 \text{ a.e} \\ + \infty & \text{ otherwise.} \end{array}\right. $$

for an appropriate quadratic form Q2. This energy has clear similarities with the energy in (1), although the details are somewhat different. Nonetheless, in either framework, the elastic energy scales like t3 in the thin limit t→0if and only if there exist finite bending energy (mathematically yW2,2) isometric immersions \(y:(\Omega,g) \to \mathbb {R}^{3}\).

What is the physical import of this theorem? The Gagliardo-Nirenberg-Sobolev inequality (cf. Evans (1998)[§5.8.1]) implies that W2,2 surfaces with finite bending content have a “nearly continuous" tangent plane and normal, in the sense they cannot oscillate “much" on small sets. More precisely, the normal map N:ΩS2 is in BMO(Ω), and the John-Nirenberg inequality (John and Nirenberg 1961) for BMO functions rules out singularities that correspond to O(1) oscillations in the normal on arbitrarily small sets, including sharp creases (folds), cone points (disclinations) or dislocations. Indeed the energy of elastic ridges (Lobkovsky 1996; Venkataramani 2004; Conti and Maggi 2008), Ett8/3, and d-cones (Ben Amar and Pomeau 1997; Cerda et al. 1999; Olbermann 2016), \(E^{t} \sim t^{3} \log (1/t)\), diverge on the scale t3, although the limiting shapes are ‘asymptotic’ isometries (Vella et al. 2015; Davidovitch et al. 2019) and arguably unstretched.

Branch points and lines of inflection

The preceding remark highlights the role of the regularity of isometries. Beyond the existence/non-existence of isometries, it is crucial whether a candidate isometry is in W2,2. This motivates the problem:

$$ \text{Find} y: \Omega \to \mathbb{R}^{3} \text{ such that} \left\{ \begin{array}{ll} \nabla y^{T}\cdot \nabla y=g \text{ and} & \\ \mathcal{B} = \int_{\Omega} Q(\nabla y^{T}\cdot \nabla N -b)\, dA < \infty, & \end{array}\right. $$

We have rigorous results showing that the problem (2) is flexible and solutions are plentiful (Gemmer et al. 2016; Shearman and Venkataramani, in preparation) (with prescribed zero-traction and moment boundary conditions, i.e. for free sheets). The proof is constructive, and uses ideas from Discrete Differential Geometry DDG (Bobenko et al. 2008; Gemmer et al. 2016). This lack of uniqueness in admissible static configurations with prescribed boundary conditions underscores the necessity of a dynamical model to ‘choose’ between acceptable configurations and/or describe the transitions between multiple admissible states (Gemmer and Venkataramani 2013).

If \(y: \Omega \to \mathbb {R}^{3}\) is C1, the Gauss Normal map is given by \({N = \frac {\partial _{1} y \times \partial _{2} y}{\|\partial _{1} y \times \partial _{2} y\|}}\), where \({\partial _{i} = \frac {\partial }{\partial x^{i}}}\) for (arbitrary) coordinates (x1,x2) on Ω. Further, if y and g are C2, Gauss’ Theorema Egregium implies that (2) is equivalent to the Monge-Ampere Exterior differential system (EDS) (Ivey and Landsberg 2003, §6.4):

$$ N \cdot dy = 0, \qquad N^{*}(d\Omega) = \kappa \,dA, \qquad \kappa \equiv \kappa[g] \text{ is determined by} g, $$

where dΩ is the area form on the sphere S2 and κ is the Gauss curvature.

Classical results in differential geometry imply that smooth solutions of (3) with κ<0 are hyperbolic surfaces and locally saddle shaped. In contrast, the curly mustard leaf in Fig. 3b is “frilly", i.e buckled on multiple scales with a wavelength that refines (“sub-wrinkles") near the edge (Sharon et al. 2004). This “looks" very unlike the smooth saddle in Fig. 3a. If \(\Omega \subset \mathbb {R}^{2}\) is a bounded domain with a smooth boundary, and g is a smooth metric on Ω with negative curvature, g can be extended to a smooth metric \(\bar {g}\) on \(\mathbb {R}^{2}\) with Gauss curvature \(\kappa [\bar {g}] < 0\) decaying (as rapidly as desired) at infinity. The existence of isometric immersions into \(\mathbb {R}^{3}\), of smooth metrics with decaying negative curvature (Hong 1993), therefore implies that bounded smooth hyperbolic surfaces can be smoothly and isometrically embedded in \(\mathbb {R}^{3}\). A smooth (C2 is sufficient) hyperbolic surface cannot refine its buckling pattern and is thus “non-frilly" (Gemmer et al. 2016; Shearman and Venkataramani, in preparation). Why do we see frilly shapes in natural surfaces, as in Fig. 3b, rather than the smooth saddles of Fig. 3a?

Fig. 3
figure 3

Hyperbolic surfaces in \(\mathbb {R}^{3}\). The inscribed (geodesic) triangle in the smooth saddle has angles that sum up to less than π, illustrating the connection between the extrinsic and intrinsic geometries – Gauss’ Theorema Egregium

We have addressed this puzzle in recent work (Gemmer and Venkataramani 2011; Gemmer and Venkataramani 2012; Gemmer and Venkataramani 2013; Gemmer et al. 2016; Shearman and Venkataramani, in preparation) and the short answer is that, for a given metric g, the frilly surfaces, somewhat counterintuitively, can have smaller bending energy than the smooth saddle. It is true that C2 (twice continuously differentiable) hyperbolic surfaces are saddle-like near every point. We find a topological invariant (Shearman and Venkataramani, in preparation), the index of a branch point - intimately related to the quantity \(\int _{\Sigma } \widehat {\alpha }^{(3)} n\, da\) that emerges in “Kinematics” section and the quantity Γ of “The discontinuity of the deformation of a non simply connected domain with prescribed third ‘deformation gradient’” section - that distinguishes sub-wrinkled surfaces from saddles locally. With branch points, the surfaces are only C1,1, but gain the additional flexibility to refine their buckling pattern, while lowering their energy (Gemmer et al. 2016). This flexibility is not available to smooth saddles, and constitutes a key property of branched (sub-wrinkled) surfaces (Gemmer et al. 2016; Shearman and Venkataramani, in preparation).

Figure 4 shows the construction of a non- C2 monkey saddle (Gemmer and Venkataramani 2011). starting from the quadratic surface w=x2−3y2. Cutting out the sector \(|x| \leq \sqrt {3}|y|\) and then patching congruent copies of this sector by odd reflections gives a W2,2 surface with a continuous normal vector and bounded curvature. The “defects" in this surface include the point in the middle – a branch point and the 6 rays through this point – lines of inflection, which together constitute the asymptotic skeleton of the surface (Shearman and Venkataramani, in preparation). This construction can be extended to generate C1,1 hyperbolic surfaces with multiple distinct branch points, and an interesting question is how these defects interact with and influence each other (Gemmer et al. 2016).

Fig. 4
figure 4

A W2,2 hyperbolic surfaces in \(\mathbb {R}^{3}\) that is built by patching together (smooth) quadratic maps on the 6 sectors shown in (a). The straight lines in the surface (b) are the images of the straight lines in the unit disk (a). The existence of such straight lines is a consequence of hyperboloids being doubly ruled surfaces

Defects are of course ubiquitous in condensed matter systems. A key feature of defects in systems driven by a free energy is that the energy density typically diverges in the vicinity of a “bare" defect (and in some cases even the total energy diverges), and as a consequence, defects are always regularized, i.e. “cored" in physical systems. This is true for dislocations and disclinations in elastic objects, for creases in crumpled sheets, for defects in liquid crystals and many other types of defects. Uniquely, branch points and lines of inflection do not carry a singular energy density (Gemmer and Venkataramani 2011; Gemmer et al. 2016), and thus do not “need" a core for energetic reasons. Nonetheless, force and moment balance implies that these defects are indeed regularized into boundary layers, of width t1/3, mediating jumps in the normal curvature (Gemmer and Venkataramani 2012).

Branch points and lines of inflection are thus mesoscopic defects. They contain large numbers of atoms (microscopic units) and are amenable to a continuum description, but are yet much smaller than the typical size of the sheet. Arguments from energy minimization, while implying the existence of these higher order defects, do not address the question of their evolution. One has to necessarily go beyond the elastic energy (1) and incorporate dissipative effects that are crucial in determining a thermo-mechanically consistent description of the coupled evolution of the shape \(y:\Omega \to \mathbb {R}^{3}\) and the internal geometry, given by the tensors g and b.


We define the notation employed in the paper in one place for convenience.

When a function on a domain is discontinuous across a (non-planar) surface S, we assume that its values along any sequence of points from either side of the surface approaching any fixed point on the surface take on a unique pair of limiting values, each element in the pair corresponding to the limit from one side. The difference of these limiting values, one for each point on the surface, is defined as the jump (denoted by ·) of the function on the surface. If ν(x) is the unit normal to S at xS, we say that x± is a point on the ± side of S at x depending on \((x^{\pm } - x) \cdot \nu (x) \gtrless 0\), respectively.

We think of an nth order tensor as a linear transformation between the space of vectors (in the translation space of three-dimensional Euclidean space, also 1st-order tensors) to the space of (n−1)th-order tensors, with its transpose defined in the natural way as being a linear transformation from the space of (n−1)th-tensors to the space of vectors. All tensors components will be written w.r.t. the basis, (e1,e2,e3) of a fixed rectangular Cartesian coordinate system and all partial derivatives, denoted often by a subscript comma, will be w.r.t coordinates of this system. The Einstein summation convention will be used unless otherwise stated. Superposed dots will represent partial derivatives w.r.t. time. If A is a pth-order tensor then the operators , div, curl may be defined as

$$\begin{array}{*{20}l} \nabla A & = A_{i_{1} \dots i_{p},k}\ e_{i_{1}} \otimes \ldots \otimes e_{i_{p}} \otimes e_{k} \notag\\ div \, A & = A_{i_{1} \dots i_{p-1}k,k}\ e_{i_{1}} \otimes \ldots \otimes e_{i_{p-1}} \notag\\ curl \, A & = e_{kr i_{p}} A_{i_{1} \dots i_{p},r}\ e_{i_{1}} \otimes \ldots \otimes e_{i_{p-1}} \otimes e_{k}, \notag \end{array} $$

(with invariant meaning independent of the choice of coordinate system and its basis, of course). The range of all indices above is 1 to 3 and eijk represents a component of the third-order alternating tensor.

The symbol ·i represents a contraction on i indices between two tensors. For any tensor A, we define the tensor obtained by symmetrizing in the first two indices as A(s) and the one obtained by antisymmetrizing in the first two indices from the left as A(a). We denote the deviatoric part of a second-order tensor by the superscript dev.

Motivation for kinematics of the theory

In this section we provide some intuition on the defect kinematics we adopt for our theory of branch point singularities. This is first done by explicitly constructing a continuously differentiable deformation of a non-simply connected domain whose second derivative has a prescribed, constant jump across a planar surface in the body.

With reference to Fig. 5, we think of Ω occupying a simply connected d=2or 3-dimensional domain of ambient Euclidean space. Here, it may be viewed either as a right-cylinder (d=3) or a cross-section perpendicular to its axis (d=2). We choose a rectangular Cartesian coordinate system with the z-axis as the axis of the cylinder; ei,i=1,2,3 are the unit vectors along the x,y,z directions, respectively. Ωc is a cylindrical subset of Ω with rectangular cross-section centered on the z-axis. The region Ωh:=ΩΩc is not simply-connected. S is a surface in Ωh such that D:=ΩhS is simply connected. The layer L is defined as \(L = \left \{ (x,y,z) \in \Omega _{h} \, \vert \, x < 0, - \frac {l}{2} \leq y \leq \frac {l}{2} \right \}\) and the surface \(S = \left \{ (x,y,z) \in \Omega _{h} \, \vert \, x < 0, y = 0 \right \}\). We will refer to Ωc as a core.

Fig. 5
figure 5

Schematic of set up

Our goal in this section is to construct a vector field \(\tilde {y}^{(l)} : \Omega _{h} \rightarrow \mathbb {R}^{n}\), \(n \in \mathbb {N}\), nd, \(0 < l \in \mathbb {R}\), with \(\tilde {y}^{(l)} \in C^{1}(\Omega _{h})\), \(\nabla \tilde {y}^{(l)}\) piecewise-smooth, and the jump in \(\nabla ^{2} \tilde {y}^{(l)}\) across S a specified constant, with the jump blowing up as \(l \rightarrow 0\) maintaining \({\lim }_{l \rightarrow 0} \nabla \tilde {y}^{(l)} \in C^{0}(\Omega _{h})\).

A necessary condition for \(\tilde {y}^{(l)} \in C^{1}(\Omega _{h})\) with \(\nabla \tilde {y}^{(l)}\) piecewise-smooth is that the jump in its second derivative across S be of the form \(\llbracket \nabla ^{2} \tilde {y}^{(l)} \rrbracket = A \otimes \nu \), where ν is the unit normal field on S (with arbitrarily chosen orientation) and A is a \(\mathbb {R}^{n \times d}\) valued matrix field on S. Noting that \(l^{-1} \llbracket \nabla ^{2} \tilde {y}^{(l)} \rrbracket \) may be formally considered an approximate discrete directional derivative of \(\nabla ^{2} \tilde {y}^{(l)}\) in the direction ν (if the discontinuity were ignored), we define the field

$$ Z := \left\{ \begin{array}{ll} \frac{1}{l} A \otimes \nu \otimes \nu \qquad & \text{in} \ L\\ 0 & \text{in} \ \Omega_{h} \backslash L \end{array}\right. $$

with A and ν=e2 constants, and seek to construct solutions to the equations

$$ \left.\begin{array}{ll} \nabla W & = Y\\ \nabla Y & = \left. Z \right\vert_{D} \end{array}\right\} \qquad \text{in} \ D. $$

The restriction of Z to D is used since, even though Z is (distributionally) curl-free in Ωh (we interpret the curl of a matrix field as row-wise curls), Ωh is not simply connected but D is and hence we are guaranteed a solution Y in D, unique up to a constant.

For any such Y field, we note that curlY=0 in D by the symmetry in the last two entries of Z, i.e. (Yel)ek−(Yek)el=(Zel)ek−(Zek)el=0. Thus W satisfying (5) can be constructed, and W is also unique in D up to a constant for a given Y.

Arbitrarily fix one of the available Y fields. Such a Y has the explicit representation

$$Y(x; x_{0}) = {\lim}_{x_{0}^{-} \to x_{0}} \left(Y \left(x_{0}^{-}\right) + \int_{x_{0}^{-}}^{x} Z \, dx \right), \qquad x \in D, $$

for x0 being any point on the surface S, and the line integral is along any path from \(x_{0}^{-}\) to x contained in D. Now choose any path going from \(x_{0}^{-}\) to \(x_{0}^{+}\) (see Fig. 5) contained in D with the stipulation that it go through the points \(x_{0} \pm \frac {l}{2} e_{2}\) and the segments between x0 and \(x_{0} \pm \frac {l}{2} e_{2}\), respectively, are parallel to e2. We will assume that \(x_{0}^{\pm } = x_{0} \pm s^{\pm } e_{2}\). Then, along the segment \(x(s) = x_{0}^{-} - (s-s^{-}) e_{2}, 0 < s^{-} \leq s \leq \frac {l}{2}\),

$$ Y\left(x_{0} - \frac{l}{2} e_{2} \right) = Y(x_{0}^{-}) + \int_{s^{-}}^{\frac{l}{2}} \frac{A \otimes \nu}{l} (e_{2} \cdot -e_{2})\, ds = Y(x_{0}^{-}) - \frac{A \otimes \nu}{l} \left(\frac{l}{2} - s^{-} \right). $$

Y(x(s)) remains constant along the path between \(x_{0} - \frac {l}{2} e_{2}\) and \(x_{0} + \frac {l}{2} e_{2}\). Therefore, using the segment \(x(s) = x_{0} + \frac {l}{2} e_{2} - s e_{2}, 0 \leq s \leq \left (\frac {l}{2} - s^{+} \right)\) we have

$$\begin{array}{ll} Y(x_{0}^{+}) & = Y\left(x_{0} + \frac{l}{2} e_{2} \right) + \int_{x_{0} + \frac{l}{2} e_{2}}^{x_{0}^{+}} Z \, dx\\ & = Y\left(x_{0} - \frac{l}{2} e_{2} \right) - \frac{A \otimes \nu}{l} \left(\frac{l}{2} - s^{+} \right)\\ & = Y(x_{0}^{-}) - \frac{A \otimes \nu}{l} \left(\frac{l}{2} - s^{-} \right) - \frac{A \otimes \nu}{l} \left(\frac{l}{2} - s^{+} \right). \end{array} $$

Hence, the jump in Y at x0 is given by

$$ \llbracket Y \rrbracket (x_{0}) = {\lim}_{x_{0}^{\pm} \to x_{0}} Y (x_{0}^{+}) - Y(x_{0}^{-}) = - A \otimes \nu. $$

Since x0S and Y such that Y=Z in D were chosen arbitrarily, (7) holds for all x0S and admissible Y in the specified class. Thus Y is unique in that class, independent of position on S, and given by the constant −Aν.

We note that \(Y^{*}:\Omega _{h} \rightarrow \mathbb {R}^{n \times d \times d}\) may be viewed as a discontinuous function with the specification

$$Y^{*}(x) = \left\{ \begin{array}{ll} \lim\limits_{\,x^{-} \to\, x} Y(x^{-}) - \frac{1}{2} A \otimes \nu, &\qquad x \in S\\ Y(x), & \qquad x \in D. \end{array}\right. $$

where the points xD belong to the − side of S at x.

We now evaluate the jump in the field W on S.

As already observed, for any Y satisfying (5) a W field in D can also be constructed and this will have the representation

$$W(x;y) = W(y) + \int_{y}^{x} Y \, dx, \qquad x,y \in D, $$

for any path linking y to x in D. We now arbitrarily fix an admissible field Y and choose the same path from \(x_{0}^{-}\) to \(x_{0}^{+}\) used in deducing its jump on S.

Along \(x(s) = x_{0}^{-} - (s - s^{-}) e_{2}\), \(s^{-} \leq s \leq \frac {l}{2}\), \(Y(s) = Y(x_{0}^{-}) - \frac {A \otimes \nu }{l} (s - s^{-})\) and

$$ \begin{array}{ll} W \left(x_{0} - \frac{l}{2} e_{2} \right) & = W(x_{0}^{-}) + \int_{s^{-}}^{\frac{l}{2}} \left[ \frac{-A \otimes \nu}{l} (s - s^{-}) \right] (- e_{2})\, ds + Y(x_{0}^{-}) \int_{s^{-}}^{\frac{l}{2}} (-e_{2}) \, ds\\ & = W (x_{0}^{-}) + \int_{0}^{\frac{l}{2} - s^{-}} \frac{A \otimes e_{2}}{l} (e_{2}) s' \, ds' - \left(\frac{l}{2} - s^{-} \right) Y(x_{0}^{-}) e_{2}\\ & = W (x_{0}^{-}) + \frac{A}{2l} \left(\frac{l}{2} - s^{-} \right)^{2} - \left(\frac{l}{2} - s^{-} \right) Y(x_{0}^{-}) e_{2}. \end{array} $$

Since Y remains constant at the value given by (6) along the chosen path from \(x_{0} - \frac {l}{2} e_{2}\) to \(x_{0} + \frac {l}{2} e_{2}\),

$$ \begin{array}{ll} {}W \left(x_{0} + \frac{l}{2} e_{2} \right) & = W \left(x_{0} - \frac{l}{2} e_{2} \right) + \int_{x_{0} - \frac{l}{2} e_{2}}^{x_{0} + \frac{l}{2} e_{2}} Y \, dx\\ & = W \left(x_{0} - \frac{l}{2} e_{2} \right) + l\, Y\left(x_{0} - \frac{l}{2} e_{2} \right)\, e_{2}\\ & = W(x_{0}^{-}) + \frac{A}{2l} \left(\frac{l}{2} - s^{-} \right)^{2} \,-\, \left(\frac{l}{2} - s^{-} \right) Y(x_{0}^{-}) e_{2} + l\, Y(x_{0}^{-}) e_{2} - \left(\frac{l}{2} - s^{-} \right) A. \end{array} $$

using (6) and (8). Now

$$ W (x_{0}^{+}) = W \left(x_{0} + \frac{l}{2} e_{2} \right) + \int_{x_{0} + \frac{l}{2} e_{2}}^{x_{0}^{+}} Y \, dx $$

and Y(s) along the segment \(x(s) = x_{0} + \frac {l}{2} e_{2} - s e_{2}, 0 \leq s \leq \frac {l}{2} - s^{+} \) is given by

$$Y(s) = Y \left(x_{0} + \frac{l}{2} e_{2} \right) + \int_{0}^{s} Z (s) (- e_{2}) \, ds = Y\left(x_{0} + \frac{l}{2} e_{2} \right) - s\frac{A \otimes \nu}{l}, $$

so that

$$\begin{array}{ll} \int_{x_{0} + \frac{l}{2} e_{2}}^{x_{0}^{+}} Y \, dx & = \int_{0}^{\frac{l}{2} - s^{+}} \left[ Y \left(x_{0} + \frac{l}{2} e_{2} \right) - s\frac{A \otimes \nu}{l} \right] (- e_{2}) \,ds\\ & = \left[- Y \left(x_{0} + \frac{l}{2} e_{2} \right) e_{2} \right] \left(\frac{l}{2} - s^{+} \right) + \frac{A}{2l} \left(\frac{l}{2} - s^{+} \right)^{2}, \end{array} $$

and therefore (9), (10), and (6), noting \(Y \left (x_{0} - \frac {l}{2} e_{2} \right) = Y \left (x_{0} + \frac {l}{2} e_{2} \right)\), imply

$$\begin{array}{ll} W (x_{0}^{+}) - W(x_{0}^{-}) =\ & \frac{A}{2l} \left(\frac{l}{2} - s^{-} \right)^{2} - \left(\frac{l}{2} - s^{-} \right) Y(x_{0}^{-}) e_{2} + l\, Y(x_{0}^{-}) e_{2} - \left(\frac{l}{2} - s^{-} \right) A \\ & + \left[- \left\{ Y(x_{0}^{-}) - \frac{A \otimes \nu}{l} \left(\frac{l}{2} - s^{-} \right) \right\} e_{2} \right] \left(\frac{l}{2} - s^{+} \right) + \frac{A}{2l} \left(\frac{l}{2} - s^{+} \right)^{2}. \end{array} $$


$$ \llbracket W \rrbracket (x_{0}) = {\lim}_{\substack{x_{0}^{\pm} \to\, x_{0}\\ s^{\pm} \to \,0}} W (x_{0}^{+}) - W(x_{0}^{-}) = \frac{A}{2l} \left(\frac{l}{2} \right)^{2} - \frac{l}{2} A + \frac{A}{l} \left(\frac{l}{2} \right)^{2} + \frac{A}{2l} \left(\frac{l}{2} \right)^{2} = 0. $$

We now define the function \(W^{*}: \Omega _{h} \rightarrow \mathbb {R}^{n \times d}\) as

$$ W^{*}(x) = \left\{ \begin{array}{ll} \lim\limits_{x^{-} \to \, x} W(x^{-}) = \lim\limits_{x^{+} \to \, x} W(x^{+}), &\qquad x^{\pm} \in D, x \in S\\ W(x), & \qquad x \in D, \end{array}\right. $$

where the points x± belong to the ± sides of S at x, respectively. W is a continuous function on Ωh.

We now assume that the constant A is of the form A=aν for \(a \in \mathbb {R}^{n}\). Then, in D, 2W=Z=aννν so that W=(ν·x)aνν+C where \(C \in \mathbb {R}^{n \times d \times d}\) is a constant. This constant is free to choose, without loss of generality (related to the choice of \(Y(x_{0}^{-})\), for instance), and we assume that it satisfies (Cei)ej=(Cej)ei for i,j=1,…,d. Then curlW=curlW=0 in D. This further implies that the line integral \(\int W^{*} \, dx = b \in \mathbb {R}^{n}\) is a constant for any closed contour encircling Ωc.

If b=0, then we define

$$\widetilde{W} = W^{*} \ \text{in} \ \Omega_{h}. $$

If not, we explicitly solve the system

$$ curl \, \widehat{W} = - b \otimes e_{3}\, \delta_{z-axis} =: \widehat{\alpha} \qquad \text{in} \ \Omega. $$

Solutions exist to this system (e.g. an explicit solution on star-shaped domains can be written down by using the Riemann-Graves integral operator (Edelen 1985)) that belong to C1(Ωh). Forcing by the Dirac distribution is not necessary; functions of (x,y) with support in a cylinder contained in Ωc satisfying \(\int _{A} \hat {\alpha }\, e_{3} \, da = -b\) for any area patch A threaded by the cylinder also suffice for generating such solutions (Acharya 2001)). Then defining

$$\widetilde{W} = \left. \widehat{W}\right|_{\Omega_{h}} + W^{*} \ \text{in} \ \Omega_{h}, $$

we note that \(\int \widetilde {W} \, dx = 0\) for any closed contour encircling Ωc and that \(\widetilde {W} \in C^{0}(\Omega _{h})\). Then we define \(\tilde {y}: \Omega _{h} \rightarrow \mathbb {R}^{n}\) by

$$ \tilde{y} (x; z) = p + \int_{z}^{x} \widetilde{W} \, dx, \qquad x,z \in \Omega_{h} $$

for arbitrarily fixed zΩh and a constant \(p \in \mathbb {R}^{n}\).

Clearly, \(\tilde {y}\) satisfies \(\nabla \tilde {y} = \widetilde {W}\) on Ωh and \(\tilde {y} \in C^{1}(\Omega _{h})\).

Consider the constant vector \(a \in \mathbb {R}^{n}\) to be parametrized by the layer width l as

$$a^{(l)} = \gamma \, l^{\beta - 1}, \qquad \gamma \in \mathbb{R}^{n}, 0 < \beta \in \mathbb{R}. $$

All fields constructed with the use of A=a(l)ν are denoted by a superscript (l). We have the following properties:

  • For 0≤β<1, \(\tilde {y}^{(l)} \in C^{1}(\Omega _{h})\) for l>0, \({\lim }_{l \to 0} \left \vert \nabla ^{2} \tilde {y}^{(l)} \right \vert \to \infty \) in Ωh, \({\lim }_{l \to 0} \left \vert \nabla \tilde {y}^{(l)} \right \vert \to \infty \) in ΩhL.

  • For β=1, \(\tilde {y}^{(l)} \in C^{1}(\Omega _{h})\) for l>0, \({\lim }_{l \to 0} \tilde {y}^{(l)} \in C^{1}(\Omega _{h})\), \({\lim }_{l \to 0} \nabla ^{2} \tilde {y}^{(l)} \in C^{0}(D)\) and \(lim_{l \to 0} \llbracket \nabla ^{2} \tilde {y}^{(l)} \rrbracket \) is bounded on S. This conclusion also holds for any value of β≥0 when l>0 is held fixed.

  • For l→0, β>1, \(\tilde {y}^{(l)} \in C^{2}(\Omega _{h})\).

Remark 0.1.

While the above considerations have dealt with one singular surface, the linearity of the construct on the prescribed field Z makes it clear that exactly similar arguments hold for the superposition of a set of deformations, each element of which contains a single planar surface of discontinuity of arbitrary orientation in Ωh terminating on Ωc. Considering yi, \(i = 1 \, \textit {to} \, n \in \mathbb {Z}^{+}\), each corresponding to a specified Zi field, the composite, superposed deformation \( \sum _{i=1}^{n} y^{i}\) is C1(Ωh), with generally discontinuous second derivatives on each of the Si corresponding to the specified Zi field. This corresponds to situations with a single branch point (Gemmer and Venkataramani 2011; Gemmer and Venkataramani 2012; Gemmer and Venkataramani 2013) as exemplified by the piecewise quadratic monkey-saddle that we discussed in “Branch points and lines of inflection” section.

Furthermore, given a fixed, simply connected domain Ω, let \(\Omega _{c}^{i} \subset \Omega \), i=1 ton, be a set of non-intersecting cores with \(\Omega _{h}^{i} := \Omega \backslash \Omega _{c}^{i}\). Let each Zi now be specified on the domain \(\Omega ^{i}_{h}\). Then each yi is \(C^{1}(\Omega _{h}^{i})\). Thus, \( \sum _{i=1}^{n} y^{i} \in C^{1} \!\left (\cap _{i=1}^{n} \Omega _{h}^{i} \right)\). This corresponds to configurations with multiple branch-points (Gemmer et al. 2016; Shearman and Venkataramani, in preparation).

Remark 0.2.

For thin objects modeled by d=2, the construction above is a representation of folds without ridges. In “Kinematics” section and “Thermodynamics” section we develop a continuum mechanical theory that encompasses the mechanics of such folds in simply connected domains within a setting that allows for deformations with less smoothness.

Remark 0.3.

Consider d=2,n=2 and b≠0, and assume that W(x),xΩh, is invertible. A field \(y^{*}:D \to \mathbb {R}^{2}\) satisfying y=W in D can be constructed that may be interpreted as a discontinuous deformation of Ωh. Now consider the metric g:=WTW on Ωh. By the Nash C1 embedding theorem, there exists a C1 deformation \(z:\Omega _{h} \to \mathbb {R}^{3}\) with (z)Tz=g=(y)Ty.

For a mechanistic interpretation, consider the configuration in \(\mathbb {R}^{3}\) defined by z(Ωh) as the stress-free, global reference configuration in a higher dimensional space \((\mathbb {R}^{3})\) corresponding to a stressed body with a dislocation (with excluded core) in \(\mathbb {R}^{2}\) represented by Ωh. The stress-free reference cannot be represented by a compatible mapping of Ωh in the lower-dimensional space \(\mathbb {R}^{2}\); instead, one of its stress-free representations in \(\mathbb {R}^{2}\) is defined by the configuration y(Ωh). The stress-producing elastic Right-Cauchy Green tensor field is given by (W−1)TW−1 on Ωh.

The discontinuity of the deformation of a non simply connected domain with prescribed third ‘deformation gradient’

Consider the domain Ωh of Fig. 5 which is rendered simply connected by a single cut-surface S which is not necessarily planar. As before, we refer to ΩhS=:D. We consider \(Z: \Omega _{h} \to \mathbb {R}^{n \times d \times d \times d}\) as a given field for which ((Z(x)el)ek)ej is invariant w.r.t interchanges of ej,ek,el for any values of \(j,k,l \in \{1, \dots, d\}\). Furthermore, we assume that ZC0(Ωh), and curlZ=0 in Ωh. We are now interested in the construction of a field \(y: D \to \mathbb {R}^{n}\) that satisfies

$$\nabla^{3} y = Z $$

and characterizing the jump field y on S.

Define, for xΩh, [((Z(x)el)ek)ejEI=:ZIjkl(x), \(I = 1,\dots, n\) and j,k,l=1,…,d, where EI represents an element of an orthonormal basis in \(\mathbb {R}^{n}\). ZIjkl is symmetric in the indices j,k,l. Now construct \(Y:D \to \mathbb {R}^{n \times d \times d}\) satisfying

$$ \frac{\partial Y_{Ijk}}{\partial x_{l}} = Z_{Ijkl}, $$

which is possible since curlZ=0 and D being simply connected. We note that YIjk(x)−YIkj(x)=YIjk(y)−YIkj(y) for x,yD, due to the symmetry of ZIjkl in j,k and the connectedness of D. Since the construction of Y allows the free specification of its value at one point of D, it can be assumed without loss of generality that YIjk=YIkj in D.

Eq. 15 and the symmetry of Z in the last two indices imply curlY=0 in D. Thus it is also possible to construct \(W : D \to \mathbb {R}^{n \times d}\) satisfying

$$ \frac{\partial W_{Ij}}{\partial x_{k}} = Y_{Ijk}. $$

Furthermore, (16) and the symmetry of Y in its last two indices imply that a function \(y: D \to \mathbb {R}^{n}\) can be constructed satisfying

$$ \frac{\partial y_{I}}{\partial x_{j}} = W_{Ij}. $$

Now, because Z is curl-free in Ωh, we have by Stokes’ theorem that

$$ {}\int \!Z\, dx \!=:\! \Gamma \in \mathbb{R}^{n \times d \times d} \ \text{a constant, for the line integral over } {any} \text{ closed loop encircling} \ \Omega_{c}. $$

By (15) and (16), this further implies that

$$ \Gamma = \llbracket Y \rrbracket (x) = \llbracket \nabla W \rrbracket (x), \qquad x \in S. $$

Let x0,xS be connected by a curve c contained in S. Consider curves c+ and c on the ± sides of S connecting \(x_{0}^{\pm }\) to x±. Then

$$ W(x^{\pm}) = W(x_{0}^{\pm}) + \int_{x_{0}^{\pm}}^{x^{\pm}} \nabla W (c^{\pm})\, dc^{\pm} \implies \llbracket W \rrbracket (x) = \llbracket W \rrbracket (x_{0}) + \Gamma (x - x_{0}) $$

as c±c. Similarly, (17) implies

$$ \begin{array}{ll} y(x^{\pm}) & = y(x_{0}^{\pm}) + \int_{x_{0}^{\pm}}^{x^{\pm}} \nabla y (c^{\pm})\, dc^{\pm}\\ \implies \llbracket y \rrbracket (x) & = \llbracket y\rrbracket (x_{0}) + \int_{x_{0}}^{x} \llbracket W \rrbracket (c) \, dc = \llbracket y\rrbracket (x_{0}) + \int_{x_{0}}^{x} \Big\{ \llbracket W \rrbracket (x_{0}) + \Gamma (c - x_{0}) \Big\} \, dc\\ & = \llbracket y\rrbracket (x_{0}) + \Big(\llbracket W \rrbracket (x_{0})\Big) (x - x_{0}) + \int_{0}^{x - x_{0}} \Gamma c' \, dc' \end{array} $$

Now, due to the symmetry of Γ in its last two indices, \(\Gamma _{Ijk} c'_{k} \frac {dc'_{j}}{ds} = \frac {1}{2} \frac {d}{ds} (\Gamma _{Ijk} c'_{k} c'_{j})\) and the last line integral in (21) evaluates to \(\frac {1}{2} \left (\Gamma (x - x_{0}) \right) (x - x_{0})\) so that (21) implies

$$ \llbracket y \rrbracket (x) = \llbracket y\rrbracket (x_{0}) + \Big(\llbracket W \rrbracket (x_{0})\Big) \cdot_{1} (x - x_{0}) + \frac{1}{2} \, \Gamma \cdot_{2} \left[ (x - x_{0}) \otimes (x - x_{0}) \right], \qquad \forall x,x_{0} \in S. $$

Remark 0.4.

The jump in the deformation y across the cut-surface S is not arbitrary, being characterized by a finite set of parameters. One choice for this parameter set is the jump of the deformation at an arbitrarily fixed point on S, the jump of W at the same point, and Γ, the latter being a constant decided by the given field Z.

Remark 0.5.

W is not constant on S even though curlY=0 in D unless the vector joining any two points on S lies in the null-space of Γ by (20). For S a planar surface with unit normal ν and Γ of the form aνν, \(a \in \mathbb {R}^{n}, \nu \in \mathbb {R}^{d}\) constants, (20) implies that W is constant on S. If, moreover W−(Wν)ν=0, then y is also a constant on S. These are all conditions satisfied by the example worked out in the preamble of this section.

Remark 0.6.

The argument remains unchanged for the case Ωh is just a punctured domain, i.e. Ωc shrinks to a point (a curve).

Remark 0.7.

The result (22) is an extension of Weingarten’s theorem (Weingarten 1901; Delphenich a; Volterra 1907; Delphenich b) and the Weingarten theorem for g.disclinations (Zhang and Acharya 2018).


In this section we propose the kinematics for a model of the type of discontinuities treated in “Motivation for kinematics of the theory” section, to be broadly applied to the mechanics of materials. For that purpose, it is essential to deal with simply connected, compact domains containing the said discontinuities. The excluded core regions are now included in the domain as are the excluded surfaces of discontinuity. Roughly speaking, we consider an additive split of fields into ‘regular’ and ‘singular’ parts whenever the field in question contains high magnitudes concentrated in ‘thin’ regions approximating smooth lower-dimensional (<d) sets; the support of the singular part of the field contains these regions of high concentration and that of the regular part contains the support of the rest of the field, including regions supporting approximate discontinuities. Importantly, both the singular and regular parts are assumed to be at least integrable functions as we want to write governing equations for these fields in the form of pde that can at least be made sense of in some weak manner. Thus, we take a somewhat microscopic point of view, assuming that discontinuities and singularities of certain fields when viewed from a macroscopic scale have a smoother definition at a microscopic scale that we describe by additional ‘eigenwall’ fields. We also adopt the point of view that once macroscopic theories generate discontinuities and singularities, in most circumstances additional physical insight beyond the constraints placed by the governing equations of the macroscopic theory are required to define evolution with a modicum of uniqueness. We develop such a model in the rest of the paper.

We refer to a fixed reference configuration, a simply connected compact region as B. In terms of the displacement field u and the i-eigenwall fields S(i),i{1,2,3}, we define the i-elastic distortionsY(i),i{0,…,4}, as

$$ \begin{array}{ll} Y^{(4)} &:= \nabla Y^{(3)}\\ Y^{(i)} & := \nabla Y^{(i-1)} - S^{(i)} \qquad i \in \{1,2,3\}\\ Y^{(0)} & := u. \end{array} $$

(Y(0) is analogous to the field y of “Motivation for kinematics of the theory” section, Y(1) to W, Y(2) to Y, and Y(3) to Z). Thus Y(0)=u and Y(4), the gradient of the regular part of the gradient of the 3-elastic distortion, are assumed to have no ‘singular’ parts. We now define the ‘composite’ eigenwall fields \(\widehat {S}^{(i)}, i = 1,2,3\), as

$$ \begin{array}{ll} Y^{(3)} & = \nabla Y^{(2)} - S^{(3)} = \nabla^{3} u - \widehat{S}^{(3)}; \qquad \widehat{S}^{(3)} := \nabla^{2} S^{(1)} + \nabla S^{(2)} + S^{(3)}\\ Y^{(2)} & = \nabla Y^{(1)} - S^{(2)} = \nabla^{2} u - \widehat{S}^{(2)}; \qquad \widehat{S}^{(2)} := \nabla S^{(1)} + S^{(2)}\\ Y^{(1)} & = \nabla Y^{(0)} - S^{(1)} = \nabla u - \widehat{S}^{(1)};\ \qquad \widehat{S}^{(1)} := S^{(1)},\vspace*{-5pt} \end{array} $$

and we note that

$$ S^{(i)} = \widehat{S}^{(i)} - \nabla \widehat{S}^{(i-1)} \qquad i \in \{1,2,3\}. $$

Remark 0.8.

While motivated as non-singular fields representing concentrations along 2-d surfaces, the (composite) eigenwall fields admit a completely diffuse description, when necessary, in the theory developed below. In this sense the theory developed in this paper is capable of dealing with some simple aspects of ‘homogenization’ of eigenwall fields to descriptions at a coarser scale.

Physical considerations related to predicting stress fields of terminating twin boundaries and the stress-free, compatible, elastic, twinning shear distortions of through-twin boundaries (Zhang et al. 2018) motivate the introduction of the following Stokes-Helmholtz (SH) decompositions:

$$ \begin{aligned} \left.\begin{array}{ll} S^{(i)} = \nabla H^{(i)} - \chi^{(i)}& \\ curl \, \chi^{(i)} = -curl \, S^{(i)} &\\ div \, \chi^{(i)} = 0 &\\ div \, \nabla H^{(i)} = div \, S^{(i)} &\\ \end{array}\right\}&\qquad x\in B, \qquad i \in \{1,2,3\},\\ \left.\begin{array}{ll} \chi^{(i)} n = 0 & \\ \nabla H^{(i)} n = S^{(i)} n &\\ \end{array}\right\}&\qquad x\in \partial B, \qquad i \in \{1,2,3\}. \end{aligned} $$

We will also consider exactly analogous SH decompositions for the fields

$$ \widehat{S}^{(i)} = \nabla \widehat{H}^{(i)} - \widehat{\chi}^{(i)}, \qquad i \in \{1,2,3\}. $$

Combining (25) and (27) and noting the uniqueness of the SH decomposition we have

$$ H^{(i)} = \widehat{H}^{(i)} - \widehat{S}^{(i-1)}, \qquad i \in \{1,2,3\}, $$

up to at most a spatially constant function of time which we will assume to be a time-independent constant. Defining

$$ \widehat{Y}^{(i)} := Y^{(i)} - H^{(i+1)}, \qquad i \in \{1,2,3\} $$

(noting that H(4)=0), we define the i-defect density tensors for i{1,2,3} from (23) and (29) as

$$ \begin{array}{ll} & \alpha^{(i)} := - Y^{(i+1)} \cdot_{2} X = curl \, Y^{(i)} + S^{(i+1)} \cdot_{2} X = curl \, \widehat{Y}^{(i)} - \chi^{(i+1)} \cdot_{2} X\\ & \widehat{\alpha}^{(i)} := \alpha^{(i)} - S^{(i+1)} \cdot_{2} X = curl \, Y^{(i)} = - curl \, S^{(i)} = - curl \, \widehat{S}^{(i)} \end{array} $$

using (25) and S(4)=χ(4)=0.

Since \(\widehat {\alpha }^{(i)}\) are defined locally as a curl, the local forms of the conservation laws for topological charge content, \( \int _{\Sigma } \widehat {\alpha }^{(i)} n\, da\), of an arbitrary area patch Σ is given by

$$ \dot{\overline{{\widehat{\alpha}}^{(i)}}} = -curl \left(\widehat{\alpha}^{(i)} \times V^{\parallel(i)} \right), \qquad i \in \{1,2,3\} $$

where V(i), for each i, is a vector field. V(i) is the velocity field of the i-defect density field. Combining (30) and (31), we have that

$$ curl\, \left(\dot{\overline{ S^{(i)}} } - \widehat{\alpha}^{(i)} \times V^{\parallel(i)} \right) = 0 \Longleftrightarrow \dot{\overline{ S^{(i)}} } \,=\, \left(-curl\, S^{(i)}\right) \times V^{\parallel(i)} + \nabla F^{(i)}, \qquad i \in \{1,2,3\} $$

for some F(i) that can be prescribed. Eqs. 24 and (32) imply

$$ \dot{ \overline{ \widehat{S}^{(i)}}} = \left(-curl\, S^{(i)}\right) \times V^{\parallel(i)} + \nabla F^{(i)} + \sum_{k=1}^{i-1} \nabla^{i-k} \,\dot{ \overline{S^{(k)}}}, \qquad i \in \{1,2,3\} $$

with the last sum vanishing for i=1.

By kinematical arguments related to allowing for transverse motion of walls characterized by localized S(i) fields on surfaces, a part of F(i) is of the form F(i)=S(i)V(i), where V(i) is the velocity of the i-eigenwall field. Guided by simplicity in thermodynamic arguments that precludes the appearance of (unremovable) gradients of dislocation and eigenwall velocity fields in the expression for dissipation of the body (see “Thermodynamics” section), we make the following choice

$$ \nabla F^{(i)} := \nabla \left(S^{(i)} V^{\perp(i)} \right) - \sum_{k=1}^{i-1} \nabla^{i-k} \,\dot{ \overline{S^{(k)}}}, \qquad i \in \{1,2,3\}. $$

In (32) and (33), incorporating (34), V(i) and V(i) are to be constitutively specified, minimally consistent with the second law of thermodynamics to be globally satisfied for all processes of any body modeled by this theory.

Surfaces of displacement discontinuity (e.g. stacking faults) are not known to move transverse to themselves; moreover, such discontinuities are often not identifiable based on knowledge of only the current state (and not of the distinguished coherent reference from which displacements are measured). Hence, we will assume V(1)≡0. Elastic phase boundaries, i.e. localizations of the S(1) field along surfaces are known to move transverse to themselves, and not much is known about transverse motions of surfaces of discontinuity of the second gradient of elastic distortion, i.e. surfaces of inflection. Thus, we allow V(i),i=2,3 to be nonvanishing fields in general. Hence, we have the following evolution equations for the eigenwall fields:

$$ \begin{array}{ll} \dot{ \overline{{S}^{(1)}}} & = \left(-curl\, S^{(1)}\right) \times V^{\parallel(1)} = \left(-curl\, \widehat{S}^{(1)}\right) \times V^{\parallel(1)} = \dot{ \overline{ \widehat{S}^{(1)}}}\\ \dot{ \overline{{S}^{(2)}}} + \nabla \, \dot{ \overline{S^{(1)}}} & = \left(-curl\, S^{(2)}\right) \times V^{\parallel(2)} + \nabla \left(S^{(2)} V^{\perp(2)} \right) \\ & = \left(-curl\, \widehat{S}^{(2)}\right) \times V^{\parallel(2)} + \nabla \left(\left(\widehat{S}^{(2)} - \nabla \widehat{S}^{(1)} \right) V^{\perp(2)} \right) = \dot{ \overline{ \widehat{S}^{(2)}}}\\ \dot{ \overline{{S}^{(3)}}} + \nabla^{2} \,\dot{ \overline{S^{(1)}}} + \nabla\, \dot{ \overline{S^{(2)}}} & = \left(-curl\, S^{(3)}\right) \times V^{\parallel(3)} + \nabla \left(S^{(3)} V^{\perp(3)} \right) \\ & = \left(-curl\, \widehat{S}^{(3)} \right) \times V^{\parallel(3)} + \nabla \left(\left(\widehat{S}^{(3)} - \nabla \widehat{S}^{(2)} \right) V^{\perp(3)} \right)= \dot{ \overline{ \widehat{S}^{(3)}}}\\ \end{array} $$


We assume a free-energy density function of the body with the following dependencies:

$$ \begin{array}{ll} \psi &= \psi^{*} \left(\widehat{Y}^{(1)},\widehat{Y}^{(2)},\widehat{Y}^{(3)},\widehat{S}^{(1)},\widehat{S}^{(2)},\widehat{S}^{(3)},\widehat{\alpha}^{(1)},\widehat{\alpha}^{(2)},\widehat{\alpha}^{(3)}, \chi^{(2)}, \chi^{(3)} \right)\\ & = \psi^{**} \left(Y^{(1)},Y^{(2)},Y^{(3)}, H^{(2)}, H^{(3)}, \widehat{S}^{(1)},\widehat{S}^{(2)},\widehat{S}^{(3)},\widehat{\alpha}^{(1)},\widehat{\alpha}^{(2)},\widehat{\alpha}^{(3)}, \chi^{(2)}, \chi^{(3)} \right)\\ &= \psi\left(\nabla u, \nabla^{2} u, \nabla^{3} u, \widehat{H}^{(2)}, \widehat{H}^{(3)}, \widehat{S}^{(1)},\widehat{S}^{(2)},\widehat{S}^{(3)},\widehat{\alpha}^{(1)},\widehat{\alpha}^{(2)},\widehat{\alpha}^{(3)}, \chi^{(2)}, \chi^{(3)} \right), \end{array} $$

using (29), (24), (28), and noting that H(4)=0 (where the argument fields of each of the functions are evaluated at (x,t) to give the value of ψ(x,t)). Roughly speaking, the dependencies of ψ on \(\widehat {Y}^{(i)}, \widehat {\alpha }^{(i)}, i = 1,2,3\) are expected to be convex and those on \(\widehat {S}^{(i)}, i = 1,2,3\) to be multi-well, nonconvex.

The balances of linear and angular momentum are given by

$$ \begin{array}{ll} \rho \dot{v} & = div \, T + b_{f} = \rho \ddot{u}\\ 0 & = div \, \Lambda - X \cdot_{2} T + K \end{array} $$

where ρ is the mass density, v is the material velocity vector, T is the stress, Λ is the couple stress, and bf,K are the body force and body-couple densities per unit volume, respectively. As usual in solid mechanics, we assume balance of mass is satisfied once the deformation map at any instant is determined by evaluating the density field on the deforming body from the formula \(\rho = \frac {\rho _{0}}{det(I + \nabla u)}\), where ρ0 is the density field on the reference configuration.

The mechanical power supplied to the body is defined as (Mindlin and Tiersten 1962)

$$\begin{array}{ll} {\sf P} & := \int_{B} b-f \cdot_{1} v \,dv + \int_{\partial B} (T n) \cdot_{1} v \, da + \int_{\partial B} (\Lambda n) \cdot_{1} \omega \, da + \int_{B} K \cdot_{1} \omega \, dv\\ & = \int_{B} \rho v \cdot_{1} v \, dv + \int_{B} \left[ T \cdot_{2} D + \Lambda \cdot_{2} M \right] \, dv, \end{array} $$

using the balances of linear and angular momentum, where n is the outward unit normal to the boundary of the body, \(\omega := \frac {1}{2} curl v = -\frac {1}{2} X \cdot _{2} \Omega \) is the rotation vector where \(\Omega := \frac {1}{2} \left (\nabla v - (\nabla v)^{T} \right)\) is the rotation-rate tensor, \(D := \frac {1}{2} \left (\nabla v + (\nabla v)^{T} \right) \) is the strain-rate tensor, and M:=ω. Denoting

$${\sf F} = \int_{B} \psi \, dv; \qquad \qquad {\sf K} = \int_{B} \frac{1}{2} \rho v \cdot_{1} v \, dv $$

the mechanical dissipation, D, or the difference between the power supplied to the body and that stored in it, is given by

$$ {\sf D} := {\sf P} - \dot{\overline{\sf K + \sf F}} = \int_{B} \left(T \cdot_{2} D + \Lambda \cdot_{2} M - \dot{\psi} \right) \, dv. $$

In the following, we deduce guidelines for constitutive specification in our model that ensure that the mechanical dissipation vanishes in the absence of eigenwall and defect field evolution in any process and is positive otherwise, a minimal necessary condition for the mathematical model to be well-posed.

To facilitate the derivation of the thermodynamic driving forces for the various defect density and eigenwall fields, we will need the following auxiliary fields P(i),i{2,3} defined by the solutions of the following Poisson equations:

$$ \left.\begin{array}{rr} div \, \nabla P^{(i)}& = \partial_{\widehat{H}^{(i)}} \psi \ \qquad x \in B \\ \nabla P^{(i)}\,n & = 0 \qquad \qquad x \in \partial B \end{array}\right\} \qquad i \in \{2,3\}, $$

which requires that the free-energy density function should satisfy the constraint

$$\int_{B} \partial_{\widehat{H}^{(i)}} \psi \, dv = 0, \qquad i \in \{2,3\}. $$

(This is formally easily arranged by taking any arbitrary \(\tilde {\psi }\) with the dependencies of (36)3, and defining \(\psi = \tilde {\psi } - \sum _{i = 2}^{3} \left (|\Omega |^{-1} \int _{\Omega } \partial _{\widehat {H}^{(i)}} \tilde {\psi } \, dv \right) \cdot _{i}\widehat {H}^{(i)}\), but its physical and rigorous mathematical implications need to be understood).

Defining \(\phantom {\dot {i}\!}R^{(i)} := \partial _{\chi ^{(i)}} \psi \), the fields \(\phantom {\dot {i}\!}W_{R^{(i)}}\) satisfying

$$ \left.\begin{array}{rr} curl \, curl \, W_{R^{(i)}} =- div \, \nabla \, W_{R^{(i)}} & = curl \, R^{(i)} \qquad x \in B \\ div \, W_{R^{(i)}} & = 0 \quad \qquad \qquad x \in B \\ W_{R^{(i)}} \times n &= 0 \quad\qquad \qquad x \in \partial B \end{array}\right\} \qquad i \in \{2,3 \} $$

(that exist by a unique Stokes-Helmholtz resolution of R(i)), will aso be required in the sequel for deriving the thermodynamic driving forces.

A long computation involving (36)3 and the kinematics of the model defined in “Kinematics” section reveals that the mechanical dissipation may be expressed in the suggestive form

$$\begin{array}{*{20}l} {\sf D} = & \int_{B} \left[ T - \partial_{\,\nabla u} \psi + div\, \partial_{\, \nabla^{2} u} \psi - div\, div\, \partial_{\, \nabla^{3} u} \psi\right]^{(s)} \cdot_{2} D \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \left[ -\frac{1}{2} X \cdot_{1} \Lambda^{dev} - \partial_{\, \nabla^{2} u} \psi + div\, \partial_{\, \nabla^{3} u} \psi \right]^{(a)} \cdot_{3} \nabla \Omega \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{\partial B} \left[ - \partial_{\, \nabla^{2} u} \psi \,n + (div \,\partial_{\, \nabla^{3} u} \psi)\, n \right]^{(s)} \cdot_{2} D \, da \ + \ \int_{\partial B} \left[ - \partial_{\, \nabla^{3} u} \psi \, n \right] \cdot_{3} \nabla^{2} v \, da \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \sum_{i = 1}^{3} \left[ X \left(\left(- \partial_{\widehat{S}^{(i)}} \psi + curl\, \partial_{\widehat{\alpha}^{(i)}} \psi \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right) \right] \cdot_{1} V^{\parallel (i)} \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \sum_{i = 1}^{3} \left[ \left(div \, \partial_{\widehat{S}^{(i)}} \psi \right) \cdot_{i} \widehat{S}^{(i)} \right] \cdot_{1} V^{\perp (i)} \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{\partial B} \sum_{i = 1}^{3} \left[ X \left(\left(\partial_{\widehat{\alpha}^{(i)}} \psi \times n \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right) \right] \cdot_{1} V^{\parallel (i)} \, da \end{array} $$
$$\begin{array}{*{20}l} & + \int_{\partial B} \sum_{i = 1}^{3} \left[ - \left(\partial_{\widehat{S}^{(i)}} \psi \, n \right) \cdot_{i} \widehat{S}^{(i)} \right] \cdot_{1} V^{\perp (i)} \, da \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \sum_{i=2}^{3} \left[ X \left(\left(\nabla P^{(i)} \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right) \right] \cdot_{1} V^{\parallel (i)} \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \sum_{i=2}^{3} \left[ \left(- \partial_{\widehat{H}^{(i)}} \psi \right) \cdot_{i} S^{(i)} \right] \cdot_{1} V^{\perp (i)} \, dv \end{array} $$
$$\begin{array}{*{20}l} & + \int_{B} \sum_{i=2}^{3} \left[ X \left(\bigg(curl \, W_{R^{(i)}} \bigg)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right) \right] \cdot_{1} V^{\parallel (i)} \, dv. \end{array} $$

Thus, a set of constitutive equations, driving forces for dissipative mechanisms (denoted below by the symbol ), and some boundary conditions for the model are

$$ T^{(s)} = \left[ \partial_{\,\nabla u} \psi - div\, \partial_{\, \nabla^{2} u} \psi + div\, div\, \partial_{\, \nabla^{3} u} \psi\right]^{(s)} $$
$$ \Lambda^{dev} = - X \cdot_{2} \left[ \partial_{\, \nabla^{2} u} \psi - div\, \partial_{\, \nabla^{3} u} \psi \right]^{(a)} $$
$$ \left. \left[ - \partial_{\, \nabla^{2} u} \psi \,n + (div \,\partial_{\, \nabla^{3} u} \psi)\, n \right]^{(s)} \right\vert_{\partial B} = 0 $$
$$ \left. \left(\partial_{\, \nabla^{3} u} \psi \right) \, n \right\vert_{\partial B} = 0 $$
$$ \begin{aligned} & \left.\begin{array}{ll} V^{\parallel (i)} & \leadsto X \left(\left(- \partial_{\widehat{S}^{(i)}} \psi + curl\, \partial_{\widehat{\alpha}^{(i)}} \psi \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right)\\ V^{\perp (i)} & \leadsto \left(div \, \partial_{\widehat{S}^{(i)}} \psi \right) \cdot_{i} \widehat{S}^{(i)}\\ \end{array}\right\}, \qquad i = 1 \\ & \left.\begin{array}{ll} V^{\parallel (i)} & \leadsto X \left(\left(- \partial_{\widehat{S}^{(i)}} \psi + curl\, \partial_{\widehat{\alpha}^{(i)}} \psi + \nabla P^{(i)} + curl \, W_{R^{(i)}} \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right)\\ V^{\perp (i)} & \leadsto \left(div \, \partial_{\widehat{S}^{(i)}} \psi - \partial_{\widehat{H}^{(i)}} \psi \right) \cdot_{i} \widehat{S}^{(i)} \end{array}\right\}, \qquad i = 2,3\\ \end{aligned} $$
$$ \left.\begin{array}{ll} \left. V^{\parallel (i)} \right\vert_{\partial B} & \leadsto X \left(\left(\partial_{\widehat{\alpha}^{(i)}} \psi \times n \right)^{T} \cdot_{i} \widehat{\alpha}^{(i)} \right)\\ \left. V^{\perp (i)} \right\vert_{\partial B} & \leadsto - \left(\partial_{\widehat{S}^{(i)}} \psi \, n \right) \cdot_{i} \widehat{S}^{(i)} \end{array}\right\}, \qquad i = 1,2,3 $$

(it can be checked that the rhs of (52) is deviatoric). Eqs. 51-(54) along with the constitutive choices for the defect and eigenwall velocities to be in the direction of their respective driving forces, mediated by a positive, mobility/drag scalar required on dimensional grounds, ensures non-negative dissipation. Of course, other choices consistent with positive dissipation are possible as well. The boundary conditions (53)-(54) are not the most general, but a compromise between including higher order stress tensors with dubious physical meaning beyond couple stresses and simplicity in an already involved higher order theory of defects.

It is clear from (51)-(52) and (37) that the governing equations lead to sixth-order pde in the displacement field u (see “Example: a model of branch-point defects in an elastic body” section below).

Remark 0.9.

A minimal set of field variables to be evolved in the model are \((u, \widehat {S}^{(i)}, i = 1,2,3)\) governed by (37) and (35), with \(\widehat {H}^{(1)}, \widehat {H}^{(2)}\) determined from (27), \(\phantom {\dot {i}\!}W_{R^{(i)}}, i = 2,3\) determined from (40), and \(\widehat {\alpha }^{(i)}, i = 1,2,3\) determined from (30).

Remark 0.10.

The composite eigenwall fields are coupled to each other through (35) and through the displacement field, appearing in the driving forces for the defect and eigenwall velocity fields, governed by (37). The results of “Motivation for kinematics of the theory” section shows how the presence of a higher order defect (characterized by Γ≠0 in a non-simply connected domain) induces a lower order defect (y≠0) that, in general, induces stress in the body (Remark 0.3).

Remark 0.11.

A theory of only surfaces of inflection and singularities arises by assuming \(\widehat {S}^{(1)} = 0\) and \(\widehat {S}^{(2)} = 0\). A theory of only dislocations arises by setting \(\widehat {S}^{(3)} = 0\) and \(\widehat {S}^{(2)} = 0\) along with V(2)=0. A theory of only g.disclinations arises by setting \(\widehat {S}^{(1)} = 0\) and \(\widehat {S}^{(3)} = 0\) along with V(3)=0. Pair-wise coupled defect theories (dislocations + g.disclinations, dislocations + branch/inflection defects, g.disclinations + branch/inflection defects) can be obtained by similar means.

Example: a model of branch-point defects in an elastic body

We assume the as-received body as the reference configuration with all displacements measured from it; in particular, we assume that u(x,0)=0. We specialize the general formalism to a specific case by making the simplest possible choice for the free energy density (36) that shows the generalization of incompatible elasticity achieved by our work:

$$ {\begin{aligned} \psi &= \frac{1}{2} \left(\nabla u - \widehat{S}^{(1)} \right) C \left(\nabla u - \widehat{S}^{(1)} \right) + \frac{1}{2} c_{2} \left| \nabla^{2} u - \widehat{S}^{(2)}\right|^{2} \\&\quad+ \frac{1}{2} c_{3} \left| \nabla^{3} u - S \right|^{2} + d_{3} f \left(l^{2} |S| \right) + \frac{1}{2} \epsilon_{3} \left| curl S \right|^{2}, \end{aligned}} $$

with the ansatz that V(i)=0,i=1,2,3 and V(1)=V(2)=0, assumptions that are consistent with non-negative dissipation. Under these conditions \(\widehat {S}^{(i)}, i = 1,2\) do not evolve and remain fixed at their values specified through initial conditions. Let \(\widehat {S}^{(1)}(x, t) = \widehat {S}^{(1)}(x, 0) = \widetilde {g}(x)\) and \(\widehat {S}^{(2)}(x, t) = \widehat {S}^{(2)}(x, 0) = \widetilde {b}(x)\), and we note that \(\widetilde {g}_{sym}\) and \(\widetilde {b}\) are the analogs, for a 3-d body, of the freely specified, non-evolving, g and b tensors of incompatible elasticity described in “Statics and equilibria of non-Euclidean elastic sheets” section; we note that with \(\widetilde {g}\) specified, \(\widehat {S}^{(2)}(x, 0)\) can be arbitrarily specified by making a suitable choice of the field S(2)(x,0). We then have \(\widehat {S}^{(3)} = S^{(3)} + \nabla \widehat {S}^{(2)} =:S\). Here, C is the 4th-order tensor of elastic moduli with major and minor symmetries, c2,c3 are non-negative scalars (in place of sixth and eighth order tensors!), d3 is a positive scalar (that could also be a positive scalar-valued function of |curlS|), and l,ε3 are positive scalars. The physical dimensions of c2,c3,d3,l,ε3 are stress.(length)2, stress.(length)4, stress, length, and stress.(length)6, respectively. Since the equilibria we envisage are of nominally elastic bodies that show non-trivial shapes under no applied loads, f is generally expected to be a multi-well nonconvex function with the bottom of one well at the argument 0.

Thus we are looking for the mechanics of surfaces of inflection and branch line defects in bodies with an evolving stress-free reference characterized by the choices \(\widehat {S}^{(1)}_{sym} = \widetilde {g}_{sym}\), \(\widehat {S}^{(2)} = \widetilde {b}\), \(\widehat {S}^{(3)} = S\). It is interesting to note that even when \(\widetilde {g} = 0\) and \(\widetilde {b} = 0\), the energy/stress-free reference for our body is never immersible in three-dimensional Euclidean space whenever S≠0, i.e. the stress-free state is necessarily incompatible or non-realizable, since it is impossible to construct a displacement field of a 3-d body with vanishing strain, i.e., (u)(s)=0, whose third gradient is non-vanishing.

The balances of linear and angular momentum (37) are solved by taking a curl of (37)2 to obtain

$$div \, T^{(a)} = \frac{1}{2}\, curl \left(div \, \Lambda^{dev} \right) + \frac{1}{2} curl \, K, $$

that on substitution in (37)1 leads to

$$ \rho \ddot{u} = div\, T^{(s)} + \frac{1}{2} \, curl \left(div \, \Lambda^{dev} \right) + \frac{1}{2} curl \, K + b_{f}. $$

Constitutive Eqs. 51-(52) are used to solve for a displacement field from (58) (when the defect fields are assumed given), thus satisfying (37)1, and (37)2 is then satisfied, in terms of this displacement field, by simply evaluating Ta from the equation

$$ X \cdot_{2} T^{(a)} - \frac{1}{3} \nabla (tr\Lambda) = div \, \lambda^{dev} + K, $$

making the assumption that the constitutively undetermined trΛ=0, without loss of generality.

For the constitutive choice (57)

$$ {{}\begin{aligned} \Lambda^{dev} & = -c_{2} \, X \cdot_{2} \left(\nabla^{2} u - \widetilde{b} \right)^{(a)} + c_{3}\, X\cdot_{2} \left(div (\nabla^{3} u) \right)^{(a)} - X \cdot_{2} (div \, S)^{(a)}; \\ \Lambda^{dev}_{il} & = e_{ijk} \left(-c_{2} (u_{[j,k]l} - \widetilde{b}_{[jk]l}) + c_{3} \, u_{[j,k]lmm} - c_{3} \,S_{[jk]lm,m} \right); \\ \frac{1}{2} \left(curl \, \left(div \, \Lambda^{dev}\right) \right)_{i} & = - c_{2} (u_{[i,m]llm} - \widetilde{b}_{[im]l,ml})+ c_{3} \, u_{[i,m]llppm} - c_{3} \, S_{[im]lp,plm} \end{aligned}} $$


$$ {{}\begin{aligned} T^{(s)} & = C (\nabla u - \widetilde{g}) - c_{2} \left(div \, \left(\nabla^{2} u - \widetilde{b} \right) \right)^{(s)} + c_{3} \left(div \, div \, \nabla^{3} u \right)^{(s)} - c_{3} \left(div \, div \, S \right)^{(s)}; \\ T^{(s)}_{im} & = C_{imkl} (u_{k,l} - \widetilde{g}_{kl}) - c_{2} (u_{(i,m)ll} - \widetilde{b}_{(im)l,l}) + c_{3} \, u_{(i,m)lppl} - c_{3}\, S_{(im)lp,pl};\\ \left(div \, T^{(s)} \right)_{i} & = C_{imkl} \left(u_{k,lm} - \widetilde{g}_{kl,m} \right) \,-\, c_{2} \left(u_{(i,m)llm} - \widetilde{b}_{(im)l,lm} \right) + c_{3} \, u_{(i,m)lpplm} - c_{3}\, S_{(im)lp,plm} \end{aligned}} $$

so that the governing equation for the displacement field (58) may be written as

$$ \rho \ddot{u} \,=\, c_{3} \, \Delta^{3} u - c_{2} \, \Delta^{2} u + div \, (C \nabla u) - div \, C \, \widetilde{g} + c_{2} \, div \, div \, \widetilde{b} - c_{3}\, div \, div \, div \, S + \frac{1}{2} curl \, K \,+\, b_{f}, $$

where Δ3 (Δ3(·)=(·),iijjkk) and Δ2 (Δ2(·)=(·),iijj) are the triharmonic and the biharmonic operators, respectively.

To develop the evolution equation for the field S we assume V(3)=0 for simplicity. Since ψ in (57) does not depend on H(3) and χ(3), we have P(3)=0 and \(\phantom {\dot {i}\!}W_{R^{(3)}} = 0\) in (55)3. The governing equation for the evolution of S therefore is given by

$$ \dot{S} = \frac{1}{B} \, curl \, S \times \left(X \left(\left(c_{3}\, (\nabla^{3} u - S) - d_{3} \,l^{2} f' \!\left(l^{2}|S| \right) \frac{S}{|S|} \,-\, \epsilon_{3} \,curl \, curl \, S \right)^{T} \cdot_{3} curl \, S \right) \right), $$

where B is a drag coefficient with physical dimensions of stress.(length)−2.time.

A detail needs to be attended to in the above considerations. The thermodynamic relation (52) requires that

$$ \left(X \cdot_{2} \left[ \partial_{\, \nabla^{2} u} \psi - div\, \partial_{\, \nabla^{3} u} \psi \right]^{(a)} \right)^{dev} = 0. $$

For the constitutive choice (57), (64) is equivalent to

$$-c_{2} \, e_{ljk} \widehat{S}^{(2)}_{[jk]l} + c_{3} \, e_{ljk} S_{[jk]lm,m} = 0. $$

Since the evolution Eq. 63 for S is of the form \(\dot {S} = - curl\, S \times V\) for a vector field V, it is true that \(X \cdot _{3} \dot {S}^{(a)} = - curl \left (X \cdot _{3} S^{(a)} \right) \times V\) and hence X·3S(a)(x,t)=0 is consistent with the evolution for initial data X·3S(a)(x,0)=0 and we adopt this solution. Furthermore, we assume \( X \cdot _{3} \widetilde {b}^{(a)} = 0\) by specification so that \( X \cdot _{3} \widehat {S}^{(2)(a)}(x,t) = 0\) and therefore (64) is satisfied.

Remark 0.12.

Spatial derivatives of the 3-eigenwall field serve as a source term in (62); for instance, if S(x)=g(ν·x) bννν, where ν is the unit normal to a planar surface, g is a scalar-valued function of the spatial coordinate along ν given by ζ=ν·x(say a Gaussian centered at ζ=0), and b is a constant vector, this forcing is of the form \(\frac {d^{3} g}{d\zeta ^{3}} b\).

Eq. 63 implies that there is no evolution of the eigenwall field at locations where curlS=0, regardless of the energetic driving force there. For example, the field S(x)=g(ν·x) bννν has no ‘longitudinal’ variation and does not evolve according to (63). However, S(x)=g(t·x)g(ν·x) bννν, where t is orthogonal to ν does evolve. Physically, the eigenwall field is ‘dragged’ by the evolution of its core.

Remark 0.13.

The as-received body (whereu=0) need not be in equilibrium for generally specified eigenwall initial data \( \left (\widetilde {g}, \widetilde {b}, S(\cdot, 0) \right)\) for which the initial acceleration field can be evaluated. The class of initial data that leads to a self-equilibrated, generally stressed reference configuration may be obtained by writing \(\widetilde {g} := \nabla z + g^{*}\), and solving for the vector field z from (62) (and boundary conditions) with ρ:=0 for each choice of \(\left (g^{*}, \widetilde {b}, S(\cdot, 0) \right)\).

Remark 0.14.

The model with the ansatz \(\widehat {S}^{(1)} = 0, \widehat {S}^{(2)}=0\), is worthy of study on its own merits.

Remark 0.15.

Configurations rendering local minima of a body with the energy density (57) may be studied by an (L2) gradient flow dynamics in the fields \(\left (u, \widehat {S}^{(i)}; i = 1,2,3 \right)\) starting from arbitrarily specified initial states for these variables.

Remark 0.16.

The governing Eq. 62 implies that, when the elastic modulus C is homogeneous and isotropic, given by Cijkl=λuk,kδij+μ(δikδjl+δilδjk), plane waves of curlu and divu are dispersive in nature, with propagation possible in any direction in space. The dilatational waves (i.e., waves of divu) with wave number |k| and direction \(\frac {k}{|k|}\) propagate with velocity

$$c_{d} := \pm \sqrt{\frac{c_{3} |k|^{4} + c_{2} |k|^{2} + (\lambda + 2\mu)}{\rho}} $$

while the equivoluminal waves or ‘shear waves’ (i.e., vectorial waves of curlu) propagate with velocity

$$c_{s} := \pm \sqrt{\frac{c_{3} |k|^{4} + c_{2} |k|^{2} + \mu}{\rho}}. $$

Continuous dependence w.r.t initial data of the Cauchy problem for the evolution of displacement requires c3≥0. When c3=0, c2 must be non-negative, with the requirement that μ≥0 and λ+2μ≥0 if c2=0. Within these parameter regimes, linear instabilities can arise for wavenumber and parameter combinations resulting in cd or cs taking complex values.

Uniqueness of the displacement field and boundary conditions

Our model encompasses a model of third-order elasticity in the absence of dissipative defect evolution, and involves the thermodynamically motivated higher-order boundary conditions (53)-(54). Here, we use a uniqueness argument (in a putative smooth class of solutions) to deduce a full set of boundary conditions for the problem (62) when the S field is assumed specified. We abstract the results of the exercise in this special case related to the ‘quadratic’ energy (57) to identify a likely set of sufficiently general boundary conditions for the determination of the displacement field for processes consistent with the general constitutive statement (36).

Consider two solutions u(1) and u(2) of (62) corresponding to identical \(S, K, b_{f}, \widetilde {g}, \widetilde {b}\) fields. Denote the difference displacement as u:=u(1)u(2) and its velocity \(v = \dot {u}\). Then u satisfies

$$\rho \ddot{u} = c_{3} \, \Delta^{3} u - c_{2} \, \Delta^{2} u + div \, (C \nabla u), $$

and taking the inner-product of the difference velocity with the equation and integrating in space, we have

$$\frac{1}{2} \frac{d}{dt} \int_{B} \rho \, v_{i} v_{i} \, dv = \int_{B} C_{imkl} \, u_{k,lm} v_{i} \, dv - \int_{B} c_{2} u_{i,mmll} \, v_{i} \,dv + \int_{B} c_{3} u_{i,mmllpp} \, v_{i} \, dv, $$

which implies

$$\begin{array}{*{20}l} & \frac{1}{2} \frac{d}{dt} \int_{B} \rho \, v_{i} v_{i} \, dv + \int_{B} C_{imkl} \, u_{k,l} v_{i,m} \, dv + \int_{B} c_{2} u_{i,ml} \, v_{iml} \,dv + \int_{B} c_{3} u_{i,mlp} \, v_{i,mlp} \, dv \notag\\ & = \quad \! \int_{\partial B} \big(C_{ilkm}\, u_{k,m} - c_{2} \, u_{i,mml} + c_{3} \, u_{i,mmppl} \big) v_{i} \,n_{l} \, da \notag\\ & \quad + \int_{\partial B} \big(c_{2} \, u_{i,lm} - c_{3} u_{i,lppm} \big) v_{i,l} \, n_{m} \, da \notag\\ & \quad + \int_{\partial B} \big(c_{3} \, u_{i,plm} \big) \, v_{i,lp} \, n_{m} \, da. \end{array} $$

Let us now assume that both u(1) and u(2) satisfy (53)-(54) consistent with (57). Then the last line of (65) vanishes due to the boundary condition (54) and the line before that due to (53).

Let the stress field arising from (u(i),S), i=1 2, be T(i)=T(s)(i)+T(a)(i), in accord with (59), (60), and (61). Then the third line from the bottom of (65) may be interpreted as

$$\int_{B} \Big(T^{(1)}_{il} - T^{(2)}_{il} \Big) v_{i} \, n_{l} \, da $$

and if we now additionally require that solutions satisfy specified tractions and velocities (or displacements) on mutually complementary parts of the boundary of the body for all times, then this term vanishes.

Consequently, we are left with

$${\begin{aligned} &\frac{d}{dt} \left(\frac{1}{2} \int_{B} \rho \, v_{i} v_{i} \, dv + \frac{1}{2} \int_{B} C_{imkl} \, u_{k,l} u_{i,m} \, dv \right. \\&\qquad\qquad\qquad\qquad \left. + \frac{1}{2} \int_{B} c_{2} u_{i,ml} \, u_{iml} \,dv + \frac{1}{2} \int_{B} c_{3} u_{i,mlp} \, u_{i,mlp}\, dv \right) = 0 \end{aligned}} $$

and if u(1) and u(2) both satisfy specified initial conditions on the displacement and velocity fields, then the bracketed quantity, an integral of sums of squares (in fact, the potential and kinetic energies of the body subjected to the difference displacement) vanishes at all times. This proves that the difference velocity vanishes point-wise, and the initial condition on the difference displacement implies that u(1)=u(2) for all (x,t). Obviously, the dynamic problem allows the prediction of unique rigid motions. In statics, i.e., when the inertia term is absent, one takes the inner product of the governing equation for the difference displacement with the difference displacement, and obtains, for the same boundary conditions (except only the displacement can now be specified on the part of the boundary complementary to where tractions are specified),

$$\int_{B} C_{imkl} \, u_{k,l} u_{i,m} \, dv + \int_{B} c_{2} u_{i,ml} \, u_{iml} \,dv + \int_{B} c_{3} u_{i,mlp} \, u_{i,mlp} \, dv = 0. $$

All integrands are non-negative implying that the strain, or the symmetrized displacement gradient, vanishes (recall the minor symmetries of C) which, by compatibility, further implies that the displacement field is unique if a displacement boundary condition is specified and otherwise it is unique up to an infinitesimally rigid deformation.

Thus, the higher order boundary conditions (53)-(54), along with classical displacement and traction boundary conditions may be expected to define a well-set problem (for the displacement field) in the case of the general constitutive Eq. 36. Of course, the traction now involves a stress tensor that has an antisymmetric part, and is constitutively dependent on higher order displacement gradients.

A ‘plate’ idealization

For simplicity we consider \(\widetilde {g} = 0\) and \(\widetilde {b} = 0\). Let the reference B be a plate of thickness 2t, i.e., B={(x1,x2,x3)|(x1,x2,0)B2,x3[−t,+t]}, where B2 is a flat 2-dimensional simply connected domain. Defining the through-the-thickness average of a function as

$$\overline{f}(x_{1},x_{2}) := \frac{1}{2t} \int_{-t}^{+t} f(x_{1},x_{2},x_{3})\, dx_{3} $$

and the notation

$$\left[ f \right]^{+t}_{-t} (x_{1}, x_{2}) := f(x_{1}, x_{2}, +t) - f(x_{1}, x_{2}, -t), $$

we now seek the governing equations for \(\overline {u}\) and \(\overline {S}\), under the ansatz that \(\overline {S} = S\) and \(\overline {\rho } = \rho \), i.e., S and ρ do not vary through the thickness of the plate, and K=b=0. It is also assumed that a component of S vanishes if any of its last three indices takes the value 3. We use the notation that all lowercase Greek indices vary from 1 to 2 while lowercase Latin indices span from 1 to 3.

While not essential, the assumptions l=2t, c2=Et2 and c3=Et4, where E is the Young’s modulus of the material can be made to draw an analogy with classical plate theory (the curvature-related elastic energy term in the thickness-integrated expression of (57) would then be proportional to t3). For 0<t1, whenever S≠0, there is energy and stress in the body, possibly small, with the corresponding thickness-integrated ‘elastic’ energy of the plate (arising from the first three terms in (57)), alternatively the ‘plate elastic energy’, scales as t5, assuming energy is minimized, there are no external forcing or constraints, and ε3>0 to rule out any possibility of a singular energy. Our governing Eqs. 62 or (66) do not require that energy be minimized, so that scaling of the thickness-integrated elastic energy w.r.t t as t→0 in the model can well contain lower order bending \(\left (O(t^{3})\right)\), and even stretching \(\left (O(t)\right)\), contributions.

Applying the averaging operator to (62) and noting that

$${\begin{aligned} \begin{array}{ll} u_{i,llppmm} &= u_{i,\alpha \alpha \beta \beta \gamma \gamma} + 3 u_{i,\beta \beta \gamma \gamma 33} + 3 u_{i, \gamma \gamma 3333} + u_{i,333333}\\ u_{i,llpp} &= u_{i,\alpha \alpha \beta \beta} + 2 u_{i,\beta \beta 33} + u_{i,3333}\\ C_{ijkl}u_{k,lj} &= C_{i\beta k \alpha} u_{k,\alpha \beta} + C_{i \beta k 3} u_{k, \beta 3} + C_{i3k \alpha} u_{k,\alpha 3} + C_{i3 k \alpha} u_{k,33}\\ S_{ijkl,jkl} &= S_{i \alpha \beta \gamma, \alpha \beta \gamma} + \left(S_{i \alpha \beta 3} + S_{i3 \alpha \beta} + S_{i \alpha 3 \beta} \right)_{,\alpha \beta 3} \\&\quad+ \left(S_{i333 \gamma} + S_{i3 \gamma 3} + S_{i \gamma 33} \right)_{,33 \gamma} + S_{i333,333}, \end{array} \end{aligned}} $$

we obtain

$$ {\begin{aligned} \begin{array}{ll} \rho \ddot{\overline{u_{i}}} &= c_{3} \, \overline{u_{i}}_{,\alpha \alpha \beta \beta \gamma \gamma} - c_{2} \, \overline{u_{i}}_{,\alpha \alpha \beta \beta} + C_{i \beta k \alpha}\overline{u_{k}}_{,\alpha \beta} - c_{3} \overline{S_{i \alpha \beta \gamma}}_{,\alpha \beta \gamma}\\ & \quad + \left[ 3 c_{3} u_{i, \gamma \gamma 333} + c_{3} u_{i,33333} - 2 c_{2} u_{i,\beta \beta 3} - c_{2} u_{i,333}\right.\\ &\quad \left.+ C_{i \beta k 3} u_{k,\beta} + C_{i3k \beta} u_{k,\beta} + C_{i3 k 3} u_{k,3} \right]^{+t}_{-t}. \end{array} \end{aligned}} $$


$$ {\begin{aligned} \begin{array}{ll} B \, \dot{\overline{S_{i \pi \sigma \lambda}}} & = e_{3 \mu \rho} \,\overline{S_{i \pi \sigma \rho}}_{,\mu} \, e_{\lambda 3 \chi}\left(e_{\chi \xi 3} \Bigg\{ c_{3} \Big(\overline{u_{w}}_{,\alpha \beta \xi} - \overline{S_{w \alpha \beta \xi}} \Big) \right.\\ & \qquad - \epsilon_{3} \, e_{\xi \nu 3} \, e_{3 \gamma \phi} \overline{S_{w \alpha \beta \phi}}_{,\gamma \nu} - d_{3} \, l^{2} f'(l^{2} |\overline{S}|) \frac{\overline{S_{w \alpha \beta \xi}}}{\left|\overline{S} \right|} \Bigg\} e_{3 \epsilon \zeta}\, \overline{S_{w \alpha \beta \zeta}}_{,\epsilon} \Bigg)\\ & \quad \ + e_{3 \mu \rho} \,\overline{S_{i \pi \sigma \rho}}_{,\mu} \, e_{\lambda 3 \chi} \, e_{\chi \xi 3} \, e_{3 \epsilon \zeta} \left(\overline{S_{w \alpha 3 \zeta}}_{,\epsilon} \left[ u_{w, \alpha \xi} \right]^{+h}_{-h} + \overline{S_{w 3 \beta \zeta}}_{,\epsilon} \left[ u_{w, \beta \xi} \right]^{+h}_{-h}\right. \\&\quad \left. + \overline{S_{w 3 3 \zeta}}_{,\epsilon} \left[ u_{w, 3 \xi} \right]^{+t}_{-t} \right). \end{array} \end{aligned}} $$

In Eq. 66, the terms beyond the first line represent forcings in the transverse direction to the plate and need to be specified (it would be physically legitimate to assume many of these terms to vanish); the third line of (67) has similar meaning and needs specification.

The functions \(\overline {u}, \overline {S}\) represent the fundamental fields of the plate theory, governed by (66)-(67). Evaluating \(\overline {T^{(a)}}\) from (59) in terms of \((\overline {u}, \overline {S})\) solving (66)-(67) and \(\overline {K}\) would imply the satisfaction of balance of angular momentum (i.e., moment balance) in the through-the-thickness averaged sense.

Remark 0.17.

We note that non-evolving and non-vanishing \(\widehat {S}^{(1)} = \widetilde {g}, \widehat {S}^{(2)} = \widetilde {b}\) ‘target’ composite eigenwall fields can be included in the considerations of this plate idealization with only slight increase of tedium in bookkeeping.

Within the context of energy minimization and for t>0, if \(curl \, \left (curl\, \left (\widehat {S}^{(1)(s)}\right)\right)^{T} = 0\), i.e. \(\widehat {S}^{(1)(s)}\) satisfies the St.-Venant compatibility condition, then an infinitesimal isometry exists (the reference configuration is assumed to be simply-connected) and the plate elastic energy scales as t3 or of smaller magnitude; if \(\widehat {S}^{(1)(s)}\) is not compatible, then the energy has to scale as t. We note that when \(\widehat {S}^{(1)(s)}\) is compatible, unless \(\widehat {S}^{(2)} = \nabla ^{2} v\), where v is s.t. \((\nabla v)^{(s)} = \widehat {S}^{(1)(s)}\) so that 2v is unique, the plate elastic energy is going to scale as t3. The requirement \(\widehat {S}^{(2)} = \nabla ^{2} v\) is non-generic for a freely-specifiable \(\widehat {S}^{(2)}\) field that, however, is satisfied by the choice \(\widehat {S}^{(1)} = 0, \widehat {S}^{(2)} = 0\). Thus, in most circumstances the plate energy is expected to scale as t3, if the plate energy is minimized.


Starting from the work of the brothers Cosserat (1909) (as presented in Truesdell and Toupin (1960)), through those of Toupin (1964), Green and Rivlin (1964), Mindlin (1962; 1964), on to that of Fleck et al. (1994), Fleck and Hutchinson (2001), Hutchinson (2012) and of Gurtin (2002), Gurtin and Anand (2009), higher order theories of continuum mechanics have made an appearance off and on and have been noted for their intricacy and elegance, but always, arguably, with the nagging question of the physical justification (in their detailsFootnote 1) in view of their added complexity. Our work aims to provides a concrete, tangible, and compelling justification - that the precise treatment of defects in the deformation and its higher order gradients is the raison d’être for higher order theory in continuum mechanics.

Our work is in the context of non-Euclidean elastic sheets with negative in-plane Gauss curvature. These objects are ubiquitous in nature and they display varied and intricate multi-scale behaviors (Sharon et al. 2002; Audoly and Boudaoud 2003; Klein et al. 2007; Kim et al. 2012; Gemmer and Venkataramani 2013). Their elastic behavior is significantly different from that of elastic plates or spherical shells (Gemmer et al. 2016; Shearman and Venkataramani, in preparation). In particular, they have “large" continuous families of low-energy states obtained from piecewise isometries, with each piece possessing additional “bending" degrees of freedom. Thin hyperbolic free sheets are thus easily deformed by weak stresses and their morphology is strongly dependent on the dynamics of the growth/swelling processes, material imperfections, or other weak external forces. This naturally motivates the need for tools to describe singularities/defects in these sheets, their interactions and the resulting dynamics.

Mesoscopic defects in hyperbolic sheets, associated with their “soft" modes of deformation, include lines of inflection that terminate at branch points (Gemmer and Venkataramani 2012; Shearman and Venkataramani, in preparation). These are higher-order defects (termination of jumps in curvature) unlike the more common types of defects, disclinations and dislocations. Irreversible effects in the dynamics of disclinations and dislocations are associated with (macroscopic) plastic behaviors - stress-free large deformations, internal stresses, and microstructure - in solids. A natural question therefore is – what are the macroscopic manifestations of moving lines/surfaces of inflection and branch points/lines?

In this work we have begun to address this question in the context of ‘small deformations’ from a (potentially stressed, when occupied) reference configuration. A detailed analysis and characterization of the kinematics of branch point defects and the discontinuities in the deformation that they induce is achieved. This analysis, in its essence, is a non-trivial adaptation and extension of the ideas of Weingarten (1901) and Volterra (1907), from the dawn of elastic defect theory, to a context not restricted within the kinematics of only strain (the symmetrized gradient of the displacement, as well as its nonlinear analog) and its incompatibilities, and shows the natural way forward for deducing the constraints on possible jumps in deformation, i.e. global constraints, for locally compatible higher order deformation gradients, albeit on domains with the simplest non-trivial topologyFootnote 2. We then develop a thermodynamically consistent theory for the dissipative dynamics of such defects in a nominally elastic solid, allowing for their interaction with dislocation, g.disclination, grain, and phase boundary defects. The constitutive guidance provided by this thermodynamic argument ensures that the model is equipped with an energy (in)equality, a crucial necessary condition for its physical and mathematical well-posedness. The analysis uncovers the non-Newtonian, energetic driving forces on these defects that couple their dynamics and mutual interactions to applied loads and the deformation of the bodyFootnote 3. Evolution of the defect fields subject to such driving forces necessarily reduces the system free-energy by design, within an overall dynamics that accounts for material inertia and is not restricted to its free-energy decreasing with time (depending on the external driving). As an example, we explicitly demonstrate the full set of governing equations for the case of branch point defects in an elastic material and develop a ‘plate’ theory idealization for it. The development of the finite deformation version of the model poses no conceptual or technical barriersFootnote 4 based on our prior work in g.disclination mechanics (Acharya and Fressengeas 2015), but this same work makes it clear that the bookkeeping tasks in pushing through the analysis are going to be formidable.

We observe in passing that while we have been interested in developing a theory for branch point/line defects and lines/surfaces of inflection, i.e. a theory for the discontinuities and singularities of the deformation and its gradients up to order three, the analysis makes it clear that the mathematical/continuum mechanical formalism extends to describing the discontinuities and singularities of any finite integer order gradient of the deformation, while including only stresses and couple stresses. As already observed in Acharya and Fressengeas (2015), using the Second Law in global form is crucial for this, albeit at the expense of the application of limited (but adequate, as we show in “Uniqueness of the displacement field and boundary conditions” section) higher-order boundary conditions about which not much is physically known anyway.

As a final comment, we note that a geometric model of growth mechanics, based on Riemannian geometry and including evolution, has been proposed in Yavari (2010). The viewpoint is different from ours and, in particular, the mechanics of incompatibility based on a Riemannian metric cannot describe (without non-trivial extension) the ‘softer’ branch point defects we focus on. We expect that one can recast our continuum mechanical kinematic constructs within a differential geometric structure involving the specification of a moving frame, and higher-order constructs based on such a field, thereby making connections with the “geometric" viewpoint of growth mechanics.

Availability of data and material

All data required to interpret the results in this paper are contained therein.


  1. For example, none of the plasticity-related works in the above, while apparently motivated from modeling plasticity arising from dislocations, recover all of the ingredients of the classical Peach-Koehler force in the driving force for their dislocation-related inelastic deformation mechanisms.

  2. It should be noted that the question of conditions for global compatibility on domains with non-trivial topology is different from the question addressed by Weingarten’s theorem and its extensions to higher order kinematics, which deduce constraints on the discontinuous deformations arising from the absence of global compatibility.

  3. The fact that similar models, for lower-order defect kinematics, can indeed represent the complex nonlinear statics, dynamics, and interaction of defects is demonstrated in Zhang et al. (2015); Zhang et al. (2016); Zhang et al. (2018); Arora and Acharya (2020).

  4. For the worker proficient in general continuum mechanics.


Download references


SCV is supported by the Simons Foundation through awards 524875 and 560103. Portions of this work were carried out when SCV visited the Center for Nonlinear Analysis at Carnegie Mellon University, and their hospitality is gratefully acknowledged.


SCV is supported by the Simons Foundation through awards 524875 and 560103. AA did not receive any funding for this work.

Author information

Authors and Affiliations



AA contributed to development of theory and writing of the paper. SCV contributed to development of theory and writing of the paper. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Amit Acharya.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Acharya, A., Venkataramani, S.C. Mechanics of moving defects in growing sheets: 3-d, small deformation theory. Mater Theory 4, 2 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: