1	Human Motion Synthesis By Example Michael Gleicher Associate Professor Lucas Kovar Post-Doctoral Associate University of Wisconsin- Madison www.cs.wisc.edu/~gleicher www.cs.wisc.edu/graphics
2	Outline Part 0 – Intro and Overview Part 1 – Basics of Motion Part 2 – Concatenative Synthesis Part 3 – Parametric Synthesis Part 4 – Motion Databases (the outline will be explained as part of part 0)
3	The Topic:
4	Disclaimers We are interested in body motion Skelletal motion NOT faces, soft tissue, … We are considering motion capture Key-framed motion should work We aren’t game developers
5	Problem Need realism and expressiveness Need performance (interactive) Need controllability (interactive)
6	Why is human motion hard? People are good at watching people! Human appearance is very complex People do many things In many ways Subtlety matters Hard to describe movement “Normal” movements aren’t interesting
7	Aspects of the Problem “Gross” Body movement NOT: Appearance Models Facial animation Cloth, clothing, secondary movement Hands
8	“Three” ways to make motion Create it by hand Expensive, but high quality Compute it (simulation or other algorithms) Not for complex human motions Not directable Capture it from a performer Animate by example Re-use existing motions Editing Synthesis by Example
9	Motion Capture and Performance Animation Use sensors to record a real person Get high-degree of realism Which may not be what you want... Actors are directable Motion capture provides recording Reliable and efficient if its done well…
10	Motion Capture Technology: Optical Tracking Use markers and special cameras Tracking + Math Processing is important
11	Is MoCap the answer? Motion Capture can record and replay human motions If done well Just a record of what happened Not controllable Can’t capture everything No continuous streams of motion Fine for a movie, not for a game
12	Our answer: Animation by Example! MoCap (or keyframing) provide clips Examples of what we want Combine examples into new motions Sequence to make longer streams Blend/Edit to get needed variations Create new motions as needed Make useful motions at author time Create in response to action at run time
13	The Challenge: Human motion is rich, complex, varied, … Difficult to characterize mathematically or programatically The mathematic/algorithmic building blocks for working with motion are low level How do we assemble these simple building blocks to do complex things?
14	Main points Basic building blocks are simple No excuse not to do them right! Basic building blocks combine to do useful things Basic building blocks can be extended to provide more versatility Automation saves more than time- it increases what you can actually do
15	Outline Part 0 – Intro and Overview Part 1 – Basics of Motion Part 2 – Concatenative Synthesis Part 3 – Parametric Synthesis Part 4 – Motion Databases
16	Part 1 - Basics Representing characters Rotations, hierarchies, ... Working with motions Signal processing intuitions and tools Applied signal processing Constraint-based approaches
17	Part 2 – Concatenative Synthesis A direct application of the basic blocks Basic blend/transition methods are limited – what can we do with them? Answer: Use them to assemble fragments of motion into longer streams of motion. Motion Graphs Search-based techniques Snap-Together Motion
18	Part 3 – Parametric Approaches Enhancing Blending How do we enhance basic blending so its more general/useful? Registering motions so they can be blended – even if different Blending multiple motions Determining the blend amounts
19	Part 4 – Motion Database Search Determining what to blend If we have a lot of motion, how do we find what we need? Database search via Match Webs
20	Basics Goal of this section is to review the basics of representing human motion and working with motion Main insight: There are a small set of mathematical operations we can perform on motions. We have to build everything else from these. (bear with me – Lucas has all the cool demos)
21	Where to begin… Some preliminaries Human Representation Rigid bodies Kinematics Motion Signal Processing Frequency Intuitions Applied Signal Processing Tools Constraints
22	Modeling Humans Humans are complex!
23	Representation of Humans Need concise description of pose Goal: Summarize pose as a vector Motion is vector valued function Compact, yet flexible Make constraints implicit
24	Abstractions
25	Abstractions vs. Reality (skeletons vs. humans)
26	How Realistic do you need? It depends! Generally, small numbers of degrees of freedom (50-60) Easier to animate/specify Don’t see the details from far away Better to use a few d.o.f.s well
27	Standard simplified models of humans Small numbers of degrees of freedom for gross motion Articulated figures Rigid pieces Sometimes stretching allowed Kinematic joints Rotations between pieces Sometimes called a skeletal model
28	Articulated figure representation
29	Rigid Body A set of points that undergoes a rigid transformation Describe configuration by the rigid transformation P’ = f (q, P)
30	Rigid Transforms Any rigid transformation can be de-composed into: A translation (all points same) A rotation (the interesting part)
31	Rotations Mapping f : Rⁿ->Rⁿ Defined by properties: Has a zero Preserves distances Preserves handedness Is a linear mapping
32	Parameterizing Rotations Goal: encode rotations in a vector Rⁿ - > “set of rotations” Give “names” to members of the set of possible rotations Many ways to do this, all flawed No perfect method Use the best one for the job
33	Goals for Parameterization Compact (as few variables as possible) Complete Every rotation can be represented 1-to-1 Every rotation has one value Every value has one rotation (stable) Singularity free “close” rotations are “close” in value Can compute with Compose, interpolate, apply, …
34	Parameterizations of Rotations Rotation Matrices Euler Angles Axis Angle formulation Unit Quaternions Exponential Coordinates Local linearizations
35	Parameterization 1: The Rotation Matrix We know the rotation is a linear function (e.g. Matrix) Use the matrix as the parameterization! Any rotation is represented by 1 matrix Must preserve distance Must preserve handedness Must preserve angles Positive, Orthonormal matrices Exactly the set of rotations
36	Problems with Matrix as Parameterization Not compact 9 numbers (but 3 d.o.f.) Not all matrices are orthonormal Change 1 number, its not orthonormal Sensitive to numerical issues Can’t tell quickly Given a matrix, determine if orthonormal Can’t project quickly Given a matrix, find the “closest” orthonormal one
37	More problems… Given two rotation matrices, M₁ and M₂ Can you measure how different they are? Can you interpolate them? (e.g. find halfway) Fortunately, they are closed under multiplication Modulo numerical issues
38	Problems are worse in 3D 3x3 matrices – 9 parameters No intuitive meaning to parameters Only supports a few operations Apply to point Multiply (compose) – beware drift Use rotation matrices to apply rotations Use other methods to parameterize and manipulate them
39	Two theorems of Euler Any rotation can be represented by a single rotation about an arbitrary axis Axis / Angle Representation Any rotation can be represented by a sequence of 3 rotations around fixed axes Euler Angles
40	Axis / Angle Not compact (4 numbers, not 3) Each rotation represented by many groups of 4 numbers Can’t compute with Hard to compose Hard to compare Hard to interpolate Inefficient
41	Euler Angles Pick 3 axes (XYZ, ZXZ, ZXY, …) Compact, Stable - Any 3 numbers is a rotation Every rotation has many values Singularities Not metric (close rotations->different numbers) Interpolations can be weird Can’t compose OK when 1 axis at a time Stability gives false sense of security in math
42	What else? Other parameterizations more recent in Computer Graphics Quaternions introduced to graphics 1985, popular recently Exponential co-ordinates introduced to graphics 1995, not yet popular Both methods are old Graphics just took a while to discover them
43	Easy case: 2D Rotations in 2D aren’t too hard Examine them to see what happens in 3D (where it is much harder) Basic problems still occur
44	2D Rotations Consider 1 point in 2D, center is the origin A rotation maps the point somewhere on the circle
45	Each rotation is a point on the circle Not exactly… There’s the handedness thing
46	So how to name points on a circle? No good mapping to the real line Real line goes on forever Circle wraps around Same problems as rotation! Note: circle (in 2D) is a 1D set
47	Method 1: use a 2D coord Name point by x,y on circle Could be a complex number
48	Extra coordinates Good points Every point can be named Every point has a unique name Close points have similar names (no singularity) Bad points Not all points are on the circle Can’t manipuate vectors How to add? Takes you off the circle
49	Quaternions Extension of this idea to 3D rotation 4 dimensional complex number Real part, 3 imaginary axes (vector) Represent 3D rotation as a point on the unit 4-sphere Need to stay on sphere E.g. UNIT Quaternions
50	Good points about Quaternions Multiplication is defined Easy composition Interpolation is defined Special methods worked out Linear (1985), Cubic (1995) Relatively compact Singularity free “Nearly” 1-to-1
51	Bad point about Quaternions Can’t add (but can compose) Can’t take linear combinations Can’t average Can’t linear filter (but SLERP works between 2) (but the hack works in practice) (and the “correct” methods aren’t that hard) Distance metric is trickier
52	A “hack” Its easy to get “back on the circle” via reprojection Pretend points are in 2D, then project back Example: averaging
53	Warning on the hack… Gets the right answer for averaging Not for other linear combinations Works well when difference is small Small angle approximation Fails when opposite Useful since we can renormalize if computations have problems
54	In practice… This hack works really well Better than any Euler Angle method In motion processing, often deal with smaller differences “Correct” methods exist SLERP is worthwhile Exponential maps worth knowing about
55	Quaternions not Euler Angles (for 3 d.o.f. joints) Euler Angles Singularities (not really a big deal) Can’t interpolate Can’t compose Can’t compare Stable Quaternions No singularities Easy to interpolate Easy to compose Easy to compare Stability by re-normalization
56	Method 2: distance How far around circle? (unit radius makes things easier) Basically an angle
57	Method 3: velocity Suppose the particle starts “at zero” and has a constant velocity ω Where does it end up at the end of a unit of time?
58	Method 4: velocity Velocity is tangent to circle – therefore it is initially upwards If circle is in the complex plane, the velocity is purely imaginary
59	Velocity (cntd) Velocity as “up” only works if we start at origin so always measure from origin shift the start around
60	Initial velocity is good… It’s linear! Linearizes the circle around the origin Can operate on it Add Scalar multiply Not perfect… Many different ways to get to any place
61	Local linearization Logarithmic map / Exponential map Good for describing the differences between orientations Good basis for performing linear operations on orientations Filtering Averaging
62	In general… Use quaternions to represent orientation Use tangent space (log map) to perform linearized computations Hack works, just as well in practice SLERP if differences are big Don’t tell anyone I said that!
63	Back to our real question… Abstraction of Human Motion Humans too complex Need tractable models Some number of connected, rigid pieces (usually)
64	Representations of Pose Angle vs. positional data Global vs. relative Hierarchical vs. non-hierarchical Skeletal vs. Non-Skeletal
65	Representations of 2 bodies
66	Good Points of Hierarchical Skeletons Enforce key constraints Connected segments Rigid limbs Fewer Dof’s Only store angles between segments Easy for skinning Local coordinate systems defined
67	Bad Points of Hierarchical Need 3D rotations Coupled parameters End effector controls require IK Forces rigidity Problems with reference Different ways of defining things
68	Are Hierachies a Given? There are systems based on points Diva (House of Moves) – one example Math on points is easy! Conversion to skeleton is hard Interpolation gives weird results Can’t blend dissimilar things Not compact, hard to draw, …
69	Making Hierarchies Work Custom character setup (have right DOFs) Well chosen joint sets (placement and type) and controls (IK / FK) Good: make characters that animator can control Bad: no uniformity/standardization Important if motion from outside source Important if want to reuse motion Everybody has a different skeleton
70	How do skeletons differ? Obvious ways? Topology number of bones Connectivity of bones Joint Types Bone lengths Anatomical / skin relations Is spine in middle of body, or up the back?
71	Subtle Skeletal Differences What to measure angles with respect to Doesn't matter, as long as we agree Poses (design of a skeleton) Zero Pose / Base Pose Dress or Binding pose Frankenstein Pose Da Vinci Pose Rest Pose (real pose of actor) Need to figure out how to get between these
72	Target Poses
73	Reference Poses
74	Why do we care? Motion data is relative to base pose Tells us how to interpret data Need binding pose to skin character Need reference poses for calibration Try to unify poses Base pose = Frankenstein? Base pose = Bind pose? Base pose = Rest pose? Animator’s T-Pose vs. Anatomical T-Pose?
75	Recap: Representation Represent human as hierarchical skeleton Vector with 1 position, 1 absolute orientation, many relative orientations Vector really isn’t in R^N Many different ways to do this Many things to be careful of
76	Doing Math on Poses What do operations really mean? Assume we get the basics right Can’t add or scalar multiply quaternions Just a notation thing A Å B “add” (really compose) aA Å (1- a) B (interpolate) aA Å βB Å γC Å … (“linear” combine)
77	Does math on poses mean anything? With Quaternions “halfway” is halfway – and operation make sense For the orientations! Not for the end effectors (more on that later) Not for the meaning of the pose!
78	Does interpolation on poses make sense? “Halfway” in math is not “halfway” in terms of meaning
79	½ Å ½ =
80	The fundamental problem Math doesn’t know about the meaning of poses
81	Two approaches Develop smarter pose operators REALLY hard Need to be general, directable, fast, … Consistent across similar poses Others are thinking about it Data centric methods Use simple operations Combine simple operations Use lots of data (examples!)
82	Now… on to motion! Motion is a function of time Given time, provide a pose Represented as samples Sparse samples + interpolation Dense samples (at frames) How to manipulate sets of samples?
83	Motion is a series of poses Can’t just change 1 pose Introduces a discontinuity No teleportation! Need a vocabulary to discuss how things can/can’t change
84	Signal Processing The mathematics of varying things Need math to talk about how things change over time Not a cure all! No real connection between signal processing and high-level meaning Just like with poses Vocabulary for talking about motion And some mathematical tools
85	Signal Processing 101: Frequency Domain Analysis Signals can be broken into frequencies Low frequencies = smooth parts High frequencies = abrupt changes Real signals contain lots of different frequencies
86	Signal Processing Example: Noise Removal Noise comes from errors in process Sensor errors Fitting errors Bad movements Noise is “data” that we don’t want
87	Mocap Noise Misconception Things in the world don't change that fast (have high freq) If there are high freqs, must be noise Get rid of high freqs (quick changes) Low-Pass Filter (LPF) easy (weighted average, FIR, ...)
88	Low-Pass Filters vs. Noise Getting Rid of High Frequencies does not eliminate noise Leaves a “soggy” look
89	Low-Pass Filters vs. Noise We want to remove the noise, to get back a signal that looks like
90	Where’s the Noise? Sometimes identification is easy: Clearly wrong (foot through floor) Marked wrong (missing data - gaps) More often, need to guess Might be a subtle twitch… Might be person shaking… Might be sensor errors…
91	Noise Detection Use heuristics and rules of thumb to identify noise Use info about which body part as a discriminator Extremities are more likely to have sharp movement “Speed” of the movement affects how prevalent noise is Visual signal/noise ratio decreases as movement gets slower
92	High Frequencies PROBLEM: High frequencies can be important! Getting rid of them makes motion look soggy ANSWER: Do not over-apply LPF How much is enough? Use a little LPF
93	Treating Mocap Noise Small amounts of Low-Pass Filtering Noise modeling Adaptive filters Non-linear filters Hybrid solutions
94	Important Intuition High Frequencies are Important! Don’t occur often Always significant Impact Rapid, sudden movement Emphasis Sensitivity of perception
95	Changing Motions High Frequencies are important Can’t remove Don’t want to take away Can’t add Don’t want to put something in
96	Motion Processing Methods Can’t introduce pops (high-freqs) Removing them is a nightmare! Must change poses together Do things gradually Two tools: Motion Displacement Maps Motion Blending
97	How to change 1 pose? Motion Displacement Maps A.K.A. Motion warping Spread changes over time
98	Motion displacement maps Think of the change as a motion (signal) It can’t be discontinuous In fact, we want it to be smooth Make a spline that interpolates goals
99	Motion Displacement Maps “Add” in another motion m(t) = m₀(t) Å d(t) Pick other motion so that it doesn’t stick out (no high frequencies) Changes are low frequency
100	Band-limited adaptation High frequencies are important Eye is sensitive to them Always signifies important events Avoid high frequency changes Preserve existing high-frequencies Avoid adding new ones Band limit the changes Not the resulting motions
101	Why does this work (if it does) Intuition: ease in, ease out Sneak in the changes Signal Processing – Superposition Adding signals does not create frequency content MD Maps don’t add bad high frequencies That doesn’t mean they work all the time!
102	Superposition at work again: Motion Blending “Add” two motions together Really interpolate m(t) = a m₀(t) Å (1- a) m₁(t) Note: this is a per-frame operation Interpolate corresponding poses! No new frequencies generated
103	Does pose interpolation (motion blending) make sense? No! Doesn’t create HF, but doesn’t mean anything Yes – but only if Individual pose blends work out ok New sequence of poses makes sense Or… it happens so fast that no one sees
104	When does blending work? Blending only works when corresponding poses are similar
105	If blending is unreliable: How can we use blending? Interpolate similar motions Need to know when is similar (Part 2) Need to make motions similar (Part 3) Transition between motions Time varying blend (a=0 -> 1) Over a short period of time A bad pose isn’t such a big deal Avoids discontinuities m(t) = a(t) m₀(t) + (1-a(t)) m₁(t)
106	Transistion Very useful! Often get small pieces of motion Need to connect Easy if motions are similar Hard if motions are not similar
107	Types of “simple” Transitions Cut transitions Gap-Filling Motion Displacements + Cut Transitions C(1) MD + Cuts Blend Transitions
108	Cut Transitions Just put one motion after the next Works if the end of one is the same as the begining of the next This is unlikely to happen unless you your motions are made special Or, if you don’t care about the pop
109	Splined Transitions Make a new motion that starts at the end of M0 and ends at M1 Can use interpolation Only works if interpolation works Only works if timing isn’t important (generally, it doesn’t work – it’s a bad idea)
110	Displacement + Cut Change the end of M0 to be the same as the beginning of M1 Change both to meet halfway
111	Blend Transitions Motions must overlap
112	Is there a difference? Blends require overlap Blends fade the changes in
113	Blends vs. MD+Cut
114	How long should a transition be? Faster is better (get it over with!) Better control Less likely to see a bad pose Slower is better (don’t add HF) Less inducement of rapid changes Depends on how similar motions are Exact match? Zero length is OK.
115	Signal Processing Signal Processing gives tools Blending Displacement Maps Tools can do useful things Motion Warps Transitions Only work when poses are similar
116	Constraints Signal Processing considers temporal aspects of parameters How about detailed requirements of end-effectors? For one frame this is the IK problem
117	Why do we care? Blending poses may not blend end-effector positions See problems in blended motions Feet slide on floor Hands slip away from goals Even evident in transitions
118	Inverse Kinematics Find joint angles to meet goals f(q) = p (but given p) Variety of methods Numerical General but Slow, Ill-conditioned Geometric (closed-form) Fast, simple – but only for special cases
119	Can’t just change 1 frame Can’t change each frame independently Special IK Solvers for continuity Can do a Motion-Displacement Map Special variants of constraint solvers to solve special motion problems Footskate (but it can be any IK goal over time)
120	Uses of a Footskate Solver Known constraints Need annotated motion Clean footskate Follow an IK goal
121	Footskate
122	Footskate Cleanup
123	Our method Special case: human limbs Documented in a paper Open source implementation www.cs.wisc.edu/graphics/Gallery/FootskateSolver Really important for blended motion Used in everything Lucas will show
124	Special Case: Just the lower body
125	What the solver does Assume specific form There are closed form solutions Simple, fast, guaranteed continuity
126	Algorithm Overview
127	Step 1: Constraint Positions
128	Step 2: Target Ankle Configurations
129	Step 3: Root Position
130	Step 4: Single-Limb IK
131	Step 4: Single-Limb IK (cont.)
132	Step 5: Blend off changes Changes to any frame are blended into the neighboring frames as well Requires buffering if on-line
133	Arms Yes, it works for arms No, I haven’t given you details Read the paper Use the Open Source implementation! www.cs.wisc.edu/graphics/Gallery/FootskateSolver
134	Summary Review human animation basics Skeletons, rotations Signal Processing Building Blocks Blends, Transitions Constraints Next stop: Animation by Example!