Skip to content

Latest commit

 

History

History
120 lines (91 loc) · 10.1 KB

readme.md

File metadata and controls

120 lines (91 loc) · 10.1 KB

UFCase - Deep Dive into "Servicing Stack"

Important

This tool is experimental and requires full control on your PC. Please DO NOT use it under production environment, especially for the mutation functionalities, which are not properly tested for now.

Windows Servicing Stack is the main infrastructure that integrated with the update of Windows internal components (generally).

UFCase (Utility Functions Case) provides overall enumeration (and possibly deployment in the future) of multi-level abstractions of these Windows components.

The article Understanding Component-Based Servicing provides the overview on the Servicing Stack and how it roughly works when an update is installed or removed.

Releases

I'll publish breaking changes for UFCase in the GitHub releases. And the CI will upload nightly build artifacts of the newest commit. Goto GitHub actions, click the commit you prefer and download UFCase_portable.zip from the row Artifacts.

Glossary

All the descriptions below are woven by my own understanding. For your information only.

Firstly, let's take a look at the underlying mechanisms, which interact with our well-known filesystems and registry.

  • Assembly, is a core concept from .Net but used by Windows Componentization Platform. Assemblies in .NET
    • The concept of assembly connects different layers of servicing stack.
    • Its Manifest uses xml format with schema denoted as urn:schemas-microsoft-com:asm.v[1~3]. This article Manifest File Schema describes the public parts. But most of the schema about windows native components is stripped. There's also a binary representation rather than dumb XML format.
  • Component, is a native assembly that compose Windows in the end. Almost everything in Windows can be traced back to a component, including NT Kernel, NTFS driver, servicing stack itself, C runtime framework, system apps and localization resources. The main contents of a real component are a group of files and registry values, with other metadata like
    • the source and target concerning installation
    • security descriptors signing how they are protected by the system (Windows Resource Protection)
    • hash values validating the integrity of the component
    • other registration mechanisms like COM interfaces and Win32 window classes
    • dependencies
  • Deployment, is a special component that defines a bunch of deployable contents by its assembly dependencies.
  • WinSxS, or Component Store, is a strong naming, dependency-aware, version-controlled, digitally-signed, corruption-detectable, transactional and hark linking component store organized by assemblies. Popularly speaking, a private package manager for windows itself.
    • SxS expands to Side by Side, denoting the components with multiple versions can live within your system side by side.
    • Built on the filesystem and registry directly. Therefore, compared with a typical database, it behaves like a turtle.
    • The directory %WINDIR%\WinSxS
    • The registry hive HKEY_LOCAL_MACHINE\COMPONENTS from %WINDIR%\System32\config
    • The registry key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows\CurrentVersion\SideBySide
  • Windows Componentization Platform, a.k.a. wcp.dll from servicing stack, is a native implementation of assembly store, or technically "Isolation Interface". Correspondingly, .Net Framework used to have its own isolation implementation. Many DLLs probably had a version that implemented isolation, includes sxs.dll, isowin32.dll, isoman.dll, clr.dll, coreclr.dll, and even ntdll.dll.

Now let's change our point of view to the upper layers. After the Updates (Windows Update) are downloaded into the SoftwareDistribution, what information do they take and which contents do they ship?

  • Update (Windows Update), also known as *.msu. The .msu file archives some other .cab recursively - some contain metadata (CompDB or OfflineSyncPackage), some are the payloads, which are in fact called "Package".
  • Package (CBS), is a high-level assembly. They can be queried using Dism /Online /Get-Packages. The package has many available formats. Some well known technologies like msdelta a delta package format are used on it. But again and again... all of them have two parts: manifest and payload. The manifest of packages ends with .mum, describing:
    • which updates (CBS) are included, this is the payload descriptor
    • which packages are its parents, some pack has lang packs as their children
  • Capability (CBS), also known as Feature on Demand. They can be queried with Dism /Online /Get-Capabilities. Capabilities are generally a bunch of packages. They are not present in your disk at the first time, and can be downloaded from Windows Update Server.
  • Update (CBS), is a logical concept that only exists in the manifest of packages. It reference to an assembly, wrappered by
    • a component element, which is an ordinary WinSxS assembly component
    • a package element, which refer to another package
    • a driver element, which is a driver
  • Feature (CBS), actually should be called Optional Feature, differing from Feature on Demand. Features are the updates of the special package Microsoft-Windows-Foundation-Package, and can be queried by Dism /Online /Get-Features. These features are staged in WinSxS but not usable directly. If you need it you can enable it without network connection.

Manifest Schema

I'm completing an unofficial schema documentation of general isolation manifests. The current progress is under the directory ./docs. And the main page is here.

If you are interested, contributions are welcomed. Just use UFCase to inspect manifests of packages and components, and fill in the unknown elements with your inference. It would be better if you can attach full-text xml manifests.

Screenshots

a766fd23677d1826eaaa042662ec1296

fe2fbb78bdd175ccd9c5aa3b5655d942

Component Manifest Viewer

Package Manifest Viewer

Package Manifest Viewer (Raw)

Future Todo

Priority undetermined.

  • Host isolation
    • Run high-elevated operations like calling CBS APIs in a seperate process for each image
    • See more benefits in the section "Known Issues" below.
  • Manifest Viewer
    • View XML in new window
    • Long term: Jump to some references like package identity or component identity
  • Component reference querying
    • The isolation support a default keyword querying, we should implement it using a search box
    • Bonus: a proper IReferenceIdentity input box
  • Non-searching filters for "packages" and "components" tabs
  • For faster querying
    • Export the data from "components" tab to SQLite
    • Or contruct a index at runtime manually
  • Xref for Files -> Components Registry -> Components Component -> Packages

Long term:

  • Optimize object memory management
    • Partially done for not fetching manifest for every component. But the memory consumption of UI process is still 5x of the backend process. (Win11 Components 80MB vs 400MB).

Known Issues

  • For online images, the payload path of components cannot be fetched. I believe this is caused by an OS bug. See this repro for detail.
    • Workaround1: modify UFCase to an unpackaged app, and build from source
    • Workaround2: boot from another OS and access as offline image
    • Update: after some tests, I found that
      • Payload path works on
        • Win11 Canary Build 25947
        • Win10 Stable Build 19045
      • Payload path errors on
        • Win11 Dev Build 23545
        • Win11 Stable Build 22621
      • I think it's some kernel regressions that caused the issue of NtRegOpenKey, but I don't have time to investigate into kernel internals.
  • Host isolation means run high-elevated operations like calling CBS APIs in a seperate process for each image.
    • This brings the following benefits:
      • Reconstruct the threading model - out-of-proc COM objects can be called from any thread context. And we don't need to use anymore arena-based hierarchal memory allocation which is brought by naive implementation of GITObject.
      • Flexible and layered design, the UI can be rewritten freely, and another layer for cache.
      • Highly controllable image resource lifetime.
      • Run multiple versions of offline servicing stack in multiple processes. We cannot load wcp.dll for Win10 and Win7 at the same time for one single process.
    • But for now it also has some limitations:
      • The isolation host and UI process must be both unpackaged, or both packaged. Because even for full trust packaged apps, their out-of-proc COM registration is isolated in a sandbox. I'm not sure, but I think it's still a mechanism called PackagedCOM. The COM objects registered by packaged UI process is NOT visible to elevated processes. See microsoft/WindowsAppSDK #567 for details. I don't know whether DynamicDependency can provide some helps -- I'll give a try in future.

So I decided to leave UFCase an unpackaged app for now. I surely prefer MSIX packaging, and UFCase is still packagable except for having the first payload path bug. If you have need for a msix package of the newest build, please feel free to contact me.