model_signing.manifest
An in-memory serialized representation of an ML model.
A manifest pairs objects from the model (e.g., files, shards of files), with
their hashes. When signing the model we first generate a manifest from
serializing the model using a configured serialization method (see
model_signing.signing).
When verifying the integrity of the model, after checking the authenticity of the signature, we extract a manifest from it. Then, we serialize the local model (the model being tested) and compare the two manifests.
The serialization method used during signing must match the one used during verification. We can auto detect the method to use during verification from the signature, but it is recommended to be explicit when possible.
Comparing the manifests can be done by checking that every item matches, both in name and in associated hash. In the future we will support partial object matching. This is useful, for example, for the cases where the original model contained files for multiple ML frameworks, but the user only uses the model with one framework. This way, the user can verify the integrity only for the files that are actually used.
This API should not be used directly, we don't guarantee that it is fully stable at the moment.
1# Copyright 2024 The Sigstore Authors 2# 3# Licensed under the Apache License, Version 2.0 (the "License"); 4# you may not use this file except in compliance with the License. 5# You may obtain a copy of the License at 6# 7# http://www.apache.org/licenses/LICENSE-2.0 8# 9# Unless required by applicable law or agreed to in writing, software 10# distributed under the License is distributed on an "AS IS" BASIS, 11# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. 12# See the License for the specific language governing permissions and 13# limitations under the License. 14 15"""An in-memory serialized representation of an ML model. 16 17A manifest pairs objects from the model (e.g., files, shards of files), with 18their hashes. When signing the model we first generate a manifest from 19serializing the model using a configured serialization method (see 20`model_signing.signing`). 21 22When verifying the integrity of the model, after checking the authenticity of 23the signature, we extract a manifest from it. Then, we serialize the local model 24(the model being tested) and compare the two manifests. 25 26The serialization method used during signing must match the one used during 27verification. We can auto detect the method to use during verification from the 28signature, but it is recommended to be explicit when possible. 29 30Comparing the manifests can be done by checking that every item matches, both in 31name and in associated hash. In the future we will support partial object 32matching. This is useful, for example, for the cases where the original model 33contained files for multiple ML frameworks, but the user only uses the model 34with one framework. This way, the user can verify the integrity only for the 35files that are actually used. 36 37This API should not be used directly, we don't guarantee that it is fully stable 38at the moment. 39""" 40 41import abc 42from collections.abc import Iterable, Iterator 43import dataclasses 44import pathlib 45import sys 46from typing import Any, Final 47 48from typing_extensions import override 49 50from model_signing._hashing import hashing 51 52 53if sys.version_info >= (3, 11): 54 from typing import Self 55else: 56 from typing_extensions import Self 57 58 59@dataclasses.dataclass(frozen=True, order=True) 60class _ResourceDescriptor: 61 """A description of any content from any `Manifest`. 62 63 We aim this to be similar to in-toto's `ResourceDescriptor`. To support 64 cases where in-toto cannot be directly used, we make this a dataclass that 65 can be mapped to in-toto when needed, and used as its own otherwise. 66 67 Not all fields from in-toto are specified at this moment. All fields here 68 must be present, unlike in-toto, where all are optional. 69 70 See github.com/in-toto/attestation/blob/main/spec/v1/resource_descriptor.md 71 for the in-toto specification. 72 73 Attributes: 74 identifier: A string that uniquely identifies this object within the 75 manifest. Depending on serialized format, users might require the 76 identifier to be unique across all manifests stored in a system. 77 Producers and consumers can agree on additional requirements (e.g., 78 several descriptors must have a common pattern for the identifier and 79 the integrity of the model implies integrity of all these items, 80 ignoring any other descriptor). Corresponds to `name`, `uri`, or 81 `content` in in-toto specification. 82 digest: One digest for the item. Note that unlike in-toto, we only have 83 one digest for the item and it is always required. 84 """ 85 86 identifier: str 87 digest: hashing.Digest 88 89 90class _ManifestKey(metaclass=abc.ABCMeta): 91 """An object that can be a key for an item in a manifest. 92 93 We need to be able to convert the key to string when we serialize the 94 manifest and to rebuild the object from the serialized representation when 95 we deserialize the manifest from the signature. 96 """ 97 98 @classmethod 99 @abc.abstractmethod 100 def from_str(cls, s: str) -> Self: 101 """Builds a manifest key from the string representation. 102 103 It is guaranteed that for any `key` of a derived class `C` and a (valid) 104 string `s`, the following two round-trip properties hold: 105 106 ``` 107 str(C.from_str(s)) == s 108 C.from_str(str(key)) == key 109 ``` 110 111 Raises: 112 ValueError: if the string argument cannot be decoded correctly. 113 """ 114 115 116@dataclasses.dataclass(frozen=True, order=True) 117class _File(_ManifestKey): 118 """A dataclass to hold information about a file as a manifest key. 119 120 Attributes: 121 path: The path to the file, relative to the model root. 122 """ 123 124 path: pathlib.PurePath 125 126 def __str__(self) -> str: 127 return str(self.path) 128 129 @classmethod 130 @override 131 def from_str(cls, s: str) -> Self: 132 # Note that we always decode the string to a pure POSIX path 133 return cls(pathlib.PurePosixPath(s)) 134 135 136@dataclasses.dataclass(frozen=True, order=True) 137class _Shard(_ManifestKey): 138 """A dataclass to hold information about a file shard as a manifest key. 139 140 Attributes: 141 path: The path to the file, relative to the model root. 142 start: The start offset of the shard (included). 143 end: The end offset of the shard (not included). 144 """ 145 146 path: pathlib.PurePath 147 start: int 148 end: int 149 150 def __str__(self) -> str: 151 return f"{str(self.path)}:{self.start}:{self.end}" 152 153 @classmethod 154 @override 155 def from_str(cls, s: str) -> Self: 156 parts = s.split(":") 157 if len(parts) != 3: 158 raise ValueError(f"Expected 3 components separated by `:`, got {s}") 159 160 path = pathlib.PurePosixPath(parts[0]) 161 start = int(parts[1]) 162 end = int(parts[2]) 163 164 return cls(path, start, end) 165 166 167class ManifestItem(metaclass=abc.ABCMeta): 168 """An individual object of a model, stored as an item in a manifest. 169 170 For example, this could be a file, or a file shard. All file paths are 171 relative to the model root, to allow moving or renaming the model, without 172 invalidating the signature. 173 174 The integrity of each `ManifestItem` can be verified individually and in 175 parallel. If the item is backed by a file, we recompute the hash of the 176 portion of the file that represents this item. 177 178 Attributes: 179 digest: The digest of the item. Use the `key` property to obtain a 180 canonical unique representation for the item. 181 """ 182 183 digest: hashing.Digest 184 185 @property 186 @abc.abstractmethod 187 def key(self) -> _ManifestKey: 188 """A unique representation for the manifest item. 189 190 Two items in the same manifest must not share the same `key`. The 191 information contained in `key` should be sufficient to determine how to 192 compute the item's digest. 193 """ 194 195 196class FileManifestItem(ManifestItem): 197 """A manifest item that records a filename path together with its digest. 198 199 Note that the path component is a `pathlib.PurePath`, relative to the model. 200 To ensure that the manifest is consistent across operating systems, we 201 convert the path to a POSIX path. 202 """ 203 204 def __init__(self, *, path: pathlib.PurePath, digest: hashing.Digest): 205 """Builds a manifest item pairing a file with its digest. 206 207 Args: 208 path: The path to the file, relative to the model root. 209 digest: The digest of the file. 210 """ 211 # Note: we need to force a PurePosixPath to canonicalize the manifest. 212 self._path = pathlib.PurePosixPath(path) 213 self.digest = digest 214 215 @property 216 @override 217 def key(self) -> _ManifestKey: 218 return _File(self._path) 219 220 221class ShardedFileManifestItem(ManifestItem): 222 """A manifest item that records a file shard together with its digest. 223 224 Note that the path component is a `pathlib.PurePath`, relative to the model. 225 To ensure that the manifest is consistent across operating systems, we 226 convert the path to a POSIX path. 227 """ 228 229 def __init__( 230 self, 231 *, 232 path: pathlib.PurePath, 233 start: int, 234 end: int, 235 digest: hashing.Digest, 236 ): 237 """Builds a manifest item pairing a file shard with its digest. 238 239 Args: 240 path: The path to the file, relative to the model root. 241 start: The start offset of the shard (included). 242 end: The end offset of the shard (not included). 243 digest: The digest of the file shard. 244 """ 245 # Note: we need to force a PurePosixPath to canonicalize the manifest. 246 self._path = pathlib.PurePosixPath(path) 247 self._start = start 248 self._end = end 249 self.digest = digest 250 251 @property 252 @override 253 def key(self) -> _ManifestKey: 254 return _Shard(self._path, self._start, self._end) 255 256 257class SerializationType(metaclass=abc.ABCMeta): 258 """A description of the serialization process that generated the manifest. 259 260 These should record all the parameters needed to ensure a reproducible 261 serialization. These are used to build a manifest back from the signature in 262 a backwards compatible way. We use these to determine what serialization to 263 use when verifying a signature. 264 """ 265 266 @property 267 @abc.abstractmethod 268 def serialization_parameters(self) -> dict[str, Any]: 269 """The arguments of the serialization method.""" 270 271 @classmethod 272 def from_args(cls, args: dict[str, Any]) -> Self: 273 """Builds an instance of this class based on the dict representation. 274 275 This is the reverse of `serialization_parameters`. 276 277 Args: 278 args: The arguments as a dictionary (equivalent to `**kwargs`). 279 """ 280 serialization_type = args["method"] 281 for subclass in [_FileSerialization, _ShardSerialization]: 282 if serialization_type == subclass.method: 283 return subclass._from_args(args) 284 raise ValueError(f"Unknown serialization type {serialization_type}") 285 286 @classmethod 287 @abc.abstractmethod 288 def _from_args(cls, args: dict[str, Any]) -> Self: 289 """Performs the actual build from `from_dict`.""" 290 291 @abc.abstractmethod 292 def new_item(self, name: str, digest: hashing.Digest) -> ManifestItem: 293 """Builds a `ManifestItem` of the correct type. 294 295 Each serialization type results in different types for the items in the 296 manifest. This method parses the `name` of the item according to the 297 serialization type to create the proper manifest item. 298 299 Args: 300 name: The name of the item, as shown in the manifest. 301 digest: The digest of the item. 302 """ 303 304 305class _FileSerialization(SerializationType): 306 method: Final[str] = "files" 307 308 def __init__( 309 self, 310 hash_type: str, 311 allow_symlinks: bool = False, 312 ignore_paths: Iterable[pathlib.Path] = frozenset(), 313 ): 314 """Records the manifest serialization type for serialization by files. 315 316 We only need to record the hashing engine used and whether symlinks are 317 hashed or ignored. 318 319 Args: 320 hash_type: A string representation of the hash algorithm. 321 allow_symlinks: Controls whether symbolic links are included. 322 ignore_paths: Paths of files to ignore. 323 """ 324 self._hash_type = hash_type 325 self._allow_symlinks = allow_symlinks 326 self._ignore_paths = [str(p) for p in ignore_paths] 327 328 @property 329 @override 330 def serialization_parameters(self) -> dict[str, Any]: 331 res = { 332 "method": self.method, 333 "hash_type": self._hash_type, 334 "allow_symlinks": self._allow_symlinks, 335 } 336 if self._ignore_paths: 337 res["ignore_paths"] = self._ignore_paths 338 return res 339 340 @classmethod 341 @override 342 def _from_args(cls, args: dict[str, Any]) -> Self: 343 return cls( 344 args["hash_type"], 345 args["allow_symlinks"], 346 args.get("ignore_paths", frozenset()), 347 ) 348 349 @override 350 def new_item(self, name: str, digest: hashing.Digest) -> ManifestItem: 351 path = pathlib.PurePosixPath(name) 352 return FileManifestItem(path=path, digest=digest) 353 354 355class _ShardSerialization(SerializationType): 356 method: Final[str] = "shards" 357 358 def __init__( 359 self, 360 hash_type: str, 361 shard_size: int, 362 allow_symlinks: bool = False, 363 ignore_paths: Iterable[pathlib.Path] = frozenset(), 364 ): 365 """Records the manifest serialization type for serialization by shards. 366 367 We need to record the hashing engine used and whether symlinks are 368 hashed or ignored, just like for file serialization. We also need to 369 record the shard size used to split the files, since different shard 370 sizes results in different resources. 371 372 Args: 373 hash_type: A string representation of the hash algorithm. 374 shard_size: The size of each file shard. 375 allow_symlinks: Controls whether symbolic links are included. 376 ignore_paths: Paths of files to ignore. 377 """ 378 self._hash_type = hash_type 379 self._allow_symlinks = allow_symlinks 380 self._shard_size = shard_size 381 self._ignore_paths = [str(p) for p in ignore_paths] 382 383 @property 384 @override 385 def serialization_parameters(self) -> dict[str, Any]: 386 res = { 387 "method": self.method, 388 "hash_type": self._hash_type, 389 "shard_size": self._shard_size, 390 "allow_symlinks": self._allow_symlinks, 391 } 392 if self._ignore_paths: 393 res["ignore_paths"] = self._ignore_paths 394 return res 395 396 @classmethod 397 @override 398 def _from_args(cls, args: dict[str, Any]) -> Self: 399 return cls( 400 args["hash_type"], 401 args["shard_size"], 402 args["allow_symlinks"], 403 args.get("ignore_paths", frozenset()), 404 ) 405 406 @override 407 def new_item(self, name: str, digest: hashing.Digest) -> ManifestItem: 408 parts = name.split(":") 409 if len(parts) != 3: 410 raise ValueError( 411 "Invalid resource name: expected 3 components separated by " 412 f"`:`, got {name}" 413 ) 414 415 path = pathlib.PurePosixPath(parts[0]) 416 start = int(parts[1]) 417 end = int(parts[2]) 418 return ShardedFileManifestItem( 419 path=path, start=start, end=end, digest=digest 420 ) 421 422 423class Manifest: 424 """Generic manifest file to represent a model.""" 425 426 def __init__( 427 self, 428 model_name: str, 429 items: Iterable[ManifestItem], 430 serialization_type: SerializationType, 431 ): 432 """Builds a manifest from a collection of already hashed objects. 433 434 Args: 435 model_name: A name for the model that generated the manifest. This 436 is the final component of the model path, and is only informative. 437 See `model_name` property. 438 items: An iterable sequence of objects and their hashes. 439 serialization_type: The serialization method used to create items. 440 """ 441 self._name = model_name 442 self._item_to_digest = {item.key: item.digest for item in items} 443 self._serialization_type = serialization_type 444 445 def __eq__(self, other: Self): 446 """Check equality based on item digests, ignoring model name.""" 447 return self._item_to_digest == other._item_to_digest 448 449 def resource_descriptors(self) -> Iterator[_ResourceDescriptor]: 450 """Yields each resource from the manifest, one by one.""" 451 for item, digest in sorted(self._item_to_digest.items()): 452 yield _ResourceDescriptor(identifier=str(item), digest=digest) 453 454 @property 455 def model_name(self) -> str: 456 """The name of the model when serialized (final component of the path). 457 458 This is only informative. Changing the name of the model should still 459 result in the same digests after serialization, it must not invalidate 460 signatures. As a result, two manifests with different model names but 461 with the same resource descriptors will compare equal. 462 """ 463 return self._name 464 465 @property 466 def serialization_type(self) -> dict[str, Any]: 467 """The serialization (and arguments) used to build the manifest. 468 469 This is needed to record the serialization method used to generate the 470 manifest so that signature verification can use the same method. 471 """ 472 return self._serialization_type.serialization_parameters
168class ManifestItem(metaclass=abc.ABCMeta): 169 """An individual object of a model, stored as an item in a manifest. 170 171 For example, this could be a file, or a file shard. All file paths are 172 relative to the model root, to allow moving or renaming the model, without 173 invalidating the signature. 174 175 The integrity of each `ManifestItem` can be verified individually and in 176 parallel. If the item is backed by a file, we recompute the hash of the 177 portion of the file that represents this item. 178 179 Attributes: 180 digest: The digest of the item. Use the `key` property to obtain a 181 canonical unique representation for the item. 182 """ 183 184 digest: hashing.Digest 185 186 @property 187 @abc.abstractmethod 188 def key(self) -> _ManifestKey: 189 """A unique representation for the manifest item. 190 191 Two items in the same manifest must not share the same `key`. The 192 information contained in `key` should be sufficient to determine how to 193 compute the item's digest. 194 """
An individual object of a model, stored as an item in a manifest.
For example, this could be a file, or a file shard. All file paths are relative to the model root, to allow moving or renaming the model, without invalidating the signature.
The integrity of each ManifestItem can be verified individually and in
parallel. If the item is backed by a file, we recompute the hash of the
portion of the file that represents this item.
Attributes:
- digest: The digest of the item. Use the
keyproperty to obtain a canonical unique representation for the item.
186 @property 187 @abc.abstractmethod 188 def key(self) -> _ManifestKey: 189 """A unique representation for the manifest item. 190 191 Two items in the same manifest must not share the same `key`. The 192 information contained in `key` should be sufficient to determine how to 193 compute the item's digest. 194 """
197class FileManifestItem(ManifestItem): 198 """A manifest item that records a filename path together with its digest. 199 200 Note that the path component is a `pathlib.PurePath`, relative to the model. 201 To ensure that the manifest is consistent across operating systems, we 202 convert the path to a POSIX path. 203 """ 204 205 def __init__(self, *, path: pathlib.PurePath, digest: hashing.Digest): 206 """Builds a manifest item pairing a file with its digest. 207 208 Args: 209 path: The path to the file, relative to the model root. 210 digest: The digest of the file. 211 """ 212 # Note: we need to force a PurePosixPath to canonicalize the manifest. 213 self._path = pathlib.PurePosixPath(path) 214 self.digest = digest 215 216 @property 217 @override 218 def key(self) -> _ManifestKey: 219 return _File(self._path)
A manifest item that records a filename path together with its digest.
Note that the path component is a pathlib.PurePath, relative to the model.
To ensure that the manifest is consistent across operating systems, we
convert the path to a POSIX path.
205 def __init__(self, *, path: pathlib.PurePath, digest: hashing.Digest): 206 """Builds a manifest item pairing a file with its digest. 207 208 Args: 209 path: The path to the file, relative to the model root. 210 digest: The digest of the file. 211 """ 212 # Note: we need to force a PurePosixPath to canonicalize the manifest. 213 self._path = pathlib.PurePosixPath(path) 214 self.digest = digest
Builds a manifest item pairing a file with its digest.
Arguments:
- path: The path to the file, relative to the model root.
- digest: The digest of the file.
222class ShardedFileManifestItem(ManifestItem): 223 """A manifest item that records a file shard together with its digest. 224 225 Note that the path component is a `pathlib.PurePath`, relative to the model. 226 To ensure that the manifest is consistent across operating systems, we 227 convert the path to a POSIX path. 228 """ 229 230 def __init__( 231 self, 232 *, 233 path: pathlib.PurePath, 234 start: int, 235 end: int, 236 digest: hashing.Digest, 237 ): 238 """Builds a manifest item pairing a file shard with its digest. 239 240 Args: 241 path: The path to the file, relative to the model root. 242 start: The start offset of the shard (included). 243 end: The end offset of the shard (not included). 244 digest: The digest of the file shard. 245 """ 246 # Note: we need to force a PurePosixPath to canonicalize the manifest. 247 self._path = pathlib.PurePosixPath(path) 248 self._start = start 249 self._end = end 250 self.digest = digest 251 252 @property 253 @override 254 def key(self) -> _ManifestKey: 255 return _Shard(self._path, self._start, self._end)
A manifest item that records a file shard together with its digest.
Note that the path component is a pathlib.PurePath, relative to the model.
To ensure that the manifest is consistent across operating systems, we
convert the path to a POSIX path.
230 def __init__( 231 self, 232 *, 233 path: pathlib.PurePath, 234 start: int, 235 end: int, 236 digest: hashing.Digest, 237 ): 238 """Builds a manifest item pairing a file shard with its digest. 239 240 Args: 241 path: The path to the file, relative to the model root. 242 start: The start offset of the shard (included). 243 end: The end offset of the shard (not included). 244 digest: The digest of the file shard. 245 """ 246 # Note: we need to force a PurePosixPath to canonicalize the manifest. 247 self._path = pathlib.PurePosixPath(path) 248 self._start = start 249 self._end = end 250 self.digest = digest
Builds a manifest item pairing a file shard with its digest.
Arguments:
- path: The path to the file, relative to the model root.
- start: The start offset of the shard (included).
- end: The end offset of the shard (not included).
- digest: The digest of the file shard.
258class SerializationType(metaclass=abc.ABCMeta): 259 """A description of the serialization process that generated the manifest. 260 261 These should record all the parameters needed to ensure a reproducible 262 serialization. These are used to build a manifest back from the signature in 263 a backwards compatible way. We use these to determine what serialization to 264 use when verifying a signature. 265 """ 266 267 @property 268 @abc.abstractmethod 269 def serialization_parameters(self) -> dict[str, Any]: 270 """The arguments of the serialization method.""" 271 272 @classmethod 273 def from_args(cls, args: dict[str, Any]) -> Self: 274 """Builds an instance of this class based on the dict representation. 275 276 This is the reverse of `serialization_parameters`. 277 278 Args: 279 args: The arguments as a dictionary (equivalent to `**kwargs`). 280 """ 281 serialization_type = args["method"] 282 for subclass in [_FileSerialization, _ShardSerialization]: 283 if serialization_type == subclass.method: 284 return subclass._from_args(args) 285 raise ValueError(f"Unknown serialization type {serialization_type}") 286 287 @classmethod 288 @abc.abstractmethod 289 def _from_args(cls, args: dict[str, Any]) -> Self: 290 """Performs the actual build from `from_dict`.""" 291 292 @abc.abstractmethod 293 def new_item(self, name: str, digest: hashing.Digest) -> ManifestItem: 294 """Builds a `ManifestItem` of the correct type. 295 296 Each serialization type results in different types for the items in the 297 manifest. This method parses the `name` of the item according to the 298 serialization type to create the proper manifest item. 299 300 Args: 301 name: The name of the item, as shown in the manifest. 302 digest: The digest of the item. 303 """
A description of the serialization process that generated the manifest.
These should record all the parameters needed to ensure a reproducible serialization. These are used to build a manifest back from the signature in a backwards compatible way. We use these to determine what serialization to use when verifying a signature.
267 @property 268 @abc.abstractmethod 269 def serialization_parameters(self) -> dict[str, Any]: 270 """The arguments of the serialization method."""
The arguments of the serialization method.
272 @classmethod 273 def from_args(cls, args: dict[str, Any]) -> Self: 274 """Builds an instance of this class based on the dict representation. 275 276 This is the reverse of `serialization_parameters`. 277 278 Args: 279 args: The arguments as a dictionary (equivalent to `**kwargs`). 280 """ 281 serialization_type = args["method"] 282 for subclass in [_FileSerialization, _ShardSerialization]: 283 if serialization_type == subclass.method: 284 return subclass._from_args(args) 285 raise ValueError(f"Unknown serialization type {serialization_type}")
Builds an instance of this class based on the dict representation.
This is the reverse of serialization_parameters.
Arguments:
- args: The arguments as a dictionary (equivalent to
**kwargs).
292 @abc.abstractmethod 293 def new_item(self, name: str, digest: hashing.Digest) -> ManifestItem: 294 """Builds a `ManifestItem` of the correct type. 295 296 Each serialization type results in different types for the items in the 297 manifest. This method parses the `name` of the item according to the 298 serialization type to create the proper manifest item. 299 300 Args: 301 name: The name of the item, as shown in the manifest. 302 digest: The digest of the item. 303 """
Builds a ManifestItem of the correct type.
Each serialization type results in different types for the items in the
manifest. This method parses the name of the item according to the
serialization type to create the proper manifest item.
Arguments:
- name: The name of the item, as shown in the manifest.
- digest: The digest of the item.
424class Manifest: 425 """Generic manifest file to represent a model.""" 426 427 def __init__( 428 self, 429 model_name: str, 430 items: Iterable[ManifestItem], 431 serialization_type: SerializationType, 432 ): 433 """Builds a manifest from a collection of already hashed objects. 434 435 Args: 436 model_name: A name for the model that generated the manifest. This 437 is the final component of the model path, and is only informative. 438 See `model_name` property. 439 items: An iterable sequence of objects and their hashes. 440 serialization_type: The serialization method used to create items. 441 """ 442 self._name = model_name 443 self._item_to_digest = {item.key: item.digest for item in items} 444 self._serialization_type = serialization_type 445 446 def __eq__(self, other: Self): 447 """Check equality based on item digests, ignoring model name.""" 448 return self._item_to_digest == other._item_to_digest 449 450 def resource_descriptors(self) -> Iterator[_ResourceDescriptor]: 451 """Yields each resource from the manifest, one by one.""" 452 for item, digest in sorted(self._item_to_digest.items()): 453 yield _ResourceDescriptor(identifier=str(item), digest=digest) 454 455 @property 456 def model_name(self) -> str: 457 """The name of the model when serialized (final component of the path). 458 459 This is only informative. Changing the name of the model should still 460 result in the same digests after serialization, it must not invalidate 461 signatures. As a result, two manifests with different model names but 462 with the same resource descriptors will compare equal. 463 """ 464 return self._name 465 466 @property 467 def serialization_type(self) -> dict[str, Any]: 468 """The serialization (and arguments) used to build the manifest. 469 470 This is needed to record the serialization method used to generate the 471 manifest so that signature verification can use the same method. 472 """ 473 return self._serialization_type.serialization_parameters
Generic manifest file to represent a model.
427 def __init__( 428 self, 429 model_name: str, 430 items: Iterable[ManifestItem], 431 serialization_type: SerializationType, 432 ): 433 """Builds a manifest from a collection of already hashed objects. 434 435 Args: 436 model_name: A name for the model that generated the manifest. This 437 is the final component of the model path, and is only informative. 438 See `model_name` property. 439 items: An iterable sequence of objects and their hashes. 440 serialization_type: The serialization method used to create items. 441 """ 442 self._name = model_name 443 self._item_to_digest = {item.key: item.digest for item in items} 444 self._serialization_type = serialization_type
Builds a manifest from a collection of already hashed objects.
Arguments:
- model_name: A name for the model that generated the manifest. This
is the final component of the model path, and is only informative.
See
model_nameproperty. - items: An iterable sequence of objects and their hashes.
- serialization_type: The serialization method used to create items.
450 def resource_descriptors(self) -> Iterator[_ResourceDescriptor]: 451 """Yields each resource from the manifest, one by one.""" 452 for item, digest in sorted(self._item_to_digest.items()): 453 yield _ResourceDescriptor(identifier=str(item), digest=digest)
Yields each resource from the manifest, one by one.
455 @property 456 def model_name(self) -> str: 457 """The name of the model when serialized (final component of the path). 458 459 This is only informative. Changing the name of the model should still 460 result in the same digests after serialization, it must not invalidate 461 signatures. As a result, two manifests with different model names but 462 with the same resource descriptors will compare equal. 463 """ 464 return self._name
The name of the model when serialized (final component of the path).
This is only informative. Changing the name of the model should still result in the same digests after serialization, it must not invalidate signatures. As a result, two manifests with different model names but with the same resource descriptors will compare equal.
466 @property 467 def serialization_type(self) -> dict[str, Any]: 468 """The serialization (and arguments) used to build the manifest. 469 470 This is needed to record the serialization method used to generate the 471 manifest so that signature verification can use the same method. 472 """ 473 return self._serialization_type.serialization_parameters
The serialization (and arguments) used to build the manifest.
This is needed to record the serialization method used to generate the manifest so that signature verification can use the same method.