Large-file support

Large-file support (LFS) is the term frequently applied to the ability to create files larger than either 2 or 4 GiB on 32-bit filesystems.

Details

Traditionally, many operating systems and their underlying file system implementations used 32-bit integers to represent file sizes and positions. Consequently, no file could be larger than 232 − 1 bytes (4 GiB − 1). In many implementations, the problem was exacerbated by treating the sizes as signed numbers, which further lowered the limit to 231 − 1 bytes (2 GiB − 1). Files that were too large for 32-bit operating systems to handle came to be known as large files.

While the limit was quite acceptable at a time when hard disks were smaller, the general increase in storage capacity combined with increased server and desktop file usage, especially for database and multimedia files, led to intense pressure for OS vendors to overcome the limitation.

In 1996, multiple vendors responded by forming an industry initiative known as the Large File Summit to support large files on POSIX (at the time Windows NT already supported large files on NTFS), an obvious backronym of "LFS". The summit was tasked to define a standardized way to switch to 64-bit numbers to represent file sizes.[1]

This switch caused deployment issues and required design modifications, the consequences of which can still be seen:

  • The change to 64-bit file sizes frequently required incompatible changes to file system layout, which meant that large-file support sometimes necessitated a file system change. For example, Microsoft Windows' FAT32 file system does not support files larger than 4 GiB−1; one has to use NTFS or exFAT instead.
  • To support binary compatibility with old applications, operating system interfaces had to retain their use of 32-bit file sizes and new interfaces had to be designed specifically for large-file support.
  • To support writing portable code that makes use of LFS where possible, C standard library authors devised mechanisms that, depending on preprocessor constants, transparently redefined the functions to the 64-bit large-file aware ones.
  • Many old interfaces, especially C-based ones, explicitly specified argument types in a way that did not allow straightforward or transparent transition to 64-bit types. For example, the C functions fseek and ftell operate on file positions of type long int, which is typically 32 bits wide on 32-bit platforms, and cannot be made larger without sacrificing backward compatibility. (This was resolved by introducing new functions fseeko and ftello in POSIX.[2] On Windows machines, under Visual C++, functions _fseeki64 and _ftelli64 are used.)

Adoption

The usage of the large-file API in 32-bit programs had been incomplete for a long time. An analysis did show in 2002 that many base libraries of operating systems were still shipped without large-file support thereby limiting applications using them.[3] The much-used zlib library started to support 64-bit large-files on 32-bit platform not before 2006.[4]

The problem disappeared slowly with PC and workstations moving completely to 64-bit computing. Microsoft Windows Server 2008 has been the last server version to be shipped in 32-bit.[5] Redhat Enterprise Linux 7 was published in 2014 only as a 64-bit operating system.[6] Ubuntu Linux stopped delivering a 32-bit variant in 2019.[7] Nvidia stopped developing 32-bit drivers in 2018 and they stopped delivering updates after January 2019.[8] Apple stopped developing 32-bit Mac OS versions in 2018 delivering macOS Mojave only as a 64-bit operating system.[9] There is no end-of-life known for Windows 10 on the desktop which is related to the latest upgrades from old systems like Windows 7 & Windows 8 in January 2020 as some of those system ran on old computers built on the i386 architecture.[10]

A similar development can be seen in the mobile area. Google required to support 64-bit versions of applications in their app store by August 2019,[11] which allows to discontinue 32-bit support for Android later.[12] The shift towards 64-bit started in 2014 when all new processors were designed to a 64-bit architecture and Android 5 ("Lollipop") was published in that year providing a fitting 64-bit variant of the operating system.[13][12] Apple had made shift in the year before starting to produce the 64-Bit Apple A7 by 2013. Google started to deliver the development environment for Linux only in 64-bit by 2015.[14] In May 2019 the share of Android versions below 5 had fallen to ten percent.[15] As app developers concentrate on a single compilation variant, many manufacturers started to require Android 5 as the minimum version by mid 2019, for example Niantic.[16] Subsequently the 32-bit versions were hard to get.[17]

Except for embedded systems with their special programs, the consideration of varying large-file support becomes obsolete in program code after 2020.

The year 2038 problem is well known for another case where a 32-bit "long" on 32-bit platforms will lead into problems. Just like the large-file limitation it will get obsolete when systems move to 64-bit only. In the meantime a 64-bit timestamp was introduced. In the Win32 API it is visible in functions having a "64" suffix along the earlier "32" suffix. When large-file support was added to the Win32 API it has led to functions having an additional "i64" suffix which sometimes makes for four combinations.(findfirst32, findfirst64, findfirst32i64, findfirst64i32).[18] By comparison the UNIX98 API introduces functions with a "64" suffix when "_LARGEFILE64_SOURCE" is used.

Related to the large-file API there is a limitation of block numbers for mass storage media. With a common size of 512 bytes per data block the barrier resulting from 32-bit numbers did occur later. When hard disk drives reached a size of 2 terabyte (around 2010) the master boot record had to be replaced by the GUID Partition Table which uses 64-bit for the LBA numbers (logical block address). On Unix-like operating systems it did also require to enlarge the inode numbers which are used in some functions (stat64, setrlimit64). The Linux kernel introduced that in 2001 leading to version 2.4 which was picked up by the glibc in that year.[19] As the large-file support and large-disk support was introduced at the same time the GNU C Library exports 64-bit inode structures on 32-bit architectures at the same time when the Unix LFS API is activated in program code.[20]

When the kernel moved to 64-bit inodes the file system ext3 used them internally in the driver by 2001. However the inode format on the storage media itself was stuck at 32-bit numbers.[19] As mass storage devices moved to the Advanced Format of 4 kilobyte per block the actual limit of that file system format is at 8 or 16 terabyte.[19] Handling larger disk partitions requires the usage of a different file system like XFS which was designed with 64-bit inodes from the start allowing for exabyte files and partitions.[21][22] The first 16 terabyte magnetic disk drives were delivered by mid 2019. Solid-state drive with 32 TiB for data centers were available as early as 2016 with some manufacturers forecasting 100 TiB SSD by 2020.[23]

See also

References

  1. Solaris OS group (March 1996). "Large Files in Solaris: A White Paper" (PDF). Sun Microsystems. Archived from the original (PDF) on 2007-02-28.
  2. "Adding Large File Support to the Single UNIX Specification". X/Open Base Working Group. 1996-08-14. Retrieved 2006-09-10.
  3. http://ac-archive.sourceforge.net/largefile/distros.html
  4. https://www.zlib.net/ChangeLog.txt
  5. Kolokythas, Panagiotis (2007-05-28). "Windows Server 2008: Microsofts letztes 32-Bit-Betriebssystem für Server" (in German). PC Welt.
  6. "Are 32-bit applications supported in RHEL 7 or later releases?". Red Hat. February 2014.
  7. Cooke, Will (2019-06-02). "Intel 32bit packages on Ubuntu from 19.10 onwards". Canonical.
  8. Addams, Matthew (2018-04-12). "Nvidia discontinues support for 32-bit Windows platforms". Windows Report.
  9. Silver, Steven (2018-06-05). "Mojave is Apple's last version of macOS to support 32-bit apps". Apple Insider.
  10. "Der Support für Windows 7 endet am 14. Januar 2020" (in German). Microsoft. Retrieved 2020-02-09.
  11. Sebayang, Andreas (2019-01-17). "Auf dem Weg zu reinen 64-Bit-Android-Apps" (in German). Golem.
  12. mw (2019-01-17). "Google kündigt Ende von 32-Bit-Android-Apps per 2021 an" (in German). IT Magazin.
  13. "64-Bit-Android: Diese Prozessoren gibt es, diese Veränderungen kommen" (in German). Android User. 2014-08-26.
  14. "Platform-tools 23.1.0 Linux changed to 64-bit without notice". Android Public Tracker. 2015-12-11. It turns out the android-sdk-linux/platform-tools content is 32-bit ELF in 23.0.1 but 64-bit ELF in 23.1_rc1 and 23.1.0. […] I set ANDROID_EMULATOR_FORCE_32BIT=true […] 23.0.1 is the last 32-bit Linux build.
  15. Tenzer, F. (2019-11-14). "Anteile der verschiedenen Android-Versionen an allen Geräten mit Android OS weltweit im Zeitraum 01. bis 07. Mai 2019" (in German). Statista.
  16. Del Favero, Elia (2019-06-10). "Ingress und Pokémon Go brauchen bald mindestens Android 5".
  17. "Why is 32bit 0.159.0 version apk still not available?". TheSilphRoad/. Reddit. December 2019.
  18. "C Run-time library (CRT) reference: findfirst". Microsoft. Retrieved 2020-02-17.
  19. Jaeger, Andreas (2015-02-15). "Large File Support in Linux". SuSE GmbH.
  20. linux/bits/stat.h: /* Note stat64 has the same shape as stat for x86-64. */
  21. Rutter, M. J. "The 64 bit inode problem". Retrieved 2020-02-10.
  22. "Ext4 Howto". kernel.org. 2019-02-11. Although very large fileystems are on ext4's feature list, current e2fsprogs currently still limits the filesystem size to 2^32 blocks (16TiB for a 4KiB block filesystem). Allowing filesystems larger than 16T is one of the very next high-priority features to complete for ext4.
  23. Scherer, Thomas (2016-08-15). "Samsungs 32-TB-SSD: Der Anfang vom Ende der Festplatte" (in German). Elektor.
  24. Kuhnt, Udo; Georgiev, Luchezar I.; Davis, Jeremy (2007). "FAT+ draft revision 2" (FATPLUS.TXT) (2 ed.). Retrieved 2015-08-05.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.