The NetBSD Project

CVS log for src/sys/ufs/ufs/ufs_readwrite.c

[BACK] Up to [cvs.NetBSD.org] / src / sys / ufs / ufs

Request diff between arbitrary revisions


Default branch: MAIN
Current tag: MAIN


Revision 1.105 / (download) - annotate - [select for diffs], Tue Jan 22 09:39:18 2013 UTC (4 months ago) by dholland
Branch: MAIN
CVS Tags: yamt-pagecache-base8, tls-maxphys-nbase, tls-maxphys-base, khorben-n900, agc-symver-base, agc-symver, HEAD
Changes since 1.104: +6 -6 lines
Diff to previous 1.104 (colored)

Stuff UFS_ in front of a few of ufs's symbols to reduce namespace
pollution. Specifically:
   ROOTINO -> UFS_ROOTINO
   WINO -> UFS_WINO
   NXADDR -> UFS_NXADDR
   NDADDR -> UFS_NDADDR
   NIADDR -> UFS_NIADDR
   MAXSYMLINKLEN -> UFS_MAXSYMLINKLEN
   MAXSYMLINKLEN_UFS[12] -> UFS[12]_MAXSYMLINKLEN (for consistency)

Sort out ext2fs's misuse of NDADDR and NIADDR; fortunately, these have
the same values in ext2fs and ffs.

No functional change intended.

Revision 1.104 / (download) - annotate - [select for diffs], Sun Apr 29 22:54:01 2012 UTC (12 months, 3 weeks ago) by chs
Branch: MAIN
CVS Tags: yamt-pagecache-base7, yamt-pagecache-base6, yamt-pagecache-base5, jmcneill-usbmp-base10
Branch point for: tls-maxphys
Changes since 1.103: +3 -3 lines
Diff to previous 1.103 (colored)

change vflushbuf() to take the full FSYNC_* flags.
translate FSYNC_LAZY into PGO_LAZY for VOP_PUTPAGES() so that
genfs_do_io() can set the appropriate io priority for the I/O.
this is the first part of addressing PR 46325.

Revision 1.103 / (download) - annotate - [select for diffs], Tue Apr 17 19:15:16 2012 UTC (13 months ago) by christos
Branch: MAIN
CVS Tags: jmcneill-usbmp-base9
Changes since 1.102: +6 -8 lines
Diff to previous 1.102 (colored)

it is not an error if the kernel needs to clear the setuid/
setgid bit on write/chown/chgrp

Revision 1.102 / (download) - annotate - [select for diffs], Tue Mar 13 18:41:14 2012 UTC (14 months, 1 week ago) by elad
Branch: MAIN
CVS Tags: yamt-pagecache-base4, jmcneill-usbmp-base8
Changes since 1.101: +20 -6 lines
Diff to previous 1.101 (colored)

Replace the remaining KAUTH_GENERIC_ISSUSER authorization calls with
something meaningful. All relevant documentation has been updated or
written.

Most of these changes were brought up in the following messages:

    http://mail-index.netbsd.org/tech-kern/2012/01/18/msg012490.html
    http://mail-index.netbsd.org/tech-kern/2012/01/19/msg012502.html
    http://mail-index.netbsd.org/tech-kern/2012/02/17/msg012728.html

Thanks to christos, manu, njoly, and jmmv for input.

Huge thanks to pgoyette for spinning these changes through some build
cycles and ATF.

Revision 1.101 / (download) - annotate - [select for diffs], Mon Jan 2 22:10:45 2012 UTC (16 months, 2 weeks ago) by perseant
Branch: MAIN
CVS Tags: netbsd-6-base, jmcneill-usbmp-base7, jmcneill-usbmp-base6, jmcneill-usbmp-base5, jmcneill-usbmp-base4, jmcneill-usbmp-base3, jmcneill-usbmp-base2
Branch point for: netbsd-6
Changes since 1.100: +3 -2 lines
Diff to previous 1.100 (colored)


* Remove PGO_RECLAIM during lfs_putpages()' call to genfs_putpages(),
  to avoid a live lock in the latter when reclaiming a vnode with
  dirty pages.

* Add a new segment flag, SEGM_RECLAIM, to note when a segment is
  being written for vnode reclamation, and record which inode is being
  reclaimed, to aid in forensic debugging.

* Add a new segment flag, SEGM_SINGLE, so that opportunistic writes
  can write a single segment's worth of blocks and then stop, rather
  than writing all the way up to the cleaner's reserved number of
  segments.

* Add assert statements to check mutex ownership is the way it ought
  to be, mostly in lfs_putpages; fix problems uncovered by this.

* Don't clear VU_DIROP until the inode actually makes its way to disk,
  avoiding a problem where dirop inodes could become separated
  (uncovered by a modified version of the "ckckp" forensic regression
  test).

* Move the vfs_getopsbyname() call into lfs_writerd.  Prepare code to
  make lfs_writerd notice when there are no more LFSs, and exit losing
  the reference, so that, in theory, the module can be unloaded.  This
  code is not enabled, since it causes a crash on exit.

* Set IN_MODIFIED on inodes flushed by lfs_flush_dirops.  Really we
  only need to set IN_MODIFIED if we are going to write them again
  (e.g., to write pages); need to think about this more.

Finally, several changes to help avoid "no clean segments" panics:

* In lfs_bmapv, note when a vnode is loaded only to discover whether
  its blocks are live, so it can immediately be recycled.  Since the
  cleaner will try to choose ~empty segments over full ones, this
  prevents the cleaner from (1) filling the vnode cache with junk, and
  (2) squeezing any unwritten writes to disk and running the fs out of
  segments.

* Overestimate by half the amount of metadata that will be required
  to fill the clean segments.  This will make the disk appear smaller,
  but should help avoid a "no clean segments" panic.

* Rearrange lfs_writerd.  In particular, lfs_writerd now pays
  attention to the number of clean segments available, and holds off
  writing until there is room.

Revision 1.100 / (download) - annotate - [select for diffs], Fri Nov 18 21:18:52 2011 UTC (18 months ago) by christos
Branch: MAIN
CVS Tags: jmcneill-usbmp-pre-base2, jmcneill-usbmp-base, jmcneill-audiomp3-base, jmcneill-audiomp3
Branch point for: jmcneill-usbmp
Changes since 1.99: +4 -2 lines
Diff to previous 1.99 (colored)

Obey MNT_RELATIME, the only addition is that mkdir in ufs sets IN_ACCESS too.

Revision 1.99 / (download) - annotate - [select for diffs], Mon Jul 11 08:27:41 2011 UTC (22 months, 2 weeks ago) by hannken
Branch: MAIN
CVS Tags: yamt-pagecache-base3, yamt-pagecache-base2, yamt-pagecache-base
Branch point for: yamt-pagecache
Changes since 1.98: +3 -3 lines
Diff to previous 1.98 (colored)

Change VOP_BWRITE() to take a vnode as its first argument like all other
VOPs do.  Layered file systems no longer have to modify bp->b_vp and run
into trouble when an async VOP_BWRITE() uses the wrong vnode.

- change all occurences of VOP_BWRITE(bp) to VOP_BWRITE(bp->b_vp, bp).
- remove layer_bwrite().
- welcome to 5.99.55

Adresses PR kern/38762 panic: vwakeup: neg numoutput

No objections from tech-kern@.

Revision 1.98 / (download) - annotate - [select for diffs], Sun Jun 19 02:42:53 2011 UTC (23 months ago) by rmind
Branch: MAIN
Changes since 1.97: +4 -6 lines
Diff to previous 1.97 (colored)

- Fix a silly bug: remove umap from uobj in ubc_release() UBC_UNMAP case.
- Use UBC_WANT_UNMAP() consistently.

ARM (PMAP_CACHE_VIVT case) works again.

Revision 1.97 / (download) - annotate - [select for diffs], Sun Jun 12 03:36:02 2011 UTC (23 months, 1 week ago) by rmind
Branch: MAIN
Changes since 1.96: +4 -4 lines
Diff to previous 1.96 (colored)

Welcome to 5.99.53!  Merge rmind-uvmplock branch:

- Reorganize locking in UVM and provide extra serialisation for pmap(9).
  New lock order: [vmpage-owner-lock] -> pmap-lock.

- Simplify locking in some pmap(9) modules by removing P->V locking.

- Use lock object on vmobjlock (and thus vnode_t::v_interlock) to share
  the locks amongst UVM objects where necessary (tmpfs, layerfs, unionfs).

- Rewrite and optimise x86 TLB shootdown code, make it simpler and cleaner.
  Add TLBSTATS option for x86 to collect statistics about TLB shootdowns.

- Unify /dev/mem et al in MI code and provide required locking (removes
  kernel-lock on some ports).  Also, avoid cache-aliasing issues.

Thanks to Andrew Doran and Joerg Sonnenberger, as their initial patches
formed the core changes of this branch.

Revision 1.96 / (download) - annotate - [select for diffs], Sun Mar 6 17:08:39 2011 UTC (2 years, 2 months ago) by bouyer
Branch: MAIN
CVS Tags: rmind-uvmplock-nbase, rmind-uvmplock-base, cherry-xenmp-base
Branch point for: cherry-xenmp
Changes since 1.95: +3 -3 lines
Diff to previous 1.95 (colored)

merge the bouyer-quota2 branch. This adds a new on-disk format
to store disk quota usage and limits, integrated with ffs
metadata. Usage is checked by fsck_ffs (no more quotacheck)
and is covered by the WAPBL journal. Enabled with kernel
option QUOTA2 (added where QUOTA was enabled in kernel config files),
turned on with tunefs(8) on a per-filesystem
basis. mount_mfs(8) can also turn quotas on.

See http://mail-index.netbsd.org/tech-kern/2011/02/19/msg010025.html
for details.

Revision 1.95 / (download) - annotate - [select for diffs], Fri Apr 23 15:38:46 2010 UTC (3 years, 1 month ago) by pooka
Branch: MAIN
CVS Tags: yamt-nfs-mp-base11, yamt-nfs-mp-base10, uebayasi-xip-base4, uebayasi-xip-base3, uebayasi-xip-base2, uebayasi-xip-base1, matt-mips64-premerge-20101231, jruoho-x86intr-base, bouyer-quota2-nbase, bouyer-quota2-base
Branch point for: jruoho-x86intr, bouyer-quota2
Changes since 1.94: +2 -16 lines
Diff to previous 1.94 (colored)

Enforce RLIMIT_FSIZE before VOP_WRITE.  This adds support to file
system drivers where it was missing from and fixes one buggy
implementation.  The arguably weird semantics of the check are
maintained (v_size vs. va_bytes, overwrite).

Revision 1.94 / (download) - annotate - [select for diffs], Sun Feb 22 20:28:07 2009 UTC (4 years, 3 months ago) by ad
Branch: MAIN
CVS Tags: yamt-nfs-mp-base9, yamt-nfs-mp-base8, yamt-nfs-mp-base7, yamt-nfs-mp-base6, yamt-nfs-mp-base5, yamt-nfs-mp-base4, yamt-nfs-mp-base3, uebayasi-xip-base, nick-hppapmap-base4, nick-hppapmap-base3, nick-hppapmap-base2, nick-hppapmap-base, matt-premerge-20091211, jymxensuspend-base, jym-xensuspend-nbase, jym-xensuspend-base
Branch point for: uebayasi-xip, rmind-uvmplock
Changes since 1.93: +2 -3 lines
Diff to previous 1.93 (colored)

PR kern/26878 FFSv2 + softdep = livelock (no free ram)
PR kern/16942 panic with softdep and quotas
PR kern/19565 panic: softdep_write_inodeblock: indirect pointer #1 mismatch
PR kern/26274 softdep panic: allocdirect_merge: ...
PR kern/26374 Long delay before non-root users can write to softdep partitions
PR kern/28621 1.6.x "vp != NULL" panic in ffs_softdep.c:4653 while unmounting a softdep (+quota) filesystem
PR kern/29513 FFS+Softdep panic with unfsck-able file-corruption
PR kern/31544 The ffs softdep code appears to fail to write dirty bits to disk
PR kern/31981 stopping scsi disk can cause panic (softdep)
PR kern/32116 kernel panic in softdep (assertion failure)
PR kern/32532 softdep_trackbufs deadlock
PR kern/37191 softdep: locking against myself
PR kern/40474 Kernel panic after remounting raid root with softdep

Retire softdep, pass 2. As discussed and later formally announced on the
mailing lists.

Revision 1.93 / (download) - annotate - [select for diffs], Mon Dec 8 11:48:03 2008 UTC (4 years, 5 months ago) by pooka
Branch: MAIN
CVS Tags: mjf-devfs2-base, haad-nbase2, haad-dm-base2, haad-dm-base
Branch point for: jym-xensuspend
Changes since 1.92: +4 -4 lines
Diff to previous 1.92 (colored)

Decode write access advice and pass to uvm (not that it's handled
there, but ...).

Revision 1.92 / (download) - annotate - [select for diffs], Sun Oct 19 18:17:14 2008 UTC (4 years, 7 months ago) by hannken
Branch: MAIN
CVS Tags: netbsd-5-base, netbsd-5-2-RELEASE, netbsd-5-2-RC1, netbsd-5-2, netbsd-5-1-RELEASE, netbsd-5-1-RC4, netbsd-5-1-RC3, netbsd-5-1-RC2, netbsd-5-1-RC1, netbsd-5-1-2-RELEASE, netbsd-5-1-1-RELEASE, netbsd-5-1, netbsd-5-0-RELEASE, netbsd-5-0-RC4, netbsd-5-0-RC3, netbsd-5-0-RC2, netbsd-5-0-RC1, netbsd-5-0-2-RELEASE, netbsd-5-0-1-RELEASE, netbsd-5-0, netbsd-5, matt-nb5-pq3-base, matt-nb5-pq3, matt-nb5-mips64-u2-k2-k4-k7-k8-k9, matt-nb5-mips64-u1-k1-k5, matt-nb5-mips64-premerge-20101231, matt-nb5-mips64-premerge-20091211, matt-nb5-mips64-k15, matt-nb5-mips64, matt-nb4-mips64-k7-u2a-k9b, matt-mips64-base2, ad-audiomp2-base, ad-audiomp2
Branch point for: nick-hppapmap
Changes since 1.91: +3 -3 lines
Diff to previous 1.91 (colored)

Make genfs_directio() IO_JOURNALLOCKED aware.  DirectIO no longer triggers
"locking against myself" panic in wapbl_begin().

Observed and tested by: Frank Kardel <kardel@netbsd.org>

Revision 1.91 / (download) - annotate - [select for diffs], Fri Aug 22 10:48:22 2008 UTC (4 years, 9 months ago) by hannken
Branch: MAIN
CVS Tags: wrstuden-revivesa-base-4, wrstuden-revivesa-base-3, wrstuden-revivesa-base-2, haad-dm-base1
Changes since 1.90: +8 -6 lines
Diff to previous 1.90 (colored)

Add snapshot support for logging ffs file systems.

- Add UFS_WAPBL_BEGIN() / UFS_WAPBL_END() where needed.

- Expunge WAPBL log inodes from snapshots.

- Ffs_copyonwrite() and ffs_snapblkfree() must run inside a WAPBL transaction.

- Add ffs_gop_write() as a wrapper around genfs_gop_write() that makes sure
  genfs_gop_write() gets always called inside a WAPBL transaction.

- Add VOP_PUTPAGES() flag PGO_JOURNALLOCKED to tag calls to VOP_PUTPAGES()
  inside a WAPBL transaction.

Reviewed by: Simon Burge <simonb@netbsd.org>,  Greg Oster <oster@netbsd.org>

PGO_JOURNALLOCKED / ffs_gop_write() part presented on tech-kern@.

Revision 1.90 / (download) - annotate - [select for diffs], Tue Aug 12 10:14:38 2008 UTC (4 years, 9 months ago) by hannken
Branch: MAIN
Changes since 1.89: +7 -2 lines
Diff to previous 1.89 (colored)

Deny read/write access to snapshot vnodes.  We use fss(4) to read from
snapshots.  With this policy in place:

- Separate the snapshot vnode lock from the snapshot common lock.
  Snapshots no longer need recursive vnode locks.

- Use a mutex (si_snaplock) to serialize creation, deletion, reading and
  writing of snapshots.

- Move ffs_read() for snapshots into ffs_snapshot.c.

Reviewed by: Jason Thorpe <thorpej@netbsd.org>

While here change ffs_copyonwrite() to fail requests from pagedaemon that need
to copy-on-write.

Revision 1.89 / (download) - annotate - [select for diffs], Thu Jul 31 05:38:06 2008 UTC (4 years, 9 months ago) by simonb
Branch: MAIN
Changes since 1.88: +26 -4 lines
Diff to previous 1.88 (colored)

Merge the simonb-wapbl branch.  From the original branch commit:

   Add Wasabi System's WAPBL (Write Ahead Physical Block Logging)
   journaling code.  Originally written by Darrin B. Jewell while
   at Wasabi and updated to -current by Antti Kantee, Andy Doran,
   Greg Oster and Simon Burge.

OK'd by core@, releng@.

Revision 1.88 / (download) - annotate - [select for diffs], Fri May 16 09:22:01 2008 UTC (5 years ago) by hannken
Branch: MAIN
CVS Tags: yamt-pf42-base4, yamt-pf42-base3, yamt-pf42-base2, wrstuden-revivesa-base-1, wrstuden-revivesa-base, simonb-wapbl-nbase, simonb-wapbl-base, hpcarm-cleanup-nbase
Branch point for: simonb-wapbl, haad-dm
Changes since 1.87: +4 -4 lines
Diff to previous 1.87 (colored)

Make sure all cached buffers with valid, not yet written data have been
run through copy-on-write.  Call fscow_run() with valid data where possible.

The LP_UFSCOW hack is no longer needed to protect ffs_copyonwrite() against
endless recursion.

- Add a flag B_MODIFY to bread(), breada() and breadn().  If set the caller
  intends to modify the buffer returned.

- Always run copy-on-write on buffers returned from ffs_balloc().

- Add new function ffs_getblk() that gets a buffer, assigns a new blkno,
  may clear the buffer and runs copy-on-write.  Process possible errors
  from getblk() or fscow_run().  Part of PR kern/38664.

Welcome to 4.99.63

Reviewed by: YAMAMOTO Takashi <yamt@netbsd.org>

Revision 1.87 / (download) - annotate - [select for diffs], Thu Apr 24 15:35:31 2008 UTC (5 years, 1 month ago) by ad
Branch: MAIN
CVS Tags: yamt-nfs-mp-base2, yamt-nfs-mp-base
Branch point for: yamt-nfs-mp, wrstuden-revivesa
Changes since 1.86: +4 -4 lines
Diff to previous 1.86 (colored)

Network protocol interrupts can now block on locks, so merge the globals
proclist_mutex and proclist_lock into a single adaptive mutex (proc_lock).
Implications:

- Inspecting process state requires thread context, so signals can no longer
  be sent from a hardware interrupt handler. Signal activity must be
  deferred to a soft interrupt or kthread.

- As the proc state locking is simplified, it's now safe to take exit()
  and wait() out from under kernel_lock.

- The system spends less time at IPL_SCHED, and there is less lock activity.

Revision 1.86 / (download) - annotate - [select for diffs], Wed Jan 2 11:49:14 2008 UTC (5 years, 4 months ago) by ad
Branch: MAIN
CVS Tags: yamt-pf42-baseX, yamt-pf42-base, yamt-lazymbuf-base15, yamt-lazymbuf-base14, nick-net80211-sync-base, nick-net80211-sync, mjf-devfs-base, matt-armv6-nbase, matt-armv6-base, keiichi-mipv6-nbase, keiichi-mipv6-base, keiichi-mipv6, hpcarm-cleanup-base, bouyer-xeni386-nbase, bouyer-xeni386-base, ad-socklock-base1
Branch point for: yamt-pf42, mjf-devfs2
Changes since 1.85: +6 -6 lines
Diff to previous 1.85 (colored)

Merge vmlocking2 to head.

Revision 1.85 / (download) - annotate - [select for diffs], Sat Dec 8 19:29:57 2007 UTC (5 years, 5 months ago) by pooka
Branch: MAIN
CVS Tags: yamt-kmem-base3, yamt-kmem-base2, yamt-kmem-base, yamt-kmem, vmlocking2-base3, jmcneill-pm-base, cube-autoconf-base, cube-autoconf
Branch point for: bouyer-xeni386
Changes since 1.84: +3 -4 lines
Diff to previous 1.84 (colored)

Remove cn_lwp from struct componentname.  curlwp should be used
from on.  The NDINIT() macro no longer takes the lwp parameter and
associates the credentials of the calling thread with the namei
structure.

Revision 1.84 / (download) - annotate - [select for diffs], Wed Oct 10 20:42:40 2007 UTC (5 years, 7 months ago) by ad
Branch: MAIN
CVS Tags: yamt-x86pmap-base4, yamt-x86pmap-base3, vmlocking2-base2, vmlocking2-base1, vmlocking-nbase, vmlocking-base, reinoud-bufcleanup-nbase, reinoud-bufcleanup-base, jmcneill-base, bouyer-xenamd64-base2, bouyer-xenamd64-base, bouyer-xenamd64
Branch point for: vmlocking2, mjf-devfs
Changes since 1.83: +3 -3 lines
Diff to previous 1.83 (colored)

Merge from vmlocking:

- Split vnode::v_flag into three fields, depending on field locking.
- simple_lock -> kmutex in a few places.
- Fix some simple locking problems.

Revision 1.83 / (download) - annotate - [select for diffs], Mon Oct 8 18:01:31 2007 UTC (5 years, 7 months ago) by ad
Branch: MAIN
Changes since 1.82: +5 -6 lines
Diff to previous 1.82 (colored)

Merge ffs locking & brelse changes from the vmlocking branch.

Revision 1.82 / (download) - annotate - [select for diffs], Mon Sep 24 16:50:58 2007 UTC (5 years, 8 months ago) by pooka
Branch: MAIN
CVS Tags: yamt-x86pmap-base2
Changes since 1.81: +5 -7 lines
Diff to previous 1.81 (colored)

BLKSIZE is always the same as blksize these days, so get rid of it.

Revision 1.81 / (download) - annotate - [select for diffs], Fri Jul 27 10:00:42 2007 UTC (5 years, 9 months ago) by yamt
Branch: MAIN
CVS Tags: yamt-x86pmap-base, nick-csl-alignment-base5, matt-mips64-base, hpcarm-cleanup
Branch point for: yamt-x86pmap, matt-mips64, matt-armv6, jmcneill-pm
Changes since 1.80: +6 -10 lines
Diff to previous 1.80 (colored)

use ubc_uiomove for read as well.

Revision 1.80 / (download) - annotate - [select for diffs], Fri Jul 27 09:50:36 2007 UTC (5 years, 9 months ago) by yamt
Branch: MAIN
Changes since 1.79: +4 -3 lines
Diff to previous 1.79 (colored)

ubc_uiomove: add an "advice" argument rather than using UVM_ADV_RANDOM blindly.

Revision 1.79 / (download) - annotate - [select for diffs], Tue Jun 5 12:31:35 2007 UTC (5 years, 11 months ago) by yamt
Branch: MAIN
CVS Tags: nick-csl-alignment-base, mjf-ufs-trans-base
Branch point for: nick-csl-alignment
Changes since 1.78: +27 -25 lines
Diff to previous 1.78 (colored)

improve post-ubc file overwrite performance in common cases.
ie. when it's safe, actually overwrite blocks rather than doing
read-modify-write.

also fixes PR/33152 and PR/36303.

Revision 1.78 / (download) - annotate - [select for diffs], Thu May 17 07:26:23 2007 UTC (6 years ago) by hannken
Branch: MAIN
CVS Tags: yamt-idlelwp-base8
Changes since 1.77: +4 -6 lines
Diff to previous 1.77 (colored)

Fstrans_start() always returns zero, so change its type to void.

Revision 1.77 / (download) - annotate - [select for diffs], Thu Apr 19 11:05:14 2007 UTC (6 years, 1 month ago) by yamt
Branch: MAIN
Changes since 1.76: +4 -2 lines
Diff to previous 1.76 (colored)

hold proclist_mutex when calling psignal().

Revision 1.76 / (download) - annotate - [select for diffs], Thu Feb 22 06:10:49 2007 UTC (6 years, 3 months ago) by thorpej
Branch: MAIN
CVS Tags: thorpej-atomic-base, thorpej-atomic, reinoud-bufcleanup, ad-audiomp-base, ad-audiomp
Branch point for: vmlocking, mjf-ufs-trans
Changes since 1.75: +8 -8 lines
Diff to previous 1.75 (colored)

TRUE -> true, FALSE -> false

Revision 1.75 / (download) - annotate - [select for diffs], Wed Feb 21 23:00:11 2007 UTC (6 years, 3 months ago) by thorpej
Branch: MAIN
Changes since 1.74: +7 -7 lines
Diff to previous 1.74 (colored)

Replace the Mach-derived boolean_t type with the C99 bool type.  A
future commit will replace use of TRUE and FALSE with true and false.

Revision 1.74 / (download) - annotate - [select for diffs], Mon Jan 29 15:42:50 2007 UTC (6 years, 3 months ago) by hannken
Branch: MAIN
CVS Tags: post-newlock2-merge, newlock2-nbase, newlock2-base
Branch point for: yamt-idlelwp
Changes since 1.73: +4 -4 lines
Diff to previous 1.73 (colored)

Change fstrans enum types to upper case.
No functional change.

From Antti Kantee <pooka@netbsd.org>

Revision 1.73 / (download) - annotate - [select for diffs], Fri Jan 19 14:49:13 2007 UTC (6 years, 4 months ago) by hannken
Branch: MAIN
Changes since 1.72: +14 -2 lines
Diff to previous 1.72 (colored)

New file system suspension API to replace vn_start_write and vn_finished_write.
The suspension helpers are now put into file system specific operations.
This means every file system not supporting these helpers cannot be suspended
and therefore snapshots are no longer possible.

Implemented for file systems of type ffs.

The new API is enabled on a kernel option NEWVNGATE.  This option is
not enabled by default in any kernel config.

Presented and discussed on tech-kern with much input from
Bill Studenmund <wrstuden@netbsd.org> and YAMAMOTO Takashi <yamt@netbsd.org>.

Welcome to 4.99.9 (new vfs op vfs_suspendctl).

Revision 1.72 / (download) - annotate - [select for diffs], Thu Jan 4 16:55:30 2007 UTC (6 years, 4 months ago) by elad
Branch: MAIN
Changes since 1.71: +3 -3 lines
Diff to previous 1.71 (colored)

Consistent usage of KAUTH_GENERIC_ISSUSER.

Revision 1.71 / (download) - annotate - [select for diffs], Sat Oct 14 09:17:26 2006 UTC (6 years, 7 months ago) by yamt
Branch: MAIN
CVS Tags: yamt-splraiseipl-base5, yamt-splraiseipl-base4, yamt-splraiseipl-base3, yamt-splraiseipl-base2, wrstuden-fixsa-newbase, wrstuden-fixsa-base-1, wrstuden-fixsa-base, wrstuden-fixsa, netbsd-4-base, netbsd-4-0-RELEASE, netbsd-4-0-RC5, netbsd-4-0-RC4, netbsd-4-0-RC3, netbsd-4-0-RC2, netbsd-4-0-RC1, netbsd-4-0-1-RELEASE, netbsd-4-0, netbsd-4, matt-nb4-arm-base, matt-nb4-arm
Changes since 1.70: +4 -6 lines
Diff to previous 1.70 (colored)

don't use g_glock directly.

Revision 1.70 / (download) - annotate - [select for diffs], Thu Oct 5 14:48:33 2006 UTC (6 years, 7 months ago) by chs
Branch: MAIN
Changes since 1.69: +14 -3 lines
Diff to previous 1.69 (colored)

add support for O_DIRECT (I/O directly to application memory,
bypassing any kernel caching for file data).

Revision 1.69 / (download) - annotate - [select for diffs], Tue Oct 3 18:24:48 2006 UTC (6 years, 7 months ago) by christos
Branch: MAIN
Changes since 1.68: +4 -2 lines
Diff to previous 1.68 (colored)

Coverity CID 3156: async = TRUE when LFS_READWRITE is defined, leading to
dead code. Ifdef the dead code appropriately (from Arnaud Lacombe)

Revision 1.68 / (download) - annotate - [select for diffs], Sun May 14 21:33:39 2006 UTC (7 years ago) by elad
Branch: MAIN
CVS Tags: yamt-splraiseipl-base, yamt-pdpolicy-base9, yamt-pdpolicy-base8, yamt-pdpolicy-base7, yamt-pdpolicy-base6, yamt-pdpolicy-base5, simonb-timecounters-base, rpaulo-netinet-merge-pcb-base, gdamore-uart-base, gdamore-uart, chap-midi-nbase, chap-midi-base, chap-midi, abandoned-netbsd-4-base, abandoned-netbsd-4
Branch point for: yamt-splraiseipl, newlock2
Changes since 1.67: +7 -6 lines
Diff to previous 1.67 (colored)

integrate kauth.

Revision 1.67 / (download) - annotate - [select for diffs], Wed Mar 1 12:38:33 2006 UTC (7 years, 2 months ago) by yamt
Branch: MAIN
CVS Tags: yamt-pdpolicy-base4, yamt-pdpolicy-base3, yamt-pdpolicy-base2, yamt-pdpolicy-base, peter-altq-base, elad-kernelauth-base
Branch point for: yamt-pdpolicy, peter-altq, elad-kernelauth
Changes since 1.66: +4 -4 lines
Diff to previous 1.66 (colored)

merge yamt-uio_vmspace branch.

- use vmspace rather than proc or lwp where appropriate.
  the latter is more natural to specify an address space.
  (and less likely to be abused for random purposes.)
- fix a swdmover race.

Revision 1.66 / (download) - annotate - [select for diffs], Sun Dec 11 12:25:28 2005 UTC (7 years, 5 months ago) by christos
Branch: MAIN
CVS Tags: yamt-uio_vmspace-base5
Branch point for: yamt-uio_vmspace, simonb-timecounters, rpaulo-netinet-merge-pcb
Changes since 1.65: +8 -8 lines
Diff to previous 1.65 (colored)

merge ktrace-lwp.

Revision 1.65 / (download) - annotate - [select for diffs], Tue Nov 29 22:52:03 2005 UTC (7 years, 5 months ago) by yamt
Branch: MAIN
CVS Tags: ktrace-lwp-base
Changes since 1.64: +6 -4 lines
Diff to previous 1.64 (colored)

merge yamt-readahead branch.

Revision 1.64 / (download) - annotate - [select for diffs], Wed Nov 2 12:39:14 2005 UTC (7 years, 6 months ago) by yamt
Branch: MAIN
CVS Tags: yamt-readahead-base3, yamt-readahead-base2, yamt-readahead-base
Branch point for: yamt-readahead
Changes since 1.63: +6 -6 lines
Diff to previous 1.63 (colored)

merge yamt-vop branch.  remove following VOPs.

	VOP_BLKATOFF
	VOP_VALLOC
	VOP_BALLOC
	VOP_REALLOCBLKS
	VOP_VFREE
	VOP_TRUNCATE
	VOP_UPDATE

Revision 1.63 / (download) - annotate - [select for diffs], Tue Apr 19 20:59:05 2005 UTC (8 years, 1 month ago) by perseant
Branch: MAIN
CVS Tags: yamt-vop-base3, yamt-vop-base2, yamt-vop-base, thorpej-vnode-attr-base, thorpej-vnode-attr, kent-audio2-base
Branch point for: yamt-vop, yamt-lazymbuf
Changes since 1.62: +2 -12 lines
Diff to previous 1.62 (colored)

Keep per-inode, per-fs, and subsystem-wide counts of blocks allocated through
lfs_balloc(), and use that to estimate the number of dirty pages belonging
to LFS (subsystem or filesystem).  This is almost certainly wrong for
the case of a large mmap()ed region, but the accounting is tighter than
what we had before, and performs much better in the typical case of pages
dirtied through write().

Revision 1.62 / (download) - annotate - [select for diffs], Fri Apr 1 21:59:47 2005 UTC (8 years, 1 month ago) by perseant
Branch: MAIN
Changes since 1.61: +4 -2 lines
Diff to previous 1.61 (colored)

Protect various per-fs structures with fs->lfs_interlock simple_lock, to
improve behavior in the multiprocessor case.  Add debugging segment-lock
assertion statements.

Revision 1.61 / (download) - annotate - [select for diffs], Sat Feb 26 05:40:42 2005 UTC (8 years, 2 months ago) by perseant
Branch: MAIN
CVS Tags: yamt-km-base4, yamt-km-base3, netbsd-3-base
Branch point for: netbsd-3
Changes since 1.60: +3 -2 lines
Diff to previous 1.60 (colored)

Various minor LFS improvements:

* Note when lfs_putpages(9) thinks it is not going to be writing any
  pages before calling genfs_putpages(9).  This prevents a situation in
  which blocks can be queued for writing without a segment header.
* Correct computation of NRESERVE(), though it is still a gross
  overestimate in most cases.  Note that if NRESERVE() is too high, it
  may be impossible to create files on the filesystem.  We catch this
  case on filesystem mount and refuse to mount r/w.
* Allow filesystems to be mounted whose block size is == MAXBSIZE.
* Somewhere along the line, ufs_bmaparray(9) started mangling UNWRITTEN
  entries in indirect blocks again, triggering a failed assertion "daddr
  <= LFS_MAX_DADDR".  Explicitly convert to and from int32_t to correct
  this.
* Add a high-water mark for the number of dirty pages any given LFS can
  hold before triggering a flush.  This is settable by sysctl, but off
  (zero) by default.
* Be more careful about the MAX_BYTES and MAX_BUFS computations so we
  shouldn't see "please increase to at least zero" messages.
* Note that VBLK and VCHR vnodes can have nonzero values in di_db[0]
  even though their v_size == 0.  Don't panic when we see this.
* Change lfs_bfree to a signed quantity.  The manner in which it is
  processed before being passed to the cleaner means that sometimes it
  may drop below zero, and the cleaner must be aware of this.
* Never report bfree < 0 (or higher than lfs_dsize) through
  lfs_statvfs(9).  This prevents df(1) from ever telling us that our full
  filesystems have 16TB free.
* Account space allocated through lfs_balloc(9) that does not have
  associated buffer headers, so that the pagedaemon doesn't run us out
  of segments.
* Return ENOSPC from lfs_balloc(9) when bfree drops to zero.
* Address a deadlock in lfs_bmapv/lfs_markv when the filesystem is being
  unmounted.  Because vfs_busy() is a shared lock, and
  lfs_bmapv/lfs_markv mark the filesystem vfs_busy(), the cleaner can be
  holding the lock that umount() is blocking on, then try to vfs_busy()
  again in getnewvnode().

Revision 1.60 / (download) - annotate - [select for diffs], Sun Jan 9 16:42:44 2005 UTC (8 years, 4 months ago) by chs
Branch: MAIN
CVS Tags: yamt-km-base2, yamt-km-base, matt-timespec, kent-audio1-beforemerge
Branch point for: yamt-km, kent-audio2
Changes since 1.59: +10 -8 lines
Diff to previous 1.59 (colored)

adjust the UBC mapping code to support non-vnode uvm_objects.
this means we can no longer look at the vnode size to determine how many
pages to request in a fault, which is good since for NFS the size can change
out from under us on the server anyway.  there's also a new flag UBC_UNMAP
for ubc_release(), so that the file system code can make the decision about
whether to cache mappings for files being used as executables.

Revision 1.59 / (download) - annotate - [select for diffs], Fri Sep 10 09:36:05 2004 UTC (8 years, 8 months ago) by yamt
Branch: MAIN
CVS Tags: kent-audio1-base, kent-audio1
Changes since 1.58: +2 -3 lines
Diff to previous 1.58 (colored)

g/c no longer used definition of fs_maxfilesize.

Revision 1.58 / (download) - annotate - [select for diffs], Sun Aug 15 16:46:18 2004 UTC (8 years, 9 months ago) by mycroft
Branch: MAIN
Changes since 1.57: +4 -6 lines
Diff to previous 1.57 (colored)

Fix some formatting glitches.

Revision 1.57 / (download) - annotate - [select for diffs], Sun Aug 15 16:24:41 2004 UTC (8 years, 9 months ago) by mycroft
Branch: MAIN
Changes since 1.56: +13 -12 lines
Diff to previous 1.56 (colored)

Minor simplification to some arithmetic.

Revision 1.56 / (download) - annotate - [select for diffs], Sun Aug 15 07:19:58 2004 UTC (8 years, 9 months ago) by mycroft
Branch: MAIN
Changes since 1.55: +15 -17 lines
Diff to previous 1.55 (colored)

Fixing age old cruft:
* Rather than using mnt_maxsymlinklen to indicate that a file systems returns
  d_type fields(!), add a new internal flag, IMNT_DTYPE.

Add 3 new elements to ufsmount:
* um_maxsymlinklen, replaces mnt_maxsymlinklen (which never should have existed
  in the first place).
* um_dirblksiz, which tracks the current directory block size, eliminating the
  FS-specific checks littered throughout the code.  This may be used later to
  make the block size variable.
* um_maxfilesize, which is the maximum file size, possibly adjusted lower due
  to implementation issues.

Sync some bug fixes from FFS into ext2fs, particularly:
* ffs_lookup.c 1.21, 1.28, 1.33, 1.48
* ffs_inode.c 1.43, 1.44, 1.45, 1.66, 1.67
* ffs_vnops.c 1.84, 1.85, 1.86

Clean up some crappy pointer frobnication.

Revision 1.55 / (download) - annotate - [select for diffs], Thu Aug 7 16:34:46 2003 UTC (9 years, 9 months ago) by agc
Branch: MAIN
CVS Tags: netbsd-2-base, netbsd-2-0-base, netbsd-2-0-RELEASE, netbsd-2-0-RC5, netbsd-2-0-RC4, netbsd-2-0-RC3, netbsd-2-0-RC2, netbsd-2-0-RC1, netbsd-2-0-3-RELEASE, netbsd-2-0-2-RELEASE, netbsd-2-0-1-RELEASE, netbsd-2-0
Branch point for: netbsd-2
Changes since 1.54: +3 -7 lines
Diff to previous 1.54 (colored)

Move UCB-licensed code from 4-clause to 3-clause licence.

Patches provided by Joel Baker in PR 22364, verified by myself.

Revision 1.54 / (download) - annotate - [select for diffs], Sun Jun 29 22:32:48 2003 UTC (9 years, 10 months ago) by fvdl
Branch: MAIN
Branch point for: ktrace-lwp
Changes since 1.53: +2 -2 lines
Diff to previous 1.53 (colored)

Back out the lwp/ktrace changes. They contained a lot of colateral damage,
and need to be examined and discussed more.

Revision 1.53 / (download) - annotate - [select for diffs], Sat Jun 28 14:22:29 2003 UTC (9 years, 10 months ago) by darrenr
Branch: MAIN
Changes since 1.52: +4 -4 lines
Diff to previous 1.52 (colored)

Pass lwp pointers throughtout the kernel, as required, so that the lwpid can
be inserted into ktrace records.  The general change has been to replace
"struct proc *" with "struct lwp *" in various function prototypes, pass
the lwp through and use l_proc to get the process pointer when needed.

Bump the kernel rev up to 1.6V

Revision 1.52 / (download) - annotate - [select for diffs], Thu May 15 20:25:33 2003 UTC (10 years ago) by kristerw
Branch: MAIN
Changes since 1.51: +4 -4 lines
Diff to previous 1.51 (colored)

The C language does not permit statements of the form
   (X ? Y : Z) = 0;
even though gcc handles this by a stupid extension.

Transform these to correct C.

Approved by fvdl.

Revision 1.51 / (download) - annotate - [select for diffs], Wed Apr 2 10:39:44 2003 UTC (10 years, 1 month ago) by fvdl
Branch: MAIN
Changes since 1.50: +21 -18 lines
Diff to previous 1.50 (colored)

Add support for UFS2. UFS2 is an enhanced FFS, adding support for
64 bit block pointers, extended attribute storage, and a few
other things.

This commit does not yet include the code to manipulate the extended
storage (for e.g. ACLs), this will be done later.

Originally written by Kirk McKusick and Network Associates Laboratories for
FreeBSD.

Revision 1.50 / (download) - annotate - [select for diffs], Sat Mar 15 07:24:37 2003 UTC (10 years, 2 months ago) by perseant
Branch: MAIN
Changes since 1.49: +11 -3 lines
Diff to previous 1.49 (colored)

Add write-behind to lfs_write().

Revision 1.49 / (download) - annotate - [select for diffs], Sat Mar 8 21:52:57 2003 UTC (10 years, 2 months ago) by perseant
Branch: MAIN
Changes since 1.48: +3 -9 lines
Diff to previous 1.48 (colored)

Take away "#ifdef LFS_UBC".

Revision 1.48 / (download) - annotate - [select for diffs], Mon Feb 17 23:48:23 2003 UTC (10 years, 3 months ago) by perseant
Branch: MAIN
Changes since 1.47: +15 -6 lines
Diff to previous 1.47 (colored)

Add code to UBCify LFS.  This is still behind "#ifdef LFS_UBC" for now
(there are still some details to work out) but expect that to go
away soon.  To support these basic changes (creation of lfs_putpages,
lfs_gop_write, mods to lfs_balloc) several other changes were made, to
wit:

* Create a writer daemon kernel thread whose purpose is to handle page
  writes for the pagedaemon, but which also takes over some of the
  functions of lfs_check().  This thread is started the first time an
  LFS is mounted.

* Add a "flags" parameter to GOP_SIZE.  Current values are
  GOP_SIZE_READ, meaning that the call should return the size of the
  in-core version of the file, and GOP_SIZE_WRITE, meaning that it
  should return the on-disk size.  One of GOP_SIZE_READ or
  GOP_SIZE_WRITE must be specified.

* Instead of using malloc(...M_WAITOK) for everything, reserve enough
  resources to get by and use malloc(...M_NOWAIT), using the reserves if
  necessary.  Use the pool subsystem for structures small enough that
  this is feasible.  This also obsoletes LFS_THROTTLE.

And a few that are not strictly necessary:

* Moves the LFS inode extensions off onto a separately allocated
  structure; getting closer to LFS as an LKM.  "Welcome to 1.6O."

* Unified GOP_ALLOC between FFS and LFS.

* Update LFS copyright headers to correct values.

* Actually cast to unsigned in lfs_shellsort, like the comment says.

* Keep track of which segments were empty before the previous
  checkpoint; any segments that pass two checkpoints both dirty and
  empty can be summarily cleaned.  Do this.  Right now lfs_segclean
  still works, but this should be turned into an effectless
  compatibility syscall.

Revision 1.47 / (download) - annotate - [select for diffs], Fri Jan 24 21:55:30 2003 UTC (10 years, 4 months ago) by fvdl
Branch: MAIN
Changes since 1.46: +4 -4 lines
Diff to previous 1.46 (colored)

Bump daddr_t to 64 bits. Replace it with int32_t in all places where
it was used on-disk, so that on-disk formats remain the same.
Remove ufs_daddr_t and ufs_lbn_t for the time being.

Revision 1.46 / (download) - annotate - [select for diffs], Sat Dec 28 14:39:10 2002 UTC (10 years, 4 months ago) by yamt
Branch: MAIN
CVS Tags: nathanw_sa_before_merge, nathanw_sa_base, fvdl_fs64_base
Changes since 1.45: +5 -5 lines
Diff to previous 1.45 (colored)

- in lfs_reserve, vref vnodes that we're locking so that cleaner doesn't
  try to reclaim them.
  (workaround for deadlock noted in the comment in lfs_reserveavail)
- in lfs_rename, mark vnodes which are being moved as well as directry vnodes.

Revision 1.45 / (download) - annotate - [select for diffs], Thu Dec 26 13:37:21 2002 UTC (10 years, 4 months ago) by yamt
Branch: MAIN
Changes since 1.44: +22 -6 lines
Diff to previous 1.44 (colored)

- in lfs_reserve, reserve locked buffer count as well.
- don't wait for locking buf in lfs_bwrite_ext to avoid deadlocks.
- skip lfs_reserve when we're doing dirop.
  reserve more (for lfs_truncate) in set_dirop instead.

this mostly solves PR 18972. (and hopefully PR 19196)

Revision 1.44 / (download) - annotate - [select for diffs], Wed Oct 23 09:15:08 2002 UTC (10 years, 7 months ago) by jdolecek
Branch: MAIN
CVS Tags: kqueue-aftermerge, gmcgarry_ucred_base, gmcgarry_ucred, gmcgarry_ctxsw_base, gmcgarry_ctxsw
Changes since 1.43: +7 -2 lines
Diff to previous 1.43 (colored)

merge kqueue branch into -current

kqueue provides a stateful and efficient event notification framework
currently supported events include socket, file, directory, fifo,
pipe, tty and device changes, and monitoring of processes and signals

kqueue is supported by all writable filesystems in NetBSD tree
(with exception of Coda) and all device drivers supporting poll(2)

based on work done by Jonathan Lemon for FreeBSD
initial NetBSD port done by Luke Mewburn and Jason Thorpe

Revision 1.43 / (download) - annotate - [select for diffs], Fri Oct 18 01:05:52 2002 UTC (10 years, 7 months ago) by yamt
Branch: MAIN
CVS Tags: kqueue-beforemerge
Changes since 1.42: +26 -9 lines
Diff to previous 1.42 (colored)

make sure to update the vnode's size even if uiomove failed.
otherwise, softdep states can't be flushed later.

ok'ed by Chuck Silvers. fix PR/16670.

Revision 1.42 / (download) - annotate - [select for diffs], Mon Mar 25 02:23:56 2002 UTC (11 years, 2 months ago) by chs
Branch: MAIN
CVS Tags: netbsd-1-6-base, netbsd-1-6-RELEASE, netbsd-1-6-RC3, netbsd-1-6-RC2, netbsd-1-6-RC1, kqueue-base, gehenna-devsw-base, gehenna-devsw
Branch point for: netbsd-1-6
Changes since 1.41: +4 -2 lines
Diff to previous 1.41 (colored)

if the size argument to write(2) is 0, do not modify the file in any way,
including updating timestamps.  required for standards conformance.

Revision 1.41 / (download) - annotate - [select for diffs], Fri Mar 22 03:57:35 2002 UTC (11 years, 2 months ago) by chs
Branch: MAIN
Changes since 1.40: +6 -3 lines
Diff to previous 1.40 (colored)

in lfs_write(), flush and invalidate any page cache pages in the range
that we're about to modify.  this weak attempt at coherency is enough
to make some applications (eg. "tail -f") happy, so it's worth having.

Revision 1.40 / (download) - annotate - [select for diffs], Fri Nov 30 07:05:55 2001 UTC (11 years, 5 months ago) by chs
Branch: MAIN
CVS Tags: newlock-base, newlock, ifpoll-base, ifpoll, eeh-devprop-base, eeh-devprop
Changes since 1.39: +6 -5 lines
Diff to previous 1.39 (colored)

VOP_PUTPAGES() requires page-aligned offsets, so be sure to provide such.
fixes PR 14759.

(while I'm here, call VOP_PUTPAGES() directly instead of indirecting through
the UVM pager op vector.)

Revision 1.39 / (download) - annotate - [select for diffs], Sat Nov 17 07:22:34 2001 UTC (11 years, 6 months ago) by simonb
Branch: MAIN
Changes since 1.38: +3 -3 lines
Diff to previous 1.38 (colored)

Set `flags' before being used in the WRITE() function.

Revision 1.38 / (download) - annotate - [select for diffs], Thu Nov 8 05:24:52 2001 UTC (11 years, 6 months ago) by chs
Branch: MAIN
CVS Tags: thorpej-mips-cache-base
Changes since 1.37: +14 -10 lines
Diff to previous 1.37 (colored)

in both paths that can cause fragments to be expanded (write and truncate-up),
deal with the fragment expansion separately before the rest of the operation.
this allows us to simplify ufs_balloc_range() by not worrying about implicit
fragment expansion.

call VOP_PUTPAGES() directly for vnodes instead of
going through the UVM pager "put" vector.

Revision 1.37 / (download) - annotate - [select for diffs], Thu Nov 8 02:42:31 2001 UTC (11 years, 6 months ago) by lukem
Branch: MAIN
Changes since 1.36: +4 -1 lines
Diff to previous 1.36 (colored)

add RCSID. (note; this file gets #included)

Revision 1.36 / (download) - annotate - [select for diffs], Wed Oct 3 05:02:46 2001 UTC (11 years, 7 months ago) by chs
Branch: MAIN
CVS Tags: thorpej-devvp-base3
Branch point for: thorpej-mips-cache
Changes since 1.35: +4 -2 lines
Diff to previous 1.35 (colored)

don't do any flush-behind for async mounts.
this matches the traditional behaviour.

Revision 1.35 / (download) - annotate - [select for diffs], Sun Sep 30 02:50:35 2001 UTC (11 years, 7 months ago) by chs
Branch: MAIN
CVS Tags: thorpej-devvp-base2
Changes since 1.34: +3 -9 lines
Diff to previous 1.34 (colored)

process one block at a time even when we're using the write fast path
that avoids zeroing pages.  this avoids a mess when we get ENOSPC and
softdeps are enabled.

Revision 1.34 / (download) - annotate - [select for diffs], Sun Sep 16 13:57:56 2001 UTC (11 years, 8 months ago) by chs
Branch: MAIN
Changes since 1.33: +11 -3 lines
Diff to previous 1.33 (colored)

make LFS work again.

Revision 1.33 / (download) - annotate - [select for diffs], Sat Sep 15 20:36:44 2001 UTC (11 years, 8 months ago) by chs
Branch: MAIN
CVS Tags: post-chs-ubcperf
Changes since 1.32: +102 -76 lines
Diff to previous 1.32 (colored)

a whole bunch of changes to improve performance and robustness under load:

 - remove special treatment of pager_map mappings in pmaps.  this is
   required now, since I've removed the globals that expose the address range.
   pager_map now uses pmap_kenter_pa() instead of pmap_enter(), so there's
   no longer any need to special-case it.
 - eliminate struct uvm_vnode by moving its fields into struct vnode.
 - rewrite the pageout path.  the pager is now responsible for handling the
   high-level requests instead of only getting control after a bunch of work
   has already been done on its behalf.  this will allow us to UBCify LFS,
   which needs tighter control over its pages than other filesystems do.
   writing a page to disk no longer requires making it read-only, which
   allows us to write wired pages without causing all kinds of havoc.
 - use a new PG_PAGEOUT flag to indicate that a page should be freed
   on behalf of the pagedaemon when it's unlocked.  this flag is very similar
   to PG_RELEASED, but unlike PG_RELEASED, PG_PAGEOUT can be cleared if the
   pageout fails due to eg. an indirect-block buffer being locked.
   this allows us to remove the "version" field from struct vm_page,
   and together with shrinking "loan_count" from 32 bits to 16,
   struct vm_page is now 4 bytes smaller.
 - no longer use PG_RELEASED for swap-backed pages.  if the page is busy
   because it's being paged out, we can't release the swap slot to be
   reallocated until that write is complete, but unlike with vnodes we
   don't keep a count of in-progress writes so there's no good way to
   know when the write is done.  instead, when we need to free a busy
   swap-backed page, just sleep until we can get it busy ourselves.
 - implement a fast-path for extending writes which allows us to avoid
   zeroing new pages.  this substantially reduces cpu usage.
 - encapsulate the data used by the genfs code in a struct genfs_node,
   which must be the first element of the filesystem-specific vnode data
   for filesystems which use genfs_{get,put}pages().
 - eliminate many of the UVM pagerops, since they aren't needed anymore
   now that the pager "put" operation is a higher-level operation.
 - enhance the genfs code to allow NFS to use the genfs_{get,put}pages
   instead of a modified copy.
 - clean up struct vnode by removing all the fields that used to be used by
   the vfs_cluster.c code (which we don't use anymore with UBC).
 - remove kmem_object and mb_object since they were useless.
   instead of allocating pages to these objects, we now just allocate
   pages with no object.  such pages are mapped in the kernel until they
   are freed, so we can use the mapping to find the page to free it.
   this allows us to remove splvm() protection in several places.

The sum of all these changes improves write throughput on my
decstation 5000/200 to within 1% of the rate of NetBSD 1.5
and reduces the elapsed time for "make release" of a NetBSD 1.5
source tree on my 128MB pc to 10% less than a 1.5 kernel took.

Revision 1.32 / (download) - annotate - [select for diffs], Fri Jul 13 20:30:26 2001 UTC (11 years, 10 months ago) by perseant
Branch: MAIN
CVS Tags: thorpej-devvp-base, pre-chs-ubcperf
Branch point for: thorpej-devvp
Changes since 1.31: +3 -3 lines
Diff to previous 1.31 (colored)

Merge the short-lived perseant-lfsv2 branch into the trunk.

Kernels and tools understand both v1 and v2 filesystems; newfs_lfs
generates v2 by default.  Changes for the v2 layout include:

- Segments of non-PO2 size and arbitrary block offset, so these can be
  matched to convenient physical characteristics of the partition (e.g.,
  stripe or track size and offset).

- Address by fragment instead of by disk sector, paving the way for
  non-512-byte-sector devices.  In theory fragments can be as large
  as you like, though in reality they must be smaller than MAXBSIZE in size.

- Use serial number and filesystem identifier to ensure that roll-forward
  doesn't get old data and think it's new.  Roll-forward is enabled for
  v2 filesystems, though not for v1 filesystems by default.

- The inode free list is now a tailq, paving the way for undelete (undelete
  is not yet implemented, but can be without further non-backwards-compatible
  changes to disk structures).

- Inode atime information is kept in the Ifile, instead of on the inode;
  that is, the inode is never written *just* because atime was changed.
  Because of this the inodes remain near the file data on the disk, rather
  than wandering all over as the disk is read repeatedly.  This speeds up
  repeated reads by a small but noticeable amount.

Other changes of note include:

- The ifile written by newfs_lfs can now be of arbitrary length, it is no
  longer restricted to a single indirect block.

- Fixed an old bug where ctime was changed every time a vnode was created.
  I need to look more closely to make sure that the times are only updated
  during write(2) and friends, not after-the-fact during a segment write,
  and certainly not by the cleaner.

Revision 1.31 / (download) - annotate - [select for diffs], Mon Mar 26 06:47:34 2001 UTC (12 years, 2 months ago) by chs
Branch: MAIN
CVS Tags: thorpej_scsipi_nbase, thorpej_scsipi_beforemerge, thorpej_scsipi_base, perseant-lfsv2-base
Branch point for: perseant-lfsv2, kqueue
Changes since 1.30: +19 -1 lines
Diff to previous 1.30 (colored)

work around a problem with sync writes vs. softdeps.

Revision 1.30 / (download) - annotate - [select for diffs], Tue Feb 27 04:37:47 2001 UTC (12 years, 2 months ago) by chs
Branch: MAIN
Branch point for: nathanw_sa
Changes since 1.29: +5 -5 lines
Diff to previous 1.29 (colored)

min() -> MIN(), max() -> MAX().
fixes more problems with file offsets > 4GB.

Revision 1.29 / (download) - annotate - [select for diffs], Mon Feb 26 17:12:08 2001 UTC (12 years, 2 months ago) by lukem
Branch: MAIN
Changes since 1.28: +6 -8 lines
Diff to previous 1.28 (colored)

some KNF

Revision 1.28 / (download) - annotate - [select for diffs], Mon Nov 27 08:39:57 2000 UTC (12 years, 5 months ago) by chs
Branch: MAIN
Changes since 1.27: +114 -25 lines
Diff to previous 1.27 (colored)

Initial integration of the Unified Buffer Cache project.

Revision 1.27 / (download) - annotate - [select for diffs], Sat Sep 9 04:49:55 2000 UTC (12 years, 8 months ago) by perseant
Branch: MAIN
Changes since 1.26: +5 -1 lines
Diff to previous 1.26 (colored)

Various bug-fixes to LFS, to wit:


Kernel:

* Add runtime quantity lfs_ravail, the number of disk-blocks reserved
  for writing.  Writes to the filesystem first reserve a maximum amount
  of blocks before their write is allowed to proceed; after the blocks
  are allocated the reserved total is reduced by a corresponding amount.

  If the lfs_reserve function cannot immediately reserve the requested
  number of blocks, the inode is unlocked, and the thread sleeps until
  the cleaner has made enough space available for the blocks to be
  reserved.  In this way large files can be written to the filesystem
  (or, smaller files can be written to a nearly-full but thoroughly
  clean filesystem) and the cleaner can still function properly.

* Remove explicit switching on dlfs_minfreeseg from the kernel code; it
  is now merely a fs-creation parameter used to compute dlfs_avail and
  dlfs_bfree (and used by fsck_lfs(8) to check their accuracy).  Its
  former role is better assumed by a properly computed dlfs_avail.

* Bounds-check inode numbers submitted through lfs_bmapv and lfs_markv.
  This prevents a panic, but, if the cleaner is feeding the filesystem
  the wrong data, you are still in a world of hurt.

* Cleanup: remove explicit references of DEV_BSIZE in favor of
  btodb()/dbtob().

lfs_cleanerd:

* Make -n mean "send N segments' blocks through a single call to
  lfs_markv".  Previously it had meant "clean N segments though N calls
  to lfs_markv, before looking again to see if more need to be cleaned".
  The new behavior gives better packing of direct data on disk with as
  little metadata as possible, largely alleviating the problem that the
  cleaner can consume more disk through inefficient use of metadata than
  it frees by moving dirty data away from clean "holes" to produce
  entirely clean segments.

* Make -b mean "read as many segments as necessary to write N segments
  of dirty data back to disk", rather than its former meaning of "read
  as many segments as necessary to free N segments worth of space".  The
  new meaning, combined with the new -n behavior described above,
  further aids in cleaning storage efficiency as entire segments can be
  written at once, using as few blocks as possible for segment summaries
  and inode blocks.

* Make the cleaner take note of segments which could not be cleaned due
  to error, and not attempt to clean them until they are entirely free
  of dirty blocks.  This prevents the case in which a cleanerd running
  with -n 1 and without -b (formerly the default) would spin trying
  repeatedly to clean a corrupt segment, while the remaining space
  filled and deadlocked the filesystem.

* Update the lfs_cleanerd manual page to describe all the options,
  including the changes mentioned here (in particular, the -b and -n
  flags were previously undocumented).

fsck_lfs:

* Check, and optionally fix, lfs_avail (to an exact figure) and
  lfs_bfree (within a margin of error) in pass 5.

newfs_lfs:

* Reduce the default dlfs_minfreeseg to 1/20 of the total segments.

* Add a warning if the sgs disklabel field is 16 (the default for FFS'
  cpg, but not usually desirable for LFS' sgs: 5--8 is a better range).

* Change the calculation of lfs_avail and lfs_bfree, corresponding to
  the kernel changes mentioned above.

mount_lfs:

* Add -N and -b options to pass corresponding -n and -b options to
  lfs_cleanerd.

* Default to calling lfs_cleanerd with "-b -n 4".


[All of these changes were largely tested in the 1.5 branch, with the
idea that they (along with previous un-pulled-up work) could be applied
to the branch while it was still in ALPHA2; however my test system has
experienced corruption on another filesystem (/dev/console has gone
missing :^), and, while I believe this unrelated to the LFS changes, I
cannot with good conscience request that the changes be pulled up.]

Revision 1.26 / (download) - annotate - [select for diffs], Sat May 27 00:19:55 2000 UTC (13 years ago) by perseant
Branch: MAIN
CVS Tags: netbsd-1-5-base, netbsd-1-5-ALPHA2, minoura-xpg4dl-base, minoura-xpg4dl
Branch point for: netbsd-1-5
Changes since 1.25: +8 -1 lines
Diff to previous 1.25 (colored)

Prevent dirops from getting around lfs_check and wedging the buffer cache.
All the dirop vnops now mark the inodes with a new flag, IN_ADIROP, which
is removed as soon as the dirop is done (as opposed to VDIROP which stays
until the file is written).  To address one issue raised in PR#9357.

Revision 1.25 / (download) - annotate - [select for diffs], Sat May 13 23:43:16 2000 UTC (13 years ago) by perseant
Branch: MAIN
Changes since 1.24: +3 -3 lines
Diff to previous 1.24 (colored)

Change the sementics of the last parameter from a boolean ("waitfor") to
a set of flags ("flags").  Two flags are defined, UPDATE_WAIT and
UPDATE_DIROP.

Under the old semantics, VOP_UPDATE would block if waitfor were set,
under the assumption that directory operations should be done
synchronously.  At least LFS and FFS+softdep do not make this
assumption; FFS+softdep got around the problem by enclosing all relevant
calls to VOP_UPDATE in a "if(!DOINGSOFTDEP(vp))", while LFS simply
ignored waitfor, one of the reasons why NFS-serving an LFS filesystem
did not work properly.

Under the new semantics, the UPDATE_DIROP flag is a hint to the
fs-specific update routine that the call comes from a dirop routine, and
should be wait for, or not, accordingly.

Closes PR#8996.

Revision 1.24 / (download) - annotate - [select for diffs], Thu Mar 30 12:41:15 2000 UTC (13 years, 1 month ago) by augustss
Branch: MAIN
Changes since 1.23: +9 -9 lines
Diff to previous 1.23 (colored)

Remove register declarations.

Revision 1.23 / (download) - annotate - [select for diffs], Mon Nov 15 18:49:15 1999 UTC (13 years, 6 months ago) by fvdl
Branch: MAIN
CVS Tags: wrstuden-devbsize-base, wrstuden-devbsize-19991221, chs-ubc2-newbase
Changes since 1.22: +4 -8 lines
Diff to previous 1.22 (colored)

Add Kirk McKusick's soft updates code to the trunk. Not enabled by
default, as the copyright on the main file (ffs_softdep.c) is such
that is has been put into gnusrc. options SOFTDEP will pull this
in. This code also contains the trickle syncer.

Bump version number to 1.4O

Revision 1.22 / (download) - annotate - [select for diffs], Wed Mar 24 05:51:31 1999 UTC (14 years, 2 months ago) by mrg
Branch: MAIN
CVS Tags: netbsd-1-4-base, netbsd-1-4-RELEASE, netbsd-1-4-PATCH003, netbsd-1-4-PATCH002, netbsd-1-4-PATCH001, netbsd-1-4, kame_14_19990705, kame_14_19990628, kame_141_19991130, kame, fvdl-softdep-base, comdex-fall-1999-base, comdex-fall-1999, chs-ubc2-base
Branch point for: wrstuden-devbsize, thorpej_scsipi, fvdl-softdep, chs-ubc2
Changes since 1.21: +1 -13 lines
Diff to previous 1.21 (colored)

completely remove Mach VM support.  all that is left is the all the
header files as UVM still uses (most of) these.

Revision 1.21 / (download) - annotate - [select for diffs], Wed Mar 10 00:00:32 1999 UTC (14 years, 2 months ago) by perseant
Branch: MAIN
Changes since 1.20: +3 -3 lines
Diff to previous 1.20 (colored)

Added flags to lfs_check call

Revision 1.20 / (download) - annotate - [select for diffs], Fri Mar 5 21:09:50 1999 UTC (14 years, 2 months ago) by mycroft
Branch: MAIN
Changes since 1.19: +5 -11 lines
Diff to previous 1.19 (colored)

Pass null pointers to VOP_UPDATE rather than having all the callers fetch the
current time themselves.

Revision 1.19 / (download) - annotate - [select for diffs], Wed Feb 10 13:14:10 1999 UTC (14 years, 3 months ago) by bouyer
Branch: MAIN
Changes since 1.18: +2 -2 lines
Diff to previous 1.18 (colored)

Make sure a buffer optained from bread() is always bresle()'d in case of
error. Closes PR kern/1448 from Wolfgang Solfrank.

Revision 1.18 / (download) - annotate - [select for diffs], Sun Aug 2 18:57:24 1998 UTC (14 years, 9 months ago) by kleink
Branch: MAIN
CVS Tags: kenh-if-detach-base, kenh-if-detach, chs-ubc-base
Branch point for: chs-ubc
Changes since 1.17: +9 -3 lines
Diff to previous 1.17 (colored)

Implement support for IEEE Std 1003.1b-1993 synchronous I/O:
* in the read vnode operator, check for IO_SYNC being set in the ioflag and
  synchronously update the file's meta-data if appropriate.
* in the write vnode operator, update the appropriate checks for IO_SYNC being
  set in the ioflag to reflect that IO_DSYNC is now inclusive-or'ed into
  IO_SYNC, and require all IO_SYNC bits to be set for operations defined by
  synchronized I/O file integrity completion but not by synchronized I/O data
  integrity completion.

Revision 1.17 / (download) - annotate - [select for diffs], Tue Jun 9 07:46:34 1998 UTC (14 years, 11 months ago) by scottr
Branch: MAIN
CVS Tags: eeh-paddr_t-base, eeh-paddr_t
Changes since 1.16: +3 -1 lines
Diff to previous 1.16 (colored)

Protect various config(8)-generated files from inclusion while
building LKMs.  Fixes PR 5557.

Revision 1.16 / (download) - annotate - [select for diffs], Sun Mar 1 02:23:37 1998 UTC (15 years, 2 months ago) by fvdl
Branch: MAIN
Changes since 1.15: +6 -6 lines
Diff to previous 1.15 (colored)

Merge with Lite2 + local changes

Revision 1.15 / (download) - annotate - [select for diffs], Tue Feb 10 14:10:59 1998 UTC (15 years, 3 months ago) by mrg
Branch: MAIN
Changes since 1.14: +3 -1 lines
Diff to previous 1.14 (colored)

- add defopt's for UVM, UVMHIST and PMAP_NEW.
- remove unnecessary UVMHIST_DECL's.

Revision 1.14 / (download) - annotate - [select for diffs], Thu Feb 5 08:00:36 1998 UTC (15 years, 3 months ago) by mrg
Branch: MAIN
Changes since 1.13: +9 -1 lines
Diff to previous 1.13 (colored)

initial import of the new virtual memory system, UVM, into -current.

UVM was written by chuck cranor <chuck@maria.wustl.edu>, with some
minor portions derived from the old Mach code.  i provided some help
getting swap and paging working, and other bug fixes/ideas.  chuck
silvers <chuq@chuq.com> also provided some other fixes.

this is the rest of the MI portion changes.

this will be KNF'd shortly.  :-)

Revision 1.13 / (download) - annotate - [select for diffs], Fri Jul 4 20:22:19 1997 UTC (15 years, 10 months ago) by drochner
Branch: MAIN
CVS Tags: thorpej-signal-base, thorpej-signal, netbsd-1-3-base, netbsd-1-3-RELEASE, netbsd-1-3-PATCH003-CANDIDATE2, netbsd-1-3-PATCH003-CANDIDATE1, netbsd-1-3-PATCH003-CANDIDATE0, netbsd-1-3-PATCH003, netbsd-1-3-PATCH002, netbsd-1-3-PATCH001, netbsd-1-3-BETA, netbsd-1-3, marc-pcmcia-bp, marc-pcmcia-base, marc-pcmcia
Changes since 1.12: +2 -2 lines
Diff to previous 1.12 (colored)

Don't cast 64bit (off_t) file sizes to vm_offset_t (32bit on many
architectures), truncate them intelligently instead.
The truncation is done centralized in vnode_pager.c.
This prevents from wrap-over effects when parts of large (>2^32 byte) files
are mmapped.
Don't allow to mmap above the numerical range of vm_offset_t.
This is considered a temporary solution until the vm system handles the
object sizes/offsets more cleanly.

Revision 1.12 / (download) - annotate - [select for diffs], Wed Jun 11 10:10:16 1997 UTC (15 years, 11 months ago) by bouyer
Branch: MAIN
CVS Tags: bouyer-scsipi
Changes since 1.11: +16 -16 lines
Diff to previous 1.11 (colored)

Add support for ext2fs, this needed a few modifications to ufs/ufs/inode.h:
- added an "union inode_ext" to struct inode, for the per-fs extentions.
  For now only ext2fs uses it.
- i_din is now an union:
	union {
		struct  dinode ffs_din; /* 128 bytes of the on-disk dinode. */
		struct ext2fs_dinode e2fs_din; /* 128 bytes of the on-disk dinode. */
	} i_din
  Added a lot of #define i_ffs_* and i_e2fs_* to access the fields.
- Added two macros: FFS_ITIMES and EXT2FS_ITIMES. ITIMES calls the rigth
  macro, depending on the time of the inode. ITIMES is used where necessary,
  FFS_ITIMES and EXT2FS_ITIMES in other places.

Revision 1.11 / (download) - annotate - [select for diffs], Fri Apr 4 14:21:32 1997 UTC (16 years, 1 month ago) by kleink
Branch: MAIN
Changes since 1.10: +3 -1 lines
Diff to previous 1.10 (colored)

Return immediately upon zero byte reads, as updating st_atime in this case
violates POSIX.1 read() semantics.

Revision 1.10 / (download) - annotate - [select for diffs], Thu Jan 30 09:52:26 1997 UTC (16 years, 3 months ago) by tls
Branch: MAIN
CVS Tags: mrg-vm-swap, is-newarp-before-merge, is-newarp-base, is-newarp
Changes since 1.9: +3 -2 lines
Diff to previous 1.9 (colored)

add support for noatime mount flag

Revision 1.9 / (download) - annotate - [select for diffs], Sat May 11 18:27:57 1996 UTC (17 years ago) by mycroft
Branch: MAIN
CVS Tags: thorpej-setroot, netbsd-1-2-base, netbsd-1-2-RELEASE, netbsd-1-2-PATCH001, netbsd-1-2-BETA, netbsd-1-2
Changes since 1.8: +6 -4 lines
Diff to previous 1.8 (colored)

Change VOP_UPDATE() semantics:
* Make 2nd and 3rd args timespecs, not timevals.
* Consistently pass a Boolean as the 4th arg (except in LFS).
Also, fix ffs_update() and lfs_update() to actually change the nsec fields.

Revision 1.8 / (download) - annotate - [select for diffs], Fri Feb 9 22:36:11 1996 UTC (17 years, 3 months ago) by christos
Branch: MAIN
Changes since 1.7: +14 -9 lines
Diff to previous 1.7 (colored)

ufs prototype changes

Revision 1.7 / (download) - annotate - [select for diffs], Mon Jul 24 21:20:53 1995 UTC (17 years, 10 months ago) by cgd
Branch: MAIN
CVS Tags: netbsd-1-1-base, netbsd-1-1-RELEASE, netbsd-1-1-PATCH001, netbsd-1-1
Changes since 1.6: +2 -8 lines
Diff to previous 1.6 (colored)

avoid unnecessary aging of buffers.  This used to make sense, when buffer
caches were much smaller, but makes little sense now, and will become more
useless as RAM (and buffer cache) sizes grow.  Suggested by Bob Baron.

Revision 1.6 / (download) - annotate - [select for diffs], Fri Mar 24 15:33:31 1995 UTC (18 years, 2 months ago) by cgd
Branch: MAIN
Changes since 1.5: +3 -2 lines
Diff to previous 1.5 (colored)

explicitly cast &time to (struct timeval *) when passing it to VOP_UPDATE.
new prototypes and picky compilers make a volatile mess.

Revision 1.5 / (download) - annotate - [select for diffs], Wed Dec 14 13:04:02 1994 UTC (18 years, 5 months ago) by mycroft
Branch: MAIN
Changes since 1.4: +3 -3 lines
Diff to previous 1.4 (colored)

Sync with CSRG.

Revision 1.4 / (download) - annotate - [select for diffs], Thu Oct 20 04:21:23 1994 UTC (18 years, 7 months ago) by cgd
Branch: MAIN
Changes since 1.3: +3 -3 lines
Diff to previous 1.3 (colored)

update for new syscall args description mechanism, and deal safely
with wider types.

Revision 1.3 / (download) - annotate - [select for diffs], Wed Jun 29 06:47:32 1994 UTC (18 years, 10 months ago) by cgd
Branch: MAIN
CVS Tags: netbsd-1-0-base, netbsd-1-0-RELEASE, netbsd-1-0-PATCH1, netbsd-1-0-PATCH06, netbsd-1-0-PATCH05, netbsd-1-0-PATCH04, netbsd-1-0-PATCH03, netbsd-1-0-PATCH02, netbsd-1-0-PATCH0, netbsd-1-0
Changes since 1.2: +3 -2 lines
Diff to previous 1.2 (colored)

New RCS ID's, take two.  they're more aesthecially pleasant, and use 'NetBSD'

Revision 1.2 / (download) - annotate - [select for diffs], Tue Jun 14 22:56:33 1994 UTC (18 years, 11 months ago) by mycroft
Branch: MAIN
Changes since 1.1: +3 -2 lines
Diff to previous 1.1 (colored)

Fix compatibility with old fastlinks.

Revision 1.1 / (download) - annotate - [select for diffs], Wed Jun 8 11:43:21 1994 UTC (18 years, 11 months ago) by mycroft
Branch: MAIN

Update to 4.4-Lite fs code, with local changes.

This form allows you to request diff's between any two revisions of a file. You may select a symbolic revision name using the selection box or you may type in a numeric name using the type-in text box.




CVSweb <webmaster@jp.NetBSD.org>