<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Is Your Storage Bloated?</title>
	<atom:link href="http://www.j2fi.net/2008/04/17/is-your-storage-bloated/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.j2fi.net/2008/04/17/is-your-storage-bloated/</link>
	<description>Yet Another Gaijin In Japan</description>
	<pubDate>Wed, 20 Aug 2008 15:21:53 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6</generator>
		<item>
		<title>By: Jason</title>
		<link>http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1274</link>
		<dc:creator>Jason</dc:creator>
		<pubDate>Thu, 17 Apr 2008 22:13:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1274</guid>
		<description>Hmm ... I've had similar experiences to the label printing program in a previous life.  A specification would be written and approved then, after most of the software had been written with a very particular goal in mind, the company would change their mind and say "No, let's do it like this instead."

It's decision processes like this that quickly turn PMs, DBAs and APs into bitter people who will begin to plan for every eventuality, regardless of what the company actually wants.

Not that this is a bad thing, of course.

When it comes to blog entries, you're right that they don't take up very much space.  If I were to have just my entries and comments in a MySQL database, the total "cost" would be less than 400 kB.  With the size and cost of storage, this number is barely a concern.

But can the same be said about other online databases?  Twitter (whose popularity still confuses me) seems to have the same quality of content as we'd find on an internet chat room.  Do we really want to indefinately store something as trivial as [ "Went grocery shopping today.  The parking lot was packed" - April 18, 2008 ] ?

Perhaps I should have used that web application as an example rather than a blog :P

Archaeologists might be interested to know what our lives were like in excrutiating detail, but it will be another century before "the dawn of the internet and its effects on the human race" is an area of study at schools.

What I like about this subject is that there is no right answer, and it opens the door to a conversation of what we'd like to store, and for how long.</description>
		<content:encoded><![CDATA[<p>Hmm &#8230; I&#8217;ve had similar experiences to the label printing program in a previous life.  A specification would be written and approved then, after most of the software had been written with a very particular goal in mind, the company would change their mind and say &#8220;No, let&#8217;s do it like this instead.&#8221;</p>
<p>It&#8217;s decision processes like this that quickly turn PMs, DBAs and APs into bitter people who will begin to plan for every eventuality, regardless of what the company actually wants.</p>
<p>Not that this is a bad thing, of course.</p>
<p>When it comes to blog entries, you&#8217;re right that they don&#8217;t take up very much space.  If I were to have just my entries and comments in a MySQL database, the total &#8220;cost&#8221; would be less than 400 kB.  With the size and cost of storage, this number is barely a concern.</p>
<p>But can the same be said about other online databases?  Twitter (whose popularity still confuses me) seems to have the same quality of content as we&#8217;d find on an internet chat room.  Do we really want to indefinately store something as trivial as [ "Went grocery shopping today.  The parking lot was packed" - April 18, 2008 ] ?</p>
<p>Perhaps I should have used that web application as an example rather than a blog <img src='http://www.j2fi.net/wp-includes/images/smilies/icon_razz.gif' alt=':P' class='wp-smiley' /><br />
Archaeologists might be interested to know what our lives were like in excrutiating detail, but it will be another century before &#8220;the dawn of the internet and its effects on the human race&#8221; is an area of study at schools.</p>
<p>What I like about this subject is that there is no right answer, and it opens the door to a conversation of what we&#8217;d like to store, and for how long.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Dunbar</title>
		<link>http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1273</link>
		<dc:creator>Brian Dunbar</dc:creator>
		<pubDate>Thu, 17 Apr 2008 19:13:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1273</guid>
		<description>&lt;i&gt;We can certainly understand why a company wouldn’t want to archive and remove data from a central location, as you never know when you’ll need to know the detailed inventory transactions for a specific box of doo-dads that you had purchased in 1997.&lt;/i&gt;

Most of the time the need to retain this data isn't to get at the record so much as for legal purposes.  Between contractual agreements with customers, legal requirements for various alphabet letter legislation and so on ... some of our data needs to be retained for up to a decade.

Other times it's just plain lack of foresight.  I used to own a label printing application.  Now this is a high-powered beast that gets data from the ERP system and sends jobs to hundreds of printers world-wide.  Yet .. they're just _labels_ with extracted data from elsewhere.

We were going to get rid of the data as soon as it was printed.  Then the users dropped in a requirement for label re-print without referring to ERP.

At that point the PM sort of threw up his hands and decided we'd revisit the get rid of the data problem later.  It's been five years now and there are records in that db that date from the very beginning of that system.

But .. blog entries.  As long as google has no problem with me it's not a biggie.  When I run it on my own server I'll worry about it.  Or maybe not - how many mb does a blog post take up in mysql?</description>
		<content:encoded><![CDATA[<p><i>We can certainly understand why a company wouldn’t want to archive and remove data from a central location, as you never know when you’ll need to know the detailed inventory transactions for a specific box of doo-dads that you had purchased in 1997.</i></p>
<p>Most of the time the need to retain this data isn&#8217;t to get at the record so much as for legal purposes.  Between contractual agreements with customers, legal requirements for various alphabet letter legislation and so on &#8230; some of our data needs to be retained for up to a decade.</p>
<p>Other times it&#8217;s just plain lack of foresight.  I used to own a label printing application.  Now this is a high-powered beast that gets data from the ERP system and sends jobs to hundreds of printers world-wide.  Yet .. they&#8217;re just _labels_ with extracted data from elsewhere.</p>
<p>We were going to get rid of the data as soon as it was printed.  Then the users dropped in a requirement for label re-print without referring to ERP.</p>
<p>At that point the PM sort of threw up his hands and decided we&#8217;d revisit the get rid of the data problem later.  It&#8217;s been five years now and there are records in that db that date from the very beginning of that system.</p>
<p>But .. blog entries.  As long as google has no problem with me it&#8217;s not a biggie.  When I run it on my own server I&#8217;ll worry about it.  Or maybe not - how many mb does a blog post take up in mysql?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason</title>
		<link>http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1272</link>
		<dc:creator>Jason</dc:creator>
		<pubDate>Thu, 17 Apr 2008 14:28:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1272</guid>
		<description>Hmm ... so a corporate database that is used by hundreds of people to report on thousands of possible sales or inventory variations can have a subset of its own data, with the archived daata remaining available on a slower or otherwise secondary system, while blogs are left 100% intact because of Google?

Sounds logical :roll:

I agree that, for the moment, most bloggers will not worry about the size of their databases, the number of entries they have, or the relevance of their content (Lord knows I don't), but as the segment continues to mature and people start building multi-decade blogs, it may get to a point where someone would want to start trimming their less popular archives :???:

Perhaps I'll make a note to check on this idea some time in the 2030's....</description>
		<content:encoded><![CDATA[<p>Hmm &#8230; so a corporate database that is used by hundreds of people to report on thousands of possible sales or inventory variations can have a subset of its own data, with the archived daata remaining available on a slower or otherwise secondary system, while blogs are left 100% intact because of Google?</p>
<p>Sounds logical <img src='http://www.j2fi.net/wp-includes/images/smilies/icon_rolleyes.gif' alt=':roll:' class='wp-smiley' /><br />
I agree that, for the moment, most bloggers will not worry about the size of their databases, the number of entries they have, or the relevance of their content (Lord knows I don&#8217;t), but as the segment continues to mature and people start building multi-decade blogs, it may get to a point where someone would want to start trimming their less popular archives <img src='http://www.j2fi.net/wp-includes/images/smilies/icon_confused.gif' alt=':???:' class='wp-smiley' /><br />
Perhaps I&#8217;ll make a note to check on this idea some time in the 2030&#8217;s&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Contamination</title>
		<link>http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1271</link>
		<dc:creator>Contamination</dc:creator>
		<pubDate>Thu, 17 Apr 2008 12:41:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.j2fi.net/2008/04/17/is-your-storage-bloated/#comment-1271</guid>
		<description>While for business I agree with the idea of moving data that's over a certain age to a secondary database to be called upon infrequently, but for blogging I would never delete an old post.

Many of my visitors that have come through a search engine do so on old posts, the last thing I'd want them to find is a 404 because I'd been cleaning up old posts.

Plus each old post is a single page as far as the almighty Google is concerned and thus counts as a small increment towards increasing my page rank and the value of doing a link exchange with me.</description>
		<content:encoded><![CDATA[<p>While for business I agree with the idea of moving data that&#8217;s over a certain age to a secondary database to be called upon infrequently, but for blogging I would never delete an old post.</p>
<p>Many of my visitors that have come through a search engine do so on old posts, the last thing I&#8217;d want them to find is a 404 because I&#8217;d been cleaning up old posts.</p>
<p>Plus each old post is a single page as far as the almighty Google is concerned and thus counts as a small increment towards increasing my page rank and the value of doing a link exchange with me.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
