Time Dimension Design Question
Time Dimension Design Question
am 19.12.2007 21:08:33 von jimbo
Im trying to design my time dimension and need to add a field to
handle null dates in the fact. So if at the time of ETL the date isnt
known, referential integrity will be preserved. Kimball suggests
insterting a record in the time dimension to handle this with a
description of 'Date not available' or something like that. However
if users are doing inner joins on the dimension they will obvisously
be pulling the datetime field..what should be in the datetime field
for this particular record?
Re: Time Dimension Design Question
am 20.12.2007 07:42:00 von jefftyzzer
On Dec 19, 12:08 pm, Jimbo wrote:
> Im trying to design my time dimension and need to add a field to
> handle null dates in the fact. So if at the time of ETL the date isnt
> known, referential integrity will be preserved. Kimball suggests
> insterting a record in the time dimension to handle this with a
> description of 'Date not available' or something like that. However
> if users are doing inner joins on the dimension they will obvisously
> be pulling the datetime field..what should be in the datetime field
> for this particular record?
James,
If I'm reading right (DW Toolkit 2d Ed, pp 60, 261), I don't think
Kimball is suggesting that you put a dummy date in the dimensional row
(e.g. 1/1/1900 or 12/21/9999)--although I suppose you could do that if
that's a convention your users would agree to--but rather that that
value be NULL, just as it is in the source data. (This would mean,
though, that your date column in the dimensional table would have to
be NULLable.)
I suppose that when you then join the fact to the time dimension, you
could apply a COALESCE to the date column, e.g.,
COALESCE(time_dim.date, 'Date not Available').
--Jeff
Re: Time Dimension Design Question
am 20.12.2007 18:27:00 von jimbo
On Dec 19, 10:42 pm, jefftyzzer wrote:
> On Dec 19, 12:08 pm, Jimbo wrote:
>
> > Im trying to design my time dimension and need to add a field to
> > handle null dates in the fact. So if at the time of ETL the date isnt
> > known, referential integrity will be preserved. Kimball suggests
> > insterting a record in the time dimension to handle this with a
> > description of 'Date not available' or something like that. However
> > if users are doing inner joins on the dimension they will obvisously
> > be pulling the datetime field..what should be in the datetime field
> > for this particular record?
>
> James,
>
> If I'm reading right (DW Toolkit 2d Ed, pp 60, 261), I don't think
> Kimball is suggesting that you put a dummy date in the dimensional row
> (e.g. 1/1/1900 or 12/21/9999)--although I suppose you could do that if
> that's a convention your users would agree to--but rather that that
> value be NULL, just as it is in the source data. (This would mean,
> though, that your date column in the dimensional table would have to
> be NULLable.)
>
> I suppose that when you then join the fact to the time dimension, you
> could apply a COALESCE to the date column, e.g.,
> COALESCE(time_dim.date, 'Date not Available').
>
> --Jeff
Ah that makes sense...thanks, thats what I'll do.
-Jim
Re: Time Dimension Design Question
am 24.12.2007 21:33:20 von Ed Murphy
jefftyzzer wrote:
> I suppose that when you then join the fact to the time dimension, you
> could apply a COALESCE to the date column, e.g.,
> COALESCE(time_dim.date, 'Date not Available').
Don't you have to CAST or CONVERT time_dim.date to CHAR or VARCHAR
first, lest it try and fail to convert 'Date not Available' to
DATETIME instead?
Re: Time Dimension Design Question
am 15.01.2008 04:36:33 von dmcmunn
On Dec 24 2007, 2:33 pm, Ed Murphy wrote:
> jefftyzzer wrote:
> > I suppose that when you then join the fact to thetimedimension, you
> > could apply a COALESCE to the date column, e.g.,
> > COALESCE(time_dim.date, 'Date not Available').
>
> Don't you have to CAST or CONVERT time_dim.date to CHAR or VARCHAR
> first, lest it try and fail to convert 'Date not Available' to
> DATETIME instead?
Please consider a web page I maintain dedicated to providing the
building blocks for kick starting a date dimension: http://www.ipcdesigns.com/dim_date/
All of the information is free including free downloads of scripts to
build a date dimension from scratch or download one that is pre-built.
If you find the downloads helpful, please leave a comment. You don't
even have to register, but I do appreciate feedback. There are also
other free tool kits for data profiling and one for ETL metadata
management should be ready by February 1, 2008. Best of luck, Don
McMunn