statsmodels.sandbox.regression.try_ols_anova.form2design
-
statsmodels.sandbox.regression.try_ols_anova.form2design(ss, data)
[source] -
convert string formula to data dictionary
-
ss : string
-
- I : add constant
- varname : for simple varnames data is used as is
- F:varname : create dummy variables for factor varname
- P:varname1*varname2 : create product dummy variables for varnames
- G:varname1*varname2 : create product between factor and continuous variable
-
data : dict or structured array
- data set, access of variables by name as in dictionaries
Returns: vars : dictionary
dictionary of variables with converted dummy variables
names : list
list of names, product (P:) and grouped continuous variables (G:) have name by joining individual names sorted according to input
Notes
with sorted dict, separate name list wouldn?t be necessary
Examples
12345>>> xx, n
=
form2design(
'I a F:b P:c*d G:c*f'
, testdata)
>>> xx.keys()
[
'a'
,
'b'
,
'const'
,
'cf'
,
'cd'
]
>>> n
[
'const'
,
'a'
,
'b'
,
'cd'
,
'cf'
]
-
Please login to continue.