Select Language

AI社区

公开数据集

QSAR鱼类生物浓缩因子(BCF)数据集

QSAR鱼类生物浓缩因子(BCF)数据集

144K
502 浏览
0 喜欢
0 次下载
0 条讨论
Life Regression

Francesca Grisoni, University of Milano-Bicocca, Dept. of Earth and Environmental Sciences, Milano ChemometricsQSAR Rese......

数据结构 ? 144K

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    Francesca Grisoni, University of Milano-Bicocca, Dept. of Earth and Environmental Sciences, Milano Chemometrics & QSAR Research Group, francesca.grisoni '@' unimib.it
    Viviana Consonni,  University of Milano-Bicocca, Dept. of Earth and Environmental Sciences, Milano Chemometrics & QSAR Research Group,  viviana.consonni '@' unimib.it
    Marco Vighi, University of Milano-Bicocca, Dept. of Earth and Environmental Sciences
    Sara Villa, University of Milano-Bicocca, Dept. of Earth and Environmental Sciences
    RobertoTodeschini,  University of Milano-Bicocca, Dept. of Earth and Environmental Sciences, Milano Chemometrics & QSAR Research Group, roberto.todeschini '@' unimib.it


    Data Set Information:

    This dataset contains manually-curated experimental bioconcentration factor (BCF) for 1058 molecules (continuous values). Each row contains a molecule, identified by a CAS number, a name (if available), and a SMILES string. Additionally, the KOW (experimental or predicted) is reported. In this database, you will also find Extended Connectivity Fingerprints (binary vectors of 1024 bits), to be used as independent variables to predict the BCF. You can find additional information in the referenced papers.
    In case you had questions, please do not hesitate to contact us!


    Attribute Information:

    The provided zip file contains two files.

    (I) The file 'QSAR BCF KOW' contains the following attributes:
    1. CAS number (molecule identifier)
    2. Molecule Name (if not available, marked as 'n.a.')
    3. SMILES string to identify the 2D molecular structure
    4. LogKOW: octanol water partitioning coefficient (experimental or predicted, as indicated by the column 'KOW Type'
    5. KOW Type: indicates whether the logKOW value is experimental or predicted
    6. Experimental logBCF (quantitative response): experimental fish bioconcentration factor (logarithm form)

    (II) The file 'ECFP_1024_m0-2_b2_c.txt' contains the following molecular descriptors (to be used to predict the BCF):
    - Extended Connectivity Fingerprints (ECFPs): binary descriptors useful to predict the experimental logBCF (computed with Dragon7, default settings --> details specified in the file)
    Each row corresponds to one molecule, as identified by the SMILES field. The molecules are in the same order as in the previous file.


    Relevant Papers:

    1. Grisoni, F., Consonni, V., Villa, S., Vighi, M. and Todeschini, R., 2015. QSAR models for bioconcentration: Is the increase in the complexity justified by more accurate predictions?. Chemosphere, 127, pp.171-179. --> Procedure for data curation.
    2. Grisoni, F., Consonni, V., Vighi, M., Villa, S. and Todeschini, R., 2016. Expert QSAR system for predicting the bioconcentration factor under the REACH regulation. Environmental research, 148, pp.507-512.  -->  Benchmark on the performance for this dataset
    3. Grisoni, F., Consonni, V., Vighi, M., Villa, S. and Todeschini, R., 2016. Investigating the mechanisms of bioconcentration through QSAR classification trees. Environment international, 88, pp.198-205. --> Relationship between KOW and BCF



    Citation Request:

    If you publish results based on this dataset or parts of it, please cite the following paper:
    @article{grisoni2015,
     title={QSAR models for bioconcentration: Is the increase in the complexity justified by more accurate predictions?},
     author={Grisoni, Francesca and Consonni, Viviana and Villa, Sara and Vighi, Marco and Todeschini, Roberto},
     journal={Chemosphere},
     volume={127},
     pages={171--179},
     year={2015},
     publisher={Elsevier}
    }

    If you use the ECFP values, additionally please cite the following software:

    Dragon (Software for Molecular Descriptor Calculation) Version 6.0 a€” 2012
    [Web link] (2012)

    And paper:

    @article{rogers2010,
     title={Extended-connectivity fingerprints},
     author={Rogers, David and Hahn, Mathew},
     journal={Journal of chemical information and modeling},
     volume={50},
     number={5},
     pages={742--754},
     year={2010},
     publisher={ACS Publications}
    }

    --> Thanks and happy predicting!

    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:6 去赚积分?
    • 502浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享