uu.seUppsala universitets publikasjoner
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Rule-based Models of Transcriptional Regulation and Complex Diseases: Applications and Development
Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Biologiska sektionen, Institutionen för cell- och molekylärbiologi, Beräknings- och systembiologi.
2014 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

As we gain increased understanding of genetic disorders and gene regulation more focus has turned towards complex interactions. Combinations of genes or gene and environmental factors have been suggested to explain the missing heritability behind complex diseases. Furthermore, gene activation and splicing seem to be governed by a complex machinery of histone modification (HM), transcription factor (TF), and DNA sequence signals. This thesis aimed to apply and develop multivariate machine learning methods for use on such biological problems. Monte Carlo feature selection was combined with rule-based classification to identify interactions between HMs and to study the interplay of factors with importance for asthma and allergy.

Firstly, publicly available ChIP-seq data (Paper I) for 38 HMs was studied. We trained a classifier for predicting exon inclusion levels based on the HMs signals. We identified HMs important for splicing and illustrated that splicing could be predicted from the HM patterns. Next, we applied a similar methodology on data from two large birth cohorts describing asthma and allergy in children (Paper II). We identified genetic and environmental factors with importance for allergic diseases which confirmed earlier results and found candidate gene-gene and gene-environment interactions.

In order to interpret and present the classifiers we developed Ciruvis, a web-based tool for network visualization of classification rules (Paper III). We applied Ciruvis on classifiers trained on both simulated and real data and compared our tool to another methodology for interaction detection using classification. Finally, we continued the earlier study on epigenetics by analyzing HM and TF signals in genes with or without evidence of bidirectional transcription (Paper IV). We identified several HMs and TFs with different signals between unidirectional and bidirectional genes. Among these, the CTCF TF was shown to have a well-positioned peak 60-80 bp upstream of the transcription start site in unidirectional genes.

sted, utgiver, år, opplag, sider
Uppsala: Acta Universitatis Upsaliensis, 2014. , s. 69
Serie
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 1167
Emneord [en]
Histone modification, Transcription factor, Transcriptional regulation, Next-generation sequencing, Feature selection, Machine learning, Rule-based classification, Asthma, Allergy
HSV kategori
Forskningsprogram
Bioinformatik
Identifikatorer
URN: urn:nbn:se:uu:diva-230159ISBN: 978-91-554-9005-8 (tryckt)OAI: oai:DiVA.org:uu-230159DiVA, id: diva2:739042
Disputas
2014-10-03, BMC C8:301, Husargatan 3, Uppsala, 13:15 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2014-09-12 Laget: 2014-08-19 Sist oppdatert: 2018-01-11
Delarbeid
1. Combinations of histone modifications mark exon inclusion levels
Åpne denne publikasjonen i ny fane eller vindu >>Combinations of histone modifications mark exon inclusion levels
2012 (engelsk)Inngår i: PLoS ONE, ISSN 1932-6203, E-ISSN 1932-6203, Vol. 7, nr 1, artikkel-id e29911Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Splicing is a complex process regulated by sequence at the classical splice sites and other motifs in exons and introns with an enhancing or silencing effect. In addition, specific histone modifications on nucleosomes positioned over the exons have been shown to correlate both positively and negatively with exon expression. Here, we trained a model of "IF … THEN …" rules to predict exon inclusion levels in a transcript from histone modification patterns. Furthermore, we showed that combinations of histone modifications, in particular those residing on nucleosomes preceding or succeeding the exon, are better predictors of exon inclusion levels than single modifications. The resulting model was evaluated with cross validation and had an average accuracy of 72% for 27% of the exons, which demonstrates that epigenetic signals substantially mark alternative splicing.

HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-175875 (URN)10.1371/journal.pone.0029911 (DOI)000312662100045 ()22242188 (PubMedID)
Forskningsfinansiär
Knut and Alice Wallenberg FoundationSwedish Foundation for Strategic Research Swedish Research CouncilSwedish Cancer Society
Tilgjengelig fra: 2012-06-13 Laget: 2012-06-13 Sist oppdatert: 2018-01-12bibliografisk kontrollert
2. Rule-Based Models of the Interplay between Genetic and Environmental Factors in Childhood Allergy
Åpne denne publikasjonen i ny fane eller vindu >>Rule-Based Models of the Interplay between Genetic and Environmental Factors in Childhood Allergy
Vise andre…
2013 (engelsk)Inngår i: PLoS ONE, ISSN 1932-6203, E-ISSN 1932-6203, Vol. 8, nr 11, s. e80080-Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Both genetic and environmental factors are important for the development of allergic diseases. However, a detailed understanding of how such factors act together is lacking. To elucidate the interplay between genetic and environmental factors in allergic diseases, we used a novel bioinformatics approach that combines feature selection and machine learning. In two materials, PARSIFAL (a European cross-sectional study of 3113 children) and BAMSE (a Swedish birth-cohort including 2033 children), genetic variants as well as environmental and lifestyle factors were evaluated for their contribution to allergic phenotypes. Monte Carlo feature selection and rule based models were used to identify and rank rules describing how combinations of genetic and environmental factors affect the risk of allergic diseases. Novel interactions between genes were suggested and replicated, such as between ORMDL3 and RORA, where certain genotype combinations gave odds ratios for current asthma of 2.1 (95% CI 1.2-3.6) and 3.2 (95% CI 2.0-5.0) in the BAMSE and PARSIFAL children, respectively. Several combinations of environmental factors appeared to be important for the development of allergic disease in children. For example, use of baby formula and antibiotics early in life was associated with an odds ratio of 7.4 (95% CI 4.5-12.0) of developing asthma. Furthermore, genetic variants together with environmental factors seemed to play a role for allergic diseases, such as the use of antibiotics early in life and COL29A1 variants for asthma, and farm living and NPSR1 variants for allergic eczema. Overall, combinations of environmental and life style factors appeared more frequently in the models than combinations solely involving genes. In conclusion, a new bioinformatics approach is described for analyzing complex data, including extensive genetic and environmental information. Interactions identified with this approach could provide useful hints for further in-depth studies of etiological mechanisms and may also strengthen the basis for risk assessment and prevention.

HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-213817 (URN)10.1371/journal.pone.0080080 (DOI)000327311900057 ()
Tilgjengelig fra: 2014-01-05 Laget: 2014-01-04 Sist oppdatert: 2017-12-06bibliografisk kontrollert
3. Ciruvis: a web-based tool for rule networks and interaction detection using rule-based classifiers
Åpne denne publikasjonen i ny fane eller vindu >>Ciruvis: a web-based tool for rule networks and interaction detection using rule-based classifiers
2014 (engelsk)Inngår i: BMC Bioinformatics, ISSN 1471-2105, E-ISSN 1471-2105, Vol. 15, s. 139-Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Background: The use of classification algorithms is becoming increasingly important for the field of computational biology. However, not only the quality of the classification, but also its biological interpretation is important. This interpretation may be eased if interacting elements can be identified and visualized, something that requires appropriate tools and methods. Results: We developed a new approach to detecting interactions in complex systems based on classification. Using rule-based classifiers, we previously proposed a rule network visualization strategy that may be applied as a heuristic for finding interactions. We now complement this work with Ciruvis, a web-based tool for the construction of rule networks from classifiers made of IF-THEN rules. Simulated and biological data served as an illustration of how the tool may be used to visualize and interpret classifiers. Furthermore, we used the rule networks to identify feature interactions, compared them to alternative methods, and computationally validated the findings. Conclusions: Rule networks enable a fast method for model visualization and provide an exploratory heuristic to interaction detection. The tool is made freely available on the web and may thus be used to aid and improve rule-based classification.

Emneord
Visualization, Rules, Interactions, Interaction detection, Classification, Rule-based classification
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-228027 (URN)10.1186/1471-2105-15-139 (DOI)000336679600001 ()
Tilgjengelig fra: 2014-07-02 Laget: 2014-07-02 Sist oppdatert: 2017-12-05bibliografisk kontrollert
4. Different distribution of histone modifications in genes with unidirectional and bidirectional transcription and a role of CTCF and cohesin in directing transcription
Åpne denne publikasjonen i ny fane eller vindu >>Different distribution of histone modifications in genes with unidirectional and bidirectional transcription and a role of CTCF and cohesin in directing transcription
2015 (engelsk)Inngår i: BMC Genomics, ISSN 1471-2164, E-ISSN 1471-2164, Vol. 16, artikkel-id 300Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Background: Several post-translational histone modifications are mainly found in gene promoters and are associated with the promoter activity. It has been hypothesized that histone modifications regulate the transcription, as opposed to the traditional view with transcription factors as the key regulators. Promoters of most active genes do not only initiate transcription of the coding sequence, but also a substantial amount of transcription of the antisense strand upstream of the transcription start site (TSS). This promoter feature has generally not been considered in previous studies of histone modifications and transcription factor binding.

Results: We annotated protein-coding genes as bi- or unidirectional depending on their mode of transcription and compared histone modifications and transcription factor occurrences between them. We found that H3K4me3, H3K9ac, and H3K27ac were significantly more enriched upstream of the TSS in bidirectional genes compared with the unidirectional ones. In contrast, the downstream histone modification signals were similar, suggesting that the upstream histone modifications might be a consequence of transcription rather than a cause. Notably, we found well-positioned CTCF and RAD21 peaks approximately 60-80 bp upstream of the TSS in the unidirectional genes. The peak heights were related to the amount of antisense transcription and we hypothesized that CTCF and cohesin act as a barrier against antisense transcription.

Conclusions: Our results provide insights into the distribution of histone modifications at promoters and suggest a novel role of CTCF and cohesin as regulators of transcriptional direction.

Emneord
Antisense transcription, CTCF, RAD21, Cohesin, CAGE, Epigenetics, Transcription factor, Histone modification
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-230158 (URN)10.1186/s12864-015-1485-5 (DOI)000355166000001 ()25881024 (PubMedID)
Tilgjengelig fra: 2014-08-19 Laget: 2014-08-19 Sist oppdatert: 2017-12-05bibliografisk kontrollert

Open Access i DiVA

fulltext(3182 kB)331 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 3182 kBChecksum SHA-512
222c0e6938cf7ed5af4e3193d082cae78c056df8b645faac67386fbf26a62685be77582fe1366c29efec96848bf78a4c4019c2d54aeca9d1eaec0c0ecf2bc27b
Type fulltextMimetype application/pdf
Kjøp publikasjonen >>

Personposter BETA

Bornelöv, Susanne

Søk i DiVA

Av forfatter/redaktør
Bornelöv, Susanne
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 331 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 1112 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf