Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_90641 |
Symbol | CHD1 |
ID | 4840385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 772013 |
End bp | 776677 |
Gene Length | 4665 bp |
Protein Length | 1414 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640391700 |
Product | transcriptional regulator |
Protein accession | XP_001385507 |
Protein GI | 150866039 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.104973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCTGTCTTTC CACCACCAAG TTCTTATAGC TTTGTGATAT CTAGACTTGG AGAGGCTAGC CCGTACTGAG TGCATCTGAA TAACACCCAA TTTTCGGTTG TGATAGCTTG TGGGTATTCT CATTTTTCGT ATCTCCAGAT CCTTATACTT TCCATATCAG CTAGCATAAA CATTGTTGAA CAGGCAGGGT ATTTTTCTGC AAAATTCATA ATATTTTCAA TATTTATACT CTATTCAGTG TAGCGACTGG AATTCGCCAT TGGACAGAAG TTCAACTGTT ATAATTGAGA TTTTGTTGGT ATCTGATCAT TAGTTTATAA GCTATCTTGA TTTACAATCC CACAGTCTTC ACTTACTAGC TAATAATATC ATGGTCAAAC CGGTATTACA AGAGGCAACC CTCTTTCCGG AACTCTACGG ACTCAGACGG TCCCACAGAG AAAGACGTGC TGCCATCGTC GAAAGTGAAA GTGAAGAAGA ACCTATTCCA AAGAGAAGGA AGAAGAAAGC TCAACGAACT GAAGATTACG GTGATGAAAA CGAAGAAGAT GAAGAAGACG ACGCGGAAGA CTCTGAGGAC GAAGATTTTT ACGACAAGCC TCTTGCGAAG AAGAAGAAGA AGACAATTAT ACAGAAGAAG AAGAAGAAAT CATCTGTGAA ACTGGAACCT TCGGTTCCTG CTGAGATCAG ATTCTCGTCG AGAAATAACA AACAGGTTAA CTATGCCGTT GATTATGATG ACGACGACGA CTTGCTTGAA AGTGAACCAG AACTTGACGA AAACGACGAA GACGTAAATG GAGATTACTA CTACTATCAA CAAGCCAATG AAGATGAAAA CGAAAGAGGT ATCGATCTCG TCATGGATCA CAAACTAAAC GAAGAAAATC CGGAAAAGAC TGGTGACCCG AAGCTAGACT ACTTGTTCAG AATTAAGTGG ACAGACCAAT CGCATTTGCA TAATACTTGG GAGCAGTGGA ATGACTTGAA GGAGTACAAA GGTATCAGAA AAGTCGAAAA CTACATCAAG CAATTCATAA TATATGATCA AGAGATCAGA AACGATCCTT TGACGACTAG GGAAGACATA GAAGCCATGG ACATTGACCG CGAAAGAAAA CGCGATGAAC AAGAAGAATA CACTCGTGTT GAAAGAATCG TTGACTCCGA AAGAATTGAA ACCGAAAATG GCGAAACTAA GTTACAATAC TATTGTAAGT GGAGAAGACT CTACTATGAC GAATGTTCTT GGGAAGATGC TGAAGAAATT GCTAAGATTG CTCCCGATCA AGTAGCCAAG TATCAGCAAA GGTTGAAGTC TAAGATATTG CCAAACTTGT CAGCTAATTA CCCATTGTCA CAAAGACCTA GATTTGAAAA GTTGGTTAAA CAGCCTGTTT TTATCAAGAA TGGAGAGTTG AGAGATTTCC AGTTGACCGG TTTGAACTGG ATGGCTTTCT TGTGGTCTAG AAACGAAAAC GGTATTCTTG CTGATGAAAT GGGGTTGGGT AAGACTGTGC AAACTGTTTC ATTTCTCTCT TGGTTGATTT ATTCTAGAAG GCAGAATGGC CCTCATATTG TTGTTGTGCC ATTGTCAACA ATTCCAGCTT GGCAAGAGAC CTTTGAGAAG TGGTCTCCTG ATGTAAACTG TGTTTATTAC TTAGGTAATA CTCAGGCACG AAAGACCATC AGAGACTACG AATTCTACGG ATCCAACAAT AAGCCTAAAT TCAACATATT GTTAACTACA TATGAATACA TTTTGAAGGA CCGCAACGAA TTGGGTGCTT TCAAATGGCA ATTCTTGGCT GTTGATGAAG CTCATAGATT GAAGAATGCT GACTCCTCGT TATATGAATC GTTGAAGAGT TTTAAGGTTG CCAACAGATT GTTGATTACT GGGACACCTT TGCAAAACAA TATTAAGGAA TTGGCCGCTT TGGTTAACTT CTTGATGCCT GGCAAATTCG ATATTGAGCA AGAAATCGAT TTTGAAACTC CTGATGAAGA GCAGGAATTG TACATTAAAG ACTTACAGAA AAAGATCCAG CCTTTCATAT TGAGAAGATT GAAAAAGGAC GTGGAGAAAT CCCTTCCTTC CAAGACAGAA AGAATCTTGA GAGTAGAATT ATCAGATATC CAGACTGAGT ACTACAAGAA CATTATCACT AAAAACTATT CTGCTTTGAA TGCTGGAAAT AAGGGCGCTC AGATCTCGTT GTTGAACGTG ATGAGTGAGT TGAAGAAGGC GTCTAACCAT CCTTACCTTT TCGATGGTGC TGAGAACAGA GTTTTAGCCA AGGTTGGTTC TGCTACTAGG GATAACATCT TGAGAGGTAT GATTATGTCT TCTGGTAAGA TGGTTTTGCT TGAACAATTA TTAACCAGAT TGAAAAAGGA AGGTCACAGA GTCTTAATCT TCTCTCAAAT GGTCAGAATT TTGGATATTC TTGGCGATTA TTTATCGATC AAGGGTTACC AGTTCCAAAG ATTGGATGGT GGGATACCGT CTGCTCAAAG AAGAATCTCT ATTGACCATT TCAACGCTCC AGAATCAAAG GATTTCATCT TTTTGTTGTC CACTAGAGCT GGTGGGTTAG GAATTAATTT AATGACCGCT GACACTGTTA TCATATTTGA CTCTGATTGG AACCCTCAAG CTGATTTGCA GGCAATGGCC AGAGCTCATA GAATTGGTCA GAAGAATCAT GTGTCGGTTT ACAGATTCGT TAGTAAAGAT ACTGTTGAGG AGCAGATCTT GGAAAGGGCT AGGAAAAAGA TGATTTTGGA GTACGCCATT ATTTCGTTGG GAATCACAGA TCCCAACTCG AAGAAGAGCA AGACCGAACC ATCAACAGGT GAATTGAGTC AAATCTTGAA GTTTGGTGCT GGCAATATGT TCAAAGAAAA CGATAACCAA AAGAAGTTGG AAGATTTGAA CTTGGATGAT GTATTAAATC ATGCTGAAGA TCATGTCACT ACTCCTGACC TTGGAGAGTC CAACTTGGGT TCTGAAGAAT TTTTAAAACA GTTCGAAGTT ACAGATTACA AGGCCGATGT TGAATGGGAC GATATCATCC CTCAAGAAGA ATTGGCCAAG TTGAAGGACG AGGAGAAAAA GAAGGCGGAT GAGATCTACT TGGAAGAACA AATTGCTATG TACTCGAGAA GAAAAGCTGC CGTTCGTAAA TTTGAAAATG GATCGGTTGC TCCAAGTGAT GACGAAGATG AAGATTCTTC TCTGGGTAGA CAACCAAGAC TGAAGCATGC CGGCGACCAT CAACTTTCTG AAAAGGAAAT TAGAGGTATA TACAGATCAA TCTTGAGACT TGGTGATTTG ACTGGAAGGT GGGAGCAATT GGTTGAAGAA GGAAGTGTCA CAAACAAGAA TCCCGTTTTG GTCAAACATG CCTACAATGA GATTATCAAC ACTTCCAAAC AATTAGTAAA GGAAGAAGAA ATCAGAAGAA CTAAAGCCTT GGCCGATTTG GAAAGAAAGG CTATTGAGCA AAGAGAAAAG GGAGCTGTAG AAAATCCAAC GGCCTTGTGG ATTGCCAAGA AGAAGGAAAA GAAGGCCGTC TTATTCGAAT ACCAGGGGGT CAAGAATATT AATGCTGAGT TGGTATTGAA TAGACCTGTG GACATGAAGA TGCTTGACAG TATGATTCCT AAGGAAAACC CTACTTCATT TGAATTGCCC AGACCTCCTA AGCTGGTGTC AGCCTGGTCT TGCGATTGGT CTGAAAAGGA CGATGCCATG TTGTTGGTTG GTGTTTTCAA ATTCGGATAT GGATCTTGGG TTCAGATTCG TGACGACACT GTTTTGGGAT TGCAGACAAA GCTCTTCTTA GAAGGCTCTA CGAATCCTAA GGAAGCAAAT ACAGTGAGTA CTGCTCCTGC TGGTGAAGCT GGCTCTGGTG CTCCAGTTGA AGAAAAGGCA GTTAAAAAGG TGCCAGGCTC TGTGCATTTG GGTAGAAGAG TAGATTACTT GTTCTCGTTA CTCAGAGGTG ACGATGACCG TGCAAATGGA GGCTCCACTC CTTCTGGTGT TACTATTAAG AAGAAAGTGA GAAGACCAAA GACGGAAACG CCGGTTCCAG CAGGTAAAGC TAAAGCCGCA GCTAGGTCAC AATCTCCCGA AACTAACCTC AAGAAGTCTA AGGTGAGAAG CATTGCTCCA AAGGCGTCCC CATCTAGTCC CGCTGGTAAC TCTCCAAAGG TAGATTCTGG AGCTCACACA CACTCTAGTC AGCACCAGCC ACATCACAAC AACAATTCCA AGGATGATGA TAAAGATCTT GAATACGATA GCATGGATGA AGGGTACTGC AAGGACGTAT TGAAGCCCGT TGCAAAGTCG TTGATGAGAC TCCACAAAGG AAATCAAGGA TTTGAAAAAC ACGAATGGGC CAACATCTTG AAAACGGAAT TGCTCACCGT GGGCGACTTT ATTGGCACAG TTGTCAAGCC GTTGGACGGT AAACCAGAAG GAAATAAGTT GCAGAAGCAT TTATGGTCGT ATTCTAGACT CTACTGGCCA GCTAAAGTTC CTTCGAAAAA GATCTTCGAC ATGTACACTA GACTAAAGAC CAAATCTGGC CAAAACACTA AGTAATTACG TTTACAGATG TATGTATATA CAATACAAGA GAATTTGCTC TATAG
|
Protein sequence | MVKPVLQEAT LFPELYGLRR SHRERRAAIV ESESEEEPIP KRRKKKAQRT EDYGDENEED EEDDAEDSED EDFYDKPLAK KKKKTIIQKK KKKSSVKSEP SVPAEIRFSS RNNKQVNYAV DYDDDDDLLE SEPELDENDE DVNGDYYYYQ QANEDENERG IDLVMDHKLN EENPEKTGDP KLDYLFRIKW TDQSHLHNTW EQWNDLKEYK GIRKVENYIK QFIIYDQEIR NDPLTTREDI EAMDIDRERK RDEQEEYTRV ERIVDSERIE TENGETKLQY YCKWRRLYYD ECSWEDAEEI AKIAPDQVAK YQQRLKSKIL PNLSANYPLS QRPRFEKLVK QPVFIKNGEL RDFQLTGLNW MAFLWSRNEN GILADEMGLG KTVQTVSFLS WLIYSRRQNG PHIVVVPLST IPAWQETFEK WSPDVNCVYY LGNTQARKTI RDYEFYGSNN KPKFNILLTT YEYILKDRNE LGAFKWQFLA VDEAHRLKNA DSSLYESLKS FKVANRLLIT GTPLQNNIKE LAALVNFLMP GKFDIEQEID FETPDEEQEL YIKDLQKKIQ PFILRRLKKD VEKSLPSKTE RILRVELSDI QTEYYKNIIT KNYSALNAGN KGAQISLLNV MSELKKASNH PYLFDGAENR VLAKVGSATR DNILRGMIMS SGKMVLLEQL LTRLKKEGHR VLIFSQMVRI LDILGDYLSI KGYQFQRLDG GIPSAQRRIS IDHFNAPESK DFIFLLSTRA GGLGINLMTA DTVIIFDSDW NPQADLQAMA RAHRIGQKNH VSVYRFVSKD TVEEQILERA RKKMILEYAI ISLGITDPNS KKSKTEPSTG ELSQILKFGA GNMFKENDNQ KKLEDLNLDD VLNHAEDHVT TPDLGESNLG SEEFLKQFEV TDYKADVEWD DIIPQEELAK LKDEEKKKAD EIYLEEQIAM YSRRKAAVRK FENGSVAPSD DEDEDSSSGR QPRSKHAGDH QLSEKEIRGI YRSILRLGDL TGRWEQLVEE GSVTNKNPVL VKHAYNEIIN TSKQLVKEEE IRRTKALADL ERKAIEQREK GAVENPTALW IAKKKEKKAV LFEYQGVKNI NAELVLNRPV DMKMLDSMIP KENPTSFELP RPPKSVSAWS CDWSEKDDAM LLVGVFKFGY GSWVQIRDDT VLGLQTKLFL EGSTNPKEAN TVSTAPAGEA GSGAPVEEKA VKKVPGSVHL GRRVDYLFSL LRGDDDRANG GSTPSGVTIK KKVRRPKTET PVPAGKAKAA ARSQSPETNL KKSKVRSIAP KASPSSPAGN SPKVDSGAHT HSSQHQPHHN NNSKDDDKDL EYDSMDEGYC KDVLKPVAKS LMRLHKGNQG FEKHEWANIL KTELLTVGDF IGTVVKPLDG KPEGNKLQKH LWSYSRLYWP AKVPSKKIFD MYTRLKTKSG QNTK
|
| |