Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0918 |
Symbol | |
ID | 4811211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1101624 |
End bp | 1105253 |
Gene Length | 3630 bp |
Protein Length | 1209 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106337 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001037345 |
Protein GI | 125973435 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.447129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGCA TTATACTTAA AGCATTTGAC GAGTCAACAG AAATTAATGG TGATTCCCTT GACTTGGGTG AAATTCAATT GACACCTTAC AAGGGCAGGT TGACCTTGGA ATTATTAAAG AAACCGGCAC ATACATCTCA GGAAGAGCCT GGATTGGAGC AAGAAAACTT ATACAATTAT ACTATTAGTA TATACAATAA AACAAAGCAG GCAAAAATCA AAAACTTTAA TATCAAACCA CCTTATATTT TCTTAAATGA TACTAACATA GAAACAGGTG ACGTTATTGA AGTAACGTTT ACACACAAGA AGGACTATAC TTTGCCGGCT ACGATCGAGC TTACTTTAGA CAGATTAATG AATGCGAGCG GGCAAGCTGT TGCAAAAGAG AACGGAAAAG GAATGGCATC AATACAGTCG GGGAACCAAA GATGTGAATT AATGGTTTAT AACTCTGACA AAAAGTTCAT AGAAAGCTGT TCGATGAACA GAGAGGGTAA AATAATCTCC AATCCTTTGA AAGAAGGAGT GTATACTTTT GTATTCATCT CCAGCAACAA CTACTTGTGG AGACTTAATA CACTGGAGCA ATTTGAGGAA ATAGGTCTGA AAGAAGGTAT ACATTTCGTA AAGAAGGAAG TAGAAATAAA AAATGGGGAA ATTAAAGATT TGGGACTGAT TGTTGTTCCT GAACTAAACG AAAAAGAGTT GTTCTTCACA GATCCCGATG TTGCAAGCTA TGCTGTGGTT TCGGAAAATA ATCGTATAGG AAAAGATATA GTGCTTAGAG CCAAATATTC ATTCAGAAAA CCTATTGAAT CACAGACAGT TAAATTACTG TTCAGGGTTC CGGATCAAAC ATTTTATGTA AAAGATACAT TAACTATTGA CGGAGAACTT GTTGATTCAG TAAATGTATC CGGTAGTTTT ATTGAAGTAA CAACTACAAA AACAGAAGGT GTTGTACGCT TTAATATCAG ACCTACTGAA GCGGTTGTTT GTTACAGTAA TGCTTATGTG GAATTTAAAG CGGAAGGAAG GACTGTAAGA GAACCTCTGG GTGTCGTTAA CGAAGTTGTT CCTTACCTCA CAATAGGAGC TCCTAATGTT ACAAGCCTGG AAAAAGTTTA TGTAACCGGT CAGACACTTC CCAACCTGGA AGTGAAGATA TATGATGAGG GAAGCTTGGT TGGAGTTACC AAAGCTAATT CGACCGGTGC TTGGAATTTC CAAGTTTCGC TGGGCAAAGA CAAATCTTAT TCAGTTCATA ACTTGTCAGC AAGGATTGAA TACAATGACC AGGTTATTGA ATCTGAGACA CTTGAAGTTA TTTATAAAAA GTCCGTTCCG GAGATTGAGA AAGTAACCAT GATCCACTCA AATCAGTCGC TGATATTGTA TAAAAATGGT GAAATAGTTC CTAAAGGAGT ATATACTTAT GTTCCTTCTC AATCCTATAC TTTTATTGTG GACTTTGTTG GTGATTCAGC AGAAGAAATT GAAAATTTGT GTATTGTAAG CAACCGCAAC GGCGAAGAAA GAAAACTGGA TGCAAAATAT GATCCACAAT CGAAGAGCTG GATTGCAACA GGAAGTTTTG GTTACAATCA TGTTCCGGGT TCGTTTACTG TAGAGTACAA TTTAAAAAAG AAAAGATTAA TTAAAGAGGA TGGTGAATTG AATTTAGAGG TTTTAGAGCA GATTTCAACT TTTGCAGATC AATTTCTTTA TGAATTAAAA AAAGAAAATG TCATGGTAGG ATTACTTGAT GAAAATGAAA TAGGATTTAT GCTGAAAGAA GATACAGGTG GCATTGCAAT TGTAACAATT GTAATAGGTG ATCAGCCAAT TGAACGAGAA AAATACGAAA AAATGGGGTA TTTATTTAAA GAAATAGATG GAAAACTAGG AGCAGTATTA TTAATAAAAG ACGAGACTGA AAATAAAATT TTTACATCAA AAGTAAGATT AGCCAGCAAT AAGACAGAAG ATTCCGTAAA GTCTGAAGAC GATGATGAAT CTTCACAAAA AGAGACGCCT TTAGTTACAA AAGATGAGCG TATTAGTATG GCTGCTACAA TAATTGGGGA ATTAGGTAAT ATTGCTGGTA GTGGAAAGGT TGTGGAAGCA ATAGATTTTA TTTCCGGAAT TGGTACAGCG GTGAATACTA TTGCATTTAC ACCGGCAAAC GCAGTTACAG ATATGGGTTA TGTTCAGAAA AAATTGCAAG AATCTGAGAA CATTTCAAGC GAAGATAAAG AAAGGCTCCG CGATGATTTG TTTTTAGCAT CTGGTGTGTA TAATTTTACT GTGAATGCAA ACTTTATTGC ATCACTTATG GATGGACATA CCAGAAATGG CACGATGCAT AAAATAATAG ACATAGTTAT TGATAAATAT GATCAGTACA GTCTTGGTGA TAAGTTATGG GAAGATATAT TTGGAAAAAA TGAAGATGAT AGAGAATCAG GCAAAAGCAC AAAATCTGGA AAAGGATCAT CTATTAAGCT ACCATTACAA GAACAAAAAA CTCTGGTAGA TCCAAGTGGA TATGTATACG AAACTGTACC TGACAACAGA ATAACCGATG CTACAGTCAC AGTATACTAT AAAGATGAAA ACGGAGAAAT GATATTATGG AATGCGGAAG AATTCGGCCA GAAGAATCCA TTGCTGACAG ACGAGAATGG TTTTTATGCC TGGGATGTTC CTGAAGGAAT GTGGCAGGTA AAGGTCGAAA AGGAAGGATA TGAAACAGCA TATAGTGAGA TTTTACCTGT GCCTCCTGTG CAGACCAATG TTAATATTCC TTTGGTTTCG TATGAACCAC CGAAGGTTGG ACACATTTAT GCATATCCGG AATATATAGA AATTGCATTT AGCAAATATG TAAAGCACGC AACGCTGGAT TCAGAGTCTA TTCAGCTGAA ACAGGGAGAA AATAAGGTAA ACATTAAAAT AGTATATGAA GATGAAGAGG GAAATTACTC TAAGAAAATT AAACTAATTC CTGCAGAAAA GACTTCTTTT GAAGGAAAAT ACACTCTTAA TATTTCTAAG AGCATAACAA GCTATGCTGG AGTTGCGATG CAAAAAGCTG AAATCCGCGA TATAGAGATT GTAGCTGAAC CAAAATCAAT CGAAATATTA GATAAAGTAG AAATTGAATT AAGAAAAACT GTTGCAATAG AGATACGTGT ACTGCCGGAA GAAGCAGCAA AAGGCAAAAA GATAATCGTA ACATCCGGAA TGGAAGAAAT AGTATCTGCA GAGGATGTTC TACTGGATGA AAGGGGACGC GGAAAACTCA AACTGAAAGG TAATCTTCCC GGAACGGTAG ATATCGATTT GCGTTTGGAA GGTAGCGCTG TTGAAAAGAA AATACAAGCG ACAGTTAAAT TTCCGGAAGA TAATGGAGAG ATTATTGAGC CGCCTGTTGT GCTTAATGGT GACTTAAACA GAAATGGAAT TGTTAACGAC GAAGATTATA TACTGCTGAA GAATTACTTG TTAAGGGGGA ATAAATTAGT AATAGATTTG AATGTGGCTG ATGTCAATAA AGACGGAAAA GTTAATTCTA CTGACTGTTT ATTCCTTAAG AAGTATATTT TGGGACTTAT AACTATATAG
|
Protein sequence | MGSIILKAFD ESTEINGDSL DLGEIQLTPY KGRLTLELLK KPAHTSQEEP GLEQENLYNY TISIYNKTKQ AKIKNFNIKP PYIFLNDTNI ETGDVIEVTF THKKDYTLPA TIELTLDRLM NASGQAVAKE NGKGMASIQS GNQRCELMVY NSDKKFIESC SMNREGKIIS NPLKEGVYTF VFISSNNYLW RLNTLEQFEE IGLKEGIHFV KKEVEIKNGE IKDLGLIVVP ELNEKELFFT DPDVASYAVV SENNRIGKDI VLRAKYSFRK PIESQTVKLL FRVPDQTFYV KDTLTIDGEL VDSVNVSGSF IEVTTTKTEG VVRFNIRPTE AVVCYSNAYV EFKAEGRTVR EPLGVVNEVV PYLTIGAPNV TSLEKVYVTG QTLPNLEVKI YDEGSLVGVT KANSTGAWNF QVSLGKDKSY SVHNLSARIE YNDQVIESET LEVIYKKSVP EIEKVTMIHS NQSLILYKNG EIVPKGVYTY VPSQSYTFIV DFVGDSAEEI ENLCIVSNRN GEERKLDAKY DPQSKSWIAT GSFGYNHVPG SFTVEYNLKK KRLIKEDGEL NLEVLEQIST FADQFLYELK KENVMVGLLD ENEIGFMLKE DTGGIAIVTI VIGDQPIERE KYEKMGYLFK EIDGKLGAVL LIKDETENKI FTSKVRLASN KTEDSVKSED DDESSQKETP LVTKDERISM AATIIGELGN IAGSGKVVEA IDFISGIGTA VNTIAFTPAN AVTDMGYVQK KLQESENISS EDKERLRDDL FLASGVYNFT VNANFIASLM DGHTRNGTMH KIIDIVIDKY DQYSLGDKLW EDIFGKNEDD RESGKSTKSG KGSSIKLPLQ EQKTLVDPSG YVYETVPDNR ITDATVTVYY KDENGEMILW NAEEFGQKNP LLTDENGFYA WDVPEGMWQV KVEKEGYETA YSEILPVPPV QTNVNIPLVS YEPPKVGHIY AYPEYIEIAF SKYVKHATLD SESIQLKQGE NKVNIKIVYE DEEGNYSKKI KLIPAEKTSF EGKYTLNISK SITSYAGVAM QKAEIRDIEI VAEPKSIEIL DKVEIELRKT VAIEIRVLPE EAAKGKKIIV TSGMEEIVSA EDVLLDERGR GKLKLKGNLP GTVDIDLRLE GSAVEKKIQA TVKFPEDNGE IIEPPVVLNG DLNRNGIVND EDYILLKNYL LRGNKLVIDL NVADVNKDGK VNSTDCLFLK KYILGLITI
|
| |