Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0433 |
Symbol | |
ID | 4808361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 543569 |
End bp | 545938 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105847 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036864 |
Protein GI | 125972954 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.371525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAGA AAAAAATCTT TTCAAAAAGA ACAAAAGCAT TAATAGTTAG TTTTGTAATT CTTGCTTTGA TGGTTTTCCC TGTCGGTACT GTCAATATTA GTGCCGCTAA TGTGGAATAC AACTATGCAA AGGCGTTGCA GTATTCCATA TACTTTTACG ATGCAAATAT GTGCGGTACC GGGGTTGACG AGAACGGACA GTACAACTGG AGAGGCGACT GCCATGTTTA TGATGCGGAA CTTCCGCTCG ATTCTGTAAA CACCAATATG TCCGATGCAT TTATCAGAGA GAACATAAGT GTACTTGATC CTGACGGAGA CGGTAAAGTT GATGTTTCCG GCGGTTTTCA TGATGCCGGT GACCATGTGA AATTCGGTAT GCCTGAAGCA TATTCCGGTT CCACACTGGG ATGGGGATAT TATGAATTCA GGGAACAGTA TAAGCAGACA GGACAGGATC AGCATATCGA GACGATTTTG CGTTATTTTA ATGATTATTT CATGAGATGT ACTTTCAGAG ATAAAGACGG CAATGTAGTT GCGTTTTGTT ATCAAGTGGG TGACGGTGAC ATTGACCATG CTTACTGGAA TCCACCTGAA ATTGATGATA TGTTCCGTAG AGGATGGTTT GCCACTAAAC ATTTGCCTTC AACGGATTGC GTGTCCGCAG CTGCTGCTTC TCTTGCAGTA AACTATTTCA ACTTCAAAGA TACAGACCCT GAATATGCCG AAAAAAGCCT TGACTATGCA AAAGCATTGT TTGACTTTGC ACAAAAAAAT GACAAAGAAG TTAATGCAGA CGGTCCTAAA GGATATTATA CTTCTTCAAA ATGGCAGGAT GACTACTGCT GGGCTGCGGC ATGGCTTTAC CTTGCAACCC AGGATGACAA TTATTTGAAT GAACTGTTTA AGTATTATGA TTATTATGCA CCTTCCTGCT GGACTCACTG CTGGAATGAC GTTTGGGCAG GTACGGCATG CATTTTGGCT CAAATAGACG ATCTTTACGA CAAAGACAGC GAGGAGTTTG AAAACAGGTA CAGACAGGCT GCAAATAAGA GTCCTTATGA ACCGATAGAT TTTTGGGCCG AGGTTGCAAA ACTGGTAGAG AACTGGATGT ACGGTAAGAC TGTTACAATT ACTCCCGGTG GATATGCATT CCTTAACAAA TGGGGTTCGG CAAGATACAA CACAGCTACA CAGTTTGTAG CTCTTGTGTA TGACAAACAC CATGGTGATG CGCCTTCAGC ATACAGTCAA TGGGCAAGGT CGCAAATGGA GTACCTTATG GGAAACAATC CTCTTAATCG TTGCTACATT GTAGGATACA GCGATATTTC CGTAAAATTC CCGCACCATA GGGCGGCATC AGGTTTGTCA AAATGTGAAG ACCCTGATCC TCACAAATAT GTATTGTATG GTGCGCTGGT CGGCGGACCG GATGAGAATG ACCAACATAT AGATATGACA TCAGACTGGG TTTACAATGA AGTTACAATA GACTACAATG CTGCTTTTGT TGGTGCATGT GCCGGCCTTT ACAGATACTT TGGGGATCCT TCAATGGAGA TTACACCTAA TTTCCCGCCG AAGGTCGAGA TATCGGACCC TGACAACGGA GGTTCCTATT GGGTAGAAGC ATTTGGTGTG GACATAGTAC AAAGCGATGG ACCAAAAGCA ACTGAAGTCA CTTTGTATGT ACGTTCGGAT TCAAGGAAAC CGTCAAAGAA TATTTCCGTC AGGTACTTCT TCGATGCCAC GGGGATGTCA TCGGTTGACC CTGACAAGAT GGAGATAAGA CAGCTTTATG ACCAGACAGC GGCAGAGACG GATTATGCGG CCAAACTTAC AGGTCCTCAC CATTATAAAG ATAATATTTA CTATGTGGAA ATATCGTGGG AAGGATTTGC AATCGCTAAT TCCAATAAAA AATACCAGTT TGCGTTGGGT ACATACACAT GGGGCAACAG TTGGGATCCG ACCGACGACT GGAGTTATCA GGAATTGAAG ATAGAAGAGA GTAATTATAC AGGAACTCCT GCGAGAAACA ACAGAATATG TGTTTATGAT GCCGGTGTTC TTGTGGGAGG AATTGAGCCG GACGGAACAA CACCTCAATC ACCTACTCCG TCGCCGACTC CCACACCTCC ACAGGAACCG GAATTCACAT ATGGAGATTT AAACGGGGAC GGCAGGGTTA ATTCATCAGA CTTGGCTTTG ATGAAGAGGT ATGTGGTTAA ACAAATTGAG AAATTAAATG TTCCGGTAAA AGCAGCTGAC CTTAATGGAG ACGATAAAGT AAATTCAACC GATTATTCAG TCTTAAAGAG ATATTTGCTC CGTTCAATCG AGGTTATTCC GATAAAATAA
|
Protein sequence | MREKKIFSKR TKALIVSFVI LALMVFPVGT VNISAANVEY NYAKALQYSI YFYDANMCGT GVDENGQYNW RGDCHVYDAE LPLDSVNTNM SDAFIRENIS VLDPDGDGKV DVSGGFHDAG DHVKFGMPEA YSGSTLGWGY YEFREQYKQT GQDQHIETIL RYFNDYFMRC TFRDKDGNVV AFCYQVGDGD IDHAYWNPPE IDDMFRRGWF ATKHLPSTDC VSAAAASLAV NYFNFKDTDP EYAEKSLDYA KALFDFAQKN DKEVNADGPK GYYTSSKWQD DYCWAAAWLY LATQDDNYLN ELFKYYDYYA PSCWTHCWND VWAGTACILA QIDDLYDKDS EEFENRYRQA ANKSPYEPID FWAEVAKLVE NWMYGKTVTI TPGGYAFLNK WGSARYNTAT QFVALVYDKH HGDAPSAYSQ WARSQMEYLM GNNPLNRCYI VGYSDISVKF PHHRAASGLS KCEDPDPHKY VLYGALVGGP DENDQHIDMT SDWVYNEVTI DYNAAFVGAC AGLYRYFGDP SMEITPNFPP KVEISDPDNG GSYWVEAFGV DIVQSDGPKA TEVTLYVRSD SRKPSKNISV RYFFDATGMS SVDPDKMEIR QLYDQTAAET DYAAKLTGPH HYKDNIYYVE ISWEGFAIAN SNKKYQFALG TYTWGNSWDP TDDWSYQELK IEESNYTGTP ARNNRICVYD AGVLVGGIEP DGTTPQSPTP SPTPTPPQEP EFTYGDLNGD GRVNSSDLAL MKRYVVKQIE KLNVPVKAAD LNGDDKVNST DYSVLKRYLL RSIEVIPIK
|
| |