Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0543 |
Symbol | |
ID | 4808292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 663788 |
End bp | 666007 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640105957 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036972 |
Protein GI | 125973062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGAAAA TTTTGGCGTT TTTGCTGACA GTTGCGCTGG TGGCAGTAGT GGCCATTCCA CAAGCCGTGG TAAGTTTTGC TGCGGATTTC AACTATGGTG AGGCACTTCA GAAAGCAATA ATGTTTTATG AGTTCCAGCG CTCGGGAAAA CTGCCCGAAA ACAAAAGAAA CAACTGGCGT GGAGATTCCG CTCTTAATGA CGGCGCAGAC AACGGTTTGG ACCTTACAGG CGGTTGGTAT GATGCCGGTG ACCATGTAAA GTTCAACCTT CCGATGGCCT ATGCCGTTAC CATGCTCGCA TGGAGTGTTT ATGAATCCCG GGATGCGTAT GTACAAAGCG GACAGCTTCC TTACATACTG GACAATATTA AATGGGCTAC CGACTACTTT ATAAAATGCC ATCCAAGTCC AAATGTATAT TATTATCAGG TGGGAGACGG AGCATTGGAC CATTCATGGT GGGGACCTGC TGAAGTAATG CAGATGCCAA GACCGTCCTT CAAAGTGGAT TTGACCAATC CGGGTTCGAC TGTGGTTGCT GAGACGGCAG CGGCTATGGC TGCATCCTCA ATTGTTTTCA AGCCTACAGA CCCGGAATAT GCTGCCACAC TTTTAAGGCA TGCGAAAGAA CTCTTTACTT TTGCCGACAC CACAAGAAGT GACGCAGGAT ATAGAGCGGC AGAGGGATAC TATTCATCCC ACAGCGGTTT TTATGATGAA CTTACCTGGG CGAGTATATG GCTGTATCTT GCAACAGGAG ACCAGTCTTA TCTTGATAAA GCAGAATCCT ATGAACCTCA TTGGGAAAGG GAAAGAGGTA CAACTTTAAT TAGTTATTCC TGGGCTCATT GCTGGGATAA CAAATTGTAC GGTTCTTTGC TTTTGTTAGC AAAAATTACC GGCAAGTCTT ATTACAAGCA ATGTATTGAA AACCATCTTG ACTATTGGAC CGTCGGATTT AACGGAAGCA GAGTTCAATA TACTCCAAAA GGACTCGCAT ATCTTGACAG ATGGGGTTCA TTGAGATATG CAACCACACA GGCGTTCCTT GCCAGCGTTT ATGCGGACTG GTCCGGCTGT GACCCGGCTA AGGCGGCTGT CTACAAGGAA TTTGCAAAAA AACAGGTGGA TTATGCATTA GGAAGCACAG GAAGAAGCTT TGTAGTAGGT TTTGGAAAAA ATCCGCCAAG AAATCCTCAC CACAGGACGG CCCACAGCTC ATGGAGCGCT TTAATGACCG AACCTGCGGA GTGCAGACAT ATTCTGGTGG GTGCATTGGT TGGCGGACCG GACGGTTCGG ATTCATATGT TGACAGGCTC GATGATTATC AGTGCAATGA GGTGGCCAAC GACTATAATG CTGGATTTGT AGGTGCTCTT GCCAAGATGT ATGAGAAGTA TGGCGGAGAA CCGATTCCGA ATTTCGTTGC TTTTGAAACA CCGGGGGAAG AATTTTATGT TGAAGCTGCG GTAAATGCTG CAGGACCCGG TTTTGTAAAT ATCAAAGCTT CAATAATCAA CAAGTCCGGT TGGCCGGCAA GAGGTTCAGA TAAATTGTCA GCCAAGTATT TTGTCGATAT TTCCGAAGCT GTTGCAAAAG GCATTACTTT GGATCAAATT ACCGTTCAGT CGACTACTAA TGGCGGAGCC AAGGTTTCAC AGCTTCTTCC GTGGGATCCG GACAATCATA TTTATTATGT AAACATTGAC TTTACGGGAA TAAACATATT CCCCGGAGGA ATAAATGAAT ACAAGAGGGA TGTATATTTC ACTATTACGG CGCCGTATGG AGAGGGTAAC TGGGACAATA CCAACGACTT CTCCTTCCAG GGACTTGAGC AGGGCTTTAC AAGCAAAAAG ACTGAATATA TACCGTTGTA TGACGGTAAT GTGAGAGTAT GGGGTAAAGT ACCGGACGGA GGTTCGGAGC CCGATCCGAC GCCGACAATC ACCGTTGGCC CCACTCCTTC GGTTACACCG ACATCAGTAC CTGGAATAAT GCTCGGAGAT GTGAATTTTG ACGGAAGAAT AAACTCGACG GATTATTCAC GCTTAAAAAG ATATGTAATA AAGTCTTTGG AATTCACAGA TCCTGAAGAG CACCAGAAGT TCATTGCAGC TGCGGATGTT GACGGGAACG GAAGAATAAA CTCCACAGAT TTGTATGTGC TCAACAGGTA CATATTAAAA CTTATTGAAA AATTCCCGGC TGAACAGTAA
|
Protein sequence | MKKILAFLLT VALVAVVAIP QAVVSFAADF NYGEALQKAI MFYEFQRSGK LPENKRNNWR GDSALNDGAD NGLDLTGGWY DAGDHVKFNL PMAYAVTMLA WSVYESRDAY VQSGQLPYIL DNIKWATDYF IKCHPSPNVY YYQVGDGALD HSWWGPAEVM QMPRPSFKVD LTNPGSTVVA ETAAAMAASS IVFKPTDPEY AATLLRHAKE LFTFADTTRS DAGYRAAEGY YSSHSGFYDE LTWASIWLYL ATGDQSYLDK AESYEPHWER ERGTTLISYS WAHCWDNKLY GSLLLLAKIT GKSYYKQCIE NHLDYWTVGF NGSRVQYTPK GLAYLDRWGS LRYATTQAFL ASVYADWSGC DPAKAAVYKE FAKKQVDYAL GSTGRSFVVG FGKNPPRNPH HRTAHSSWSA LMTEPAECRH ILVGALVGGP DGSDSYVDRL DDYQCNEVAN DYNAGFVGAL AKMYEKYGGE PIPNFVAFET PGEEFYVEAA VNAAGPGFVN IKASIINKSG WPARGSDKLS AKYFVDISEA VAKGITLDQI TVQSTTNGGA KVSQLLPWDP DNHIYYVNID FTGINIFPGG INEYKRDVYF TITAPYGEGN WDNTNDFSFQ GLEQGFTSKK TEYIPLYDGN VRVWGKVPDG GSEPDPTPTI TVGPTPSVTP TSVPGIMLGD VNFDGRINST DYSRLKRYVI KSLEFTDPEE HQKFIAAADV DGNGRINSTD LYVLNRYILK LIEKFPAEQ
|
| |