Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2807 |
Symbol | |
ID | 4809644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3308398 |
End bp | 3309429 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108227 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039199 |
Protein GI | 125975289 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000151011 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAGTT TTAAAGCAGG TATAAATTTA GGCGGATGGA TATCACAATA TCAAGTTTTC AGCAAAGAGC ATTTCGATAC ATTCATTACG GAGAAGGACA TTGAAACTAT TGCAGAAGCA GGGTTTGACC ATGTCAGACT GCCTTTTGAT TATCCAATTA TCGAGTCTGA TGACAATGTG GGAGAATATA AAGAAGATGG GCTTTCTTAT ATTGACCGGT GCCTTGAGTG GTGTAAAAAA TACAATTTGG GGCTTGTGTT GGATATGCAT CACGCTCCCG GGTACCGCTT TCAAGATTTT AAGACAAGCA CCTTGTTTGA AGATCCGAAC CAGCAAAAGA GATTTGTTGA CATATGGAGA TTTTTAGCCA AGCGTTACAT AAATGAACGG GAACATATTG CCTTTGAACT GTTAAATGAA GTTGTTGAGC CTGACAGTAC CCGCTGGAAC AAGTTGATGC TTGAGTGTGT AAAAGCAATC AGGGAAATTG ATTCCACCAG GTGGCTTTAC ATTGGGGGCA ATAACTATAA CAGTCCTGAT GAGCTTAAAA ACCTTGCAGA TATTGATGAT GATTACATAG TTTACAATTT CCATTTTTAC AATCCTTTTT TCTTTACGCA TCAGAAAGCC CACTGGTCGG AAAGTGCCAT GGCGTACAAC AGGACTGTAA AATATCCGGG ACAATATGAG GGAATTGAAG AGTTTGTGAA AAATAATCCT AAGTACAGTT TTATGATGGA ATTGAATAAC CTGAAGCTGA ATAAAGAGCT TTTGCGCAAA GATTTAAAAC CAGCAATTGA GTTCAGGGAA AAGAAAAAAT GCAAACTATA TTGCGGGGAG TTTGGCGTAA TTGCCATTGC TGACCTGGAG TCCAGGATAA AATGGCATGA AGATTATATA AGTCTTCTAG AGGAGTATGA TATCGGCGGC GCGGTGTGGA ACTACAAAAA AATGGATTTT GAAATTTATA ATGAGGATAG AAAACCTGTC TCGCAAGAAT TGGTAAATAT ACTGGCGAGA AGAAAAACTT GA
|
Protein sequence | MVSFKAGINL GGWISQYQVF SKEHFDTFIT EKDIETIAEA GFDHVRLPFD YPIIESDDNV GEYKEDGLSY IDRCLEWCKK YNLGLVLDMH HAPGYRFQDF KTSTLFEDPN QQKRFVDIWR FLAKRYINER EHIAFELLNE VVEPDSTRWN KLMLECVKAI REIDSTRWLY IGGNNYNSPD ELKNLADIDD DYIVYNFHFY NPFFFTHQKA HWSESAMAYN RTVKYPGQYE GIEEFVKNNP KYSFMMELNN LKLNKELLRK DLKPAIEFRE KKKCKLYCGE FGVIAIADLE SRIKWHEDYI SLLEEYDIGG AVWNYKKMDF EIYNEDRKPV SQELVNILAR RKT
|
| |