Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0314 |
Symbol | |
ID | 4808532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 396564 |
End bp | 397742 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105725 |
Product | glycosyltransferase 28-like protein |
Protein accession | YP_001036745 |
Protein GI | 125972835 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGAA GCCTCCCGAT TGCCCTGGCT TTGGCTGAGG CCGGATATGA AATAAAATAT TTGGGTTATG ACATGGCCAA AGAATATATG AAAAAAGCCG GGATAGAAGA ACTGTGCCCT GAGTTCAGCA TAAGCGATAT TAAAAAGGGA AGTCCCAACC CGTACTGGAA CACGGCAGAA GAATTCTGGT CGATGATTGG TTATGGCAAC ATGCCGTGGG TTGAAAGAAA AGTCGATGAA TTAATAAATT TGTTAAAACA ATTTTTCCCC GATTACATAC TGTCCGACCT GGGCATTTTA GCATGTCTTG CTGCAAGAAT AACGGGAATT CCATTAATTG CGATAAACCA GAGCTGTTAT CATCCAAATG TAAAATTAAA ATGGTGGGAA AACAATTATG AAGCCGAAAA CTATAAAGAT AAAGACAGTC TTTTAAATAA ACTGAATGCA TTTCTAAAGA AAAAAGGCGC ACAGCAATTA AATACTTTTA CGGAAATATT TACAGGAAGG CTTACAATCA TTCCCGGTTT CTATGATTTT GATCCGATAC CGAATCTTGA AAAATATAAT ACCCATTATG TAGGGCCTGT TCTGTATACT CCAAAGGAAA ATGTTTCCGA AAAGCTTTTA AAACTTTTTG ACGCCGATCA ACCGATAATC TTTTGCTATA CGGCAAGGTT CTATGATAAT GTGGGAGAAA GCGGGAAAGC AATTTTCGAT AATATGATTA AAATTGCCGA TAAAATAGAT GCCTCCATTA TTATTTCGAC AGGGAATAAA AAGGATGAAT TGCTTGCCTT GGATATTGCG TCAAAGGAAT TGAAAAGCGG CAAAGTCAGT ATCGTTGATT ACGTGCCTTT GGATATGGCT TATGAAAAGT CTGACCTGGT GATTCACCAC GGCGGTCATG GAAGCTGTCT TGCACAATTT TACTATGGTG TTCCTTCTGT CATAATACCT ACTCATACTG AACGGGAATA TAACGCAAGA ATGTGTGAAA AACTGCATGT TGGCAAAATG CTCCCCAGAA GAGAATTAAA CAGTGCAAAT TTGAAGAATT GCATCAATGA TGTGCTAAAT GACATTACTT ATAAGAAAAG TGTCCGGGAT TGGAAAGAGA AAGTGTCCGG TGATTTTAAC AACCTTGATA AAGTGGTAAA ACTCGTTGAT TCGCTGTAA
|
Protein sequence | MSRSLPIALA LAEAGYEIKY LGYDMAKEYM KKAGIEELCP EFSISDIKKG SPNPYWNTAE EFWSMIGYGN MPWVERKVDE LINLLKQFFP DYILSDLGIL ACLAARITGI PLIAINQSCY HPNVKLKWWE NNYEAENYKD KDSLLNKLNA FLKKKGAQQL NTFTEIFTGR LTIIPGFYDF DPIPNLEKYN THYVGPVLYT PKENVSEKLL KLFDADQPII FCYTARFYDN VGESGKAIFD NMIKIADKID ASIIISTGNK KDELLALDIA SKELKSGKVS IVDYVPLDMA YEKSDLVIHH GGHGSCLAQF YYGVPSVIIP THTEREYNAR MCEKLHVGKM LPRRELNSAN LKNCINDVLN DITYKKSVRD WKEKVSGDFN NLDKVVKLVD SL
|
| |