Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2222 |
Symbol | |
ID | 4811087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2651422 |
End bp | 2652486 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107628 |
Product | glycosyltransferase 28-like protein |
Protein accession | YP_001038617 |
Protein GI | 125974707 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3980] Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase |
TIGRFAM ID | [TIGR03590] pseudaminic acid biosynthesis-associated protein PseG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATA TAGGAATCAG AGTTGACGGA AGTGCCAATA TCGGCATGGG ACATATAATG CGCTGCCTGT CGCTGGCAAA AGGATTTAGA AATGCCGGCG CCAATGTATA TTTCTTAAGC CGGTTTGAAC AGGGAATTTC AAGGATAAGG CAGGACAACT TTGAAGTTTT GGAAATGCCG TACCGAAAAA GCAGGAATTC GGGAGGCTTT TTCTATGGAG ATGCTTCGGA GCTGGAGGAA GACGCGGAAG AAATAATCTG CCGAATTAGA GCATTTAATC TGGATGTGCT GATTATTGAC TCCTATAACG TCAGCCGGGA GTTTTTTTTG AAGCTGAAGC CGCATGTAAG AAAGCTTTGC TACATTGATG ATCTTAATAA ATTTGTATAT CCTGTGGATG TGCTGATAAA CGGAAACATT ACAGCCCCAG CATTAAATTA TGCCAAATAC AGCGATGACG AGCTTATGCT TTTGGGCTTG AAATATAATC TCATAAGGGA TGAATTTAAA AATTTGCCCG AGAGAATAAT AAACAGGGAT GTGCGGGAAA TAATGATAAC AACAGGAGGC TCAGACCCTT TTAACCTGAC TCTGAGGCTT GCAAATGCCA TCCTGCCGGA AGAAGAATTT AAAGATGTGA GAATCAATAT TGTTGTGGGC AGCGGTTTTA CCAATGCGGA CAAGTTTAGA GAGCTGTCCG AAAGAAACCC GAATGTTGTA TTGCATGAAA ATGTTTTGCG AATGTCGGAA GTAATGCTAA AATCCGATGT TGCAATATCT GCAGGGGGAA GCACATTGTA TGAGCTTTGC GCCTGCGGGA CACCTGCCCT GGCTGTTGTT ATTGCTGATA ACCAAAGGGA AATGGTGGAT ATGTTGTCTT CCGAAGGTTA CATAATCAGC CTGGGCTGGC ATGAAGAGCT TGATGACAGG GAGCTTTTGC GAAAGGTTAA GTCTTTGTGC GGGGATTATG AAAAAAGAGT GCTTTTCAGC AGAAAGATGC AAAAGCTGGT GGACGGAGAA GGGGTAAAAC GTGTGGTTGA GGAAATAATG AAAATAACTT CGTGA
|
Protein sequence | MLNIGIRVDG SANIGMGHIM RCLSLAKGFR NAGANVYFLS RFEQGISRIR QDNFEVLEMP YRKSRNSGGF FYGDASELEE DAEEIICRIR AFNLDVLIID SYNVSREFFL KLKPHVRKLC YIDDLNKFVY PVDVLINGNI TAPALNYAKY SDDELMLLGL KYNLIRDEFK NLPERIINRD VREIMITTGG SDPFNLTLRL ANAILPEEEF KDVRINIVVG SGFTNADKFR ELSERNPNVV LHENVLRMSE VMLKSDVAIS AGGSTLYELC ACGTPALAVV IADNQREMVD MLSSEGYIIS LGWHEELDDR ELLRKVKSLC GDYEKRVLFS RKMQKLVDGE GVKRVVEEIM KITS
|
| |