Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1303 |
Symbol | |
ID | 4809555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1581724 |
End bp | 1582950 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106726 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001037728 |
Protein GI | 125973818 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATTT TGATGCTTTC ATGGGAATAT CCTCCCAGAA TTGTCGGAGG GATATCGAGA GTCGTTCACG GCCTGGCGCA GAAGCTTGGA GCGCGGGGCT GTGACGTTCA TGTTATAACA TGTTGGGAAA TGGGAACTCG GGAGTTTGAA AGGGATAAAT ATGTAAAGGT ACACAGACTT CATTCCTATG ACGTAACTCC AAATAATTTT GTTGACTGGG TTCTTCATTT GAATTTTGCC ATTGTTGAGC ATGCCACCCG GCTTATAAAT GAAACCGGAA AGTTTGACAT TATACATGCC CATGACTGGC TGGTTGCTTT TGCGGCGAGA GTTTTGAAGC ATGCCTATTC AACACCTCTT GTTGCAACTA TACATGCCAC GGAACATGGC AGGAACTGGG GTATACATAA TGACACCCAG CGCTATATAA ACAATGTGGA ATGGTGGCTT GCGTTCGAGG CCTGGAGGCT GATTGTCAAC AGCGAATATA TGAAGAATGA AGTTATGTCC ATATTCAAGA TTCCCAATGA CAAAATAGAC GTGATTCCCA ATGGGGTTGA TTTGGATAAA TTCAAAGGCT ACGAGAAGGA TATGGAATTT AGAAGACGGT TTGCGCAGGA CAACGAGAAA ATAGTGTTCT TTGTTGGAAG ACTGGTAAAC GAGAAAGGTG TACATGTACT TATAGATGCG CTCCCGAAGG TGTGCCATTA TTACAATGAT GTCAAGTTTG TGATTGCAGG GAAAGGTCCG CAGTTTGACC ATCTGAAGTG GAAGGCCGAG AGCATGGGAA TGGCGCACAA GGTCTACTTC ACCGGATACA TAAGTGACGA GGAACTTTTA AAGCTTTATA AATGTGTTGA TGTTGCAGTT TTTCCAAGTC TTTACGAACC TTTTGGAATT GTTGCTTTGG AAGGGATGGT TGCAAATGTT CCGGTTGTCG TTTCCGACAC CGGAGGCCTT GGAGAGATTG TGGAACACGG CGTCGACGGC ATGAAGTCTT ACACGGGAAA TCCCAATTCC CTTGCAGACA GTATATTGGA AATACTTCAC AATCCCGATA AAGCGGAGAG AATGAAGAAA AAAGCGTTGG AGAAAGTTCG TTCAATTTAT AATTGGGATG TGGTTGCGGA AAAAACGCTA AATGTGTATA AAACCATTTT GGAAGAAAAC AAGCATATTT ATTGGGGTTC CCCGATTATG AAGGAGGAAA CGGAAAGGCT CAACTGA
|
Protein sequence | MRILMLSWEY PPRIVGGISR VVHGLAQKLG ARGCDVHVIT CWEMGTREFE RDKYVKVHRL HSYDVTPNNF VDWVLHLNFA IVEHATRLIN ETGKFDIIHA HDWLVAFAAR VLKHAYSTPL VATIHATEHG RNWGIHNDTQ RYINNVEWWL AFEAWRLIVN SEYMKNEVMS IFKIPNDKID VIPNGVDLDK FKGYEKDMEF RRRFAQDNEK IVFFVGRLVN EKGVHVLIDA LPKVCHYYND VKFVIAGKGP QFDHLKWKAE SMGMAHKVYF TGYISDEELL KLYKCVDVAV FPSLYEPFGI VALEGMVANV PVVVSDTGGL GEIVEHGVDG MKSYTGNPNS LADSILEILH NPDKAERMKK KALEKVRSIY NWDVVAEKTL NVYKTILEEN KHIYWGSPIM KEETERLN
|
| |