Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2600 |
Symbol | |
ID | 4809022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3070549 |
End bp | 3071679 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108014 |
Product | glycosyl transferase family protein |
Protein accession | YP_001038993 |
Protein GI | 125975083 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTTATG AATATATAAT ATTCTTTGCA CTGGCATTTA TCGTGTCGTT TTCTTCTACA CCGATTGCCA AAAAAATAGC TTTTAGTGTA GGAGCTGTTG ACGTGCCGAA TGATGCAAGG AGAATACATA AAAAGCCTGT TGCCAGACTT GGCGGTTTGG CCATATTTAC CGGCTTTTTG GTTTCTTTGC TGTTCGGAAT TTTAAGTACA TATCTCAATA TGAAGGGAAT TGTTCCCTCA AGACAGCTTT TCGGAATGCT TATCGGCGCT TTTATTATAG TTGCCGTTGG TATTGTTGAT GATATAAAGC AGCTTGGACC AAGACTCAAG TTAGTTTTTC AAATAATGGC TGCCCTTTCT GTCATATTTA TATCAGACAT TAGAATTGTG AATATTACCA ATCCTTTTGC AGAGGGCGGC ACAACCCAGT TTAATGATTT TGTGTCCTAT CCCCTTACTG TTTTGTGGAT AGTCGGTGTA ACCAATGCAA TCAATTTGAT TGACGGCCTT GATGGTCTTG CGGCGGGAAT ATCGTCCATA TCATACCTTT CTTTGTTCTT TGTTTCACTT ATTACGGGTG ATACGGCAAG CGCAATGCTT ACCGTGGTGC TGGCAGGTGC CACATTGGGA TTTTTGCCCT ATAATTTCAA TCCTGCAAAG ATATTTATGG GAGACACAGG GGCCACATTC CTGGGCTTTA CCCTGGCGGT TATTTCCGTG CAAGGTATGC TCAAATCCTA TGCGGCTCTT TCTATTGCCG TGCCATTGCT TGTGCTGGGA CTTCCGCTGT TTGACACGAT ATCCACCATA TTCCGCAGGG CTTTGAACAG AAAGCCTATT ATGCAGCCCG ACAGGGGACA TCTGCATCAC AAGCTTATAG ACATGGGATA CAGTCAGAAG CAGACGGTGC TTGTTATGTA TTCGCTTAGC GGTGCATTGG GACTGAGTGC CATAGTCCTG GCGGACAAAG GCTTTGTCAG CGCCATTATA CTTGTAATAA CCGTTGCGGC GTTTGTAATC GGCGGCGCGA GATACATGCT GGAGATGGAT GACGTGGAAA TACCTGAAAA GTATAAGCCT AAAGAAGAAA TGGTAAGTCC GGTTGTAAAT GAAATAAAAG ACCAACAGTA G
|
Protein sequence | MLYEYIIFFA LAFIVSFSST PIAKKIAFSV GAVDVPNDAR RIHKKPVARL GGLAIFTGFL VSLLFGILST YLNMKGIVPS RQLFGMLIGA FIIVAVGIVD DIKQLGPRLK LVFQIMAALS VIFISDIRIV NITNPFAEGG TTQFNDFVSY PLTVLWIVGV TNAINLIDGL DGLAAGISSI SYLSLFFVSL ITGDTASAML TVVLAGATLG FLPYNFNPAK IFMGDTGATF LGFTLAVISV QGMLKSYAAL SIAVPLLVLG LPLFDTISTI FRRALNRKPI MQPDRGHLHH KLIDMGYSQK QTVLVMYSLS GALGLSAIVL ADKGFVSAII LVITVAAFVI GGARYMLEMD DVEIPEKYKP KEEMVSPVVN EIKDQQ
|
| |