Gene Cthe_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2600 
Symbol 
ID4809022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3070549 
End bp3071679 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content44% 
IMG OID640108014 
Productglycosyl transferase family protein 
Protein accessionYP_001038993 
Protein GI125975083 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTTATG AATATATAAT ATTCTTTGCA CTGGCATTTA TCGTGTCGTT TTCTTCTACA 
CCGATTGCCA AAAAAATAGC TTTTAGTGTA GGAGCTGTTG ACGTGCCGAA TGATGCAAGG
AGAATACATA AAAAGCCTGT TGCCAGACTT GGCGGTTTGG CCATATTTAC CGGCTTTTTG
GTTTCTTTGC TGTTCGGAAT TTTAAGTACA TATCTCAATA TGAAGGGAAT TGTTCCCTCA
AGACAGCTTT TCGGAATGCT TATCGGCGCT TTTATTATAG TTGCCGTTGG TATTGTTGAT
GATATAAAGC AGCTTGGACC AAGACTCAAG TTAGTTTTTC AAATAATGGC TGCCCTTTCT
GTCATATTTA TATCAGACAT TAGAATTGTG AATATTACCA ATCCTTTTGC AGAGGGCGGC
ACAACCCAGT TTAATGATTT TGTGTCCTAT CCCCTTACTG TTTTGTGGAT AGTCGGTGTA
ACCAATGCAA TCAATTTGAT TGACGGCCTT GATGGTCTTG CGGCGGGAAT ATCGTCCATA
TCATACCTTT CTTTGTTCTT TGTTTCACTT ATTACGGGTG ATACGGCAAG CGCAATGCTT
ACCGTGGTGC TGGCAGGTGC CACATTGGGA TTTTTGCCCT ATAATTTCAA TCCTGCAAAG
ATATTTATGG GAGACACAGG GGCCACATTC CTGGGCTTTA CCCTGGCGGT TATTTCCGTG
CAAGGTATGC TCAAATCCTA TGCGGCTCTT TCTATTGCCG TGCCATTGCT TGTGCTGGGA
CTTCCGCTGT TTGACACGAT ATCCACCATA TTCCGCAGGG CTTTGAACAG AAAGCCTATT
ATGCAGCCCG ACAGGGGACA TCTGCATCAC AAGCTTATAG ACATGGGATA CAGTCAGAAG
CAGACGGTGC TTGTTATGTA TTCGCTTAGC GGTGCATTGG GACTGAGTGC CATAGTCCTG
GCGGACAAAG GCTTTGTCAG CGCCATTATA CTTGTAATAA CCGTTGCGGC GTTTGTAATC
GGCGGCGCGA GATACATGCT GGAGATGGAT GACGTGGAAA TACCTGAAAA GTATAAGCCT
AAAGAAGAAA TGGTAAGTCC GGTTGTAAAT GAAATAAAAG ACCAACAGTA G
 
Protein sequence
MLYEYIIFFA LAFIVSFSST PIAKKIAFSV GAVDVPNDAR RIHKKPVARL GGLAIFTGFL 
VSLLFGILST YLNMKGIVPS RQLFGMLIGA FIIVAVGIVD DIKQLGPRLK LVFQIMAALS
VIFISDIRIV NITNPFAEGG TTQFNDFVSY PLTVLWIVGV TNAINLIDGL DGLAAGISSI
SYLSLFFVSL ITGDTASAML TVVLAGATLG FLPYNFNPAK IFMGDTGATF LGFTLAVISV
QGMLKSYAAL SIAVPLLVLG LPLFDTISTI FRRALNRKPI MQPDRGHLHH KLIDMGYSQK
QTVLVMYSLS GALGLSAIVL ADKGFVSAII LVITVAAFVI GGARYMLEMD DVEIPEKYKP
KEEMVSPVVN EIKDQQ