Gene Cthe_2340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2340 
Symbol 
ID4809268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2789803 
End bp2791122 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content35% 
IMG OID640107747 
ProductUDP-glucose/GDP-mannose dehydrogenase 
Protein accessionYP_001038735 
Protein GI125974825 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGT ATGAAAAAAT TGTTAATAAG GAAGAGAAAA TTTCTGTGAT TGGATTAGGC 
TACGTGGGAA TGCCATTAGC CATTGCATTT GCTAAATATG CACAGGTAAT TGGATTTGAT
GTGGACAAAA AAAAGATTGA AATATATAAA TCAGGAATCG ATCCAACAAA AGAGATAGGG
AATGAAGCTA TAAAAAATAC AACTGTTGAA TTTACGTCCG ATGAAACAAA GTTAAAAGAA
GCAAAATTTC ATATAGTGGC AGTTCCCACC CCGATTAATA TAGATAAAAC TCCGGATTTG
ACACCGGTTG AAAGTGCAAG TGTTATTGTG GGAAGAAATA TGGCTAGAGG TTCATATGTG
GTCTATGAAT CAACAGTTTA TCCTGGAGTA ACTGAGGATA TTTGTATTCC AATACTTGAA
AGAGAATCGG GACTTAAATG CGGAGTTGAT TTTAAAGTTG GATATTCTCC GGAACGTATT
AATCCAGGAG ACAAGAAGCA TACGCTGGAA ACAATAAAAA AAATTGTATC TGCTATTGAT
GATGAAAGTT TAGATGAGAT TGCAAAAATT TATAGCCTTG TGATAAAAGC AGGAGTGCAT
AAGACCAGCT CGATTAAAGT TGCCGAGGCG GCTAAAGTTG TGGAAAACAG CCAAAGGGAT
ATAAATATTG CCTTTATGAA TGAACTTGCG ATGGTATTTG ATCGTATGGG CATTGATACG
AAAGAAGTTA TAGATGCCAT GAATACAAAG TGGAATGCAC TGGGATTTTA TCCTGGTTTG
GTGGGTGGAC ATTGCATTGG TGTGGATCCC TATTATTTTA TTTATCAGGC GGAAAAGCTG
GGATATCACA GTCAAATTAT TCTTTCCGGA AGAAAAATTA ATGATGGAAT GGGTGAATTT
ATAGCTAACA CCATAATTAA GAAGCTTATA CATGTCAATA AAGTAGTAAA AAAATCAAAA
GTAGTTATTT TCGGTATTAC TTATAAAGAA AATTGCCCGG ATACAAGAAA CTCCAAGGTT
GTAGATATAA TTAAAGGATT GAATGAGTAT GGAATAGAAC CCATAGTGGT TGATCCTCAG
GCAAACAGGG AAGATACAAA ACAAGAATAT GGCATTGAAC TGATGGATAT AGATGATGTG
AAAGATGCGG ACTGCCTTGT ATTTGCCGTT GCCCATGATG AATTTAAAAA TATGGCTTGG
GAGCAGATAG AAAAATTCTT TAAAAATATT GATAATAGCG AAAAAGTAAT TATTGATGTA
AAGGGAATAT TTGATAAAAA TAAAATCGAG AAACTAGGCT ATTGCTACTG GAGGCTATGA
 
Protein sequence
MGLYEKIVNK EEKISVIGLG YVGMPLAIAF AKYAQVIGFD VDKKKIEIYK SGIDPTKEIG 
NEAIKNTTVE FTSDETKLKE AKFHIVAVPT PINIDKTPDL TPVESASVIV GRNMARGSYV
VYESTVYPGV TEDICIPILE RESGLKCGVD FKVGYSPERI NPGDKKHTLE TIKKIVSAID
DESLDEIAKI YSLVIKAGVH KTSSIKVAEA AKVVENSQRD INIAFMNELA MVFDRMGIDT
KEVIDAMNTK WNALGFYPGL VGGHCIGVDP YYFIYQAEKL GYHSQIILSG RKINDGMGEF
IANTIIKKLI HVNKVVKKSK VVIFGITYKE NCPDTRNSKV VDIIKGLNEY GIEPIVVDPQ
ANREDTKQEY GIELMDIDDV KDADCLVFAV AHDEFKNMAW EQIEKFFKNI DNSEKVIIDV
KGIFDKNKIE KLGYCYWRL