Gene Cthe_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1352 
Symbol 
ID4809347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1644326 
End bp1645657 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content40% 
IMG OID640106776 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_001037777 
Protein GI125973867 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000465012 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGA TACCAAAAGT TGCAATGTTT GGTACCGGTT ATGTTGGGCT TGTATCCGGA 
GTATGTATAG CCGATTTCGG CATAAACGTC ATTTGTGTTG ATGTTGACAA AGAAAAGATT
GACGGGCTCA ATAATGGGAA GATTCCTATT TACGAACCAG GACTTGACGT TTTCCTTGAA
AGAAATATAA AAGCAGGAAG AATACAATTT ACTACAGATG CAAAAATGGC AATAGAAGAA
TCCAATGTTT TGTTTATTGC TGTAGGCACA CCTCCGAAGG AAAACGGAGA GGCTGACATG
CAGTATGTAT ATGCTGTTGC TGAAACTATC GGACAGTATA TGAACGGATA TAAAGTTATA
GTTGATAAAA GCACTGTACC TGTTGGTACA GGTCAGGTTG TTAAGAAAAT AATAGCCGAC
AAGCTTAAAG AAAGAGGAGT CGAATACTCT TTTGATGTTG TTTCAAATCC GGAGTTTCTT
CGTGAAGGAA AAGCGCTTTA CGACTTTACT CATCCTGACA GGGTTGTTAT AGGCGTTGAA
AGTGAAGAAG TTGCAGAGAT AATGAAAAAG GTATACAGGC CTCTGTATAT CAATGAAACA
CCCTTTGTAA TAACCAACAT AGAAACTGCG GAAATGATTA AGTATGCATC CAATGCATTT
CTTGCAACCA AGATAACTTT TATAAATGAA ATTGCAAACC TTTGTGAGAA AGTGGGGGCA
AATGTTCAGC AGGTCGCAAT GGCCATGGGA AGAGACGGAA GAATAGGTCC AAAGTTCCTG
CATGCAGGAC CGGGTTTTGG AGGAAGCTGC TTCCCAAAGG ATACAAAGGC CCTTGTACAA
ATAGCTGAGA AGCATGGGGT TCAAATGTCT GTGGTAAATG CGGTAATAGA AGCAAACGAG
AGGCAGAAGA AAATGGTGGC TGAGAAACTC GAAAAATTTG CAGGAGATTT AAAAGGTAAA
ACAATAGGCA TACTTGGACT TGCGTTCAAA CCCGAAACGG ATGACGTGAG GGAAGCTCCT
GCGTTAACAA TAATAGCCGA TTTGATTGAA AGGGGAGCAA GTATCCGCGC GTATGACCCT
CAGGCCATGG AGGAGGCTAA AAAAGCTCTC AGAAAATACG CGGATAATAT TACTTACTGC
AAGCATGCCT ATGATACTGC CGAGAGTGTG GATGCATTGG TTATAGTTAC GGAATGGCAT
GAGTTTCGCA ACATGGACTT GACACTGCTG AAAAAAATAA TGAGGGGAAA TATTTTCTAT
GACGCCAGAA ATATATACTC GAGAAAGGAT ATAGAAGAAA AAGGATTTGT GTTTATAGGT
ACCGGAGTAT AA
 
Protein sequence
MNMIPKVAMF GTGYVGLVSG VCIADFGINV ICVDVDKEKI DGLNNGKIPI YEPGLDVFLE 
RNIKAGRIQF TTDAKMAIEE SNVLFIAVGT PPKENGEADM QYVYAVAETI GQYMNGYKVI
VDKSTVPVGT GQVVKKIIAD KLKERGVEYS FDVVSNPEFL REGKALYDFT HPDRVVIGVE
SEEVAEIMKK VYRPLYINET PFVITNIETA EMIKYASNAF LATKITFINE IANLCEKVGA
NVQQVAMAMG RDGRIGPKFL HAGPGFGGSC FPKDTKALVQ IAEKHGVQMS VVNAVIEANE
RQKKMVAEKL EKFAGDLKGK TIGILGLAFK PETDDVREAP ALTIIADLIE RGASIRAYDP
QAMEEAKKAL RKYADNITYC KHAYDTAESV DALVIVTEWH EFRNMDLTLL KKIMRGNIFY
DARNIYSRKD IEEKGFVFIG TGV