Gene Cthe_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1022 
Symbol 
ID4811316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1224150 
End bp1225169 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content45% 
IMG OID640106440 
Productglycerol-3-phosphate dehydrogenase (NAD(P)(+)) 
Protein accessionYP_001037447 
Protein GI125973537 
COG category[C] Energy production and conversion 
COG ID[COG0240] Glycerol-3-phosphate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0256064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAATATCCAT AATTGGTGCG GGAAGCTGGG GGACCGCTCT GGCGGTGTTA 
TTGGCCAACA ACGGCATGAG TGTTACCATG TGGTCGATTT TTGAAGACGA AATTAAGATG
CTGAACGAAA AAAGAGAGCA TGTACATAAG TTGCCGGGTG TTATTGTTCC GGAGAATGTC
ACATTTACAT CGGATCTTGA AAAAGCTGTG TGTGATGCCG AGGTTGTGGT TGTGGTGGTA
CCTTCCCAAA CTGTCAGGCA GACTGCAAAG GATATATCGA AATATATAAG GGATGATACG
GTAATTGTTA GTTGTTCCAA AGGGTTGGAG GAAGGAACGG GGCTTAGAAT GTCCGAGGTA
ATAGGTCAGG AGATAAAAGA CGCAAAAACC GTTATCCTTT CAGGTCCAAG CCATGCCGAA
GAAGTGGGAA GAGGTGTGCC CACGGCAATT GTGGCGGCAT CTTGTGATAT CAAAGCGGCG
GAACTTATTC AGGATATATT CATGTCACCG GAATTTAGAG TTTACACCAA CACGGATGTT
GTCGGAGTGG AGCTTGGAGG TGCCTTGAAA AATGTAATAG CATTGTGTGC CGGAATATCG
GATGGTTTGG GTTTTGGGGA CAATACCAAG GCTGCGCTTA TGACAAGAGG AATAACCGAA
ATTTCAAGGC TGGGAGTTTC CATGGGGGCA AATCCCCAGA CTTTTGCCGG ACTTACGGGT
ATAGGAGACC TTATTGTGAC TTGTACCAGC ATGCACAGCA GAAACAGGCG TGCCGGAATT
TTGATCGGTC AGGGAAAATC ACCGCAGGAA GCAATGGATG AAGTTAAAAT GGTTGTTGAG
GGTGTTACAA CGACAAAAGC AGCTTATGAA CTTGCACGGA AAATGGATGT TGCGATGCCC
ATAACCTTCG AAGCATACGA AGTATTGTTT AACGGAAAGA ATCCAAGACA GGCAGTGTAT
GATCTTATGA TGAGGAACAA GAAAAATGAG GTTGAAGAAT TGGATGCCAA ATGGCTTTGA
 
Protein sequence
MNKKISIIGA GSWGTALAVL LANNGMSVTM WSIFEDEIKM LNEKREHVHK LPGVIVPENV 
TFTSDLEKAV CDAEVVVVVV PSQTVRQTAK DISKYIRDDT VIVSCSKGLE EGTGLRMSEV
IGQEIKDAKT VILSGPSHAE EVGRGVPTAI VAASCDIKAA ELIQDIFMSP EFRVYTNTDV
VGVELGGALK NVIALCAGIS DGLGFGDNTK AALMTRGITE ISRLGVSMGA NPQTFAGLTG
IGDLIVTCTS MHSRNRRAGI LIGQGKSPQE AMDEVKMVVE GVTTTKAAYE LARKMDVAMP
ITFEAYEVLF NGKNPRQAVY DLMMRNKKNE VEELDAKWL