Gene GWCH70_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3324 
Symbol 
ID7979221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3347541 
End bp3348593 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content43% 
IMG OID644800091 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002951230 
Protein GI239828606 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0421123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAG CGCGATGGTA TAATGCTAGA GACATTCGAG TAGAAGAAGT AGAAGAACCG 
AAAGTAGGAA AAGGAAAAGT AAAAATTAAA GTCGAATGGG CGGGAATTTG CGGAAGCGAT
TTACACGAAT ATGCGGCAGG CCCGATTTTT ATTCCTGTCC AAAATCCTCA TCCCGTCAGT
AAAGATGTCG CTCCGATTAT CATGGGTCAC GAATTTTCGG GACGAGTCGT GGAAGTTGGG
GAAGGAGTTA CTAAAGTCAA AGTTGGCGAT CCTGTCGTTG TTGAACCGAT TCTTCGCTGT
GGAGAATGCC CAGCTTGCAA AAAAGGAAAA TACAATCTTT GCGATCATTT AGGATTTCAT
GGTCTATCCG GAGGAGGCGG CGGCTTCTCC GAATATACCG TTGTTGATGA ATATATGGTG
CACAAAATGC CTGAAGGGCT TTCTTTTGAA CAAGGAGCGC TGGTGGAACC GGCAGCTGTC
GCTTTACATG CGGTTAGATT AAGCAAAATC AAGCCTGGCG ATAAAGCAGC TGTTTTTGGC
ACGGGGCCTA TAGGTCTTCT CGTTATTGAA GCATTAAAAG CAGCTGGCGC CTCGGAAATT
TATGCAGTAG AAGTTTCTAA AGAACGTTTG CAAAAAGCGA AAGAGCTCGG CGCTACATCT
GTCATCAATC CAAAAGAGGA AGATCCGGTT CAAAAGCTTG TCGAATTGAC CGATGGCGGC
GTCGATGTTG CGTTTGAAGT AACAGGAGTG CCGGCCGTTT TACAACAGGC CATTGATAGT
ACTACATTTG AAGGTGAAAC GATTATCGTT AGTATATGGG AAAAAGAAGC GAACATTCAG
CCAAATAATA TCGTATTAAA AGAAAGAAAC GTAAAAGGAA TCATTGCGTA CCGCGATATT
TTCCCTGCGG TAATGGAGTT AATGAAACGA GGCTACTTCC AAGCCGAAAA GCTCGTTACG
AAACGAATTA AGCTAGATGA TATTGTAACA GAAGGATTTG AAACGCTCAT GAAAGAAAAA
GACCAAGTGA AAATTTTGGT CAAACCAGAA TAA
 
Protein sequence
MKAARWYNAR DIRVEEVEEP KVGKGKVKIK VEWAGICGSD LHEYAAGPIF IPVQNPHPVS 
KDVAPIIMGH EFSGRVVEVG EGVTKVKVGD PVVVEPILRC GECPACKKGK YNLCDHLGFH
GLSGGGGGFS EYTVVDEYMV HKMPEGLSFE QGALVEPAAV ALHAVRLSKI KPGDKAAVFG
TGPIGLLVIE ALKAAGASEI YAVEVSKERL QKAKELGATS VINPKEEDPV QKLVELTDGG
VDVAFEVTGV PAVLQQAIDS TTFEGETIIV SIWEKEANIQ PNNIVLKERN VKGIIAYRDI
FPAVMELMKR GYFQAEKLVT KRIKLDDIVT EGFETLMKEK DQVKILVKPE