Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3324 |
Symbol | |
ID | 7979221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3347541 |
End bp | 3348593 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644800091 |
Product | Alcohol dehydrogenase zinc-binding domain protein |
Protein accession | YP_002951230 |
Protein GI | 239828606 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0421123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAG CGCGATGGTA TAATGCTAGA GACATTCGAG TAGAAGAAGT AGAAGAACCG AAAGTAGGAA AAGGAAAAGT AAAAATTAAA GTCGAATGGG CGGGAATTTG CGGAAGCGAT TTACACGAAT ATGCGGCAGG CCCGATTTTT ATTCCTGTCC AAAATCCTCA TCCCGTCAGT AAAGATGTCG CTCCGATTAT CATGGGTCAC GAATTTTCGG GACGAGTCGT GGAAGTTGGG GAAGGAGTTA CTAAAGTCAA AGTTGGCGAT CCTGTCGTTG TTGAACCGAT TCTTCGCTGT GGAGAATGCC CAGCTTGCAA AAAAGGAAAA TACAATCTTT GCGATCATTT AGGATTTCAT GGTCTATCCG GAGGAGGCGG CGGCTTCTCC GAATATACCG TTGTTGATGA ATATATGGTG CACAAAATGC CTGAAGGGCT TTCTTTTGAA CAAGGAGCGC TGGTGGAACC GGCAGCTGTC GCTTTACATG CGGTTAGATT AAGCAAAATC AAGCCTGGCG ATAAAGCAGC TGTTTTTGGC ACGGGGCCTA TAGGTCTTCT CGTTATTGAA GCATTAAAAG CAGCTGGCGC CTCGGAAATT TATGCAGTAG AAGTTTCTAA AGAACGTTTG CAAAAAGCGA AAGAGCTCGG CGCTACATCT GTCATCAATC CAAAAGAGGA AGATCCGGTT CAAAAGCTTG TCGAATTGAC CGATGGCGGC GTCGATGTTG CGTTTGAAGT AACAGGAGTG CCGGCCGTTT TACAACAGGC CATTGATAGT ACTACATTTG AAGGTGAAAC GATTATCGTT AGTATATGGG AAAAAGAAGC GAACATTCAG CCAAATAATA TCGTATTAAA AGAAAGAAAC GTAAAAGGAA TCATTGCGTA CCGCGATATT TTCCCTGCGG TAATGGAGTT AATGAAACGA GGCTACTTCC AAGCCGAAAA GCTCGTTACG AAACGAATTA AGCTAGATGA TATTGTAACA GAAGGATTTG AAACGCTCAT GAAAGAAAAA GACCAAGTGA AAATTTTGGT CAAACCAGAA TAA
|
Protein sequence | MKAARWYNAR DIRVEEVEEP KVGKGKVKIK VEWAGICGSD LHEYAAGPIF IPVQNPHPVS KDVAPIIMGH EFSGRVVEVG EGVTKVKVGD PVVVEPILRC GECPACKKGK YNLCDHLGFH GLSGGGGGFS EYTVVDEYMV HKMPEGLSFE QGALVEPAAV ALHAVRLSKI KPGDKAAVFG TGPIGLLVIE ALKAAGASEI YAVEVSKERL QKAKELGATS VINPKEEDPV QKLVELTDGG VDVAFEVTGV PAVLQQAIDS TTFEGETIIV SIWEKEANIQ PNNIVLKERN VKGIIAYRDI FPAVMELMKR GYFQAEKLVT KRIKLDDIVT EGFETLMKEK DQVKILVKPE
|
| |