Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0885 |
Symbol | |
ID | 7979339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 946882 |
End bp | 947937 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644797848 |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_002949021 |
Protein GI | 239826397 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCG CGAAAATTAT TCAGCACAAA AAACCATTGG AAATTTTAAA TGTCCCGGAT CCTAAACCAG GGCCGGAAGA TGCGGTAATA AAAATCGAAG CTTGCGGGGT TTGCCGCAGC GACTGGCATG CGTGGCAAGG GGATTGGTCG TGGATCGGCT TGTCCCCAGA ATTACCGATA ACACCTGGAC ATGAATTTGG AGGTGTAATC GAAGAAGTTG GCAAAGATGT GAAGTCGTTT CGTCCGGGAG ATCGTGTCAC AGTTCCGTTC CATTCCGCCT GTGGACGATG TGAATACTGC AAAAAAGGTG TGCCGAATTT ATGCGAAAAC CTCCAAATTT ATGGTCTTGT TTCTGGACTT GAAGGAGGAT ATGCGGAGTA TGTATTGGTT CGCAATGCCG ATTTTAACCT GATCAGACTC CCTGAAAATG TTGACAGTTT AACAGCTGCA GCTTTAGGCT GCCGTTACAT GACTGGATAT CACGGTATTG TCAGAGGCAA TGTCAAGCCT GGTGATTGGG TGGCCGTGCA TGGAGCTGGC GGAGTAGGCC TTTCTGCGAT TCAGGTGGCC AATGCATTAG GAGCGCAAGT TATAGCCGTT GATATTGATG ATCAAAAACT TGAAATCGCA AAACAGGAAG GCGCCATTGC GGTTGTCAAT GCCAGAAAAG AAAACGTTGT CGAAGCAATT AAAGAAATCA CAAAAGGAGG CGCGCATGTC GGACTGGATG CTTTAGGCAT CAAGGATACC GTTCTTAATT CGGTTCTGTC CTTAAGAAAA GGAGGAAGAC ATGTTCAAGT AGGTTTAACC ACATCAGAAG AAGGAGGTTT TGTATCGCTC CCTGTCGATC TGATTACGGC ATCTGAAATT GAGTTTGTAG GAAGTATCGG CAATCCTCAT CCTGACTATC GCGGCTTATT GAGCTTGATT TCTTCCGGAC GATTGAATCC GAAGCGCTTG GTTGAACGTG AAATCAAATT GGAAGATGTA AACGCTGTAT TCGAAAATAT GTCACAATAT AATACGAAAG GGTTTAACGT CATCACGAAA TTTTAA
|
Protein sequence | MKAAKIIQHK KPLEILNVPD PKPGPEDAVI KIEACGVCRS DWHAWQGDWS WIGLSPELPI TPGHEFGGVI EEVGKDVKSF RPGDRVTVPF HSACGRCEYC KKGVPNLCEN LQIYGLVSGL EGGYAEYVLV RNADFNLIRL PENVDSLTAA ALGCRYMTGY HGIVRGNVKP GDWVAVHGAG GVGLSAIQVA NALGAQVIAV DIDDQKLEIA KQEGAIAVVN ARKENVVEAI KEITKGGAHV GLDALGIKDT VLNSVLSLRK GGRHVQVGLT TSEEGGFVSL PVDLITASEI EFVGSIGNPH PDYRGLLSLI SSGRLNPKRL VEREIKLEDV NAVFENMSQY NTKGFNVITK F
|
| |