Gene GWCH70_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0808 
Symbol 
ID7977756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp874002 
End bp875462 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content39% 
IMG OID644797786 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002948959 
Protein GI239826335 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACAGT ATCTGGAACT AAATAAAAGT TTTATTAATG GTGAGTGGGT TGAAGGATTA 
AGTTCGAACT CTTATAATAT TTTAAACCCG TATGATGATT CTGTCATTAC AACAGTTAAA
TTAGCTACAA AAGAACAAAC ACAGGAAGCT TTTGAAGCAG CGCAGCAAGT GCAGAAAAAA
TGGGCGAAAT CTTCGGTAGA GGAAAGAAAA GCGGTCATCC GGAAAGCATT AGAGTATTTT
AAAGAGAATA AAGAAGCAAT TTTAGAGACC ATGGTAGTAG AAACAGGCAG CACATATATA
AAAGCGGAAA TGGAATATCA AATTACGTTA GATGAATTAG TAGAAGCGGA AAAAATGACA
GAAGAGATTT ATACGTATAG AGAAGTTCCT TCTCCTATTG AAGGAAAAAC AAATCGAATT
TATCGATTGC CGCTCGGAGT TATTTCATCG ATTTCTCCAT TTAATTTCCC GTTATTTTTA
TCGATGAGAA CCATTGCCCC TGCGATCGCA TTAGGTAATG CGGTCGTTCA TAAACCGGAT
TTGAAAACCG GCTTAACCGG AGGTTCGATT ATTGCGGCAG CGTTTGAATA TGCTGGATTG
CCTAAAGGTG TACTCAATGT GATTTTAACG AATTCGAGAG AAATTGGAGA TGAAATGTTA
ACAAACCCTC ATGCTAAGTT GGTCAGCTTC ACTGGATCGA CAGAGGCAGG AAAGCATGTC
GGAGCTGTCG CTGGTGGAGC TTTAAAACGC GTTGCATTAG AATTAGGCGG AAACAGTCCG
TTTGTCGTGT TGAGCGATGC TGATGTAGAT CGCGCAGTGG ATGCAGCGAT ATTCGGAAAG
TATTTACACC AAGGCCAAAT TTGTATGATC GTGAATAGAT TCATTGTTCA TAAAGATTTG
TATGATGAAT TTGTACAGAA GTTTGTGGAG CGGGCTAAAA ACTTGCCGTA CGGAGATCCA
CGCAATCGTA AAAACGTGAT TGGGCCGATC ATCGACCAAC GTCAGTTAGA AAAAGCATTA
AAGGTAATTG AAGAAGCAAA AGCAGAAGGA ATTCCGTTGG CATTAGAAGG AAAACGCGTA
GGAAATATAT TAACGCCATA TGTATTTGTA GATGTGGATA ATGATAGTAA GTTAGCTCAA
ACGGAAGTGT TCGCACCGAT TGCGATCATG ATTAAAGCGG AAACGGATGA ACAAGCGATT
GAATTGGCAA ATGAAACCGA ATATGGTCTA AGTTCGGCTA TATTTACATC TGATCTTGAA
AAAGGAACAG AACTAGCTTT AGAGATTGAT AGCGGAATGA CTCATGTAAA TGACCAAACG
GTTAACCTTC AATCGAATAC ACCATTTGGG GGAACAAAAG CGAGCGGATT AGGCCGTTTT
GGAAATCCAT GGATTGTAGA AGAGTTTACA GTGACAAAAT GGGTTTCCGT TCAACATCAA
TATCGAAAAT TTCCGTTTTA A
 
Protein sequence
MKQYLELNKS FINGEWVEGL SSNSYNILNP YDDSVITTVK LATKEQTQEA FEAAQQVQKK 
WAKSSVEERK AVIRKALEYF KENKEAILET MVVETGSTYI KAEMEYQITL DELVEAEKMT
EEIYTYREVP SPIEGKTNRI YRLPLGVISS ISPFNFPLFL SMRTIAPAIA LGNAVVHKPD
LKTGLTGGSI IAAAFEYAGL PKGVLNVILT NSREIGDEML TNPHAKLVSF TGSTEAGKHV
GAVAGGALKR VALELGGNSP FVVLSDADVD RAVDAAIFGK YLHQGQICMI VNRFIVHKDL
YDEFVQKFVE RAKNLPYGDP RNRKNVIGPI IDQRQLEKAL KVIEEAKAEG IPLALEGKRV
GNILTPYVFV DVDNDSKLAQ TEVFAPIAIM IKAETDEQAI ELANETEYGL SSAIFTSDLE
KGTELALEID SGMTHVNDQT VNLQSNTPFG GTKASGLGRF GNPWIVEEFT VTKWVSVQHQ
YRKFPF