Gene GBAA_3609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3609 
SymboldhaS 
ID2815001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3318404 
End bp3319888 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content42% 
IMG OID637790350 
Productaldehyde dehydrogenase 
Protein accessionYP_020244 
Protein GI47528895 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAC TAGCTGTAAA TCTTCATGAA AAGGTAGAAA AGTTTCTTCA AGGTACGAAA 
AAGTTATATG TGAATGGATC ATTCATTGAA AGCGCTTCCG GTAAGACGTT TAATACACCT
AATCCAGCAA CTGGCGAAAC ACTTGCCGTC GTTTCTGAAG CCGGTCGCGA AGATATTCAT
AAAGCTGTAG TTGCAGCTCG CATGGCTTTT GACGAAGGTC CTTGGTCTCG CATGAGCACT
GCGGAGCGAA GCCGTCTTAT GTACAAGTTA GCTGATTTAA TGGAAGAACA TAAAGAAGAG
CTTGCACAGC TCGAGACGTT AGATAACGGA AAGCCAATCC GTGAAACAAT GGCAGCAGAC
ATACCACTTG CAATTGAGCA CATGCGCTAT TATGCTGGCT GGGCGACGAA AATCGTTGGT
CAAACAATCC CTGTTTCCGG TGATTTCTTT AACTATACAC GCCATGAAGC TGTTGGTGTC
GTTGGTCAAA TTATCCCTTG GAACTTCCCG CTTCTTATGG CCATGTGGAA AATGGGAGCA
GCGCTTGCTA CAGGATGTAC AATCGTTTTA AAACCTGCAG AACAAACTCC ACTATCTGCT
CTATACTTAG CTGAATTAAT TGAAGAAGCT GGATTCCCGA AAGGCGTTAT TAATATCGTT
CCTGGATTCG GTGAATCAGC TGGACAAGCT CTCGTTAATC ATCCACTCGT TGATAAAATT
GCATTTACCG GTTCTACTCC AGTCGGTAAA CAAATTATGC GACAAGCATC TGAATCCTTG
AAACGTGTTA CTTTAGAGCT TGGTGGTAAA TCACCGAACA TTATTTTACC AGACGCTGAT
TTATCTCGCG CAATTCCTGG TGCACTTTCT GGTGTTATGT TTAACCAAGG GCAAGTATGC
TCTGCTGGAT CACGCCTATT TGTTCCGAAG AAAATGTATG ATAATGTCAT GGCTGATCTC
GTCCTCTATT CTAAAAAACT AAATCAAGGT GTCGGTCTTG ACCCTGAAAC GACAATTGGT
CCTCTCGTTT CCGAAGAACA ACAAAAACGT GTAATGGGCT ACATTGAAAA AGGGATTGAA
GAAGGCGCTG AAGTACTTTG CGGAGGAAAT AATCCATTCG ATCAAGGCTA CTTCATTTCT
CCTACAGTAT TCGCTGACGT AAATGACGAA ATGACAATCG CAAAAGAAGA AATTTTCGGT
CCAGTTATTT CTGCAATACC TTTTAACGAT ATTGATGAAG TAATTGAACG AGCAAATAAA
TCACAATTCG GCTTAGCGGC TGGTGTGTGG ACAGAAAATG TTAAAACAGC ACACTATGTT
GCAAGTAAAG TACGTGCAGG TACAGTATGG GTTAACTGTT ACAACGTCTT TGATGCAGCA
TCTCCATTTG GAGGATTTAA ACAATCTGGT CTCGGCCGTG AAATGGGATC TTACGCATTA
AATAACTATA CAGAAGTGAA GAGCGTTTGG CTTAACTTAA ATTAA
 
Protein sequence
MSQLAVNLHE KVEKFLQGTK KLYVNGSFIE SASGKTFNTP NPATGETLAV VSEAGREDIH 
KAVVAARMAF DEGPWSRMST AERSRLMYKL ADLMEEHKEE LAQLETLDNG KPIRETMAAD
IPLAIEHMRY YAGWATKIVG QTIPVSGDFF NYTRHEAVGV VGQIIPWNFP LLMAMWKMGA
ALATGCTIVL KPAEQTPLSA LYLAELIEEA GFPKGVINIV PGFGESAGQA LVNHPLVDKI
AFTGSTPVGK QIMRQASESL KRVTLELGGK SPNIILPDAD LSRAIPGALS GVMFNQGQVC
SAGSRLFVPK KMYDNVMADL VLYSKKLNQG VGLDPETTIG PLVSEEQQKR VMGYIEKGIE
EGAEVLCGGN NPFDQGYFIS PTVFADVNDE MTIAKEEIFG PVISAIPFND IDEVIERANK
SQFGLAAGVW TENVKTAHYV ASKVRAGTVW VNCYNVFDAA SPFGGFKQSG LGREMGSYAL
NNYTEVKSVW LNLN