Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3609 |
Symbol | dhaS |
ID | 2815001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 3318404 |
End bp | 3319888 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637790350 |
Product | aldehyde dehydrogenase |
Protein accession | YP_020244 |
Protein GI | 47528895 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAC TAGCTGTAAA TCTTCATGAA AAGGTAGAAA AGTTTCTTCA AGGTACGAAA AAGTTATATG TGAATGGATC ATTCATTGAA AGCGCTTCCG GTAAGACGTT TAATACACCT AATCCAGCAA CTGGCGAAAC ACTTGCCGTC GTTTCTGAAG CCGGTCGCGA AGATATTCAT AAAGCTGTAG TTGCAGCTCG CATGGCTTTT GACGAAGGTC CTTGGTCTCG CATGAGCACT GCGGAGCGAA GCCGTCTTAT GTACAAGTTA GCTGATTTAA TGGAAGAACA TAAAGAAGAG CTTGCACAGC TCGAGACGTT AGATAACGGA AAGCCAATCC GTGAAACAAT GGCAGCAGAC ATACCACTTG CAATTGAGCA CATGCGCTAT TATGCTGGCT GGGCGACGAA AATCGTTGGT CAAACAATCC CTGTTTCCGG TGATTTCTTT AACTATACAC GCCATGAAGC TGTTGGTGTC GTTGGTCAAA TTATCCCTTG GAACTTCCCG CTTCTTATGG CCATGTGGAA AATGGGAGCA GCGCTTGCTA CAGGATGTAC AATCGTTTTA AAACCTGCAG AACAAACTCC ACTATCTGCT CTATACTTAG CTGAATTAAT TGAAGAAGCT GGATTCCCGA AAGGCGTTAT TAATATCGTT CCTGGATTCG GTGAATCAGC TGGACAAGCT CTCGTTAATC ATCCACTCGT TGATAAAATT GCATTTACCG GTTCTACTCC AGTCGGTAAA CAAATTATGC GACAAGCATC TGAATCCTTG AAACGTGTTA CTTTAGAGCT TGGTGGTAAA TCACCGAACA TTATTTTACC AGACGCTGAT TTATCTCGCG CAATTCCTGG TGCACTTTCT GGTGTTATGT TTAACCAAGG GCAAGTATGC TCTGCTGGAT CACGCCTATT TGTTCCGAAG AAAATGTATG ATAATGTCAT GGCTGATCTC GTCCTCTATT CTAAAAAACT AAATCAAGGT GTCGGTCTTG ACCCTGAAAC GACAATTGGT CCTCTCGTTT CCGAAGAACA ACAAAAACGT GTAATGGGCT ACATTGAAAA AGGGATTGAA GAAGGCGCTG AAGTACTTTG CGGAGGAAAT AATCCATTCG ATCAAGGCTA CTTCATTTCT CCTACAGTAT TCGCTGACGT AAATGACGAA ATGACAATCG CAAAAGAAGA AATTTTCGGT CCAGTTATTT CTGCAATACC TTTTAACGAT ATTGATGAAG TAATTGAACG AGCAAATAAA TCACAATTCG GCTTAGCGGC TGGTGTGTGG ACAGAAAATG TTAAAACAGC ACACTATGTT GCAAGTAAAG TACGTGCAGG TACAGTATGG GTTAACTGTT ACAACGTCTT TGATGCAGCA TCTCCATTTG GAGGATTTAA ACAATCTGGT CTCGGCCGTG AAATGGGATC TTACGCATTA AATAACTATA CAGAAGTGAA GAGCGTTTGG CTTAACTTAA ATTAA
|
Protein sequence | MSQLAVNLHE KVEKFLQGTK KLYVNGSFIE SASGKTFNTP NPATGETLAV VSEAGREDIH KAVVAARMAF DEGPWSRMST AERSRLMYKL ADLMEEHKEE LAQLETLDNG KPIRETMAAD IPLAIEHMRY YAGWATKIVG QTIPVSGDFF NYTRHEAVGV VGQIIPWNFP LLMAMWKMGA ALATGCTIVL KPAEQTPLSA LYLAELIEEA GFPKGVINIV PGFGESAGQA LVNHPLVDKI AFTGSTPVGK QIMRQASESL KRVTLELGGK SPNIILPDAD LSRAIPGALS GVMFNQGQVC SAGSRLFVPK KMYDNVMADL VLYSKKLNQG VGLDPETTIG PLVSEEQQKR VMGYIEKGIE EGAEVLCGGN NPFDQGYFIS PTVFADVNDE MTIAKEEIFG PVISAIPFND IDEVIERANK SQFGLAAGVW TENVKTAHYV ASKVRAGTVW VNCYNVFDAA SPFGGFKQSG LGREMGSYAL NNYTEVKSVW LNLN
|
| |