Gene BAS1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1198 
Symbol 
ID2849417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1242946 
End bp1244313 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content39% 
IMG OID637504455 
Productaldehyde dehydrogenase 
Protein accessionYP_027468 
Protein GI49184216 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.869515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTTT CTTCTATTGT AAATAAACAA AAGGCATATT TTTATAATGG CCATACGAGA 
AGTGTAGAAG TGAGAAAGAA TAATTTGAAG AAGCTTTATG AAGGCATTCA GCGCTTTGAA
GAAGAAATAT TTCAGGCATT GAAATTAGAT TTAAATAAAT CAGTTCACGA GTCATTTACA
ACGGAAGTTG GATATGTATT AAAAGAAATT TCCTTTCAAT TGAAACATAT GTCATCGTGG
AGTAAACCGA AGCGTGTTCG AACAGCACTC ACTCATTTTG GATCAAAGGG GAAAGTAGTG
CCCGAACCGT ACGGTGTGAC ACTTATTATT GCACCGTGGA ACTATCCGTT TCAATTAGCA
ATTGCACCGC TTGTAGGAGC TTTGGCAGCT GGAAATACAA TCGTTTTAAA GCCATCAGAG
CTAACCCCAA ACGTTTCTAA AGTGATTACG AGAATGTTAG CAGAATTATT CCAAGAAGAG
CTTGTAGCGG TAGTAGAAGG TGGCGTTGAA GAGAGTACGG CTTTGCTGAA GGAACCATTT
GATTATATCT TCTTTACAGG TAGTGTGGGG GTTGGGAAGG TTGTCATGGA AGCAGCTGCG
AAACAGCTGA CGCCGCTTAC GTTAGAACTT GGCGGAAAGA GCCCATGTAT TGTACATAAA
GATGCAAAAG TAGATGTAAC AGCGAGACGA ATTGTGTGGG GCAAGTTTTT AAATGCAGGG
CAAACGTGTG TAGCACCTGA TTATCTGTAT GTGCATGCTT CTGTGAAAGA GCAGTTAATC
GAGGCATTGC GACACGAAAT TGCGGAGCAG TATGGGAACG AGCCTTTGCA AAATGAAAAT
TACGTACGCA TTGTAAGTGA GCGGCATTTT GAACGATTAT GCCGGTTTTT ACAAGATGGT
CAAGTCGCAA TCGGCGGTAA CTATAAGCGA GATACATTAC ATATTGAACC GACAGTAGTG
AAGGATATTA CATGGCAAGA TGCTGTCATG GAAGATGAAA TTTTTGGTCC GATTTTACCA
ATTATAGAGT ACGAAAACAT AGAAGAGGTA ATTGACACAA TTCAGCAACA TCCGAAACCG
TTAGCGTTAT ATGTATTCTC TGAAGATAAA GAAATGCAAA AGAAAGTAAC GAGTAATATT
TCGTATGGCG GAGGCTGTGT GAATGATGTT GTTTATCATC TTGCAACCCC ATATTTACCT
TTTGGAGGCG TCGGAAGTAG TGGATTAGGG AGTTACCATG GGGAAGAAAG TTTTCGGACT
TTTTCACATT ATAAAAGCAT TTTAGCCCAA TCTACGGCAT TCGATATGAA AATTCGTTAC
TCTTCTACAA AAAGTGCTTT AAAATTCATA CGAAAGTTGT TAAAATGA
 
Protein sequence
MSVSSIVNKQ KAYFYNGHTR SVEVRKNNLK KLYEGIQRFE EEIFQALKLD LNKSVHESFT 
TEVGYVLKEI SFQLKHMSSW SKPKRVRTAL THFGSKGKVV PEPYGVTLII APWNYPFQLA
IAPLVGALAA GNTIVLKPSE LTPNVSKVIT RMLAELFQEE LVAVVEGGVE ESTALLKEPF
DYIFFTGSVG VGKVVMEAAA KQLTPLTLEL GGKSPCIVHK DAKVDVTARR IVWGKFLNAG
QTCVAPDYLY VHASVKEQLI EALRHEIAEQ YGNEPLQNEN YVRIVSERHF ERLCRFLQDG
QVAIGGNYKR DTLHIEPTVV KDITWQDAVM EDEIFGPILP IIEYENIEEV IDTIQQHPKP
LALYVFSEDK EMQKKVTSNI SYGGGCVNDV VYHLATPYLP FGGVGSSGLG SYHGEESFRT
FSHYKSILAQ STAFDMKIRY SSTKSALKFI RKLLK