Gene BAS2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2028 
Symbol 
ID2849262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2036375 
End bp2037886 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content38% 
IMG OID637505278 
Productneutral metalloprotease 
Protein accessionYP_028291 
Protein GI49185039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.12889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAATA AAAAAATGGT GGCAATGGCG ATGACTGTGC CATTAGTAAT GGGGACGCTT 
TCTACTGTTT CTGCAGTAGA GACAAAGAAA CAAGTAAAGG TGGAAGCTTA TTCGCCACAG
AAAAAAGCTA TTGAATATTT AAAAGGGAAT GCCAATCAGT ACGGATTAAA GCCAGACCTT
TCAGACTTGC AATATATTTC TACAACAGAA ACACCTGTAG CTTCATATGT TAGGTTCCAA
CAAGTTGTGA ATGGTGCACC TGTATTTTCA AATCAAATTA CAGTAACTCT AAACGGAGAA
GGAAAGGGTG TACTTGCTGT TTCTGATTAT CAACCTATTA AAGCTGTGAA GCAAGTAACG
GAAAAAATTA GTGAAAAAGA TGCAATACAA AAATCAATGG CGTATGTTGG AGAAGCAAGT
GAGCAAAACT TATGGGCTCC TACAGAGAAA GAATTTGGAT ACATTATTGA AGAGGGAATT
GCTCGTCCAG TATATAAAGT GGTTGTCCAT TCTAATAATC CATTTGGTGC ATGGGAAACA
TTTATTGATG CGGAAAATGG AAAGTTAATT AAAAAAGTTG ATATAAACCG AAAAGTAGAA
GGAACAGGAA AAGTATTTTT ACCTAACCCT GTCGTATCAA GTGGGAGTTT AGCAGGATTA
AAAGATAATA ACGATGCAGA TTCAACAGCG CTAACGAATC AATTAAAAAC TGTTACGTTA
AAAGGTTTAG ATGGGACAGG ATTTTTAATT GGAGAGTATG TAACGATATC TTCAAAAGCG
AGAACAAAAT CTACAAACTT ACAATTTAAC TATACACGCG CAAATGATAG CTTTGAAGAT
GTGATGTCGT ATTATCATAT CGATACTTTA CAACGTTACA TTCAAAGTCT AGGTTTTCAA
AATATTAATA ATCGCTCCAT TAAAGTGAAT GTGAATGGAA CGACGGCGGA CAACTCATTT
TATTCTCCGA CAACGAAAGC TTTAACTTTT GGTACTGGCG GAGTAGATGA TGCAGAAGAT
GCGGGTATTA TCGCACATGA ATATGGGCAT TCAATTCAAG ATAATCAAGT CCCTGGCTTC
GGTAGTTCTG CAGAAGGCGG AGCGATGGGA GAAGGATTTG GTGACTTCTT AGGTGCTACT
TATGAAGATG CTGTATCGAC TACAGGTTAC GGAAAAGCGT GTGTTGGAGA GTGGGATGCA
GCGGCTTATT CTAGTTCAGA TCCAACGTGC CTACGCCGAT TAGATACGAA TAAAGTGTAT
CCGAAAGATA TGAAAAATCA AGTACATGCT GACGGTGAAA TTTGGTCACA AGGACAATAT
GAGATGGCAC AAGCTTTTGG CCGTGATGTT GCAACAAAAA TTATTTTACA ATCACATTGG
TCATTAACAC CAAACTCTAA ATTTAGCGAT GGAGCAAAAG CAATTAAGCA AGCAGATGCT
CTTTTATACG GTGGACAACA CGCTGCTGAT ATTGATCGTA TTTGGGCAGC GAGAGGAATT
AGTACGAATT AA
 
Protein sequence
MFNKKMVAMA MTVPLVMGTL STVSAVETKK QVKVEAYSPQ KKAIEYLKGN ANQYGLKPDL 
SDLQYISTTE TPVASYVRFQ QVVNGAPVFS NQITVTLNGE GKGVLAVSDY QPIKAVKQVT
EKISEKDAIQ KSMAYVGEAS EQNLWAPTEK EFGYIIEEGI ARPVYKVVVH SNNPFGAWET
FIDAENGKLI KKVDINRKVE GTGKVFLPNP VVSSGSLAGL KDNNDADSTA LTNQLKTVTL
KGLDGTGFLI GEYVTISSKA RTKSTNLQFN YTRANDSFED VMSYYHIDTL QRYIQSLGFQ
NINNRSIKVN VNGTTADNSF YSPTTKALTF GTGGVDDAED AGIIAHEYGH SIQDNQVPGF
GSSAEGGAMG EGFGDFLGAT YEDAVSTTGY GKACVGEWDA AAYSSSDPTC LRRLDTNKVY
PKDMKNQVHA DGEIWSQGQY EMAQAFGRDV ATKIILQSHW SLTPNSKFSD GAKAIKQADA
LLYGGQHAAD IDRIWAARGI STN