Gene BAS2392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2392 
Symbol 
ID2852825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2388844 
End bp2389920 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content34% 
IMG OID637505639 
Productthermolysin metallopeptidase catalytic subunit 
Protein accessionYP_028652 
Protein GI49185400 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00577856 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAAAAGAAAT AGCTATAGTT GCATTAACAA CAGGATTGGT TTTAACAAGC 
ATAGTACCAT ACGGGATAGG TTATGCAGAG GAAACGGGTC AAATGCAAGT TGATATTCAA
GAGGATTCGT TCCGTACAGG TGAACTTACA AAACCATCAA AAAAAACGCC AGAGAGTGTA
GTGAAAGATG CACTTAAGGA AAAAACGGAG CATGTTTTGT CTCCAAAACA AGTTAGTGGA
GACAAAGGGG TAGATTACAA GGTCCTTCAA AAACGTGGTT CATATGATGG GACTACACTT
GTGCGTTTGC AACAAATATA TGAAGGAAAA GAAGTATATG GACACCAATT GACTGCTCAT
GTAGATAAAA AGGGTATTAT TAAAAGTGTT TCAGGGGAAA GCGCACAAAA TTTAGAAAAA
GAAGATTTAA AGAATCCTAT TAATTTATCA AAAGAAGAAG CAAAACAATA TATTTATAAA
AAGTACGGAA ATGATATTAA ATTTATTTCT GAGCCAGAAG TTAAGGAAGT TATTTTTGTT
GATGAAAATA ATGGACAGGC TAGCAATGCA TATCAAGTTA CATTTGCGGC TGCCACACCA
AACTATGTAT CTGGAACTTA TTTAGTGGAT GCCCATAATG GTGTTATGTT GAAAAATACG
TTACAAGAAT CCGATTTAAA AGTAAGTGAA GAGCAAGTTG AATCTTTAAA GGAGAATAAA
AAAAGCAATT CCATATCATT AACTGGAACA GGAAAAGATG ATTTAGGGAT AACTCGCATA
TTTGGTATTT CTGAACAGAG TAACGGAAAA TATGCGCTTG CTGATTATAC AAGAGGGCAA
GGAATTGAAA CGTACGATGT AAATTATAGA GATATTAATT TTGAAGAAAG ATATTATCCT
GGTATATTAG CAACTAGCAC TTCAACAACC TTTGATGATC CAAAGGCGGT CAGTGCTCAT
TTCTTAGCAA CAAAGGTATA TGATTTTTAT AAAGACAAAT ATAAGCGTAA TAGTTTTGAT
AATAAGGGAA AAAATAGTAT CAGTTGTACA TGCATGGCAT TCAGGAGAAA CAGATGA
 
Protein sequence
MKNKKEIAIV ALTTGLVLTS IVPYGIGYAE ETGQMQVDIQ EDSFRTGELT KPSKKTPESV 
VKDALKEKTE HVLSPKQVSG DKGVDYKVLQ KRGSYDGTTL VRLQQIYEGK EVYGHQLTAH
VDKKGIIKSV SGESAQNLEK EDLKNPINLS KEEAKQYIYK KYGNDIKFIS EPEVKEVIFV
DENNGQASNA YQVTFAAATP NYVSGTYLVD AHNGVMLKNT LQESDLKVSE EQVESLKENK
KSNSISLTGT GKDDLGITRI FGISEQSNGK YALADYTRGQ GIETYDVNYR DINFEERYYP
GILATSTSTT FDDPKAVSAH FLATKVYDFY KDKYKRNSFD NKGKNSISCT CMAFRRNR