Gene BAS4907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4907 
Symbol 
ID2852335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4789136 
End bp4790794 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content37% 
IMG OID637508163 
Productneutral protease B 
Protein accessionYP_031148 
Protein GI49187895 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAATG TTATAGTGAA AAAGCAAGTC ATTTCATCAG CATTAGCGTT AACTGTTATC 
GCCGGGGGAT TTGGAACATT TGGAGCAACG ACAACGAAAG CGGAAGAACA AAAAATTCAA
TATCATCAAG AATTTAAAAC GCCTGCATAC ATAGGTGAAG AATGGAAAGC ACCGGAAGGA
CTAGATAAAA AAGAGACAGT CTTTCAATAT TTAGAGAGTA AAAAAGACAT GTTTAAATTA
GCAGGAAATA TTGATAAACA TTTCAATGTC GTTGGGGAAG AAAAAGATGC TGAATCTGGC
ACAACACACG TGAAGCTAGT TGAGAAACAT AATAACATTC CTGTGTATGG TTCAGACCAA
ACTGTTACAC TAGATAAAGA AAATAATGTA AAAGCATTCT TCGGACAAGT TATTCCGAAT
TTAGAGGATA AAAATATTCC TGAGTTTGCA AGCATTAGTG CGGAACAAGC AGAAACGATT
GCAAAGGCAG ATATTGAAAA AGAATTGGGT AAAGTAAAGA ATTATGACGG TGTGAAAAAA
GATTTATTTG TTTATGAAAA AGATGGAAAA TACTATCTTG CATATTTAGT GAAAGCGTCG
ATTTCAAAAC CAGCTCCAGG ATATTGGCAT TATTTTGTTG ATGCAACGAA TGGAAATGTG
ATTGAGAAAT ATAATGCTGT AGATCATATT ACAGGATTTG GTTACGAAGT ACTAGGAGCT
AGACAATCAT TTGAAATTGC TCAAGATGAG AAAACAGGAG CATTCAACTT ATTTGACGGG
AAACGAGGAC AAGGTGTTCA TACATTTGAT GCAGATAATA TGGATGAAAA TTTATTTAAC
ATATTCTCGC AATGGTTTGG ATATACAGGT CTAGAAGTAG AAAGTAAAAA TAAATTCTTT
GATGACAAAG CGGCGGTTGA TGCGCATGTA AACGCAGGAA AAGTATATGA TTACTACAAA
AAGACATTTA ATCGTAACTC GTTCGATGAT AAAGGTGCGA AACTTATTTC CTCTGTTCAC
GTAGGTGAAA ACTGGAATAA CGCAGCTTGG AATGGTGTGC AAATGATGTA TGGTGATGGC
GATGGTACAA CATTTATTTC ATTATCTGCT GGATTAGATG TTATCGGTCA CGAATTAACG
CATGCTGTAA CGGAACATAC AGCAAATCTT GTTTATAAAA ATGAGTCAGG TGCGTTAAAT
GAATCGTTAT CTGATATTAT GGGTGTCATG GTTGAGAAGA AGAGTTGGGA TTTAGGTGCT
GACATTTATA CACCTGGAAA ACCTGGTGAT GCACTTCGTT CTCTGAAAGA TCCAGCGTCT
ATTCCAAATC CATTAAAACC AGGTGAAGGT TACCCTGATC ATTACAATAA ACGCTACACT
GGAACAGCTG ATAATGGCGG CGTTCATATT AACAGTAGTA TTAACAATAA AGCTGCCTAT
TTAGTGTCTG ATGGTGGAGA GCATTACGGT GTGAAAGTAA CTGGAGTTGG CCGTGAAGCG
ACAGAGAAAA TTTATTACCG TGCTCTTACG AAATATTTAA CTGCAAACTC TGACTTCAAA
ATGATGCGTC AAGCGGCACT TCAGTCAGCT GAAGATTTAT ATGGTAAAGA TTCTAAAGCT
GTACAAGCTG TAACGAAAGC TTATGATGCA GTAGCGTAA
 
Protein sequence
MGNVIVKKQV ISSALALTVI AGGFGTFGAT TTKAEEQKIQ YHQEFKTPAY IGEEWKAPEG 
LDKKETVFQY LESKKDMFKL AGNIDKHFNV VGEEKDAESG TTHVKLVEKH NNIPVYGSDQ
TVTLDKENNV KAFFGQVIPN LEDKNIPEFA SISAEQAETI AKADIEKELG KVKNYDGVKK
DLFVYEKDGK YYLAYLVKAS ISKPAPGYWH YFVDATNGNV IEKYNAVDHI TGFGYEVLGA
RQSFEIAQDE KTGAFNLFDG KRGQGVHTFD ADNMDENLFN IFSQWFGYTG LEVESKNKFF
DDKAAVDAHV NAGKVYDYYK KTFNRNSFDD KGAKLISSVH VGENWNNAAW NGVQMMYGDG
DGTTFISLSA GLDVIGHELT HAVTEHTANL VYKNESGALN ESLSDIMGVM VEKKSWDLGA
DIYTPGKPGD ALRSLKDPAS IPNPLKPGEG YPDHYNKRYT GTADNGGVHI NSSINNKAAY
LVSDGGEHYG VKVTGVGREA TEKIYYRALT KYLTANSDFK MMRQAALQSA EDLYGKDSKA
VQAVTKAYDA VA