Gene BAS5314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5314 
Symbol 
ID2852943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5200937 
End bp5202112 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content39% 
IMG OID637508567 
Productserine protease 
Protein accessionYP_031551 
Protein GI49188298 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTTA TTGATGGAGA AAATTATCGT ATGAAACGTG CAAGGAAAAA GAAGCATAAA 
GGTATTGTTA TTTCTAGCAT AGCAGGAACA ATTGTAGGAG CTTCGTTATT TGCATTTGGA
GCTCCTATAT TTTCAAATGA TACGGGCGCA CTTCCGCAAG CTGAAGCAAG TGGAAGTAAT
ATGGCCGAAG CTCAAGGAAT TAAACAGATT AGCTTTGTGG ATGCTGTTGA TCGTGCATCT
GAAGCTGTTG TTGGTATTAT TAATATTCAA CGAGATAATT TGTCAGAGGC AGATTCAGAA
GCTGGCACAG GCTCAGGTGT AATTTATAAG AAGACAAATG ATCAAGCTTA TATTGTAACG
AATAATCATG TTGTTGCTGG GGCAAATCGT ATTGAAGTAA GTTTAAGTGA CGGTAAGAAG
GTTCCAGGAA AGGTATTAGG AACCGATGTA GTCACAGATT TGGCTGTACT AGAGATAGAT
GCAAAGCATG TGAAAAAGGT CATTGAGATT GGCGATTCTA ATGCTGTTCG TAGAGGAGAA
CCAGTCATTG CGATTGGGAA CCCGCTCGGG CTACAATTTT CTGGAACCGT CACACAAGGT
ATTATTTCGG CTAATGAGCG TATTGTTCCT GTAGATTTAG ATCAAGATGG ACATTATGAT
TGGCAAGTAG AAGTATTGCA AACAGACGCA GCAATTAATC CGGGTAATAG TGGTGGGGCG
CTTGTAAATG CAGCAGGTCA ATTAATTGGT ATTAACTCAA TGAAAATTGC CGCAAAAGAA
GTAGAAGGAA TTGGTCTAGC TATTCCAGTG ACTAGAGCTG TTCCAATTAT GAATGAATTA
GAGAAGTACG GAAAAGTAAG AAGACCGTAT GTTGGAATTG AACTGAGATC ATTAAATGAG
ATTCCAAACT ATTATTGGTC AAAAACATTG CATTTACCAG GCAATGTAAC AGAGGGAGTT
TGCATTTTAG ATGTGAAAAG TCCTTCGCCA GGCACAGATG CTGGTTTACG AGAACACGAT
GTAATTGTAG CAGTAGATGG AAAACCGGTT CGTGATATTA TCGGATTCCG TACGGCCTTA
TATGATAAAA AAATTAATGA TAAAATGACT CTTACGTTTT ATCGTGGTAC GAAACGAGCA
ACAACAACGG TTAAACTAGG CATTCAAAAG TATTAA
 
Protein sequence
MSFIDGENYR MKRARKKKHK GIVISSIAGT IVGASLFAFG APIFSNDTGA LPQAEASGSN 
MAEAQGIKQI SFVDAVDRAS EAVVGIINIQ RDNLSEADSE AGTGSGVIYK KTNDQAYIVT
NNHVVAGANR IEVSLSDGKK VPGKVLGTDV VTDLAVLEID AKHVKKVIEI GDSNAVRRGE
PVIAIGNPLG LQFSGTVTQG IISANERIVP VDLDQDGHYD WQVEVLQTDA AINPGNSGGA
LVNAAGQLIG INSMKIAAKE VEGIGLAIPV TRAVPIMNEL EKYGKVRRPY VGIELRSLNE
IPNYYWSKTL HLPGNVTEGV CILDVKSPSP GTDAGLREHD VIVAVDGKPV RDIIGFRTAL
YDKKINDKMT LTFYRGTKRA TTTVKLGIQK Y