Gene BAS3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3395 
Symbol 
ID2849056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3368530 
End bp3369771 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content37% 
IMG OID637506638 
Productserine protease 
Protein accessionYP_029651 
Protein GI49186399 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0740174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATATT ACGACGGACC AAATTTAAAT GAAGAGCATA GTGAAACGAG AGAAGTGAGA 
AAATCGGGCA GTAAAAAAGG CTATTTTTTC ACAGGTTTAG TCGGAGCTGT AGTTGGGGCT
GTTTCAATTA GTTTTGCGGC ACCATATATG CCATGGGCTC AAAATAATGG AGCGACTGTA
TCATCTTTTA GTTCAGATTC AAAAGTTGAA GGTACTGTAG TTCCTGTTGT CAATAAAGCA
AAAAATGAAA CGGATTTACC TGGTATGATT GAAGGCGCGA AAGATGTTGT TGTAGGTGTT
ATTAACATGC AACAAAGCAT TGATCCATTT GCAATGCAAC CGACAGGTCA AGAGCAACAA
GCTGGTTCAG GATCAGGTGT TATTTATAAA AAGGCAGGAA ATAAAGCATA TATTGTAACG
AACAATCATG TAGTAGATGG TGCAAATAAA CTTGCTGTAA AACTGAGTGA TGGCAAGAAG
GTAGATGCAA AGCTTGTAGG GAAAGACCCT TGGTTAGATT TAGCTGTTGT TGAAATTGAT
GGTGCTAATG TTAATAAAGT TGCCACTTTA GGTGATTCTA GTAAAATCCG TGCGGGTGAA
AAAGCAATTG CAATCGGTAA CCCATTAGGA TTTGACGGAA GTGTAACGGA AGGTATTATT
AGTAGTAAAG AACGTGAAAT TCCAGTAGAT ATCGATGGCG ATAAGCGTGC AGATTGGAAT
GCTCAAGTTA TTCAAACAGA TGCAGCAATT AACCCTGGGA ACAGTGGTGG TGCGTTATTT
AACCAAAACG GAGAAATAAT TGGGATTAAT TCAAGTAAAA TTGCACAACA AGAAGTTGAA
GGAATTGGAT TTGCTATTCC AATTAATATC GCAAAACCAG TTATTGAATC ACTTGAAAAA
GACGGAGTAG TGAAACGTCC AGCTCTTGGA GTAGGTGTCG TTTCATTAGA AGATGTGCAA
GCTTATGCAG TAAATCAATT GAAAGTGCCA AAAGAAGTAA CAAACGGTGT TGTATTAGGT
AAAATTTACC CAATATCACC TGCAGAAAAA GCTGGTTTAG AGCAATATGA TATTGTAGTA
GCATTAGATA ATCAAAAAGT AGAAAACTCA CTTCAATTCC GTAAATATTT ATATGAGAAG
AAAAAAGTAG GCGAGAAAGT GGAAGTTACA TTCTACCGTA ACGGTCAAAA AATGACGAAA
ACAGCTACTT TAGCAGATAA CTCAGCTACA AAGAATCAAT AA
 
Protein sequence
MGYYDGPNLN EEHSETREVR KSGSKKGYFF TGLVGAVVGA VSISFAAPYM PWAQNNGATV 
SSFSSDSKVE GTVVPVVNKA KNETDLPGMI EGAKDVVVGV INMQQSIDPF AMQPTGQEQQ
AGSGSGVIYK KAGNKAYIVT NNHVVDGANK LAVKLSDGKK VDAKLVGKDP WLDLAVVEID
GANVNKVATL GDSSKIRAGE KAIAIGNPLG FDGSVTEGII SSKEREIPVD IDGDKRADWN
AQVIQTDAAI NPGNSGGALF NQNGEIIGIN SSKIAQQEVE GIGFAIPINI AKPVIESLEK
DGVVKRPALG VGVVSLEDVQ AYAVNQLKVP KEVTNGVVLG KIYPISPAEK AGLEQYDIVV
ALDNQKVENS LQFRKYLYEK KKVGEKVEVT FYRNGQKMTK TATLADNSAT KNQ