Gene BAS4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4220 
Symbol 
ID2847942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4135815 
End bp4136921 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content38% 
IMG OID637507456 
Productgermination protease 
Protein accessionYP_030468 
Protein GI49187216 
COG category 
COG ID 
TIGRFAM ID[TIGR01441] GPR endopeptidase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0681612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAC CATTAGATTT AAGTAAATAT AGCGTTAGAA CTGACCTTGC TGTAGAGGCA 
CATCAAATGT TGCAAGAGCG TCAAGAAGAG CAACAACAAG GGATACAAGG AGTTATTGTA
AAAGAGAGGG AAGAAGAAGG TATTATCATT ACGAAAGTAA CGATTGATGA AGTTGCCTCT
GAATCGATGG GTAAAAAACC TGGAAATTAT TTAACACTTG AAGTACAAGG TATACGTCAA
CAAGATACGG AATTGCAACA AAAAGTAGAG CGCATTTTTG CAAAAGAATT TTCTTATTTC
TTAGAAGAGG TTGGCGTTAC GAAAGAAGCG AGTTGTTTAA TTGTTGGTCT TGGAAATTGG
AATGTAACCC CTGATGCGCT TGGACCGATA GTGGTAGAAA ATGTATTGGT AACGAGACAT
TTGTTTCAAT TGCAGCCTGA AAGTGTAGAA GAAGGCTTTA GGCCTGTTAG TGCAATTCGG
CCGGGGGTAA TGGGGATTAC AGGAATTGAA ACGAGCGATG TCATTTATGG AATCATTGAG
AAGACAAAAC CAGACTTTGT CATTGCAATT GATGCATTAG CTGCTCGTTC TATTGAAAGG
GTAAATAGTA CGATACAAAT TTCTGATACA GGAATTCATC CTGGATCGGG TGTTGGGAAT
AAACGTAAGG AACTGAGTAA AGAAACATTA GGTATTCCTG TTATCGCAAT TGGTGTTCCG
ACTGTGGTGG ATGCCGTTTC AATTACAAGC GATACAATTG ATTTTATTTT GAAACATTTT
GGCCGGGAGA TGAAAGAAGG AAACAAACCT TCTCGCTCTT TGTTACCAGC TGGTTTTACA
TTTGGAGAAA AGAAAAAATT AACAGAAGAG GATATGCCGG ATGAAAAGAG CCGAAATATG
TTTTTAGGTG CTGTAGGTAC ACTGGAAGAT GAAGAGAAGA GAAAATTAAT TTATGAAGTG
TTATCTCCTC TAGGTCATAA TTTAATGGTG ACTCCGAAAG AAGTGGATGC TTTCATAGAA
GATATGGCAA ATGTAATCGC AAGTGGTTTA AATGCAGCGC TGCATCATCA AATTGACCAA
GATAATACAG GAGCGTATAC ACATTGA
 
Protein sequence
MKEPLDLSKY SVRTDLAVEA HQMLQERQEE QQQGIQGVIV KEREEEGIII TKVTIDEVAS 
ESMGKKPGNY LTLEVQGIRQ QDTELQQKVE RIFAKEFSYF LEEVGVTKEA SCLIVGLGNW
NVTPDALGPI VVENVLVTRH LFQLQPESVE EGFRPVSAIR PGVMGITGIE TSDVIYGIIE
KTKPDFVIAI DALAARSIER VNSTIQISDT GIHPGSGVGN KRKELSKETL GIPVIAIGVP
TVVDAVSITS DTIDFILKHF GREMKEGNKP SRSLLPAGFT FGEKKKLTEE DMPDEKSRNM
FLGAVGTLED EEKRKLIYEV LSPLGHNLMV TPKEVDAFIE DMANVIASGL NAALHHQIDQ
DNTGAYTH