Gene BAS5310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5310 
Symbol 
ID2852954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5197244 
End bp5199061 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content37% 
IMG OID637508563 
Productoligoendopeptidase F 
Protein accessionYP_031547 
Protein GI49188294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATG TAATTGAGAA ACGCCATATT CGCGCAGAAG TTCCTATTGA ATTAACATGG 
GACCTCTCTG ATTTATACGA ATCTGATAAA AAGTGGGAAA CTGCATTACG TGTATTAACA
GATGATATAA AAAAACTTGA TGCGTTTAAA GGACAATTAC ACACTAGCCC TACCACCTTA
TTACATTGCC TACTTTTAGA AGAAGAGCTT TTAATGAAGT TAACAAAACT ATACTCCTAT
GCAAATTTAA AAGAATCTAC TGATCGTACA AATCCAGTTA TTCAAGCGAA CTCTTCCAAA
ATTGCTGCTT TATGGACGAA AGTACATACG GCGCTATCCT TTATTCATAA TGAAATTCTC
TCATTAGAAG AAGGCACAAT TGAAAAATAT TTAACTAAAG AAACAAAACT TGAACCTTTC
CGTAAATCAT TACTAGACAT ATTACAAAAA AGGCAGTACA CGCTCTCTCC TGAAACAGAA
GAAGCACTTG CTGCACTTGG CGAAGTACAT AGTTCTCCAT ACAAAATTTA CGGTATGACT
AAATTAGCCG ATATGGATTT TAACTCCATA CAAGACGAAC AAGGAAATGA ACTCCCTGTC
TCATTTTCAT TATTTGAAAG TAAATATGAG TTTTCTCCAA GCGTAGACAT ACGCAGAAAG
GCATACTCAT CATTTGTGTC CACCTTGAAG CGATATAAAA ATACCGTGGC AACAACATAT
GCTACCGAGG TAAAAAAACA AGTAACACTC TCTCGTTTAC GCAAATTTGA ATCTGTTACT
CATATGCTTT TAGAGCCTCA AAACGTTCCA CTTGAAATGT ATAACAATCA ACTTGATATT
ATTTATAAAG AATTAGCGCC TCATATGCGC CGTTTTGCAG ATTTAAAAAA GAAAGTATTA
GGACTCGATC AAATGCTGTT CTGCGACTTA CATGCACCTT TAGATCCCGA ATTTAATCCA
GCAATTACTT ACGAGGAAGC TGGAAAACTT ATTCAAGACT CTTTACAAGT ACTTGGCGAT
GAATATAGTG CCATTATCAA AAAAGGGTTC AAAGAAAGAT GGGTCGATCT TGCAGATAAT
GTAGGGAAAT CAACAGGAGC ATTTTGTTCA AGCCCATATG GTTCTCATCC ATACATTTTA
ATTACATGGC AAAATACGAT GCGTGGCTGC TTCACATTGG CTCATGAATT TGGACATGCT
GGTCATTTTT ATTTAGCAAA TAAAAACCAG CGTATTATGA ATGTACGTCC ATCTATGTAC
TTTGTTGAAG CGCCATCTAC AATGAATGAA TTACTATTAG CCCAGCATTT ATTAGCGACG
ACTGACGATA AGAGAATGCG TAGATGGGTT ATTCTGCAAC TACTCGGCAC GTATTATCAT
AACTTTGTTA CCCACTTACT TGAGGGAGAA TATCAAAGGA GGGTATATAG CCTAGCAGAA
GAAGGAGAAG CACTTACAGC TACAACTTTA ACTGAAATAA AAACAAATGT CCTTTCAACA
TTCTGGGGAA ATTCCGTAGA AATTGATGAA GGTGCTGGCT TAACTTGGAT GCGTCAACCT
CATTATTATA TGGGCTTATA TTCTTACACG TATTCCGCAG GCCTCACTGC ATCTACTGCG
GTAGCTCAAA TGATTAAAGA AGAAGGACAA CCTGCCGTTG ATCGCTGGTT AGATGTACTT
CGCGCTGGTG GTACGATGAA ACCACTTGAA TTAATGAAAC ATGCCGGAGT CGATATGTCA
AAACCAGATG CAATCCGGAA AGCTGTTTCT TACGTCGGTT CCTTAATTGA TGAATTAGAA
CGCTCTTATG AAGAATAA
 
Protein sequence
MKDVIEKRHI RAEVPIELTW DLSDLYESDK KWETALRVLT DDIKKLDAFK GQLHTSPTTL 
LHCLLLEEEL LMKLTKLYSY ANLKESTDRT NPVIQANSSK IAALWTKVHT ALSFIHNEIL
SLEEGTIEKY LTKETKLEPF RKSLLDILQK RQYTLSPETE EALAALGEVH SSPYKIYGMT
KLADMDFNSI QDEQGNELPV SFSLFESKYE FSPSVDIRRK AYSSFVSTLK RYKNTVATTY
ATEVKKQVTL SRLRKFESVT HMLLEPQNVP LEMYNNQLDI IYKELAPHMR RFADLKKKVL
GLDQMLFCDL HAPLDPEFNP AITYEEAGKL IQDSLQVLGD EYSAIIKKGF KERWVDLADN
VGKSTGAFCS SPYGSHPYIL ITWQNTMRGC FTLAHEFGHA GHFYLANKNQ RIMNVRPSMY
FVEAPSTMNE LLLAQHLLAT TDDKRMRRWV ILQLLGTYYH NFVTHLLEGE YQRRVYSLAE
EGEALTATTL TEIKTNVLST FWGNSVEIDE GAGLTWMRQP HYYMGLYSYT YSAGLTASTA
VAQMIKEEGQ PAVDRWLDVL RAGGTMKPLE LMKHAGVDMS KPDAIRKAVS YVGSLIDELE
RSYEE