Gene BAS5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5043 
Symbol 
ID2853086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4915846 
End bp4917156 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID637508298 
Productendopeptidase lytE 
Protein accessionYP_031282 
Protein GI49188029 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TAAAAATGGC ATCTTGCGCA TTAGTTGCAG GGTTAATGTT TTCAGGGCTA 
ACACCAAATG CATTTGCAGA AGATAAAATT TCTGACGTGA AATCACAAAT TAATACACAA
AATGACACTT TACATAAACA ACAACAAGAG CGTGATGAAT TACAAAAACA AATGAATGAC
TTAAACAAAA CAATTCAAGG TTTAGATAAG TCTGTTCAAG AGAATGCTGC AAAACTTGAT
GAAACAACGA AAAAAGTTGC AGATACTGAG CAATTAATCG AAAAGAAAAA TAAAGATATT
GCAGAATTAC AAACGAAAAT TGCAAAACGT GAAGAGTTAT TAAGAAAACG TTTAGTTGCA
CTTCAAGAGC AACCAAACAC GAACGTTGTA ACAGAAGTTC TTGTAAACTC TAAAAACGTT
GCAGATTTAG TTGATCGTTT AACTTCTGTT TCTAAAATTC TTGAGTCTGA TGAAGATATT
ATGAAAACAC AACAAGAAGA TCAAGCGAAC GTGAAAAAAG ATGTTGAAAC GGTAAAAACG
AAGCAAAAAG AATTAAAAGA AGCACAAGCT CAAATTGAAA CTGCTAAGAA AGAACTTGAC
GCTGAAAAAG CGAAAAAAGC AACAGCAGTA AATGATTTAA GCGGTAAAAT GGATACAGTT
GTAACTTCAA TGACAAGTAC GGAAAGTCAA TTGAAAGATC TTGAAAAACA AGCATTACAA
TTACAACGTA TTGCTGAACA AGAAGCGCAA GAAAAAGCTG CACAAGAAGC TGCAGCACAA
AAACAAGCAG AGCAAGCTGC TAAAGATGCG CAAGCTCAAC CAGCACAAGC GGCACCTGCT
CAGCCAGCAG CGCCAGCAAA CAACGGTGGA CAAGCTCAAA AAGAAGAGCC TAAAAAGGAA
GAGCCTAAAA AAGAAGCACC TAAAAAAGAA GATAAAAAAC CAGAACCGAC ACCAGGTCCA
GCTCCGGCTC CTGGCGTAAT TGGTAAAGCA CAACAATACT TAGGTTTACC ATATGTTTGG
GGAAGTGCAT CTCCATCAAA CGGTGGTTTT GACTGTAGTG GATTTATTTC TTACGTATTC
GGTGTAGGTC GTCAAGACGT TAATGGTTAC TGGCATTCAG TTTCAAAAGT AAGTAGCCCA
CAGCCAGGGG ACTTAGTATT CTTCCAAAAT ACTTATAAAA ATGGTCCATC TCACATCGGT
ATTTATGTTG GTAATGGCCA AATGATTCAT GCTGGTGATA AAGGTATTGC TTACTCTAGC
TTAAGCAGTA GCTACAACCA AAAACATTTC TTAGGATACG GTAGATTCTA G
 
Protein sequence
MKKLKMASCA LVAGLMFSGL TPNAFAEDKI SDVKSQINTQ NDTLHKQQQE RDELQKQMND 
LNKTIQGLDK SVQENAAKLD ETTKKVADTE QLIEKKNKDI AELQTKIAKR EELLRKRLVA
LQEQPNTNVV TEVLVNSKNV ADLVDRLTSV SKILESDEDI MKTQQEDQAN VKKDVETVKT
KQKELKEAQA QIETAKKELD AEKAKKATAV NDLSGKMDTV VTSMTSTESQ LKDLEKQALQ
LQRIAEQEAQ EKAAQEAAAQ KQAEQAAKDA QAQPAQAAPA QPAAPANNGG QAQKEEPKKE
EPKKEAPKKE DKKPEPTPGP APAPGVIGKA QQYLGLPYVW GSASPSNGGF DCSGFISYVF
GVGRQDVNGY WHSVSKVSSP QPGDLVFFQN TYKNGPSHIG IYVGNGQMIH AGDKGIAYSS
LSSSYNQKHF LGYGRF