Gene BAS3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3986 
Symbol 
ID2847994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3926657 
End bp3927847 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content36% 
IMG OID637507223 
ProductD-alanyl-D-alanine carboxypeptidase 
Protein accessionYP_030236 
Protein GI49186984 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.997885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAG TTTTTGGAAT ACTTGTTTGT TTCATGTTAT TGCTTTCGGG TACTTCAGTT 
AGTTTCGCAC AATCTGAGAA AACGAAGCAG GATAAAACAG AAGAAACAAC GCCGAAGTTA
GCAGAACAAG CGTCGTCAGC AATTGTAATT GAACAAGATA CAGGTAAAGT TTTGTTTGAT
AAAAATCCAA ATGAGAAATT ACCGCCTGCT AGTATGACGA AGATTATGAC AATGCTATTA
ATTATGGAAC AAGTTGAAAA AGGAAAATTA AAACTGACCG ATAAAGTTAG AGCGAGTGAG
CATGCAGCTT CAATGGGTGG ATCACAAATC TTTTTAGAGC CTGGAGAAGA GATGACTGTA
AATGAAATGC TAAAGGGTAT TGCAATTGCA TCTGGAAATG ATGCGTCTGT TGCAGTAGCG
GAGCATATCG CTGGTTCAGA AGAAGGTTTT GTAAATATGA TGAACAAAAA AGCGAAAGAT
TTAGGGCTAA AAAATACTCA TTTTCAAAAT CCAACAGGTC TTCCGGCTAA AGACCATTAT
TCTACAGCAA ATGATATGGC TATCATGGCG AAAGAATTGA TGAAGTACCC ACTTATTCGC
AAATACACAG GTAAATACGA AGACTATTTA CGTGAAGATA CGGATAAGAA GTTTTGGCTC
GTTAATACGA ATAAGTTAGT ACGTTTTTAT CCTGGAGTAG ATGGTGTAAA AACGGGCTTT
ACGACAGAAG CAAAATATTG TTTAACAGCA TCGGCTGAGA AGAATGGGAT GCGTGTTATT
TCAGTTGTTA TGGGAGCACC TACATCAAAA GAACGGAACA ATCAAGTAAC GAAGCTTCTT
GACTACGCAT TTGGTCAATA TATGACAAAA AAATTGTATA CACGAGGCGA AAAAATTAAA
ACTGTCCAAG TAGGAAAAGG GAAAAAAGAA AAAGTAGATT TAGTTGCGTC AGACAATGTA
TCTCTTCTTA TGAAGAAGGG CGAAAATATG GACAAGGTAA AGCAAGAAGT AATTGCTGAA
AAGAAAGTGA AAGCGCCGAT TAAAAAAGGT GATGCACTTG GTACGCTTGT TATTAAAAAA
GATAAAGATG TTTTATTAAA ACAAACAATT GTAGCAAAAG AAGATGTTGC TGCAGCGAGC
TGGTGGGAGT TATTTAAAAG AAGTTTTGGG ATGTTTTCAA CATCAAAATA G
 
Protein sequence
MKRVFGILVC FMLLLSGTSV SFAQSEKTKQ DKTEETTPKL AEQASSAIVI EQDTGKVLFD 
KNPNEKLPPA SMTKIMTMLL IMEQVEKGKL KLTDKVRASE HAASMGGSQI FLEPGEEMTV
NEMLKGIAIA SGNDASVAVA EHIAGSEEGF VNMMNKKAKD LGLKNTHFQN PTGLPAKDHY
STANDMAIMA KELMKYPLIR KYTGKYEDYL REDTDKKFWL VNTNKLVRFY PGVDGVKTGF
TTEAKYCLTA SAEKNGMRVI SVVMGAPTSK ERNNQVTKLL DYAFGQYMTK KLYTRGEKIK
TVQVGKGKKE KVDLVASDNV SLLMKKGENM DKVKQEVIAE KKVKAPIKKG DALGTLVIKK
DKDVLLKQTI VAKEDVAAAS WWELFKRSFG MFSTSK