Gene BAS5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5240 
Symbol 
ID2848451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5125708 
End bp5126979 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content35% 
IMG OID637508494 
ProductD-alanyl-D-alanine carboxypeptidase 
Protein accessionYP_031478 
Protein GI49188225 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TCTTGTCTAT TTTACTAGCG CTTTTATTGT CAATTTCTTC CTTAGGAGTA 
ACAACATCCC ATGCAGAAGA GAAAATACAC ATTGAGGCGG CGGCTGCACT TCTATTTGAT
GCAGATACAG GAAAAATACT GCATGAACAA AATCCTGATG AATTACTAGC TATCGCTAGT
ATGTCAAAGT TAATTGTTGT TTATGCTGTA TTAGAAGCAA TTAAAGAAGG AAAAATCACA
TGGGATACAA AGGTAAACAT TTCTGATTAT GCTTATGAAG TTTCACGTAA TAACGAATTC
TCTAACGTAC CGTTCGAAAA AGGACGTCAA TATACTGTAA GAGAACTATA TCATTCTATC
GTTATCTTCT CTGCAAACGG ATCTAGTATT GCCTTAGCAG AGCTTCTTGC AGGAAGTGAG
AAAAACTTCT TAAACCTTGC AAATGAACAT GCAAAAAAAC TAGGGTTAAA GAAATATAAA
TTTGTAAACG CTACTGGTTT AAATAACACT GACTTAAAAG GAAAACATCC TGAGGGTACT
GATCCAAATG CAGAAAACTC TATGTCAGCT CGCGATATGG GTATACTTTC AAAAGCAATG
ATTACAAAGT ATCCGGAAAT GCTAGAGGAT ACAAAACAAA GATTTAGAAA CTTCCCAGAT
AATCATCCGA AACCAATTCG TATGGAAAAC TGGAACTGGA TGTTACCAGG GGCTGCTTTC
GCTTATGAAG GTACTGATGG TTTAAAAACC GGAAGTTCTG ATACAGCTGG ATACGGATTT
ACCATTACTG CTAAACGTGG TGATGTACGT CTTATTTCGG TTATTATTAA AACAAAATCA
ATGGACGAGC GCTTCACAGA ATCTCGTGAA TTAATCGAAT ACGGGTTTAA TAACTTTGAA
AAACAAAAGC TAAAGGTTGA TAAAAACAAT ACACTCTCTG TCGCACAAGG GAAAGAGGAT
CAAGTAACTG TTGCACCGGA AAAAGAAATA ACCGTAATTG CGAAGAAAGG TAGCAAAAAT
CCTTATAAAA TTGGAACTGA AGTAGATAAA TCTCTTGCTG AAGATGGGCA TTTAGTTGCT
CCTATTAAGA AGGATGCCAA AGTAGGCTCA ATTACTTTAG AATCAACTGA TAAATATGGT
TTCTTAGACG GTAGTAACAG TATGAAAGTT ACTGCAAAAA CGACAGAAGA AGTTGAAAAA
GCAAATTGGT TTGTTTTAAC AATGCGCTCT ATTGGAGATT TCTTCTCAAA CTTATGGTCT
AAAGTTTTTT AA
 
Protein sequence
MKKILSILLA LLLSISSLGV TTSHAEEKIH IEAAAALLFD ADTGKILHEQ NPDELLAIAS 
MSKLIVVYAV LEAIKEGKIT WDTKVNISDY AYEVSRNNEF SNVPFEKGRQ YTVRELYHSI
VIFSANGSSI ALAELLAGSE KNFLNLANEH AKKLGLKKYK FVNATGLNNT DLKGKHPEGT
DPNAENSMSA RDMGILSKAM ITKYPEMLED TKQRFRNFPD NHPKPIRMEN WNWMLPGAAF
AYEGTDGLKT GSSDTAGYGF TITAKRGDVR LISVIIKTKS MDERFTESRE LIEYGFNNFE
KQKLKVDKNN TLSVAQGKED QVTVAPEKEI TVIAKKGSKN PYKIGTEVDK SLAEDGHLVA
PIKKDAKVGS ITLESTDKYG FLDGSNSMKV TAKTTEEVEK ANWFVLTMRS IGDFFSNLWS
KVF