Gene BAS4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4052 
Symbol 
ID2851068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3986067 
End bp3987185 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content38% 
IMG OID637507289 
Productpeptidase T 
Protein accessionYP_030302 
Protein GI49187050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01883] peptidase T-like protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAATC AAGAACGTTT AGTAAATGAA TTCATGGAAT TAGTACAAGT AGATTCTGAA 
ACGAAATTTG AAGCAGAAAT TTGCAAAGTA TTAACAAAGA AATTTACAGA TTTAGGTGTA
GAAGTATTTG AAGATGACAC AATGGCTGTT ACTGGGCATG GTGCAGGTAA CTTAATTTGT
ACATTACCAG CAACAAAAGA TGGTGTTGAT ACAATTTACT TTACTTCTCA TATGGATACA
GTAGTTCCTG GTAATGGAAT TAAGCCTTCT ATTAAAGATG GATATATCGT ATCAGATGGT
ACTACGATTT TAGGTGCGGA TGATAAAGCG GGATTAGCAT CAATGTTTGA AGCAATCCGT
GTTTTAAAAG AGAAAAATAT CCCTCACGGC ACAATTGAAT TTATTATTAC AGTTGGAGAA
GAATCTGGTC TTGTTGGTGC AAAAGCATTA GATCGTGAGC GCATTACAGC GAAATATGGT
TACGCGTTAG ATAGCGATGG GAAAGTTGGC GAAATCGTTG TTGCAGCTCC AACACAAGCG
AAAGTGAACG CGATTATTCG CGGGAAAACA GCTCATGCAG GTGTAGCACC GGAAAAAGGC
GTATCTGCAA TTACGATCGC AGCGAAAGCA ATTGCGAAGA TGCCACTTGG TCGTATTGAT
TCTGAAACAA CTGCAAATAT TGGACGTTTT GAAGGTGGTA CACAAACGAA TATCGTTTGC
GATCATGTAC AAATCTTTGC AGAAGCGCGT TCTTTAATCA ATGAAAAAAT GGAAGTACAA
GTTGCGAAAA TGAAAGAAGC ATTTGAAACA ACTGCAAAAG AAATGGGCGG CCAAGCAGAT
GTTGAAGTAA AGGTTATGTA CCCAGGATTT AAATTTGCTG ATGGGGATCA CGTTGTAGAA
GTTGCAAAAC GCGCAGCTGA AAAAATTGGT CGTACACCTT CTCTTCACCA AAGTGGTGGC
GGAAGTGATG CAAACGTAAT TGCTGGACAC GGAATTCCAA CAGTTAACTT AGCAGTTGGT
TATGAAGAAA TTCATACAAC AAACGAAAAG ATTCCTGTTG AAGAATTAGC GAAAACAGCA
GAATTAGTTG TTGCAATCAT AGAGGAAGTA GCGAAATAA
 
Protein sequence
MINQERLVNE FMELVQVDSE TKFEAEICKV LTKKFTDLGV EVFEDDTMAV TGHGAGNLIC 
TLPATKDGVD TIYFTSHMDT VVPGNGIKPS IKDGYIVSDG TTILGADDKA GLASMFEAIR
VLKEKNIPHG TIEFIITVGE ESGLVGAKAL DRERITAKYG YALDSDGKVG EIVVAAPTQA
KVNAIIRGKT AHAGVAPEKG VSAITIAAKA IAKMPLGRID SETTANIGRF EGGTQTNIVC
DHVQIFAEAR SLINEKMEVQ VAKMKEAFET TAKEMGGQAD VEVKVMYPGF KFADGDHVVE
VAKRAAEKIG RTPSLHQSGG GSDANVIAGH GIPTVNLAVG YEEIHTTNEK IPVEELAKTA
ELVVAIIEEV AK