Gene BAS4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4508 
Symbol 
ID2850384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4423147 
End bp4424244 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content39% 
IMG OID637507746 
Productproline dipeptidase 
Protein accessionYP_030756 
Protein GI49187503 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCTA GATTAGAAAA TTTAATGCAA TGGCTAAAAG AAAAAAACGT AGAAGCTGCG 
TTCTTAACTT CTACACCAAA CGTCTTCTAC ATGACAAACT TCCACTGTGA ACCACACGAA
AGACTTCTTG GTATGTTTGT ATTCCAAGAA AAAGAGCCTA TTTTAATTTG CCCTAAAATG
GAAGAAGGCC AAGCACGTAA CGCCGGCTGG GCACATGAAA TTATCGGATT TACTGATACT
GACAGACCAT GGGATATGAT TGCAAAAGCA ATTAAAGACC GCGGCATCAA TGCAAACGCA
GTTGCAATTG AAAAAGAACT TTTAAACGTA GAGCGCTACG AAGAATTAAC AAAACTATTC
CCAAATGCAG CTTTCACATC AGCTGAGGAA AAAGTTCGTG AACTTCGTTT AATTAAAGAT
GAAAAAGAAC TTTCTATTTT ACGCGAAGCA GCTAAAATGG CAGACTATGC TGTTGAAGTT
GGTGTAAATG CAATTAAAGA AGATCGTAGC GAACTAGAAG TATTAGCAAT TATTGAACAT
GAATTAAAAA CAAAAGGCAT ACATAAAATG TCATTTGATA CGATGGTATT AGCTGGTGCA
AACTCTGCTC TTCCACACGG TATTCCAGGT GCAAACAAAA TGAAACGCGG CGATTTCGTA
CTATTTGATT TAGGCGTAAT CATTGACGGT TATTGCTCTG ACATTACACG TACAGTGGCA
TTTGGCGAGA TTTCTGAAGA ACAAACTCGC ATTTACAACA CTGTACTTGC TGGACAACTA
CAAGCAGTTG AAGCATGTAA ACCAGGTGTT ACACTTGGCG CAATCGACAA CGCTGCTCGT
TCTGTTATCG CAGATGCAGG TTATGGTGAC TTCTTCCCGC ACCGCCTTGG TCACGGACTT
GGAATTAGCG TGCACGAATA TCCAGATGTA AAAGCTGGTA ACGAATCTCC ATTAAAAGAA
GGTATGGTCT TCACAATTGA GCCAGGTATT TACGTACCAA ACGTAGGTGG CGTTCGTATT
GAAGATGATA TTTATATCAC AAAAGACGGG TCAGAAATTT TAACGAAGTT CCCGAAAGAA
TTACAATTTG TAAAATAA
 
Protein sequence
MNARLENLMQ WLKEKNVEAA FLTSTPNVFY MTNFHCEPHE RLLGMFVFQE KEPILICPKM 
EEGQARNAGW AHEIIGFTDT DRPWDMIAKA IKDRGINANA VAIEKELLNV ERYEELTKLF
PNAAFTSAEE KVRELRLIKD EKELSILREA AKMADYAVEV GVNAIKEDRS ELEVLAIIEH
ELKTKGIHKM SFDTMVLAGA NSALPHGIPG ANKMKRGDFV LFDLGVIIDG YCSDITRTVA
FGEISEEQTR IYNTVLAGQL QAVEACKPGV TLGAIDNAAR SVIADAGYGD FFPHRLGHGL
GISVHEYPDV KAGNESPLKE GMVFTIEPGI YVPNVGGVRI EDDIYITKDG SEILTKFPKE
LQFVK