Gene BAS4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4102 
Symbol 
ID2852298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4031941 
End bp4033002 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content39% 
IMG OID637507339 
Productproline dipeptidase 
Protein accessionYP_030352 
Protein GI49187100 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0184718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTATTTTG 
TTAACAAATG AACATAGTCG TAGATATATG GCTAACTTCA CAGGAACAGC TGGTGTTGTC
CTGATTTCGA AAAAACGCGC CCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT
AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA
AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT
AGTTCTTATT CAGCTCATAA AGAAGCGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT
GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA
CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA
ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG
TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA
AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GCGCTTATTA CAAAGGATAT
TGCTCTGATA TTACTCGTAC GATTGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT
TATAATATCG TTTTAGAAGC ACAATTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT
GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC
TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA
TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT
ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT
GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
 
Protein sequence
MEKIERLRSA FDEAGIDGIL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS 
KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEAI DAEFIPTSGL
VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS
FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI
YNIVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA
FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL