Gene GBAA_4422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4422 
SymbolpepQ-1 
ID2818369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4031569 
End bp4032630 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content39% 
IMG OID637791121 
Productproline dipeptidase 
Protein accessionYP_021064 
Protein GI47529715 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTATTTTG 
TTAACAAATG AACATAGTCG TAGATATATG GCTAACTTCA CAGGAACAGC TGGTGTTGTC
CTGATTTCGA AAAAACGCGC CCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT
AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA
AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT
AGTTCTTATT CAGCTCATAA AGAAGCGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT
GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA
CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA
ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG
TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA
AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GCGCTTATTA CAAAGGATAT
TGCTCTGATA TTACTCGTAC GATTGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT
TATAATATCG TTTTAGAAGC ACAATTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT
GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC
TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA
TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT
ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT
GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
 
Protein sequence
MEKIERLRSA FDEAGIDGIL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS 
KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEAI DAEFIPTSGL
VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS
FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI
YNIVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA
FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL