Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_4422 |
Symbol | pepQ-1 |
ID | 2818369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | - |
Start bp | 4031569 |
End bp | 4032630 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637791121 |
Product | proline dipeptidase |
Protein accession | YP_021064 |
Protein GI | 47529715 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA TCGAAAGATT AAGAAGTGCA TTTGATGAGG CTGGTATTGA CGGTATTTTG TTAACAAATG AACATAGTCG TAGATATATG GCTAACTTCA CAGGAACAGC TGGTGTTGTC CTGATTTCGA AAAAACGCGC CCAATTTATT ACAGATTTCC GTTACGTAGA GCAGGCTAGT AAACAAGCGG TTGGATACGA GATTGTACAG CATGCAGGAT TAATTATCGA TGAAGTTGCA AAGCAAGTGA AAGAACTAGG AATTCAAAAG CTTGGCTTTG AGCAAGATAC TCTTACATAT AGTTCTTATT CAGCTCATAA AGAAGCGATC GATGCTGAAT TTATCCCAAC TTCTGGGCTT GTAGAAAAGT TACGCTTGAT AAAGACTGAT TCAGAGATTA AGATATTAAA GGAAGCTGCA CAGATTGCAG ATGCTGCCTT TGAACATATT CTATCATTCA TTCGCCCGGG AGTATCTGAA ATTGAAGTGT CAAATGAACT TGAATTTTTC ATGAGAAAAC AAGGAGCAAC ATCTTCTTCG TTTGATATTA TCGTTGCTTC AGGTCTTCGT TCGGCATTAC CGCACGGCGT GGCATCTGAA AAAGTGATAG AAACAGGAGA TTTCGTTACA TTAGACTTCG GCGCTTATTA CAAAGGATAT TGCTCTGATA TTACTCGTAC GATTGCAGTT GGTGAACCAT CTGATAAATT GAAAGAAATT TATAATATCG TTTTAGAAGC ACAATTACGT GGTGTGAACG GTATTAAAGC TGGTTTAACT GGCCGTGAAG CGGATGCGTT AACGCGTGAT TACATAACGG AAAAAGGATA CGGTGAATAC TTCGGACATT CTACTGGTCA TGGAATCGGT CTTGAAATCC ATGAAGCACC AGGTTTAGCA TTCCGTTCTG ATACAGTACT TGAACCAGGT ATGGCTGTAA CAGTAGAGCC AGGTATTTAT ATTCCAGGTA TTGGCGGCGT ACGTATTGAA GATGATATCA TTGTGACAAG TGAAGGTAAT GAAGTAATTA CGAAATCACC AAAAGAACTT ATTATTTTGT AA
|
Protein sequence | MEKIERLRSA FDEAGIDGIL LTNEHSRRYM ANFTGTAGVV LISKKRAQFI TDFRYVEQAS KQAVGYEIVQ HAGLIIDEVA KQVKELGIQK LGFEQDTLTY SSYSAHKEAI DAEFIPTSGL VEKLRLIKTD SEIKILKEAA QIADAAFEHI LSFIRPGVSE IEVSNELEFF MRKQGATSSS FDIIVASGLR SALPHGVASE KVIETGDFVT LDFGAYYKGY CSDITRTIAV GEPSDKLKEI YNIVLEAQLR GVNGIKAGLT GREADALTRD YITEKGYGEY FGHSTGHGIG LEIHEAPGLA FRSDTVLEPG MAVTVEPGIY IPGIGGVRIE DDIIVTSEGN EVITKSPKEL IIL
|
| |