Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A3307 |
Symbol | pepD |
ID | 5801784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 3514085 |
End bp | 3515545 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641341130 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_001607652 |
Protein GI | 162418581 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00393893 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00000164378 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTGAAC TGTCTCAACT TTCGCCTCAG CCGTTGTGGG ATATTTTTGC AAAAATTTGT TCTATCCCAC ATCCCTCTTA CCATGAAGAA GCGCTGGCAC AATATATTGT TACCTGGGCC AAAGAGAAAG GCCTGCATGC CGAGCGTGAT CAGGTCGGTA ATATCCTGCT GCGTAAACCT GCCACTAAAG GGATGGAAAA CCGCAAGCCT GTTGCACTGC AAGCGCATTT AGATATGGTG CCACAGAAAA ATAATGACAC GGTACATGAC TTTACCAAAG ATCCTATCCA GCCTTATATC GACGGCGAAT GGGTGAAAGC CCGTGGTACC ACATTAGGCG CAGATCATGG TATTGGTATG GCATCCGCTC TGGCAGTGTT ATCTGATGAC CGCGTTGAGC ATGGGCCACT AGAAGTGTTG TTAACCATGA CCGAAGAAGC CGGTATGGAT GGTGCCTTCG GCCTGCAACC TAATTGGCTG AAAGCCGATA TTCTGATCAA TACCGATTCT GAGCAGGAAG GCGAAATCTA CATGGGCTGT GCTGGTGGTA TTGATTTCAT CACCACCATG CCGCTACAGC GGGAAGCTAT CCCTGCGGGC TATCAAACAC TAAAATTGAC GATCAAAGGC CTAAAAGGCG GCCACTCAGG TGCGGATATC CATTTAGGTT TAGGTAACGC CAACAAACTG CTGGCCCGCT TCCTGTTTGA ACAAGCAAAA GATTTAGATC TGCGGGTGCT GGATCTGAAT GGCGGTACTT TACGTAACGC AATTCCACGG GAAGGTCACG TAACACTGGC TGTTGCCGCA GACAAAGTGG AACAGCTGAA AAAACTGAGC CAGAATTACC TGGCAACGTT GAAAGACGAG TTGATCGCGG TCGAAAAAAA TCTGACCCTA GTACTCGAAC CCGTCTCTAC CGAGACGAAA GCACTGACTA AAGAGACTCA ACAGCGTTTT GTCGCATTAC TGAATGCCAC GCCAAATGGT GTGATCCGCA TGAGTGATGC GGTTAAAGGT GTCGTAGAAA CCTCACTTAA CGTGGGTGTT GTTACGATGA ATGAGCATGA AGCAGAAATT GTCTGCCTGA TTCGTTCCCT GATCGACAGC GGTAAAGATT ACGTAGCCAG TATGCTGACC GCAATTGGTG AATTAGCGGG TGCCAAAACA TCGCCAAGCG GCGGCTATCC TGGCTGGCAA CCGGATCCAA CGTCACCGGT CATGGCACTG GTACGGGAAA CCTACCAAAA ACTGTTCAAC AAAACGCCTA ACATCATGGT TATCCATGCC GGTCTGGAAT GTGGTTTGTT CAAAAAACCC CATCCTAACA TGGACATGGT GTCGATTGGG CCAACCATGA CCGGCCCGCA TTCACCAGAT GAACAAGTTC ATATTGAGAG CGTTGGTCAA TATTGGCAGT TATTAACCGC CCTGCTGAAA GCGATACCTG AACGTACATA A
|
Protein sequence | MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE ALAQYIVTWA KEKGLHAERD QVGNILLRKP ATKGMENRKP VALQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADHGIGM ASALAVLSDD RVEHGPLEVL LTMTEEAGMD GAFGLQPNWL KADILINTDS EQEGEIYMGC AGGIDFITTM PLQREAIPAG YQTLKLTIKG LKGGHSGADI HLGLGNANKL LARFLFEQAK DLDLRVLDLN GGTLRNAIPR EGHVTLAVAA DKVEQLKKLS QNYLATLKDE LIAVEKNLTL VLEPVSTETK ALTKETQQRF VALLNATPNG VIRMSDAVKG VVETSLNVGV VTMNEHEAEI VCLIRSLIDS GKDYVASMLT AIGELAGAKT SPSGGYPGWQ PDPTSPVMAL VRETYQKLFN KTPNIMVIHA GLECGLFKKP HPNMDMVSIG PTMTGPHSPD EQVHIESVGQ YWQLLTALLK AIPERT
|
| |