Gene YpAngola_A3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3307 
SymbolpepD 
ID5801784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3514085 
End bp3515545 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content49% 
IMG OID641341130 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001607652 
Protein GI162418581 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00393893 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00000164378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAACT TTCGCCTCAG CCGTTGTGGG ATATTTTTGC AAAAATTTGT 
TCTATCCCAC ATCCCTCTTA CCATGAAGAA GCGCTGGCAC AATATATTGT TACCTGGGCC
AAAGAGAAAG GCCTGCATGC CGAGCGTGAT CAGGTCGGTA ATATCCTGCT GCGTAAACCT
GCCACTAAAG GGATGGAAAA CCGCAAGCCT GTTGCACTGC AAGCGCATTT AGATATGGTG
CCACAGAAAA ATAATGACAC GGTACATGAC TTTACCAAAG ATCCTATCCA GCCTTATATC
GACGGCGAAT GGGTGAAAGC CCGTGGTACC ACATTAGGCG CAGATCATGG TATTGGTATG
GCATCCGCTC TGGCAGTGTT ATCTGATGAC CGCGTTGAGC ATGGGCCACT AGAAGTGTTG
TTAACCATGA CCGAAGAAGC CGGTATGGAT GGTGCCTTCG GCCTGCAACC TAATTGGCTG
AAAGCCGATA TTCTGATCAA TACCGATTCT GAGCAGGAAG GCGAAATCTA CATGGGCTGT
GCTGGTGGTA TTGATTTCAT CACCACCATG CCGCTACAGC GGGAAGCTAT CCCTGCGGGC
TATCAAACAC TAAAATTGAC GATCAAAGGC CTAAAAGGCG GCCACTCAGG TGCGGATATC
CATTTAGGTT TAGGTAACGC CAACAAACTG CTGGCCCGCT TCCTGTTTGA ACAAGCAAAA
GATTTAGATC TGCGGGTGCT GGATCTGAAT GGCGGTACTT TACGTAACGC AATTCCACGG
GAAGGTCACG TAACACTGGC TGTTGCCGCA GACAAAGTGG AACAGCTGAA AAAACTGAGC
CAGAATTACC TGGCAACGTT GAAAGACGAG TTGATCGCGG TCGAAAAAAA TCTGACCCTA
GTACTCGAAC CCGTCTCTAC CGAGACGAAA GCACTGACTA AAGAGACTCA ACAGCGTTTT
GTCGCATTAC TGAATGCCAC GCCAAATGGT GTGATCCGCA TGAGTGATGC GGTTAAAGGT
GTCGTAGAAA CCTCACTTAA CGTGGGTGTT GTTACGATGA ATGAGCATGA AGCAGAAATT
GTCTGCCTGA TTCGTTCCCT GATCGACAGC GGTAAAGATT ACGTAGCCAG TATGCTGACC
GCAATTGGTG AATTAGCGGG TGCCAAAACA TCGCCAAGCG GCGGCTATCC TGGCTGGCAA
CCGGATCCAA CGTCACCGGT CATGGCACTG GTACGGGAAA CCTACCAAAA ACTGTTCAAC
AAAACGCCTA ACATCATGGT TATCCATGCC GGTCTGGAAT GTGGTTTGTT CAAAAAACCC
CATCCTAACA TGGACATGGT GTCGATTGGG CCAACCATGA CCGGCCCGCA TTCACCAGAT
GAACAAGTTC ATATTGAGAG CGTTGGTCAA TATTGGCAGT TATTAACCGC CCTGCTGAAA
GCGATACCTG AACGTACATA A
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE ALAQYIVTWA KEKGLHAERD QVGNILLRKP 
ATKGMENRKP VALQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADHGIGM
ASALAVLSDD RVEHGPLEVL LTMTEEAGMD GAFGLQPNWL KADILINTDS EQEGEIYMGC
AGGIDFITTM PLQREAIPAG YQTLKLTIKG LKGGHSGADI HLGLGNANKL LARFLFEQAK
DLDLRVLDLN GGTLRNAIPR EGHVTLAVAA DKVEQLKKLS QNYLATLKDE LIAVEKNLTL
VLEPVSTETK ALTKETQQRF VALLNATPNG VIRMSDAVKG VVETSLNVGV VTMNEHEAEI
VCLIRSLIDS GKDYVASMLT AIGELAGAKT SPSGGYPGWQ PDPTSPVMAL VRETYQKLFN
KTPNIMVIHA GLECGLFKKP HPNMDMVSIG PTMTGPHSPD EQVHIESVGQ YWQLLTALLK
AIPERT