Gene YpsIP31758_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3156 
SymbolpepD 
ID5386787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3554042 
End bp3555502 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content49% 
IMG OID640866163 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001402116 
Protein GI153948494 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAACT TTCGCCTCAG CCGTTGTGGG ATATTTTTGC AAAAATTTGT 
TCTATCCCAC ATCCCTCTTA CCATGAAGAA GCGCTGGCAC AATATATTGT TACCTGGGCC
AAAGAGAAAG GCCTGCACGC CGAGCGTGAT CAGGTCGGTA ATATCCTGCT GCGTAAACCT
GCCACTAAAG GGATGGAAAA CCGCAAGCCT GTTGCACTGC AAGCGCATTT AGATATGGTG
CCACAGAAAA ATAATGACAC GGTACATGAC TTTACCAAAG ATCCTATCCA GCCTTATATC
GACGGCGAAT GGGTGAAAGC CCGTGGTACC ACATTAGGCG CAGATAATGG TATTGGTATG
GCATCCGCTC TGGCAGTGTT ATCTGATGAT CGCGTTGAGC ATGGGCCACT AGAAGTGTTG
TTAACCATGA CCGAAGAAGC CGGTATGGAT GGTGCCTTCG GCCTGCAACC TAATTGGCTG
AAAGCCGATA TTCTGATCAA TACCGATTCT GAGCAGGAAG GCGAAATCTA CATGGGCTGT
GCCGGTGGTA TTGATTTCAT CACCACCATG CCGCTACAGC GGGAAGCTAT CCCTGTGGGC
TATCAAACAC TAAAATTGAC GATCAAAGGC CTAAAAGGCG GCCACTCAGG TGCGGATATC
CATTTAGGTT TAGGTAACGC CAACAAACTG CTGGCCCGCT TCCTGTTTGA ACAAGCAAAA
GATTTAGATC TGCGGGTGCT GGATCTGAAT GGCGGTACTT TACGTAACGC AATTCCACGG
GAAGGTCACG TAACACTGGC TGTTGCCGCA GACAAAGTGG AACAGCTGAA AAACCTGAGC
CAGAATTACC TGGCAACGTT GAAAGACGAG TTGATCGCGG TCGAAAAAAA TCTGACCCTA
GTACTCGAAC CCGTCTCTAC CGAGACGAAA GCACTGACTA AAGAGACTCA ACAGCGTTTT
GTCGCATTAC TGAATGCCAC GCCAAATGGT GTGATCCGCA TGAGTGATGC GGTTAAAGGT
GTCGTAGAAA CCTCACTTAA CGTGGGTGTT GTTACGATGA ATGAGCATGA AGCAGAAATT
GTCTGCCTGA TTCGTTCCCT GATCGACAGC GGTAAAGATT ACGTAGCCAG TATGCTGACC
GCAATTGGTG AATTAGCGGG TGCCAAAACA TCGCCAAGCG GCGGCTATCC TGGCTGGCAA
CCGGATCCAA CGTCACCGGT CATGGCACTG GTACGGGAAA CCTACCAAAA ACTGTTCAAC
AAAACGCCTA ACATCATGGT TATCCATGCC GGTCTGGAAT GTGGTTTGTT CAAAAAACCC
TATCCTAACA TGGACATGGT GTCGATTGGG CCAACCATGA CCGGCCCGCA TTCACCAGAT
GAACAAGTTC ATATTGAGAG CGTTGGTCAA TATTGGCAGT TATTAACCGC CCTGCTGAAA
GCGATACCTG AACGTACATA A
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE ALAQYIVTWA KEKGLHAERD QVGNILLRKP 
ATKGMENRKP VALQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLSDD RVEHGPLEVL LTMTEEAGMD GAFGLQPNWL KADILINTDS EQEGEIYMGC
AGGIDFITTM PLQREAIPVG YQTLKLTIKG LKGGHSGADI HLGLGNANKL LARFLFEQAK
DLDLRVLDLN GGTLRNAIPR EGHVTLAVAA DKVEQLKNLS QNYLATLKDE LIAVEKNLTL
VLEPVSTETK ALTKETQQRF VALLNATPNG VIRMSDAVKG VVETSLNVGV VTMNEHEAEI
VCLIRSLIDS GKDYVASMLT AIGELAGAKT SPSGGYPGWQ PDPTSPVMAL VRETYQKLFN
KTPNIMVIHA GLECGLFKKP YPNMDMVSIG PTMTGPHSPD EQVHIESVGQ YWQLLTALLK
AIPERT