Gene YpsIP31758_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0284 
SymbolpepQ 
ID5388552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp318329 
End bp319660 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content45% 
IMG OID640863250 
Productproline dipeptidase 
Protein accessionYP_001399281 
Protein GI153950393 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000682682 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGC TGGCTTCTTT ATATAACGAA CATTTATCCA CTCTACAACA GCGCACCCGC 
GATGTGCTGG AGCGTCATCA ATTAGATGCG TTGCTGATTC ACTCTGGTGA ATTACAGCGG
CTTTTTCTTG ATGACCGTGA CTATCCCTTT AAGGTTAATC CACAGTTTAA AGCTTGGGTC
CCTGTGACTG AGGTCCCCAA TTGCTGGTTA TGGGTTGATG GAGTCAATAC ACCAAAATTA
TGGTTCTACT CACCGGTAGA TTACTGGCAC AGTGTCGAAC CGCTACCTGA TAGCTTCTGG
ACGAAGAATA TTGATGTTCA GCCATTGCTG AATGCTGATG ATATTGCGCA ACAATTACCT
GTTCAGCGTG AACGTGTTGC TTATATTGGT TATGCCCAGC AGCGTGCACA AGCTTTAGGC
TTCAGTGCTG AGAACATTAA CCCGCAGCCG GTCTTGGATT ATCTTCATTA TTATCGCTCT
TATAAAACGG ATTACGAACT GGCGTGCATG CGTGAAGCAC AAAAAACTGC AGTCGTAGGG
CATCGTGCGG CCTATGAAGC ATTCCAGTCG GGTATGAGTG AGTTTGATAT TAATCTGGCT
TACCTGATGG CTACCGGACA TCGTGATACT GATGTTCCTT ACGATAATAT TGTCGCGCTG
AATGAACACT CCGCAGTACT TCATTATACG ATTTTACAGC ATCAACCTCC GGCAGAGATA
CGTAGTTTCC TGATTGATGC CGGAGCTGAA TATAATGGCT ATGCTGCCGA TCTGACTCGT
ACTTACGCAG CAGACCGTGA CAGTGATTTT GCGGCTTTAA TTAGCGACCT TAATACTGAG
CAATTGGCGC TGATCGATAC GATTAAAAGT GGTGAACGCT ATACTGATTA TCACGTTCAG
ATGCATCAAC GCATTGCTAA GCTTTTGCGT ACACATAATT TAGTCACGGG GATCAGCGAA
GAGGCGATGG TCGAACAGGG AATTACCTGC CCATTCCTGC CACATGGTTT GGGTCATCCA
CTTGGTTTGC AAGTGCATGA TACTGCTGGT TTTATGCAGG ACGATAAAGG TACGAACCTG
AACGCGCCAT CTAAGTATCC TTATCTACGT TGCACACGTG TTCTGCAACC GCGCATGGTG
CTGACTATTG AGCCGGGCCT GTACTTTATC GATTCTTTGT TGGCTCCTTG GCGCATTGGT
GAGTTCAGCA AACATTTTAA TTGGGATCGT ATTGATGCAC TGAAGCCTTA TGGTGGTATT
CGTATAGAAG ACAATATTGT TATTCATGAT AAACGGGTCG AAAATATGAC GCGTGATCTG
AAACTGGCCT GA
 
Protein sequence
METLASLYNE HLSTLQQRTR DVLERHQLDA LLIHSGELQR LFLDDRDYPF KVNPQFKAWV 
PVTEVPNCWL WVDGVNTPKL WFYSPVDYWH SVEPLPDSFW TKNIDVQPLL NADDIAQQLP
VQRERVAYIG YAQQRAQALG FSAENINPQP VLDYLHYYRS YKTDYELACM REAQKTAVVG
HRAAYEAFQS GMSEFDINLA YLMATGHRDT DVPYDNIVAL NEHSAVLHYT ILQHQPPAEI
RSFLIDAGAE YNGYAADLTR TYAADRDSDF AALISDLNTE QLALIDTIKS GERYTDYHVQ
MHQRIAKLLR THNLVTGISE EAMVEQGITC PFLPHGLGHP LGLQVHDTAG FMQDDKGTNL
NAPSKYPYLR CTRVLQPRMV LTIEPGLYFI DSLLAPWRIG EFSKHFNWDR IDALKPYGGI
RIEDNIVIHD KRVENMTRDL KLA