Gene YpsIP31758_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2344 
Symbolpip 
ID5386412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2642788 
End bp2643738 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content49% 
IMG OID640865333 
Productproline iminopeptidase 
Protein accessionYP_001401313 
Protein GI153950754 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.634485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAT TACGTGGACT TTATCCTGCA TATGAACCTT ACGACAGTGG TTTATTAGAC 
ACCGGGGACG GGCATCAAAT TTATTGGGAG CTCTGCGGCA ATCCGAAGGG CAAGCCCGCG
ATCTTTATTC ACGGGGGGCC AGGGGGCGGG ATTGCACCTT ATCATCGGCA GCTATTCAAC
CCTGCAAAAT ATAATGTGAT GTTATTTGAT CAACGTGGCT GTGGGCGCTC TAAACCCCAT
GCCAGCTTGG ATAACAACAC GACTTGGCAT CTGGTGGAGG ATATTGAACG TCTGCGCAAG
ATGGCCGGGA TTGAACAATG GCTGGTGTTT GGCGGTTCTT GGGGATCGAC TCTGGCATTG
GCTTATGGCG AAACACACCC TGAACGTGTC AGTGAGATGG TTCTGCGTGG GATCTTCACT
TTACGCAGGA AAGAACTGCA TTGGTACTAT CAAGAGGGGG CCTCGCGCTT TTTCCCCGAG
AAATGGCAGC GGGTACTGTC AATTTTATCC CCAGAAGAGC AGGGCGATGT GATAGCGGCT
TATCGTAAAC GGCTGACATC ACCTGATCGG GCAATACAGC TAGAGGCCGC TAAAATATGG
AGTTTGTGGG AAGGCGAAAC AGTGACCTTA TTACCAACTA AAAGCTCGGC TTCCTTTGGT
GAAGAGCATT TTGCACTGGC GTTTGCCCGG ATTGAAAATC ACTATTTCAC GCATCTTGGC
TTCTTGGACA GTGATAACCA GTTGTTAGAC AATGTGACAC GTATACGGCA TATCCCAGCT
GTAATTATTC ATGGTCGATA TGATATGGCG TGTCAGCCAC AGAACGCCTG GGATTTAGCA
CAGGCTTGGC CTGAAGCTGA GCTCTATATC GTTGAAGGTG CCGGGCACTC CTTTGATGAG
CCAGGGATAC TGCATCAACT TATTCTAGCT ACTGATAAAT TTGCTCACTG A
 
Protein sequence
MEQLRGLYPA YEPYDSGLLD TGDGHQIYWE LCGNPKGKPA IFIHGGPGGG IAPYHRQLFN 
PAKYNVMLFD QRGCGRSKPH ASLDNNTTWH LVEDIERLRK MAGIEQWLVF GGSWGSTLAL
AYGETHPERV SEMVLRGIFT LRRKELHWYY QEGASRFFPE KWQRVLSILS PEEQGDVIAA
YRKRLTSPDR AIQLEAAKIW SLWEGETVTL LPTKSSASFG EEHFALAFAR IENHYFTHLG
FLDSDNQLLD NVTRIRHIPA VIIHGRYDMA CQPQNAWDLA QAWPEAELYI VEGAGHSFDE
PGILHQLILA TDKFAH