Gene YPK_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_4037 
Symbol 
ID6090284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4453331 
End bp4454827 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content51% 
IMG OID641599134 
Productguanosine pentaphosphate phosphohydrolase 
Protein accessionYP_001722752 
Protein GI170026247 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.868911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTAA GTTCCACCTC ACTTTATGCT GCCATCGATC TTGGCTCCAA TAGTTTTCAT 
ATGTTGGTAG TACGTGAGGT GGCTGGCAGT ATCCAAACGC TGGCACGTAT TAAGCGGAAG
GTCCGCCTGG CGGCTGGTCT GGATAACCAA AATCATCTAT CGCAGGAAGC GATGGAACGA
GGCTGGCAAT GCCTAAAACT TTTCTCAGAG CGTTTACAGG ATATTCCTCT GGATCAAATC
CGCGTGGTCG CAACGGCAAC CTTGCGCCTG GCCTCTAATG CCGACGAATT CTTGCGTACT
GCAACCGAGA TCCTCGGCTG CCCTATTCAA GTCATCAGTG GCGAAGAAGA AGCCCGTCTG
ATTTATCATG GCGTAGCGCA TACGACTGGC GGGCCAGAAC AGCGGTTGGT CGTCGATATT
GGGGGGGGCA GCACTGAGTT GGTTACAGGC AATGGGGCTC AGGCGAATAT TTTGGTCAGC
CTATCAATGG GTTGTGTTAC CTGGTTAGAA CGTTATTTTG GTGACCGCCA TCTGGCAAAG
GAAAATTTTG AACGCGCTGA ATTGGCCGCT CATGAGATGA TCAAGCCCGT CGCCCAACGT
TTTCGTGAAC ATGGCTGGCA AGTTTGTGTC GGCGCTTCAG GCACCGTTCA GGCACTACAA
GAGATCATGG TCGCTCAAGG TATGGACGAG CTGATCACCT TAGCCAAGCT GCAACAGCTC
AAACAAAGAG CGATTCAGTG TGGCAAATTA GAAGAGTTGG AAATCCCTGG TTTAACCTTG
GAACGAGCGC TGGTCTTCCC CAGTGGTCTG TCCATCTTAA TTGCGATATT CCAGGAGTTG
TCCATTGAAA GCATGACACT GGCAGGTGGC GCACTGCGCG AAGGGCTGGT CTATGGCATG
CTCCATTTAC CGGTCGAGCA AGATATTCGC CGCCGGACAC TACGTAATTT ACAGCGCCGC
TATTTACTGG ATACCGAGCA AGCTAAGCGC GTCAGTTGTT TGGCGGATAA CTTTTTCCTA
CAAGTGGAAA AAGAGTGGCA TCTCGATGGC CGATGTCGCG AATTTTTGCA AAACGCCTGT
TTGATCCATG AAATTGGCCT CAGTGTCGAT TTTAAACATG CTCCGCAACA TGCCGCTTAT
CTGATCCGTA ATCTGGATCT ACCCGGTTTT ACCCCTGCAC AAAAGCTGCT ACTTTCTGCT
CTGTTACAAA ACCAGAGTGA CACTATCGAC CTATCGCTCT TGAACCAGCA GAATGCATTA
CCTGCCGACA TGGCACAGCA TTTGTGTCGT CTACTGCGCT TGGCCATTAT TTTTTCCAGC
CGTCGCCGGG ATGATACCCT GCCAGCAGTC AGGTTGCGGG CCGATAATAA TGCGCTTTAT
GTGCTGGTCC CCCAAGGTTG GTTGGAACAG CACCCCTACC GCGCCGAAGC GTTAGAACAA
GAGAGTCACT GGCAAAGTTA TGTTCAATGG CCACTGCTAT TGGAAGAGCT TAGCTAA
 
Protein sequence
MMLSSTSLYA AIDLGSNSFH MLVVREVAGS IQTLARIKRK VRLAAGLDNQ NHLSQEAMER 
GWQCLKLFSE RLQDIPLDQI RVVATATLRL ASNADEFLRT ATEILGCPIQ VISGEEEARL
IYHGVAHTTG GPEQRLVVDI GGGSTELVTG NGAQANILVS LSMGCVTWLE RYFGDRHLAK
ENFERAELAA HEMIKPVAQR FREHGWQVCV GASGTVQALQ EIMVAQGMDE LITLAKLQQL
KQRAIQCGKL EELEIPGLTL ERALVFPSGL SILIAIFQEL SIESMTLAGG ALREGLVYGM
LHLPVEQDIR RRTLRNLQRR YLLDTEQAKR VSCLADNFFL QVEKEWHLDG RCREFLQNAC
LIHEIGLSVD FKHAPQHAAY LIRNLDLPGF TPAQKLLLSA LLQNQSDTID LSLLNQQNAL
PADMAQHLCR LLRLAIIFSS RRRDDTLPAV RLRADNNALY VLVPQGWLEQ HPYRAEALEQ
ESHWQSYVQW PLLLEELS