Gene YPK_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0587 
Symbol 
ID6089968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp648344 
End bp651697 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content53% 
IMG OID641595649 
Producthypothetical protein 
Protein accessionYP_001719343 
Protein GI170022838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.720861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACCA AGTCAATCAC CAACGTCGTA CACAGCGCAC AGGGCGGCGT ACTGCAGTTT 
AATTTCATGC AGGGCAACAT GTTGAAAAAC ATGCTGGCCG GCGACATCAT GCTCAACCTG
TTTGACACCC CAAGGCTGGA TATGGCGATT GCCAATATCT TCCTGCGCCA GTTGGACGAT
AGCGGTATTG TGCGGGTTAC CCCACTGTTG TTTCACAATG ACCAACTCGT CACCTATAAG
AACGCGCAGG ATGAAATCAT CTGGCAGACC ACGGCTCCAG ACTTCACTGC CTTTGTGACT
CTCTCCTTTA GCCATGAGCA GGAAGAGACT TACTACTACA GTGTGCGGGT TGAAAATCAC
AGCGCGCAGG CGCTACGCTA TGATCTGATC TACGGTCAGG ATCTCTCACT GTCCGATGCC
GGTGCGACCA AGACCAATGA ATCCTACTGC AGTCAGTATC TGGATCATAA GGTCTTCTCG
CTGGATAAGT ATGGCTATAC CGTATCCTCT CGGCAAAACC TACCACAGAG CACCGGTAAT
CCCCTGCTAC AACTCGGTAG CTTCTCACCC GCAGTGGGTT TCTCTACCGA TGGTTACCAG
TTCTTTGCCA AGCAGTACAA GTTTAGCCAC TTGCCCACCA TCGTCACTGA ACCATCGCTG
GAGAACCGGA ACTACCAGTA TGAAATGGCC TATGTCGCTC TACAGCTACA GCCAGTCACG
CTGCCCGCGG GTGACAGCGC AGACAGCGTA TTTTACGGTT TCTATCTCAG CCACCAGCCA
GAGGCCAATA TTGCACAGGC CTTTGATGTC GCGCGGATCC GGGCTAACTA TCGCCAGCCA
CAGCGAGAAC CCGCTACCGA GCAGGCATCC CCAACTCAGC AGGGCTATGA TCAGCGGCCA
CTGTCGGGAG ACAAGCTAAC GGCAGAAGAG ATCGAACAAC TGTTTGACGG TGAAAAACAG
TTTGTTGAGC AGTTGGACGG TGAACTGCTA TCGTTTTTCT ATCAGGAGGC TAACTACGTC
ACGTTGGCAG AAAAAGAGCG CCATCTGGAG CGACCGACCG GCCATATTAT TTCCTCTGGC
AATAATATCG ACTTCACTAA TCCCATCATG AATTCTACCC ACTATATCTA TGGGGTATTC
AACTCGCATC TGACGCTGGG TAACACCTCC TTTAATAAGC TGTTGGGCGT CAATCGTAAT
ATGCTCAATC AGTTCAAGAG CAGTGGTCAG CGCATTCTGG TACAGATCGG CGGCGAATAT
CGTATTCTGG CGATGCCCTC TGCATATGAA GTGGGTGCCA ACTTCTCTCG CTGGATATAC
AAGCTGGCCG AGGGCATGAT TCAGGTCCGC GCCTTCGCCA GCCAGAGTGA GCCGGTTATT
CAATTGGATA TTGCTGTCTC AGGCCATAGC CAACCGCTCA ACGTTATTGT CAGCCACCAG
CTGATCATGG GCAATATGGA GGAGGAAGCG ACCGTACAGG TTGAACGCCA TGGCGATCTG
TTGCAGATCA GCCGCACGGG CGATGATCAT GGCCCCCGCT TTAGCATCGC CACTCGCGGC
GGGTTCACGG GTGTCGAAAG ATACGTCGAT AGCGACAGCC AAGGAGTACA GTACCTGTTG
CTACAAGGAC AGATTGCCCA GCAGGCCAGC ATCGCCTTCG GTGGCGTGCT AAACGGCGTA
GATAGCCGCG GAAAGTGGCT GGATTTTGAA CACGAGCGGC AGGCTTATCA CGCACAGTAT
CGTGCACTGC TAAACGACTT CTCGGTGAGT TTTTCTGCGG CACCGCAGCA AGCGCAGAAG
CTGAATCACG CCATGCACTG GTTCACCCAT AATGCGCTCA CCCACTACAG CTCACCGCAT
GGTTTAGAGC AGCCAGCCGG TGCCGCTTGG GGGACCCGCG ACGTTTCACA GGGGCCGATA
GAGTTCTTCA TGGCCATGGG GCGCTATCAG CAGGTTGAAG CGATTCTGTG CCAGACTTAT
CGCCATCAGT ATCTGGAAAC CGGAACCTGG CCGCAGTGGT TTATGTTTGA TGAATATGCT
CAGGTACAAC AGCAGGAATC GCACGGCGAT ATTGTGGTCT GGCCGCTGAA AGCGTTGGCC
GATTATCTGT TAGCCACGGA TCGCGTCGCG TTATTGGACA CGCGTCTGTC CTACACCAGC
ATCAAACAGA ATTTCGCCTT TACCGGCGAG CAGGAGACGC TGCTGCAACA TGTTCAGCGG
CAGATAGACC ATATCGTGGC GCATCTGGTA CCCGGTACCT ATCTTTCCAG CTACGGTGAC
GGTGACTGGG ATGATACGTT GCAGCCCGCC AATCAGTCGC TGCGGGAAAA TATGGTCAGT
GGCTGGACTA TTCCGCTGAC GCTGCAAACG CTGAAAACCT TGACCAAGGC GCTACAGGCT
TATCCGCAGT TTGCCGATTT TATCGCGCGT ATCGTGACGC TAACCAGCAA CATGGAGGCG
GATTACCATA AGTATTTGAT CAAGGACGGC GTGATCAGCG GCTTTATTCA CTTTAATCAG
GGGGAGGCGG AATACCTACT GCATCCTACC GATACCACGA CCCAGATCAA GTACCGGCTG
TTGCCCGCCA AGCGCTCAAT TATCTCCGAG TCGTTCGATA AAGAGATGGC CGAGCAGCAC
ATGAAGATCA TCATGGATAA CCTAATGTAT CCTGATGGCG TGCGCCTGAT GGACCGGATG
GCGGAGTACA AGGCCGGTAA GCAGACTTAC TTCAAACGGG CCGAACTGTC CGCTAATCTG
GGGCGTGAAA TCGGCCTACA ATACTGCCAC GCCCATATTC GCTTTATAGA AGCACTCTGC
AAAATGGGGA TGGCGCAGGA GCTGTACGAT AACCTGTTTA AAACCATCCC TGTGGGGATC
CAGGAGAGCG TGCCTAACGC CGAGCTGCGC CAGGCAAACA GCTACTTCTC CAGTTCCGAT
GCCAAGTTTG ATGATCGCTA CCAGGCTTAT AACAACTTCG ATCAATTGAA AACCGGTGCG
GTAGCCGCGA AAGCGGGCTG GCGTATTTAC TCCAGCGGCC CTGGGATCTA TATCAACCAG
ATCGTTTCCA ACGTACTTGG TGTGCGTTAT CAGGCCGGCG ATCTGTTGCT GGATCCGGTG
ATCAGTCGGC AGTTTGGTGA TGTGACGCTA AACTATCAAC TCTATAACCT TCCGGTCACG
CTGCGCATCT ATCCACAACA GGGGGAGTTT ACCCCGAAGC GTGTGCTACT CGATGGTCAG
CCGCTGGCGT TTACGTTGCA GGATAATCCC TATCGTAGTG GGGCCGCACT GATTCACCGC
CAGGAGATAG AAGGGCGTCT GACGGCACAC AGTCAGCTAG AGATTTACCT GTAG
 
Protein sequence
MITKSITNVV HSAQGGVLQF NFMQGNMLKN MLAGDIMLNL FDTPRLDMAI ANIFLRQLDD 
SGIVRVTPLL FHNDQLVTYK NAQDEIIWQT TAPDFTAFVT LSFSHEQEET YYYSVRVENH
SAQALRYDLI YGQDLSLSDA GATKTNESYC SQYLDHKVFS LDKYGYTVSS RQNLPQSTGN
PLLQLGSFSP AVGFSTDGYQ FFAKQYKFSH LPTIVTEPSL ENRNYQYEMA YVALQLQPVT
LPAGDSADSV FYGFYLSHQP EANIAQAFDV ARIRANYRQP QREPATEQAS PTQQGYDQRP
LSGDKLTAEE IEQLFDGEKQ FVEQLDGELL SFFYQEANYV TLAEKERHLE RPTGHIISSG
NNIDFTNPIM NSTHYIYGVF NSHLTLGNTS FNKLLGVNRN MLNQFKSSGQ RILVQIGGEY
RILAMPSAYE VGANFSRWIY KLAEGMIQVR AFASQSEPVI QLDIAVSGHS QPLNVIVSHQ
LIMGNMEEEA TVQVERHGDL LQISRTGDDH GPRFSIATRG GFTGVERYVD SDSQGVQYLL
LQGQIAQQAS IAFGGVLNGV DSRGKWLDFE HERQAYHAQY RALLNDFSVS FSAAPQQAQK
LNHAMHWFTH NALTHYSSPH GLEQPAGAAW GTRDVSQGPI EFFMAMGRYQ QVEAILCQTY
RHQYLETGTW PQWFMFDEYA QVQQQESHGD IVVWPLKALA DYLLATDRVA LLDTRLSYTS
IKQNFAFTGE QETLLQHVQR QIDHIVAHLV PGTYLSSYGD GDWDDTLQPA NQSLRENMVS
GWTIPLTLQT LKTLTKALQA YPQFADFIAR IVTLTSNMEA DYHKYLIKDG VISGFIHFNQ
GEAEYLLHPT DTTTQIKYRL LPAKRSIISE SFDKEMAEQH MKIIMDNLMY PDGVRLMDRM
AEYKAGKQTY FKRAELSANL GREIGLQYCH AHIRFIEALC KMGMAQELYD NLFKTIPVGI
QESVPNAELR QANSYFSSSD AKFDDRYQAY NNFDQLKTGA VAAKAGWRIY SSGPGIYINQ
IVSNVLGVRY QAGDLLLDPV ISRQFGDVTL NYQLYNLPVT LRIYPQQGEF TPKRVLLDGQ
PLAFTLQDNP YRSGAALIHR QEIEGRLTAH SQLEIYL