Gene YpsIP31758_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0526 
Symbol 
ID5387963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp621191 
End bp624544 
Gene Length3354 bp 
Protein Length1117 aa 
Translation table11 
GC content53% 
IMG OID640863497 
Producthypothetical protein 
Protein accessionYP_001399519 
Protein GI153948172 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACCA AGTCAATCAC CAACGTCGTA CACAGCGCAC AGGGCGGCGT ACTGCAGTTT 
AATTTCATGC AGGGCAACAT GTTGAAAAAC ATGCTGGCCG GCGACATCAT GCTCAACCTG
TTTGACACCC CAAGGCTGGA TATGGCGATT GCCAATATCT TCCTGCGCCA GTTGGACGAT
AGCGGTATTG TGCGGGTTAC CCCACTGTTG TTTCACAATG ACCAACTCGT CACCTATAAG
AACGCGCAGG ATGAAATCAT CTGGCAGACC ACGACTCCAG ACTTCACTGC CTTTGTTACT
CTCTCCTTTA GCCATGGGCA GGAAGAGACT TACTACTACA GTGTACGGGT TGAGAATCAC
GGCGCGCAGG CGCTACGCTA TGATCTGATC TACGGGCAGG ATCTCTCACT GTCCGATGCC
GGTGCGACCA AGACCAATGA ATCCTACTGC AGTCAGTATC TGGATCATAA GGTCTTCTCG
CTGGATAAGT ATGGCTATAC CGTATCCTCT CGGCAAAACC TACCACAGAG CACCGGTAAT
CCCCTGCTAC AACTCGGTAG CTTCTCACCC GCAGTAGGTT TCTCTACCGA TGGTTACCAG
TTCTTTGCTA AGCAGTATAA GTTTAGCCAC TTGCCCACCA TCGTCACTGA ACCATCGCTG
GAGAACCGGA ACTACCAGTA TGAAATGGCC TATGTCGCTC TACAGCTACA GCCAGTCACG
CTGTCCGCGG GTGACAGCGC AGACAGCGTA TTTTACGGTT TCTATCTCAG CCACCAGCCA
GAGGCCAATA TTGCACAGGC CTTTGATGTT GCGCGGATCC GGGCTAACTA TCGCCAGCCA
CAGCGAGAAC CCGCTACCGA GCAGGCATCC CCAACTCAGC AGGGCTATGA TCAGCGGCCA
CTGTCGGGAG ACAAGCTAAC GGCAGAAGAG ATCGAACAAC TGTTTGACGG TGAAAAACAG
TTTGTTGAGC AGTTGGACGG TGAACTGCTA TCGTTTTTCT ATCAGGAGGC TAACTACGTC
ACGTTGGCAG AAAAAGAGCG CCATCTGGAG CGACCGACCG GCCATATTAT TTCCTCTGGC
AATAATATCG ACTTCACTAA TCCCATCATG AATTCTACCC ACTATATCTA TGGGGTATTC
AACTCGCATC TGACGCTGGG TAACACCTCC TTTAATAAGC TGTTGGGCGT CAATCGTAAT
ATGCTCAATC AGTTCAAGAG CAGTGGTCAG CGCATTCTGG TACAGATCGG CGGCGAATAT
CGCATTCTGG CGATGCCCTC TGCATATGAA GTGGGTGCCA ACTTCTCTCG CTGGATATAC
AAGCTGGCCG AGGGCATGAT TCAGGTCCGC GCCTTCGCCA GCCAGAGTGA GCCGGTTATT
CAATTGGATA TTGCTGTCTC AGGCCATAGC CAACCGCTCA ACGTTATTGT CAGCCACCAA
CTGATCATGG GCAATATGGA GGAGGAAGCG ACCGTACAGG TTGAACGCCA TGGCGATCTG
TTGCAGATCA GCCGCACGGG CGATGATCAT GGCCCCCGCT TTAGCATCGC CACTCGCGGC
GGGTTCACGG GTGTCGAAAG ATACGTCGAT AGCGACAGCC AAGGGGTACA GTACCTGTTG
CTACAAGGAC AGATTGCCCA GCAGGCCAGC ATCGCCTTCG GTGGCGTGCT AAACGGCGTA
GATAGCCGCG GAAAGTGGCT GGATTTTGAA CACGAGCGGC AGGCTTATCA CGCACAGTAT
CGTGCACTGC TAAACGACTT CTCGGTGAGT TTTTCTGCGG CACCGCAGCA AGCGCAGAAG
CTGAATCACG CCATGCACTG GTTCACCCAT AATGCGCTCA CCCACTACAG CTCACCGCAT
GGTTTAGAGC AGCCAGCCGG TGCCGCTTGG GGGACCCGCG ACGTTTCACA GGGGCCGATA
GAGTTCTTCA TGGCCATGGG GCGCTATCAG CAGGTTGAAG CGATTCTGTG CCAGACTTAT
CGCCATCAGT ATCTGGAAAC CGGAACCTGG CCGCAGTGGT TTATGTTTGA TGAATATGCT
CAGGTACAAC AGCAGGAATC GCACGGCGAT ATTGTGGTCT GGCCGCTGAA AGCGTTGGCC
GATTATCTGT TAGCCACGGA TCGCGTCGCG TTATTGGACA CGCGTCTGCC CTACACCAGC
ATCAAACAGA ATTTCGCCTT TACCGGCGAG CAGGAGACGC TGCTGCAACA TGTTCAGCGG
CAGATAGACC ATATCGTGGC GCATCTGGTA CCCGGTACCT ATCTTTCCAG CTACGGTGAC
GGTGACTGGG ATGATACGTT GCAGCCCGCC AATCAGTCGC TGCGGGAAAA TATGGTCAGT
GGCTGGACTA TTCCGCTGAC GCTGCAAACG CTGAAAACCT TGACCAAGGC GCTACAGGCT
TATCCGCAGT TTGCCGATTT TATCGCGCGT ATCGTGACGC TAACCAGCAA CATGGAGGCG
GATTACCATA AGTATTTGAT CAAGGACGGC GTGATCAGCG GCTTTATTCA CTTTAATCAG
GGGGAGGCGG AATACCTACT GCATCCTACC GATACCACGA CCCAGATCAA GTACCGGCTG
TTGCCCGCCA AGCGCTCAAT TATCTCCGAG TCGTTCGATA AAGAGATGGC CGAGCAGCAC
ATGAAGATCA TCATGGATAA CCTAATGTAT CCTGATGGCG TGCGCCTGAT GGACCGGATG
GCGGAGTACA AGGCCGGTAA GCAGACTTAC TTCAAACGGG CCGAACTGTC CGCTAATCTG
GGGCGTGAAA TCGGCCTACA ATACTGCCAC GCCCATATTC GCTTTATAGA AGCACTCTGC
AAAATGGGGA TGGCGCAGGC GCTGTACGAT AACCTGTTTA AAACCATCCC TGTGGGGATC
CAGGAGAGCG TGCCTAACGC CGAGCTGCGC CAGGCAAACA GCTACTTCTC CAGTTCCGAT
GCCAAGTTTG ATGATCGCTA CCAGGCTTAT AACAACTTCG ATCAATTGAA AACCGGTGCC
GTAGCCGCGA AAGCGGGCTG GCGTATTTAC TCCAGCGGCC CTGGGATCTA TATCAACCAG
ATCGTTTCCA ACGTACTTGG TGTGCGTTAT CAGGCCGGCG ATCTGTTGCT GGATCCGGTG
ATCAGTCGGC AGTTTGGTGA TGTGACGCTA AACTATCAAC TCTATAACCT TCCGGTCACG
CTGCGCATCT ATCCACAACA GGGGGAGTTT ACCCCGAAGC GTGTGCTACT CGATGGTCAG
CCGCTGGCGT TTACGTTGCA GGATAATCCC TATCGTAGTG GGGCCGCACT GATTCACCGC
CAGGAGATAG AAGGGCGTCT GACGGCACAC AGTCAGCTAG AGATTTACCT GTAG
 
Protein sequence
MITKSITNVV HSAQGGVLQF NFMQGNMLKN MLAGDIMLNL FDTPRLDMAI ANIFLRQLDD 
SGIVRVTPLL FHNDQLVTYK NAQDEIIWQT TTPDFTAFVT LSFSHGQEET YYYSVRVENH
GAQALRYDLI YGQDLSLSDA GATKTNESYC SQYLDHKVFS LDKYGYTVSS RQNLPQSTGN
PLLQLGSFSP AVGFSTDGYQ FFAKQYKFSH LPTIVTEPSL ENRNYQYEMA YVALQLQPVT
LSAGDSADSV FYGFYLSHQP EANIAQAFDV ARIRANYRQP QREPATEQAS PTQQGYDQRP
LSGDKLTAEE IEQLFDGEKQ FVEQLDGELL SFFYQEANYV TLAEKERHLE RPTGHIISSG
NNIDFTNPIM NSTHYIYGVF NSHLTLGNTS FNKLLGVNRN MLNQFKSSGQ RILVQIGGEY
RILAMPSAYE VGANFSRWIY KLAEGMIQVR AFASQSEPVI QLDIAVSGHS QPLNVIVSHQ
LIMGNMEEEA TVQVERHGDL LQISRTGDDH GPRFSIATRG GFTGVERYVD SDSQGVQYLL
LQGQIAQQAS IAFGGVLNGV DSRGKWLDFE HERQAYHAQY RALLNDFSVS FSAAPQQAQK
LNHAMHWFTH NALTHYSSPH GLEQPAGAAW GTRDVSQGPI EFFMAMGRYQ QVEAILCQTY
RHQYLETGTW PQWFMFDEYA QVQQQESHGD IVVWPLKALA DYLLATDRVA LLDTRLPYTS
IKQNFAFTGE QETLLQHVQR QIDHIVAHLV PGTYLSSYGD GDWDDTLQPA NQSLRENMVS
GWTIPLTLQT LKTLTKALQA YPQFADFIAR IVTLTSNMEA DYHKYLIKDG VISGFIHFNQ
GEAEYLLHPT DTTTQIKYRL LPAKRSIISE SFDKEMAEQH MKIIMDNLMY PDGVRLMDRM
AEYKAGKQTY FKRAELSANL GREIGLQYCH AHIRFIEALC KMGMAQALYD NLFKTIPVGI
QESVPNAELR QANSYFSSSD AKFDDRYQAY NNFDQLKTGA VAAKAGWRIY SSGPGIYINQ
IVSNVLGVRY QAGDLLLDPV ISRQFGDVTL NYQLYNLPVT LRIYPQQGEF TPKRVLLDGQ
PLAFTLQDNP YRSGAALIHR QEIEGRLTAH SQLEIYL