Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0526 |
Symbol | |
ID | 5387963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 621191 |
End bp | 624544 |
Gene Length | 3354 bp |
Protein Length | 1117 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640863497 |
Product | hypothetical protein |
Protein accession | YP_001399519 |
Protein GI | 153948172 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAACCA AGTCAATCAC CAACGTCGTA CACAGCGCAC AGGGCGGCGT ACTGCAGTTT AATTTCATGC AGGGCAACAT GTTGAAAAAC ATGCTGGCCG GCGACATCAT GCTCAACCTG TTTGACACCC CAAGGCTGGA TATGGCGATT GCCAATATCT TCCTGCGCCA GTTGGACGAT AGCGGTATTG TGCGGGTTAC CCCACTGTTG TTTCACAATG ACCAACTCGT CACCTATAAG AACGCGCAGG ATGAAATCAT CTGGCAGACC ACGACTCCAG ACTTCACTGC CTTTGTTACT CTCTCCTTTA GCCATGGGCA GGAAGAGACT TACTACTACA GTGTACGGGT TGAGAATCAC GGCGCGCAGG CGCTACGCTA TGATCTGATC TACGGGCAGG ATCTCTCACT GTCCGATGCC GGTGCGACCA AGACCAATGA ATCCTACTGC AGTCAGTATC TGGATCATAA GGTCTTCTCG CTGGATAAGT ATGGCTATAC CGTATCCTCT CGGCAAAACC TACCACAGAG CACCGGTAAT CCCCTGCTAC AACTCGGTAG CTTCTCACCC GCAGTAGGTT TCTCTACCGA TGGTTACCAG TTCTTTGCTA AGCAGTATAA GTTTAGCCAC TTGCCCACCA TCGTCACTGA ACCATCGCTG GAGAACCGGA ACTACCAGTA TGAAATGGCC TATGTCGCTC TACAGCTACA GCCAGTCACG CTGTCCGCGG GTGACAGCGC AGACAGCGTA TTTTACGGTT TCTATCTCAG CCACCAGCCA GAGGCCAATA TTGCACAGGC CTTTGATGTT GCGCGGATCC GGGCTAACTA TCGCCAGCCA CAGCGAGAAC CCGCTACCGA GCAGGCATCC CCAACTCAGC AGGGCTATGA TCAGCGGCCA CTGTCGGGAG ACAAGCTAAC GGCAGAAGAG ATCGAACAAC TGTTTGACGG TGAAAAACAG TTTGTTGAGC AGTTGGACGG TGAACTGCTA TCGTTTTTCT ATCAGGAGGC TAACTACGTC ACGTTGGCAG AAAAAGAGCG CCATCTGGAG CGACCGACCG GCCATATTAT TTCCTCTGGC AATAATATCG ACTTCACTAA TCCCATCATG AATTCTACCC ACTATATCTA TGGGGTATTC AACTCGCATC TGACGCTGGG TAACACCTCC TTTAATAAGC TGTTGGGCGT CAATCGTAAT ATGCTCAATC AGTTCAAGAG CAGTGGTCAG CGCATTCTGG TACAGATCGG CGGCGAATAT CGCATTCTGG CGATGCCCTC TGCATATGAA GTGGGTGCCA ACTTCTCTCG CTGGATATAC AAGCTGGCCG AGGGCATGAT TCAGGTCCGC GCCTTCGCCA GCCAGAGTGA GCCGGTTATT CAATTGGATA TTGCTGTCTC AGGCCATAGC CAACCGCTCA ACGTTATTGT CAGCCACCAA CTGATCATGG GCAATATGGA GGAGGAAGCG ACCGTACAGG TTGAACGCCA TGGCGATCTG TTGCAGATCA GCCGCACGGG CGATGATCAT GGCCCCCGCT TTAGCATCGC CACTCGCGGC GGGTTCACGG GTGTCGAAAG ATACGTCGAT AGCGACAGCC AAGGGGTACA GTACCTGTTG CTACAAGGAC AGATTGCCCA GCAGGCCAGC ATCGCCTTCG GTGGCGTGCT AAACGGCGTA GATAGCCGCG GAAAGTGGCT GGATTTTGAA CACGAGCGGC AGGCTTATCA CGCACAGTAT CGTGCACTGC TAAACGACTT CTCGGTGAGT TTTTCTGCGG CACCGCAGCA AGCGCAGAAG CTGAATCACG CCATGCACTG GTTCACCCAT AATGCGCTCA CCCACTACAG CTCACCGCAT GGTTTAGAGC AGCCAGCCGG TGCCGCTTGG GGGACCCGCG ACGTTTCACA GGGGCCGATA GAGTTCTTCA TGGCCATGGG GCGCTATCAG CAGGTTGAAG CGATTCTGTG CCAGACTTAT CGCCATCAGT ATCTGGAAAC CGGAACCTGG CCGCAGTGGT TTATGTTTGA TGAATATGCT CAGGTACAAC AGCAGGAATC GCACGGCGAT ATTGTGGTCT GGCCGCTGAA AGCGTTGGCC GATTATCTGT TAGCCACGGA TCGCGTCGCG TTATTGGACA CGCGTCTGCC CTACACCAGC ATCAAACAGA ATTTCGCCTT TACCGGCGAG CAGGAGACGC TGCTGCAACA TGTTCAGCGG CAGATAGACC ATATCGTGGC GCATCTGGTA CCCGGTACCT ATCTTTCCAG CTACGGTGAC GGTGACTGGG ATGATACGTT GCAGCCCGCC AATCAGTCGC TGCGGGAAAA TATGGTCAGT GGCTGGACTA TTCCGCTGAC GCTGCAAACG CTGAAAACCT TGACCAAGGC GCTACAGGCT TATCCGCAGT TTGCCGATTT TATCGCGCGT ATCGTGACGC TAACCAGCAA CATGGAGGCG GATTACCATA AGTATTTGAT CAAGGACGGC GTGATCAGCG GCTTTATTCA CTTTAATCAG GGGGAGGCGG AATACCTACT GCATCCTACC GATACCACGA CCCAGATCAA GTACCGGCTG TTGCCCGCCA AGCGCTCAAT TATCTCCGAG TCGTTCGATA AAGAGATGGC CGAGCAGCAC ATGAAGATCA TCATGGATAA CCTAATGTAT CCTGATGGCG TGCGCCTGAT GGACCGGATG GCGGAGTACA AGGCCGGTAA GCAGACTTAC TTCAAACGGG CCGAACTGTC CGCTAATCTG GGGCGTGAAA TCGGCCTACA ATACTGCCAC GCCCATATTC GCTTTATAGA AGCACTCTGC AAAATGGGGA TGGCGCAGGC GCTGTACGAT AACCTGTTTA AAACCATCCC TGTGGGGATC CAGGAGAGCG TGCCTAACGC CGAGCTGCGC CAGGCAAACA GCTACTTCTC CAGTTCCGAT GCCAAGTTTG ATGATCGCTA CCAGGCTTAT AACAACTTCG ATCAATTGAA AACCGGTGCC GTAGCCGCGA AAGCGGGCTG GCGTATTTAC TCCAGCGGCC CTGGGATCTA TATCAACCAG ATCGTTTCCA ACGTACTTGG TGTGCGTTAT CAGGCCGGCG ATCTGTTGCT GGATCCGGTG ATCAGTCGGC AGTTTGGTGA TGTGACGCTA AACTATCAAC TCTATAACCT TCCGGTCACG CTGCGCATCT ATCCACAACA GGGGGAGTTT ACCCCGAAGC GTGTGCTACT CGATGGTCAG CCGCTGGCGT TTACGTTGCA GGATAATCCC TATCGTAGTG GGGCCGCACT GATTCACCGC CAGGAGATAG AAGGGCGTCT GACGGCACAC AGTCAGCTAG AGATTTACCT GTAG
|
Protein sequence | MITKSITNVV HSAQGGVLQF NFMQGNMLKN MLAGDIMLNL FDTPRLDMAI ANIFLRQLDD SGIVRVTPLL FHNDQLVTYK NAQDEIIWQT TTPDFTAFVT LSFSHGQEET YYYSVRVENH GAQALRYDLI YGQDLSLSDA GATKTNESYC SQYLDHKVFS LDKYGYTVSS RQNLPQSTGN PLLQLGSFSP AVGFSTDGYQ FFAKQYKFSH LPTIVTEPSL ENRNYQYEMA YVALQLQPVT LSAGDSADSV FYGFYLSHQP EANIAQAFDV ARIRANYRQP QREPATEQAS PTQQGYDQRP LSGDKLTAEE IEQLFDGEKQ FVEQLDGELL SFFYQEANYV TLAEKERHLE RPTGHIISSG NNIDFTNPIM NSTHYIYGVF NSHLTLGNTS FNKLLGVNRN MLNQFKSSGQ RILVQIGGEY RILAMPSAYE VGANFSRWIY KLAEGMIQVR AFASQSEPVI QLDIAVSGHS QPLNVIVSHQ LIMGNMEEEA TVQVERHGDL LQISRTGDDH GPRFSIATRG GFTGVERYVD SDSQGVQYLL LQGQIAQQAS IAFGGVLNGV DSRGKWLDFE HERQAYHAQY RALLNDFSVS FSAAPQQAQK LNHAMHWFTH NALTHYSSPH GLEQPAGAAW GTRDVSQGPI EFFMAMGRYQ QVEAILCQTY RHQYLETGTW PQWFMFDEYA QVQQQESHGD IVVWPLKALA DYLLATDRVA LLDTRLPYTS IKQNFAFTGE QETLLQHVQR QIDHIVAHLV PGTYLSSYGD GDWDDTLQPA NQSLRENMVS GWTIPLTLQT LKTLTKALQA YPQFADFIAR IVTLTSNMEA DYHKYLIKDG VISGFIHFNQ GEAEYLLHPT DTTTQIKYRL LPAKRSIISE SFDKEMAEQH MKIIMDNLMY PDGVRLMDRM AEYKAGKQTY FKRAELSANL GREIGLQYCH AHIRFIEALC KMGMAQALYD NLFKTIPVGI QESVPNAELR QANSYFSSSD AKFDDRYQAY NNFDQLKTGA VAAKAGWRIY SSGPGIYINQ IVSNVLGVRY QAGDLLLDPV ISRQFGDVTL NYQLYNLPVT LRIYPQQGEF TPKRVLLDGQ PLAFTLQDNP YRSGAALIHR QEIEGRLTAH SQLEIYL
|
| |