Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3562 |
Symbol | |
ID | 5324450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3767908 |
End bp | 3769875 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640792511 |
Product | ribokinase-like domain-containing protein |
Protein accession | YP_001329212 |
Protein GI | 150398745 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0524] Sugar kinases, ribokinase family [COG3892] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.249096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCAGA TTTCATCGGC TGAAGTTCCA TCGGCTGACA TTCCTGCAAC AGAGGGAACC AGGCCTCTCG ACATCATCAC GATCGGCAGA GCCTCGGTCG ATCTTTACGG GCAGCAGATC GGCACGCGGC TGGAAGACGT GGCGAGCTTT GCCAAGTCGG TCGGCGGCTG CCCCTGCAAC ATCTCGGTCG GTACCGCGCG ACTTGGCCTG AGATCCGCCT TGCTGACCCG CGTCGGCAAC GAGCAGATGG GCCGCTTCAT TCGCGAGCAG CTTCAGCGTG AAGGGGTCGA GACGCGCGGC ATCGTCACCG ATCCGGAGCG GTTGACCGCG CTTGCGATTC TTTCCGTCGA AAACGAAAAA TCCTTCCCGC TGCTCTTTTA CCGCGACAAT TGCGCCGATA ACGCGCTCAG CGAGGATGAT GTGGCGGAAG ACTTCATTCG CTCCGCGCAT GCGATCCTCG TTACCGGCAC GCATTTCTCG AAGCCCAATA CGGACGCAGC CCAGCGCAAG GCGATCAGGA TTGCCAAGGA AAACGGTTCC AGGATCGTCT TCGACATCGA CTACCGCCCT AATCTCTGGG GCCTTGCGGG CCACGATGCG GGCGAGAGCC GTTATATAGC CTCCGACCGC GTCTCCGCAC ATCTGCGAAC CGTCCTTGGC GATTGCGACC TGATCGTCGG CACCGAGGAA GAAGTGCTGA TCGCATCAGG CGAAAACGAT CTGCTCGCGG CGCTCAAGTC CATCCGCTCG CTTTCCAAGG CCACGATCGT GCTCAAACGC GGACCGATGG GCTGCATCGT CTATGACGGA CCGATCTCGG ACGACCTCGA AGACGGTATC GTCGGCAAGG GCTTCCCAAT CGAGGTTTAC AACGTCCTTG GCGCCGGCGA TGCTTTCATG TCCGGTTTCC TGCGCGGCTG GCTGAGGGGC GAGCCGCATG CGACCTGCGC GACCTGGGCG AATGCCTGCG GCGCCTTCGC GGTTTCCCGC CTGCTCTGCG CGCCTGAAAT CCCGACCTGG ACCGAGCTGC AGTACTTCCT CGAGCACGGC AGCAAGGTGA AGGCGCTTCG CAAGGACGAG GCGATCAACC ACGTGCATTG GGCAACGACG CGCAGGCGCG AGATACCGCT GCTGATGGCG CTTGCCGTCG ATCACCGCAG CCAGCTCGAA GACATTGCCG AGGGAAATCC GGAACTGCTC TCGCGCATAC CGGCCTTCAA GGTCCTCGCC GTCAAGGCGG CGGCGGAGGT GGCCGCCGGC CGCTCCGGCT TCGGGATGCT CATCGACGAC AAATACGGAC GCGATGCGCT TTATGCTGCC GGCGCCTATC GCGATTTCTG GATCGGAAAG CCCGTCGAGC TGCCGGGCTC GCGGCCGTTG CAGTTCGAAT TCAGCCAGGA TCTCGGCAGC CGCCTTATCG AGTGGCCGGT CGACCATTGC ATCAAAGTGC TTTCCTTCTA CCACCCTGAC GATCCGGCCG AACTCAAGAC CGCCCAGATT GCCAAGCTTC GTTCGGCCTT CGAGGCGGCG CGCAAGGTCG GACGCGAGAT CCTGATCGAG ATCATCGCCG GCAAGCATGG ACCACTCGAC GACCGGACTG TACCGAGAGC GCTCGAGGAA CTCTATGATG CAGGCTTGAA GCCGGACTGG TGGAAGCTCG AGCCCCAGGC AAGCCGCGCA GCCTGGAGAG CCATCGATGC CGTGATCGAG CGGCGCGACC CGCTTTGCCG GGGCGTGGTG CTCCTCGGCC TGGAAGCACC CTATGAAGTG CTGAAGAATG GGTTCGCGGC GGCCAGAACA TCGAAGACGG TCAGGGGATT TGCCGTCGGA AGGACGATCT TCGCCGATGC CGCCAGAGCC TGGCTCTCCG GCGGGATGAC CGACGAACAG GCGATCACCG ACATGGCGGC AAAGTTCAAG GCACTCGTGG ATCTTTGGCT GCAACTGGGC GAGACCAGGG ATCTATAG
|
Protein sequence | MSQISSAEVP SADIPATEGT RPLDIITIGR ASVDLYGQQI GTRLEDVASF AKSVGGCPCN ISVGTARLGL RSALLTRVGN EQMGRFIREQ LQREGVETRG IVTDPERLTA LAILSVENEK SFPLLFYRDN CADNALSEDD VAEDFIRSAH AILVTGTHFS KPNTDAAQRK AIRIAKENGS RIVFDIDYRP NLWGLAGHDA GESRYIASDR VSAHLRTVLG DCDLIVGTEE EVLIASGEND LLAALKSIRS LSKATIVLKR GPMGCIVYDG PISDDLEDGI VGKGFPIEVY NVLGAGDAFM SGFLRGWLRG EPHATCATWA NACGAFAVSR LLCAPEIPTW TELQYFLEHG SKVKALRKDE AINHVHWATT RRREIPLLMA LAVDHRSQLE DIAEGNPELL SRIPAFKVLA VKAAAEVAAG RSGFGMLIDD KYGRDALYAA GAYRDFWIGK PVELPGSRPL QFEFSQDLGS RLIEWPVDHC IKVLSFYHPD DPAELKTAQI AKLRSAFEAA RKVGREILIE IIAGKHGPLD DRTVPRALEE LYDAGLKPDW WKLEPQASRA AWRAIDAVIE RRDPLCRGVV LLGLEAPYEV LKNGFAAART SKTVRGFAVG RTIFADAARA WLSGGMTDEQ AITDMAAKFK ALVDLWLQLG ETRDL
|
| |