Gene Smed_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3562 
Symbol 
ID5324450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3767908 
End bp3769875 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content62% 
IMG OID640792511 
Productribokinase-like domain-containing protein 
Protein accessionYP_001329212 
Protein GI150398745 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0524] Sugar kinases, ribokinase family
[COG3892] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.249096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGA TTTCATCGGC TGAAGTTCCA TCGGCTGACA TTCCTGCAAC AGAGGGAACC 
AGGCCTCTCG ACATCATCAC GATCGGCAGA GCCTCGGTCG ATCTTTACGG GCAGCAGATC
GGCACGCGGC TGGAAGACGT GGCGAGCTTT GCCAAGTCGG TCGGCGGCTG CCCCTGCAAC
ATCTCGGTCG GTACCGCGCG ACTTGGCCTG AGATCCGCCT TGCTGACCCG CGTCGGCAAC
GAGCAGATGG GCCGCTTCAT TCGCGAGCAG CTTCAGCGTG AAGGGGTCGA GACGCGCGGC
ATCGTCACCG ATCCGGAGCG GTTGACCGCG CTTGCGATTC TTTCCGTCGA AAACGAAAAA
TCCTTCCCGC TGCTCTTTTA CCGCGACAAT TGCGCCGATA ACGCGCTCAG CGAGGATGAT
GTGGCGGAAG ACTTCATTCG CTCCGCGCAT GCGATCCTCG TTACCGGCAC GCATTTCTCG
AAGCCCAATA CGGACGCAGC CCAGCGCAAG GCGATCAGGA TTGCCAAGGA AAACGGTTCC
AGGATCGTCT TCGACATCGA CTACCGCCCT AATCTCTGGG GCCTTGCGGG CCACGATGCG
GGCGAGAGCC GTTATATAGC CTCCGACCGC GTCTCCGCAC ATCTGCGAAC CGTCCTTGGC
GATTGCGACC TGATCGTCGG CACCGAGGAA GAAGTGCTGA TCGCATCAGG CGAAAACGAT
CTGCTCGCGG CGCTCAAGTC CATCCGCTCG CTTTCCAAGG CCACGATCGT GCTCAAACGC
GGACCGATGG GCTGCATCGT CTATGACGGA CCGATCTCGG ACGACCTCGA AGACGGTATC
GTCGGCAAGG GCTTCCCAAT CGAGGTTTAC AACGTCCTTG GCGCCGGCGA TGCTTTCATG
TCCGGTTTCC TGCGCGGCTG GCTGAGGGGC GAGCCGCATG CGACCTGCGC GACCTGGGCG
AATGCCTGCG GCGCCTTCGC GGTTTCCCGC CTGCTCTGCG CGCCTGAAAT CCCGACCTGG
ACCGAGCTGC AGTACTTCCT CGAGCACGGC AGCAAGGTGA AGGCGCTTCG CAAGGACGAG
GCGATCAACC ACGTGCATTG GGCAACGACG CGCAGGCGCG AGATACCGCT GCTGATGGCG
CTTGCCGTCG ATCACCGCAG CCAGCTCGAA GACATTGCCG AGGGAAATCC GGAACTGCTC
TCGCGCATAC CGGCCTTCAA GGTCCTCGCC GTCAAGGCGG CGGCGGAGGT GGCCGCCGGC
CGCTCCGGCT TCGGGATGCT CATCGACGAC AAATACGGAC GCGATGCGCT TTATGCTGCC
GGCGCCTATC GCGATTTCTG GATCGGAAAG CCCGTCGAGC TGCCGGGCTC GCGGCCGTTG
CAGTTCGAAT TCAGCCAGGA TCTCGGCAGC CGCCTTATCG AGTGGCCGGT CGACCATTGC
ATCAAAGTGC TTTCCTTCTA CCACCCTGAC GATCCGGCCG AACTCAAGAC CGCCCAGATT
GCCAAGCTTC GTTCGGCCTT CGAGGCGGCG CGCAAGGTCG GACGCGAGAT CCTGATCGAG
ATCATCGCCG GCAAGCATGG ACCACTCGAC GACCGGACTG TACCGAGAGC GCTCGAGGAA
CTCTATGATG CAGGCTTGAA GCCGGACTGG TGGAAGCTCG AGCCCCAGGC AAGCCGCGCA
GCCTGGAGAG CCATCGATGC CGTGATCGAG CGGCGCGACC CGCTTTGCCG GGGCGTGGTG
CTCCTCGGCC TGGAAGCACC CTATGAAGTG CTGAAGAATG GGTTCGCGGC GGCCAGAACA
TCGAAGACGG TCAGGGGATT TGCCGTCGGA AGGACGATCT TCGCCGATGC CGCCAGAGCC
TGGCTCTCCG GCGGGATGAC CGACGAACAG GCGATCACCG ACATGGCGGC AAAGTTCAAG
GCACTCGTGG ATCTTTGGCT GCAACTGGGC GAGACCAGGG ATCTATAG
 
Protein sequence
MSQISSAEVP SADIPATEGT RPLDIITIGR ASVDLYGQQI GTRLEDVASF AKSVGGCPCN 
ISVGTARLGL RSALLTRVGN EQMGRFIREQ LQREGVETRG IVTDPERLTA LAILSVENEK
SFPLLFYRDN CADNALSEDD VAEDFIRSAH AILVTGTHFS KPNTDAAQRK AIRIAKENGS
RIVFDIDYRP NLWGLAGHDA GESRYIASDR VSAHLRTVLG DCDLIVGTEE EVLIASGEND
LLAALKSIRS LSKATIVLKR GPMGCIVYDG PISDDLEDGI VGKGFPIEVY NVLGAGDAFM
SGFLRGWLRG EPHATCATWA NACGAFAVSR LLCAPEIPTW TELQYFLEHG SKVKALRKDE
AINHVHWATT RRREIPLLMA LAVDHRSQLE DIAEGNPELL SRIPAFKVLA VKAAAEVAAG
RSGFGMLIDD KYGRDALYAA GAYRDFWIGK PVELPGSRPL QFEFSQDLGS RLIEWPVDHC
IKVLSFYHPD DPAELKTAQI AKLRSAFEAA RKVGREILIE IIAGKHGPLD DRTVPRALEE
LYDAGLKPDW WKLEPQASRA AWRAIDAVIE RRDPLCRGVV LLGLEAPYEV LKNGFAAART
SKTVRGFAVG RTIFADAARA WLSGGMTDEQ AITDMAAKFK ALVDLWLQLG ETRDL