Gene Smed_2696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2696 
Symbol 
ID5323565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2802494 
End bp2803990 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content62% 
IMG OID640791640 
Productextracellular solute-binding protein 
Protein accessionYP_001328361 
Protein GI150397894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA AAGACAGAAA GTTTTATCTC GCGTCTGTGA CCGACCGCTT CGTCCGCGGC 
CAGATGGACC GCCGCAGCTT CCTGCGTACG GCCGGTACGC TCGGCCTCGG CGCAACCGCG
CTCGGGATGG GCTTCGGCAG CCGGCCCTTC GGGGTGAGCC GGGCGCTTGC CCAGGAACAG
CTCGAGCCGT CCGCCGAAGT GATCTCCTGG CTCAAGGACG TCGCCAAACC GTTTGCGGGG
ACGACGCTGA AGCTGGCGAC GGAATCGACC CCGCCGTCGA ACGCCATCAA CTCCCAGCTT
AAGAAATATT TCGAGGAAGC GACCGGCATC AGGGTCGAGA TCGAGGTCCT GCCGCTTGAG
CAGGTTCTGC AGAAGCTGAC ACTCGATGTC GCCTCCTCAC TTGGCAGCTA TGACCTCTAC
TATATCGACC AGAGCTGGTC GGCATCTTTC AGCCAGGACG TGTTCGATCC GCGCGAGCAG
CTTCAGGAAA AGCCGGATCT CGCCATGCCG AACTACAATA TCGATGACTT CATGCCTGCG
CTCGTCGACG GGATCGCCAA ATACGAGGAC CGTTGGGTCG GCGTACCCTA CGACATCCCC
ATCTTCATCA TGATCTACCG CAAGGACATC TACGAGAAGC TCGGCCTCAA GGCTCCGGCA
ACATTCGAGG ATCTGCTGAA CAATTCCGTC ACCATCACCA AGGAAATGGG ACCGAACCTC
TATGGCTACG CCGGTCAGAT GAAGTCCGGC CACTACGCGC TGGAATGCGA ATGGACTTCC
ATGCTCTGGG GCAATGGCGG CTCGATCTTC AATGCCGACA AGAAGTTCGT GGGCAATGAC
GAGCGGGGCA TTGCCGCGCT CGACTACTAT ACCAAGCTCA AGGAAACCAT GCCGCCGGGC
GTGGATCAGT GGACCTGGGA CGGTCAGGGC CAGGCGATCG CTCAGGGCGT TGCCGCCTCG
ATGCTTTCCT GGGGCGAGTT CTTCCCCTAC TTCGACGATG CCAGCCAGAC CAAGGTTTCC
GGCCTCTGCG AGGCGGTCGT TCCGCCGCAG CCGGTGGCGC TCAGGAAACC CGAAGAATGC
GGGTATGGCG AAATTCCCGG CACCGGCCAC CAGGGCGGCT CGTCACTCGC GGTGTCCAGA
TACTCCAAGA GCCCGGATGC GGCATGGATC TTCATGCAGT GGGCGACTTG CGCCGACACG
CAGGCCCTCA TCACCACGCT CGGCGGCGGT ACCGGCCCCA CCCGCGCCTC TGTCTATGAC
GACCCGCGCG TCAAGGCAAA TGCCCGCGTC GGCGCCGGCA CGACCCGCCA CCTGTCGGTC
GTCCGCGAGA CCATCGACAA ATATATGGGC TCGGAGCCGG ACCTGCCGGA ATGGGCCCAG
CTGTCGAGCG ATACAATTCC CGTGGGCCTC GGCAAGTACT TCGCCGGCAG CTACGGCTCC
TCCAAGGAAG CGATGGACGA CATCGCCTCT CAGGTCGAGA CCGTGCTGAA GGGCTGA
 
Protein sequence
MKEKDRKFYL ASVTDRFVRG QMDRRSFLRT AGTLGLGATA LGMGFGSRPF GVSRALAQEQ 
LEPSAEVISW LKDVAKPFAG TTLKLATEST PPSNAINSQL KKYFEEATGI RVEIEVLPLE
QVLQKLTLDV ASSLGSYDLY YIDQSWSASF SQDVFDPREQ LQEKPDLAMP NYNIDDFMPA
LVDGIAKYED RWVGVPYDIP IFIMIYRKDI YEKLGLKAPA TFEDLLNNSV TITKEMGPNL
YGYAGQMKSG HYALECEWTS MLWGNGGSIF NADKKFVGND ERGIAALDYY TKLKETMPPG
VDQWTWDGQG QAIAQGVAAS MLSWGEFFPY FDDASQTKVS GLCEAVVPPQ PVALRKPEEC
GYGEIPGTGH QGGSSLAVSR YSKSPDAAWI FMQWATCADT QALITTLGGG TGPTRASVYD
DPRVKANARV GAGTTRHLSV VRETIDKYMG SEPDLPEWAQ LSSDTIPVGL GKYFAGSYGS
SKEAMDDIAS QVETVLKG