Gene Smed_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4159 
Symbol 
ID5319208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp632630 
End bp633895 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content62% 
IMG OID640775964 
Productextracellular solute-binding protein 
Protein accessionYP_001312897 
Protein GI150376301 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.119299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.873951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCCC GCATTTGGGA GGAATGCAAA ATGTTCACGA AACTTATGGC GGCAACGGCC 
CTCATATCGG CCAGCATGAT CTCGGTCGCC TCGGCCGAGA CGATCAGCAT GTGGGTGCGC
TCGGGTATAG GCGACTCGTT CAAGGAAGTC GTGAAGGCCT ACAACACAGC GCACGAGAAC
AAGGTGGAAC TCACCGAGGT GCCCTTTGCC GAGCTCGTGC AGAAATATGC GACGGCGATC
GCCGGCGGGC AGGCACCCGA CGCGCTGTCC CTCGACCTGA TCTACACGCC GGCCTTTGCA
GCTGCCGGCC AGCTGGAAGA CCTCACCGAC TGGGCGAAGG CGCTACCCTA TTTCAACTCG
CTGTCGCCGT CGCACGTCAA GCTCGGCACC TATGAGGACA AGATCTACGG CCTGCCGCTG
ACGGTCGAGA CCTCAATTTT CGCCTGGAAC AAGGACCTCT ACAAGAAGGC CGGCCTCGAT
CCCGAAAAAG CGCCGGCCAC CTGGGAGGAG ATCACTGCCA ACGCCGAGAA AATCCGTGGG
CTCGGCGGCG ATACATACGG CTTCTATTTC TCCGGCGGCG GCTGCGGCGG CTGCATGATA
TTCACCTTCA CGCCCCTGAC CTGGGGTGCG GGTGCGGATA TCCTTTCGGC CGACGGCAAG
ACGGCGACGC TCGACACGCC CCCGATGCGC AAGGCCGTCG ACATCTACCG CAACATGATC
GCGAAGGATC TGGTGCCGGC GGGTGCTGCA AGCGACAACG GGGTGAATTT CCTGAGCTTC
ACCAATGGCA AGATCGGCCA GCAAAGCCTC GGCGCCTTCG CGATCGGCAC ACTGGTGACG
CAGCATCCGG AGATTGATTT CGGCGTGACG CTGATCCCGG GCGTCGACGG CAAGCCGTCC
TCCTTTGCCG GCGGGGACAA TTTCGTCGTC ACCAAGGGCA CGCCTAAGCT CGCGGACGTC
AAGGAATTCC TTGAATACAC CTATTCGCCC GAAGGCCAGA AGATCATGGC GAAGTATGGC
AGCCTGCCGA CCCGCGGCGA CATCGCCAAT GAGGTGCTCG AGGGCCTTGA CCCGCGCCTG
AAGGTCGGTC TCGACGCGAT CGCCGTCGCC AAGACGCCCT ACACGCTGCA GTTCAACGAT
CTGATCAACA GCGCAAACGG CCCATGGGCG ACCTTCACCA ATGCCGCGAT CTACGGCGAC
GACGTCGACG GTGCCTTCGC GGATGCGCAA GCAGAGATGC AGTCGATCAT CGACGCGGGG
CAGTAA
 
Protein sequence
MWPRIWEECK MFTKLMAATA LISASMISVA SAETISMWVR SGIGDSFKEV VKAYNTAHEN 
KVELTEVPFA ELVQKYATAI AGGQAPDALS LDLIYTPAFA AAGQLEDLTD WAKALPYFNS
LSPSHVKLGT YEDKIYGLPL TVETSIFAWN KDLYKKAGLD PEKAPATWEE ITANAEKIRG
LGGDTYGFYF SGGGCGGCMI FTFTPLTWGA GADILSADGK TATLDTPPMR KAVDIYRNMI
AKDLVPAGAA SDNGVNFLSF TNGKIGQQSL GAFAIGTLVT QHPEIDFGVT LIPGVDGKPS
SFAGGDNFVV TKGTPKLADV KEFLEYTYSP EGQKIMAKYG SLPTRGDIAN EVLEGLDPRL
KVGLDAIAVA KTPYTLQFND LINSANGPWA TFTNAAIYGD DVDGAFADAQ AEMQSIIDAG
Q