Gene Smed_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2358 
Symbol 
ID5323219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2433714 
End bp2435039 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID640791296 
Productextracellular solute-binding protein 
Protein accessionYP_001328025 
Protein GI150397558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGA GAACTTTCCT GCTGGGCACG TGCTCGGCAG CCGCACTGGC CGGCTTGACT 
CACGCGGGCT GGGCCCAAGC GGAGACCCTG ACCATTGCCA CCGTGAACAA TGGCGACATG
ATCCGGATGC AGAAGCTGAC GGACGATTTC ACGTCGAAGA ACCCGGACAT CCAACTCGAG
TGGGTCACTC TTGAGGAAAA CGTCCTGCGC CAGCGCGTCA CGACGGACAT TGCGACCAAG
GGCGGTCAGT ACGACATCAT GACGATCGGC ACCTATGAAG TGCCGATCTG GGCGAAACAG
GGCTGGCTCC TGCCTCTGGA CAATCTCGGC CCCGAATACG ACGTAGACGA CCTTCTGCCG
GCGATCCGCA GCGGCCTGAC CATCGATGGC AAGCTTTATG CCGCGCCCTT CTACGGCGAA
AGCTCGATGG TCATGTATCG CAAGGACCTG TTCGAGAAGG CGGGTCTCAC CATGCCCGAT
GCGCCGACCT GGGAATTCGT TGCCGAAGCG GCTCGCAAGA TCACCGACAA GAGCAACGAG
ATCTACGGCA TCTGCCTTCG CGGAAAGGCT GGATGGGGCG AGAACATGGC CTTCCTGACG
GCCACGGCAA ACGCCTTTGG CGCCCGCTGG TTCGATGAGA ACTGGAAGCC GCAATTCGAT
CAGCCGGAGT GGAAGAACGC TCTCGACTTC TACGTCAAGC TGATGAATGA CGCCGGCCCC
CCCGGTGCCT CGTCCAACGG CTTCAACGAA AACCTGTCGC TGTTCCAGAC CGGCAAGTGC
GGGATGTGGA TCGACGCGAC CGTCGCCGCC TCCTTCGTCA CAAATCCGAA GGAGTCGACT
GTCGCCGACA AGGTTGGTTT CGCGCTCGCT CCCGATACCG GCCTCGGAAA GCGCGGCAAC
TGGCTCTGGG CCTGGAACCT CGCGGTTCCG GCGGGCACGC AGAAGGCCGA AGCGGCGCAG
AAGTTCATCG CCTGGGCAAC GGGCAAGGAA TATCTGAATC TGGTTGCCGA GAAGGAGGGC
TGGGCGAATG TTCCTCCCGG CACCCGCATC TCTCTCTATG AGAACCCGGA ATACCAGAAG
GCGGCGCCCT TCGCGAAGAT GACGCTGGAC TCGATCAATG CGGCCGACCC GAAGAACCCG
GCGGTGAAGC CGGTGCCATA TGTCGGCGTT CAGTTCGTGG CGATCCCGGA ATTCCAGGGC
CTCGGCACGG CGGTCGGGCA GGTATTCTCG GCAGCTTTGG CCGGCCAGAT GAGCGTCGAC
CAGGCACTCG CGAGCGCACA GCAGCTGTCG ACCCGCGAAA TGACCAAGGC CGGCTACATC
AAGTGA
 
Protein sequence
MNLRTFLLGT CSAAALAGLT HAGWAQAETL TIATVNNGDM IRMQKLTDDF TSKNPDIQLE 
WVTLEENVLR QRVTTDIATK GGQYDIMTIG TYEVPIWAKQ GWLLPLDNLG PEYDVDDLLP
AIRSGLTIDG KLYAAPFYGE SSMVMYRKDL FEKAGLTMPD APTWEFVAEA ARKITDKSNE
IYGICLRGKA GWGENMAFLT ATANAFGARW FDENWKPQFD QPEWKNALDF YVKLMNDAGP
PGASSNGFNE NLSLFQTGKC GMWIDATVAA SFVTNPKEST VADKVGFALA PDTGLGKRGN
WLWAWNLAVP AGTQKAEAAQ KFIAWATGKE YLNLVAEKEG WANVPPGTRI SLYENPEYQK
AAPFAKMTLD SINAADPKNP AVKPVPYVGV QFVAIPEFQG LGTAVGQVFS AALAGQMSVD
QALASAQQLS TREMTKAGYI K