Gene Smed_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3042 
Symbol 
ID5323920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3191144 
End bp3192091 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content61% 
IMG OID640791991 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_001328703 
Protein GI150398236 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTA ATTGGAGGAA GGCGGCGGCC GCCGCCGTCG TAGGATTTCT ATTGAATGCA 
ACCGGCGCGA ATGCGCTGAA TCTCGCCTGG GTGCACGCAA ATGCCGCCGC CCAATCCGAG
CAGCGGGTCA AGGCCGGCTT CGATGCCTGG CTGAAGGAAA CCGGCAAGGA CTGGAATGTG
AGCCTGCTCG ACAGCGGCGG CTCCGGCGAA CGCACCGCAT CCAACCTTCA GGACGCGGCT
TCCCGCGGCG TCGATGCGAT CATCATCACC ATGGCGGATC TGCGCGCGTC CCGCGCCGCA
ATCGATGCCG CGGTCGACGC AAAAATCCCG ATCATCACCG TCGACAGCGG TTACATTCCG
GGCGTTCTGG TCGACGTCAC CACAAATAAC TGGGCCATGT CTTCGGATGT TTCGCCCTAT
CTGCTGAACG AACTGGGTGG GAAGGGCCGC ATCATATTCC TTCGCATGGC CGAACATCAC
GGCACCCGCA AGCGCGGCGA CGTGATGGAG ACCATCCTCA GGGAATACCC GGACGTGAAG
GTTCTGGCCG AGCACAACAT CGACTACACC GCCTTCTTCG AGGATACGAC ATCGACGATG
CAGGATTATG CATCCCGGTT CGGAGACGAG ATCGACGCCG TCTGGGCTCC CTGGGACGAG
CCTGCGCAAG CGGCGATCAA CGTGCTGCAG GCTGCCGGCC TCAAGAACGT GAAGGTTATC
GGCATCGACG GCCATCCCAA TGCCGTCACC GAGGTCTGCA AGCCGGACGG TCTGATGATC
GCCACAGTCA GTCAGCCCTT CGAGAAGATG GGTGCACAGG CCGGCGCGTG GATCGAGGAG
ATCGTCGTCA GGAAAGAAGA CCCAGCCAAG GTCATACCGG CGAAGACGGT CTATATGGAC
GCCCCGTTGG TCACCAAGCA GAACTGCAAG GACTTCCTCC CGAAGTGA
 
Protein sequence
MSANWRKAAA AAVVGFLLNA TGANALNLAW VHANAAAQSE QRVKAGFDAW LKETGKDWNV 
SLLDSGGSGE RTASNLQDAA SRGVDAIIIT MADLRASRAA IDAAVDAKIP IITVDSGYIP
GVLVDVTTNN WAMSSDVSPY LLNELGGKGR IIFLRMAEHH GTRKRGDVME TILREYPDVK
VLAEHNIDYT AFFEDTTSTM QDYASRFGDE IDAVWAPWDE PAQAAINVLQ AAGLKNVKVI
GIDGHPNAVT EVCKPDGLMI ATVSQPFEKM GAQAGAWIEE IVVRKEDPAK VIPAKTVYMD
APLVTKQNCK DFLPK