Gene Smed_4930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4930 
Symbol 
ID5318246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1439595 
End bp1440584 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content61% 
IMG OID640776713 
Productputative sugar uptake ABC transporter periplasmic solute-binding protein precursor 
Protein accessionYP_001313645 
Protein GI150377049 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.947336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGG CACTATTGCT TGCCGTTGCC GCTCTGGCAC TCAGCGCCGG AACGACCATG 
GCCCAGAAGA AACAGCTCGT CATTGTGGTG AAGGGGCTCG ACAATCCCTT CTTTGAAGCC
ATCAACCAGG GTTGCCAGAA ATGGAACAAG GAAAATCCGG ACTCCGAATA CGAATGTTTC
TACACAGGCC CGGCTTCAAC TTCCGATGAG GCAGGCGAGG CCCAGATCGT CCAGGACATG
CTGGGCAAAG CTGAAACCGC CGCCATCGCC ATCTCGCCGT CCAATGCGAA ACTCATCGCC
CAGACGCTGA AAACCTCAAA CCCTACCGTC CCGGTGATGA CCGTGGATGC GGATCTCGCG
GCCGAGGATT CGGCCCTGCG CAAAACATAT CTGGGAACCG ACAACTACCT GATGGGCTAC
CGCATCGGCG AGTACATCAA GAAAGCCAAG CCCGATGGCG GCAAGATCTG CACCATCGAG
GGTAACCCGG GGGCCGACAA CATTCTGCGG CGCGCCCAGG GCATGCGCGA CGCGCTGACC
GGCCAGAAGG ACCTGGCGGA GCTCAAGGGC GAAGGCGGCT GGACCGAAGT GGCCGGTTGC
CCCGTCTTCA CCAATGACGA CGGCGCCAAG GGCGTGCAGG CGATGACGGA CATCCTTGCC
GCCAACCCCG ACCTGGACGC TTTCGGGATC ATGGGGGGAT GGCCGCTGTT CGGCGCGCCG
CAGCCCTATC GCGACCTGTT CAGGCCGGTG GCCGACAAGA TCGCCAAGAA CGAATTCGTC
ATCGGTGCCG CCGACACGAT CGGCGAGGAG GTCGCGATCG CGCGGGAAGG ATTGGTCACC
GCTCTGGTTG GACAGCGGCC GTTCGAAATG GGCTATAAGG CACCTCAGGT GATGCTCGAC
CTGATCGCCG GTAAACCTGT CGAAGACCCG GTCTTTACCG GCCTCGACGA GTGCACAAAA
GAGACCGCGG ACACCTGCAT TCAGAAATAG
 
Protein sequence
MRKALLLAVA ALALSAGTTM AQKKQLVIVV KGLDNPFFEA INQGCQKWNK ENPDSEYECF 
YTGPASTSDE AGEAQIVQDM LGKAETAAIA ISPSNAKLIA QTLKTSNPTV PVMTVDADLA
AEDSALRKTY LGTDNYLMGY RIGEYIKKAK PDGGKICTIE GNPGADNILR RAQGMRDALT
GQKDLAELKG EGGWTEVAGC PVFTNDDGAK GVQAMTDILA ANPDLDAFGI MGGWPLFGAP
QPYRDLFRPV ADKIAKNEFV IGAADTIGEE VAIAREGLVT ALVGQRPFEM GYKAPQVMLD
LIAGKPVEDP VFTGLDECTK ETADTCIQK