Gene Smed_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4473 
Symbol 
ID5318175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp954054 
End bp955121 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID640776274 
Productputative sugar uptake ABC transporter periplasmic solute-binding protein precursor 
Protein accessionYP_001313206 
Protein GI150376610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.659853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCT TTACTTCGCT TCTCGCAGCT GCGGCGATGA CGGTCGCCGG CTTCGCTGCG 
CCGGCAGTCG CCCAGGACAA GGGCATGGTC GGCATCTCCA TGCCGACAAA GACGTCGACG
CGCTGGATTT CCGACGGCGA AACCATGGAG AAGCTGTTCA AGGATGCAGG CTATACGCCG
GACCTGCAAT TCGCCGACGA CGATATTCCG AACCAGCTCG CGCAGATCGA GAACATGGTG
ACCAAGGGCG CGAAGGTCCT CGTCATCGGC GCCATCGACG GCACGACGCT CTCCGACATT
CTGCAGAAGG CCGCCGACGC CGGCGTCAAG GTCATCGCCT ATGATCGCCT GATCCGCGAT
TCCGGCAATG TCGACTACTA TGCCACCTTC GACAACTTCC AGGTTGGCGT CCTGCAGGCG
ACCTCGCTCG TCGAGGGTCT GAAGCTCGAC AGCGCGACCG AGCCGAAGAA CATCGAACTT
TTCGGCGGCT CGCCGGACGA CAACAACGCC TTCTTCTTTT ACGACGGTGC AATGTCCGTT
CTGCAGCCTC TGATTGACAG CGGCAAGCTT GTCGTCAAGT CCGGCCAGAT GGGCATGGAC
CAGGTCGGTA CGCTGCGCTG GGACGGTGCT GTGGCTCAGG CCCGCATGGA AAACCTGCTG
TCGTCGGCCT ATACCGATGC GAAGGTCGAC GGCGTTCTGT CGCCCTATGA CGGACTGTCG
ATCGGCATCA TCTCTGCTCT CAAGGGCGTC GGTTACGGCT CCGGCGACAT GCCGATGCCG
ATCGTCACCG GTCAGGACGC CGAGCTGCCT TCGGTCAAGT CCATCCTTGC GGGCGAACAA
CATTCCACGG TCTTCAAGGA CACCCGTGAA CTCGCCAAGG TCACGGTCAA CATGGTCAAC
GCGATCATGG ACGGCAAGGA GCCGGAAGTT AACGACACCA AGACGTATGA AAACGGAGTC
AAGGTCGTTC CGTCCTATCT GCTGAAGCCC GTTTCCGTAG ACAAGTCGAA CGCCAAGGAC
GTTCTTGTCG GCTCCGGCTA CTACACGGAA GATCAGCTCA ACAACTGA
 
Protein sequence
MKFFTSLLAA AAMTVAGFAA PAVAQDKGMV GISMPTKTST RWISDGETME KLFKDAGYTP 
DLQFADDDIP NQLAQIENMV TKGAKVLVIG AIDGTTLSDI LQKAADAGVK VIAYDRLIRD
SGNVDYYATF DNFQVGVLQA TSLVEGLKLD SATEPKNIEL FGGSPDDNNA FFFYDGAMSV
LQPLIDSGKL VVKSGQMGMD QVGTLRWDGA VAQARMENLL SSAYTDAKVD GVLSPYDGLS
IGIISALKGV GYGSGDMPMP IVTGQDAELP SVKSILAGEQ HSTVFKDTRE LAKVTVNMVN
AIMDGKEPEV NDTKTYENGV KVVPSYLLKP VSVDKSNAKD VLVGSGYYTE DQLNN