Gene Smed_5858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5858 
Symbol 
ID5320160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp820037 
End bp821344 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content56% 
IMG OID640777553 
Productextracellular solute-binding protein 
Protein accessionYP_001314485 
Protein GI150377890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.437643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA CAGTGGCCGG ACTAATGACC GGCATCGGTT TCATGTTTGC CTGCGGAACA 
TCGGCGCAGT CGCAAGAACT GACGATCTTC TGGGCGGAAT GGGATCCAGC CAACTACCTC
CAGGAGCTTG TAAACGAGTA CGAAGCCGAG ACCGGTGTGA CGATCACCGT GGAGACGACA
CCTTGGGCGG ATTTCCAGAC GAAGGCCTTC ACCGAGTTCA ATGCCAAGGG ATCAGCCTAT
GATATGGTCG TCGGCGACTC TCAATGGATC GGGGCCGCGT CGGAGGCCGG CCACTACGTG
GATCTCACGG AGTTCTTCAA CAAGCACAAG CTAAAGGAGG TAATGGCCCC GGCGACCGTG
AAGTACTACG CGGAGTACCC CGCAAACTCC GGTAAATACT GGTCGATACC GGCCGAAGGC
GACGCTGTCG GTTGGTCCTA TCGTAAGGAT TGGTTTGAAG ATCCAAAGGA AATGGAGGCC
TTCAAGGCGA AGTATGGCTA TGACCTTGCT CCTCCGAAGG ATTGGAAACA ACTGCGTGAC
ATCGCCGAAT TCTTCCATCG TCCGGACCAG AAGCGCTACG GCATCGCAAT CTACACCGAC
AACTCCTATG ACGGATTGGT CATGGGCGTA GAGAACGCCA TCTTCTCTTT CGGTGGAGAA
CTCGGCGACT ACAGCACCTA CAAGGTGGAC AGCATCATCA ACTCGGAGAA GAACGTCAAG
GCTCTGGAAA CTTACCGCGA GCTCTATGGT TTCACGCCTC CGGGCTGGGC CAAGTCTTTC
TTTGTCGAGA ACAACCAGGC TATCACCGAG AACCTGGCAG CGATGAGCAT GAACTACTTC
GCCTTCTTCC CGGCGCTGGT CAATGAAGCT TCGAACCCGA ACGCGAAGGT CACCGGCTTC
TTCGCCAATC CCGCCGGTCC AGACGGGGAC CAGTATGCCG CTCTTGGCGG CCAGGGCATT
TCGATTGTCT CCTATTCCCA GAACAAGGAA GAGGCGATGA AATTTCTCGA ATGGTTCATC
AAGGACGAGA CACAGAAGCG CTGGGCCGAG CTCGGCGGCT ATACGGCAAG CGCCAAGGTT
CTGGAGTCGG AAGAGTTCCA GAACGCGACC CCATACAACA AGGCATTTTA CGAGACCATG
TTCCGGGTGA AGGACTTCTG GGCAACACCG GAATATGCCG AGTTGCTGAT CCAGATGAAC
CAGCGCATCT ATCCTTATGT CACCGCGGGC CAGGGCACGG CAAAGGAGGC GCTCGACGCA
CTTGCTGAGG ACTGGAATGC AACCTTCAAG AAGTACGGCC GCCACTAA
 
Protein sequence
MRKTVAGLMT GIGFMFACGT SAQSQELTIF WAEWDPANYL QELVNEYEAE TGVTITVETT 
PWADFQTKAF TEFNAKGSAY DMVVGDSQWI GAASEAGHYV DLTEFFNKHK LKEVMAPATV
KYYAEYPANS GKYWSIPAEG DAVGWSYRKD WFEDPKEMEA FKAKYGYDLA PPKDWKQLRD
IAEFFHRPDQ KRYGIAIYTD NSYDGLVMGV ENAIFSFGGE LGDYSTYKVD SIINSEKNVK
ALETYRELYG FTPPGWAKSF FVENNQAITE NLAAMSMNYF AFFPALVNEA SNPNAKVTGF
FANPAGPDGD QYAALGGQGI SIVSYSQNKE EAMKFLEWFI KDETQKRWAE LGGYTASAKV
LESEEFQNAT PYNKAFYETM FRVKDFWATP EYAELLIQMN QRIYPYVTAG QGTAKEALDA
LAEDWNATFK KYGRH