Gene Smed_5582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5582 
Symbol 
ID5319884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp548777 
End bp550051 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content58% 
IMG OID640777329 
Productextracellular solute-binding protein 
Protein accessionYP_001314261 
Protein GI150377666 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.231128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATT GGTTTAAACT TACGAAGGCA GTGCTTTTGG CGGGAGGTCT CATGACCTCC 
GTCGCCCACG CCGAGGAGAC CGTCACATGG TGGGACTTCT TCGGCGGCGG TGACGGCGTT
CGCATGAAAC AATTGGTCGC CGATTTCAAC GAGGCCCACA AGGGCAAGAT TAAGATCGAT
GCGACCACGC TGGAATGGGG AACGCCATTT TATTCCAAGG TCCAAACCTC TGCGGCGGTT
GGCGAAGCCC CTGATATCAT GACGTATCAC GCCAGCCGCA TCCCGCTAGC GGTGAAGCAG
GGCATTCTGC AGGAAATCAC CGCCGACGAC TGGAAGACGA TGGGCCTCGG ACGATCTGAC
TACGCCCCCG CGACTTGGGA GGCTGTGAGC ATCGACGGCA AACAGTATGC GGTGCCGCTC
GATACACACC CAATCGTCCT TTATTATAAC AAGGACCTGC TCAAGAAAGC CGGAGTGCTC
GGCGACGACG GCAAGCCGAA GGGCATGGAC TCGCGTGAGA ACTTCACCGC GACGCTGAAG
GCCTTGAAGG CCGCTGGGGT CAAGTTCCCG CTTGGCTCGG TGACTGCGGA CGGCAACTTC
ATGTTCCGCA CCGTCTATTC CTTCATGGGC CAGCAGGATG GTGAGCTCAT GACGGACGGT
GAGTTTCTTG CTGGCGACAG CGCAAAGAAG CTCGAAAACT CTCTTGCCGT TCTGTCCGAA
TGGACGAAGG AAGGGCTTCA ATCCACCTAT ACCGACTACC CTGCGACCGT GGCGCTCTTC
ACATCGGGTG AAGCTGCGAT GATGATCAAC GGGGTTTGGG AAGTCCCCAC TATGACGGAC
CTCAAGAACA ACGGCAAGCT CTTCGAATGG GGAGCCGTCG AACTCCCTGT AATCTTCGAT
CATCCCTCCA CCTACGCCGA CAGTCACACC TTCGCACTCC CTGCCAACAA GGGCGAGGAG
ATGAGTCCCG AGAAGCGCGC CGCGGTCCTC GAGGTCATGA GCTGGATGTC AAAGAATTCG
TTGTTCTGGG CGACCGCGGG GCATATCCCC GCATATGGTC CGGTCACGAA CTCGGCCGAG
TACAAGGCGA TGGAGCCGAA CCACACATAT TCTTCGCTCA CCGCGAACAT CATCTTCGAT
CCCAAGACTC CGCTGGCGGG CATCGCTGGG CCAATTTTTG ACGTGATGTC GAATTTCTTC
GTGCCGACGC TCAATGGTGA AATGGAGCCC GCGAAAGCGG TCGAGGAGAT CAAGGCCGGT
TTGGCCGAGC TTTGA
 
Protein sequence
MPNWFKLTKA VLLAGGLMTS VAHAEETVTW WDFFGGGDGV RMKQLVADFN EAHKGKIKID 
ATTLEWGTPF YSKVQTSAAV GEAPDIMTYH ASRIPLAVKQ GILQEITADD WKTMGLGRSD
YAPATWEAVS IDGKQYAVPL DTHPIVLYYN KDLLKKAGVL GDDGKPKGMD SRENFTATLK
ALKAAGVKFP LGSVTADGNF MFRTVYSFMG QQDGELMTDG EFLAGDSAKK LENSLAVLSE
WTKEGLQSTY TDYPATVALF TSGEAAMMIN GVWEVPTMTD LKNNGKLFEW GAVELPVIFD
HPSTYADSHT FALPANKGEE MSPEKRAAVL EVMSWMSKNS LFWATAGHIP AYGPVTNSAE
YKAMEPNHTY SSLTANIIFD PKTPLAGIAG PIFDVMSNFF VPTLNGEMEP AKAVEEIKAG
LAEL