Gene Smed_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4979 
Symbol 
ID5318792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1492278 
End bp1493597 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID640776761 
Productextracellular solute-binding protein 
Protein accessionYP_001313693 
Protein GI150377097 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.336131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.060129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGC TGCTTTCCGG CGTATCGGCC GGTGTAATCA TGCTTGTCTG CGCTATGGGG 
GCGGCAAGTG CCGCTGACCT GCCGGGCAAG TTCGAGGGTG TTACCATCGA CGCCAAGCTG
ATTGGCGGCC AGCAATATGA AAAGCTCTAC GAGCGCATCG GCGAATGGGA GAAGGCGACC
GGCGCCAAGG TCAACATTCT GTCGAAGAAG AACCACTTCG AACTCGACAA GGAGATCAAA
TCCGACATCG CCACCGGCGG CATCACCTGG TGCATCGGTT CGAATCATTC CTCATTCGCA
CCGCAATATC CGGATATCTA CGCCGATCTT TTCGGCCTCG TCCCGTCCGA GGAGGTCGCC
AGGTTCGTGC CGGCCGTGAT CGATGCCTCG ACGCTCGAGG GCAAGCTTGT CATGCTGCCG
CGGGCGCAAT TCGACGTCTC GGCACTCTAT TACCAGAAGA GCCTTTATCA GGACGAGGCG
AAGAAGGCCG AGTTCAAGGC CAAGTACGGC TATGATCTCG CACCGCCCGA CACCTGGGCT
CAGGTGAGCG ATCAGGCGGA ATTCTTCGCC GCGCCGCCGA ACTTCTACGG CACGCAGTTC
GCCGGCAAGG AGGAAGCGAT CAACGGCCGC TTCTACGAGA TGCTGGTCGC CGAGGGTGGA
GAATATCTCG ACAAGGACGG CCGACCGGCG TTCAATTCGG AAGCGGGCGT GCGCGCCCTC
GATTGGTTCG TCAAGCTCTA CAAAAGCAAA GCGGTTCCGC CGGGCACCAC CAACTATCTC
TGGGACGATC TCGGCCAGGG CTTCGCTTCG GGCTCGATCG CGGTCAATCT CGACTGGCCT
GGCTGGGCGT CCTTCTTCAA CGATCCGAAA TCCTCGAAGG TTGCCGGCAA TGTCGGGGTG
AAGGTTCAGC CGGCCGGGTC GTCCGGCAAG CGCACCGGCT GGTCCGGGCA TCACGGCTTT
TCGGTAACGG AGTCCTGCGG GAGCAAGGAG GCAGCCGCCT CGCTCGTCTG GTGGCTGACC
AACGAAGACA GCCAGATGCT CGAATCCGCA GCCGGCCCTC TTCCGACCCG CAGCGCCGTA
TGGGATCACA ACATCAAGGC CGCCGAGGGC GACGCCTACA AGACCGAGGT GCTGCAGGCC
TTCCAGGAGG CAGCAAAGCA CGCCTTCCCG GTTCCGCAGA CGGCCGAGTG GATCGAGATA
TCCAATGCCG TCTATCCGGA GCTTCAGGCG GCCATCCTCG GCGACAAGAC GTCGAAGGAA
GCGCTCGATG CTGCCGCCGA AAAGGCGACC GGCATCCTCG AAGACGCCGG CAAGCTCTAG
 
Protein sequence
MNRLLSGVSA GVIMLVCAMG AASAADLPGK FEGVTIDAKL IGGQQYEKLY ERIGEWEKAT 
GAKVNILSKK NHFELDKEIK SDIATGGITW CIGSNHSSFA PQYPDIYADL FGLVPSEEVA
RFVPAVIDAS TLEGKLVMLP RAQFDVSALY YQKSLYQDEA KKAEFKAKYG YDLAPPDTWA
QVSDQAEFFA APPNFYGTQF AGKEEAINGR FYEMLVAEGG EYLDKDGRPA FNSEAGVRAL
DWFVKLYKSK AVPPGTTNYL WDDLGQGFAS GSIAVNLDWP GWASFFNDPK SSKVAGNVGV
KVQPAGSSGK RTGWSGHHGF SVTESCGSKE AAASLVWWLT NEDSQMLESA AGPLPTRSAV
WDHNIKAAEG DAYKTEVLQA FQEAAKHAFP VPQTAEWIEI SNAVYPELQA AILGDKTSKE
ALDAAAEKAT GILEDAGKL