Gene Smed_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2909 
Symbol 
ID5323786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3052335 
End bp3054059 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content61% 
IMG OID640791861 
Productextracellular solute-binding protein 
Protein accessionYP_001328574 
Protein GI150398107 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGGC ACCTTTTGAC AACGACGGCG GCCATGCTGC TGGCTTTCAC CGGCTCGGCC 
TTTGCCGGCA TGGACGAGGC AAAACAGTTT CTGGACAAGG AAGTCGGCGA CATGTCCTCG
CTCGACCGCG CTGCCCAGGA AGCGGAAATG CAGTGGTTCG TCGATGCGGC GAAACCCTTT
GCGGGCATGG AGATCAAGGT CGTCTCCGAA ACGATCACCA CCCACGAATA TGAATCGAAG
GTGCTGGCCC CGGCCTTCAC TGCGATCACC GGCATCAAAA TCACCCATGA CCTGATCGGC
GAAGGCGACG TTGTCGAAAA ACTGCAGACG CAGATGCAGT CGGGCGAAAA CGTCTATGAC
GCCTACATCA ACGACAGCGA CCTGATCGGC ACCCACTGGC GCTACCAGCA GGCGCGAAGC
CTGACCGACT TCATGGCGAA CGAGGGCAAG GACGTCACCA ATCCCAACCT CGACATCGAC
GACTTCATCG GCAAGTCCTT CACCACGGCT CCGGACGGCA AGCTCTACCA GCTTCCCGAC
CAGCAGTTCG CGAACCTCTA CTGGTTCCGC TACGACTGGT TCAACGACGA GAAGAACAAA
GCGGATTTCA AGGCGAAGTA CGGCTACGAT CTCGGCGTTC CCGTCAACTG GTCGGCCTAC
GAGGACATCG CCGAGTTCTT CACCGGCCGC GAAGTCGACG GCAAGAAGGT CTATGGCCAC
ATGGACTACG GCAAGAAGGA CCCGTCGCTC GGCTGGCGCT TCACCGACGC TTGGCTCTCG
ATGGCCGGCA ATGGCGACAA GGGCATTCCG AACGGCAAGC CGGTCGACGA GTGGGGCATC
AAGGTCGACG AAAACTCAAG ACCCGTCGGA TCCTGCGTTG CACGCGGCGG CGATACCAAT
GGCCCGGCAT CCGTCTATGC CATCCAGAAA TATCTCGACT GGATGAAGGC CTATGCACCG
GCCGCTGCTC AGGGCATGAC CTTCTCGGAA TCTGGCCCCG TTCCATCGCA AGGTGAGGTC
GCCCAGCAGA TGTTCACCTA TACGGCCTTC ACCGCCGATT TCGTGAAGGA AGGCCTGCCA
GTCGTGAACG AGGACGGTAC GCCGAAGTGG CGTTTCGCTC CGAGCCCGCA TGGCGTCTAC
TGGAAGGAGG GCATGAAGCT CGGCTATCAG GACGCCGGTT CCTGGACGCT GCTTAAGTCG
ACGCCGGACG ATCGCGCCAA GGCCGCGTGG CTCTACGCGC AGTTCGTGAC CTCGAAAACG
GTAGACGTGA AGAAGAGCCA TGTCGGCCTC ACCTTCATCC GTCAATCGAC GCTCGACCAT
CAGAGCTTCA CCGACCGTGC GCCGAAGCTC GGCGGTCTGA TCGAGTTCTA CCGTTCGCCG
GCCCGCCTGC AGTGGTCGCC GACCGGAACG AACGTCCCTG ACTATCCGAA GCTGGCACAG
CTCTGGTGGC AGGCGATCGG CGATGCTTCT TCCGGTGCGA AGACCGCACA GGAGGCCATG
GACTCGCTCT GCGCCGAACA GGAAAAAGTG ATGCAGCGTC TCGAGCGCGC CGGAGTCCAG
GGCGAAATCG GGCCGAAGCT CGCCGATGAG CACGACCTCG AATACTGGAA CGCGGAAGCC
GTCAAGGCCG GCAACCTCGC ACCGCAGCTC AAGGTCGAGA ACGAGAAGGA TAAGCCGGTC
ACCGTCAATT ACGACGAACT GGTCAAGAGC TGGCAGACCA ACTGA
 
Protein sequence
MRRHLLTTTA AMLLAFTGSA FAGMDEAKQF LDKEVGDMSS LDRAAQEAEM QWFVDAAKPF 
AGMEIKVVSE TITTHEYESK VLAPAFTAIT GIKITHDLIG EGDVVEKLQT QMQSGENVYD
AYINDSDLIG THWRYQQARS LTDFMANEGK DVTNPNLDID DFIGKSFTTA PDGKLYQLPD
QQFANLYWFR YDWFNDEKNK ADFKAKYGYD LGVPVNWSAY EDIAEFFTGR EVDGKKVYGH
MDYGKKDPSL GWRFTDAWLS MAGNGDKGIP NGKPVDEWGI KVDENSRPVG SCVARGGDTN
GPASVYAIQK YLDWMKAYAP AAAQGMTFSE SGPVPSQGEV AQQMFTYTAF TADFVKEGLP
VVNEDGTPKW RFAPSPHGVY WKEGMKLGYQ DAGSWTLLKS TPDDRAKAAW LYAQFVTSKT
VDVKKSHVGL TFIRQSTLDH QSFTDRAPKL GGLIEFYRSP ARLQWSPTGT NVPDYPKLAQ
LWWQAIGDAS SGAKTAQEAM DSLCAEQEKV MQRLERAGVQ GEIGPKLADE HDLEYWNAEA
VKAGNLAPQL KVENEKDKPV TVNYDELVKS WQTN