Gene Smed_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1702 
Symbol 
ID5322560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1780799 
End bp1782067 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content59% 
IMG OID640790641 
Productextracellular solute-binding protein 
Protein accessionYP_001327373 
Protein GI150396906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.57478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGCA TCGATCTCAA AAGAATTTGC AGCGTGGCGG TGATTGCCAT GCTGGCAACG 
CCGGCGCTTG GGGAGCCCGT GACCATCGCG GTATGGATGC ATGAACATCC ACCAAGACTG
GCGCTGGACG AGAAGTTGGT CGCCGCATTC GAGCAGGCCA ATCCCGACAT AAATGTCGAC
CTGACCATTT TCCCGAACGC TCAGTTCGAG CAGCGGCTTC AGATTGGATT CGCCAGCGGC
GACGGGCCGG ACCTGTTCAA CAGTGGCTCG TTCAATATCG GGCAATATCG CCATTCCCGA
TTGTTGGCTC CGGTCGATCT GAAGACGGTG GGTGTCGATG ATTTGAATGA ACTCAAGGCG
AAATTCGGGA TCGGCATCGC CGGCGCCGAG TTTGACGGTG TCGCTTACGG ACTGCCGACG
GAGGTGAGCA ATTATGCGTG CGTCGCCAAC AATGCGCTGT GGCGTACGGC GGGGCTCGAT
CCCGCAAAAG ATGCACCGGC CACTTGGGAG GAGATGGTTG AAGTCGCCCG CAAGCTGACC
CGCCGGGATG ACGGAAATGT TCCGGTCGTA CGCGGCTTCG ACTTCAACTG GTCCGACCCG
ATCTTCATGT GGTTGACGTT CAACGCGATG GTGAACCAGC TTGGCGGCAC CGTCATCGAT
GAGGCGGCGT TGACTGCGGA TTTCGACTCG GTTCAGGTCC GCACGGTCAT GGACTTCTGG
AGCGCCTGGG CCAATGATTG GGCACTGGGC GGACCGCAAT ATACCGCGAG CCGCGATGCT
TTTCTGGCTG GCGAACTGGC TACGGAATGC ACCTTCGGCT CATGGGGACG CGATCAGTTC
AAAGCGGCAG GGATCGATTA TACCTTCTTT CCTGTACCGC GCTGGCGTGA AGGAACGGTC
GATACCGGTT TCAACGCCTA TGCCTATTAC ATGATGGTCA ATGCCAATGC CCCGCCGGAA
CGCCAGGAGG CAGCGTGGCG CTTTGCCGCG TTCTATGCGA GCCAGGGCGA GGCCCTCTTC
GAAGAAGCTG GGCTCTTCAC CACGGTGCCG GCAGTCCAGG AGCTGGAAAG CTATACATCC
GATGGTAGTA ACACGATCTT CACCGACGAG CTGGACAAGG CGGTGTTTTC GCCCCGTGTG
CCGGGTTTCA ACGAGCTTGG CGACGCTCTG GCTCGCGCCC GCGACCGGAT CGTTATAAAC
CATGAAGATG CTTCCGCCGC TCTCGGCGAG CTCGAGGCTG AGGCGTCGAC CATTCTCGGA
CGGTTTTGA
 
Protein sequence
MIRIDLKRIC SVAVIAMLAT PALGEPVTIA VWMHEHPPRL ALDEKLVAAF EQANPDINVD 
LTIFPNAQFE QRLQIGFASG DGPDLFNSGS FNIGQYRHSR LLAPVDLKTV GVDDLNELKA
KFGIGIAGAE FDGVAYGLPT EVSNYACVAN NALWRTAGLD PAKDAPATWE EMVEVARKLT
RRDDGNVPVV RGFDFNWSDP IFMWLTFNAM VNQLGGTVID EAALTADFDS VQVRTVMDFW
SAWANDWALG GPQYTASRDA FLAGELATEC TFGSWGRDQF KAAGIDYTFF PVPRWREGTV
DTGFNAYAYY MMVNANAPPE RQEAAWRFAA FYASQGEALF EEAGLFTTVP AVQELESYTS
DGSNTIFTDE LDKAVFSPRV PGFNELGDAL ARARDRIVIN HEDASAALGE LEAEASTILG
RF