Gene Smed_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1751 
Symbol 
ID5322609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1831471 
End bp1833087 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content60% 
IMG OID640790689 
Productextracellular solute-binding protein 
Protein accessionYP_001327421 
Protein GI150396954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0954392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAACT TGTTACTGGC CGGCGTCTGC GCCGCCGCAC TGATGGGAAA TCCCGCATTC 
GCCGACGACA TCAAGCAGGG TGGCGAAATG ACCGTCACCT ATAAGGACGA TGTTTCGACA
CTCGATCCGG CGATCGGCTA CGACTGGCAG AACTGGTCGA TGATCAAGTC GCTGTTCGAC
GGCCTGATGG ATTATGTCCC GGGCACGACC GAGTTGCGTC CCGATCTTGC CGAAGCCTAT
GAAATCTCCG GGGACGGCAA AATCTTCACG TTCAAGCTGC GCCAGGGCGT CAAGTTTCAC
AATGGTCGTG AGCTGACTGC CGAGGACGTG AAATATTCGA TTGAGCGCGT GGTGAATCCG
ACGACCCAGA GCCCGGGTGC CGGGTTCTTC TCATCGATCA AAGGCTTCGA AGATGTCTCG
GCCGGAAAGG GGGGTGATCT GTCCGGCATC GCCGTGCAGG ATCCGCACAC AATCAGGTTC
GAACTGAGCC GGCCGGACGC CACCTTCCTC CACGTCATGG CGCTCAACTT TGCCCATGTC
GTGCCAAAGG AAGAGGTCGA GAAACACGGT GCGGATTTCG GGAAAAATCC CGTCGGTTCC
GGAGCGTTCA AGCTTGCCGA GTGGACGCTT GGGCAACGCC TGGTGTTCGA ACGCTTCGCC
GACTACTGGA ACGAAGGGCT TCCGAAGCTT GACCGCATTA CCTTCGAGGT CGGCCAGGAG
CCCGTTGTCG CGCTCCTTCG CCTGCAGAAC GGCGAAATCG ACGTGCCCGG AGACGGCATT
CCGCCGGCGA AGTTCGTCGA GGTGACCAAA GATCCTAATT TCAAGGAGCT GATCATTCAG
GGCGGTCAGT TGCACACCGG CTATGTGACG ATGAACGTCA AGATGGCCCC CTTCGACAAG
GTCGAGGTGC GCAAGGCTGT GAACATGGCC ATCAACAAGG ATCGCATCCT GCGCATCATC
AACGGTCGCG CAGTCGCCGC CAATCAGCCG CTGCCGCCCT CGATGCCGGG ATATGCCAAG
GATTATAAAG GATATGCCTA TGATCCCGAG GGCGCCAAGA AGCTGCTCGA ACAGGCCGGC
CTGGGCGACG GGTTCTCGAC CGAACTCTAT GTCATGAACA CCGACCCTCA GCCGCGTATC
GCCCAGGCCA TCCAGCAGGA CCTGAAGGCG ATCGGCATCA CGGCATCGAT AAAGTCGCTG
GCACAGGCCA ATGTCATCGC GGCGGGCGGC GAGGAGAACC AGGCGCCGAT GGTCTGGTCG
GGCGGCATGG CATGGATTGC CGACTTCCCG GATCCCTCGA ATTTCTACGG CCCCATTCTG
GGGTGCGGCG GTGCCGTGCC GGGAGGCTGG AACTGGTCCT GGTACTGCAA TGAGGAGCTC
GACAAGAAGG CAGCCGAAGC CGATGCCATC GTAGACCCGG CAAAGGCCGC CGAGCGCGAG
GCCATGTGGC GCGACATCTA TGTGAAGATC ATGGAGGACG CACCCTGGGC ACCGATCTTC
AACGAGGAGC GCTTTACCAT TCGCTCGGAG CGTATCGGCG GCGACGACAA GCTGTTCGTC
GATCCGGTCC ACATTCCCGT TCACTACGAT CAGGTATATG CAAAAGATGT GCAGTAA
 
Protein sequence
MRNLLLAGVC AAALMGNPAF ADDIKQGGEM TVTYKDDVST LDPAIGYDWQ NWSMIKSLFD 
GLMDYVPGTT ELRPDLAEAY EISGDGKIFT FKLRQGVKFH NGRELTAEDV KYSIERVVNP
TTQSPGAGFF SSIKGFEDVS AGKGGDLSGI AVQDPHTIRF ELSRPDATFL HVMALNFAHV
VPKEEVEKHG ADFGKNPVGS GAFKLAEWTL GQRLVFERFA DYWNEGLPKL DRITFEVGQE
PVVALLRLQN GEIDVPGDGI PPAKFVEVTK DPNFKELIIQ GGQLHTGYVT MNVKMAPFDK
VEVRKAVNMA INKDRILRII NGRAVAANQP LPPSMPGYAK DYKGYAYDPE GAKKLLEQAG
LGDGFSTELY VMNTDPQPRI AQAIQQDLKA IGITASIKSL AQANVIAAGG EENQAPMVWS
GGMAWIADFP DPSNFYGPIL GCGGAVPGGW NWSWYCNEEL DKKAAEADAI VDPAKAAERE
AMWRDIYVKI MEDAPWAPIF NEERFTIRSE RIGGDDKLFV DPVHIPVHYD QVYAKDVQ