Gene Smed_2315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2315 
Symbol 
ID5323176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2394167 
End bp2395534 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID640791253 
Productextracellular solute-binding protein 
Protein accessionYP_001327982 
Protein GI150397515 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.271919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATA GATCCAGCCT TGTTGCGGCA TCCGGCAAGT CGCGCCGGGA ATTTCTCCGC 
AACGGCGCAA CCTTCGCCGC CGCCGGACTG GCCGGCGGCC TGAGCGGCTT TCCGTTCATC
AACCGGCTGC CGGTGCGTGC CCAGGACGCC CCGTTGAAGT TCTGGCAATT TTACGCGCCC
GGCGGCCAGG TGAAGCCGCA GGTCGAATGG TTCGAGAAAA CCGTCGCCGA TTGGAACGCA
ACGCACGACC AGAAGGTCGA GCTCGAATTC ATCCCGAACA AGGAATACAT CAACGGTCCG
AAGCTCGCGA CCGCCTTCGC CTCCGGTGAC GGGCCGGACA TCTTCATCAT CTCGCCCGGC
GACTTCCTGC GCTATTACAA TGGCGGGGTT CTGCAAGATC TGACGCCCTA TATCGACGAG
AAGGCCCGGG CCGATTTCCC GGAAAGCGTG CTTGCGAACC GTATGGTCGA CGGCAAGATC
TTCGGCCTTC CGATGGAAGT CGAGCCGATG GCGATGTTCT ACTCCATCAA GGCCTTTGAG
GATGCCGGCC TCAATGAGAA TGACGTGCCG AAGACCTGGG ACGAACTGTT GGAACTGGGC
AAGAAACTGA CCACGCCGGA ACGTTATGGC CTGCTGTTCC AGACCGCGCC GGGCTATTAC
CAGAACTTCA CCTGGTATCC CTTCCTCTGG CAGGGCGGCG GCGAATTCCA GAACGCCGAG
GGAAAAAGCG CGTTCGACTC GCCCGCGACC GTGCAGGCGC TGAAGCTCTG GCAGGATGCC
GTGAATTCGG GCGCCGCACC CCGGCAGGTA CTCGGCAACG GCGCCAACGA CAGTGTCGCC
AATCTCGCCT CCGGCTATTG CGCGATCCAG AATGTCGGTA TCTGGGCCAT TTCCCAATTG
AAGAACAACG CCAAGGACTT CCCCTACGGC GTGTTCCGCT TGCCGACGCC GGCGAATGGC
AAGTACGTCA CAGTCGGCGG CGGATGGGCT TTCGTCGCCA ATTCCAAAGG CAAGAACCCG
GAAGCTGCCG GGCAGTTCTG CGCCTGGGCG CTGGCATCGA TGGATCAAGG CTCGATCGAT
CGTGTCGCGA GCTGGTGCAC CGAAGCGAAA TCCGACATGC CCCCGCGCGA CAGCGCTCTG
AAAGCACGTG AAGCGGCATT CAGCGAAGGC ATAATCGGCC AGTTCGCCAA AGAGATTCAC
CCGGGTACGC GCGCCGAGCC GCGGGTGCCG CCGGAAGTCT ACAAGATCAT CTCGGACGCC
GTACAACAGG CCATGCTCGG TGGCGCCGAC CCGCAGGCGA CCGCGACCAC GGCCTCGCAG
CGGCTCGACG CCTACCTGGC CTCCTATTCC GGCGCGCCGA TTCTTTAA
 
Protein sequence
MKNRSSLVAA SGKSRREFLR NGATFAAAGL AGGLSGFPFI NRLPVRAQDA PLKFWQFYAP 
GGQVKPQVEW FEKTVADWNA THDQKVELEF IPNKEYINGP KLATAFASGD GPDIFIISPG
DFLRYYNGGV LQDLTPYIDE KARADFPESV LANRMVDGKI FGLPMEVEPM AMFYSIKAFE
DAGLNENDVP KTWDELLELG KKLTTPERYG LLFQTAPGYY QNFTWYPFLW QGGGEFQNAE
GKSAFDSPAT VQALKLWQDA VNSGAAPRQV LGNGANDSVA NLASGYCAIQ NVGIWAISQL
KNNAKDFPYG VFRLPTPANG KYVTVGGGWA FVANSKGKNP EAAGQFCAWA LASMDQGSID
RVASWCTEAK SDMPPRDSAL KAREAAFSEG IIGQFAKEIH PGTRAEPRVP PEVYKIISDA
VQQAMLGGAD PQATATTASQ RLDAYLASYS GAPIL