Gene Smed_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3812 
Symbol 
ID5318366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp264933 
End bp266204 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content60% 
IMG OID640775624 
Productextracellular solute-binding protein 
Protein accessionYP_001312557 
Protein GI150375961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTGA AGCCTTTTAT CAGGACGCTG ATTTCCTGTG CCGCTATCGT CGGTGCAATC 
GATGTTGCTG CCGCCACCGA ATTGTCGATG GCGGCCAATT CGACCGGTAA GAATCTGAGC
TTTCTGCGCG AGCAGATCGC CAGGTTCGAG AAGGAGACGG GCCATAAGGT CAATCTGGTG
ACGATGCCGG CGTCGAGCAG CGAGCAGTTC AGCCAATACC GGCTCTGGCT CGCGGCCGGC
AACAAGGACG TCGACGTCTA CCAGACGGAT GTCATCTGGG CTCCACAGCT CGCCGAGCAG
TTCGTAGACC TGACCGAGGC CACGAAGGAC GTCGTCGCCG ACCATTTCCC CTCGATCATT
CAGTCACAGA CCGTCAACGG CAAGTTGGTG GCCTTGCCCT TCTATACCGA TGCGCCGGCG
CTCTATTACC GCAAGGACCT GCTCGACAAA TACGGCAAGG CGCCGCCGAA GACCTGGGAT
GAAATGGCAG CGACGGCCAA GGAAATTCAG GAAAAGGAGC GTGCTGCCGG CAATGCCGAT
ATCTGGGGCT TCGTTTTCCA GGGCAATGCC TATGAAGGGC TCACCTGCAA CGCACTCGAG
TGGATCAAGT CCTCGGGCGG TGGCCAGATC GTCGAGCCCG ATGGCACGAT CTCCGTCAAT
AATGAGAAGG CGGCCGCGGC CGTGGAACGT GTCAAGGAAT GGATCGGCAC GATCGCGCCC
AAGGGCGTGC TTGCCTATCA GGAAGAGGAA TCGCGCGGGG TCTGGCAGAC CGGCAATGCG
GTCTTCATGC GTAACTGGCC CTATGCCTAT GCGCTCGGTA ACGGCGACGA CAGTGCCGTC
AAGGGCAAAT TCGAAGTGGC CCCGTTGCCG GCCGCCGCCG ATGGCGAGAA GCCATCTTCC
ACCCTCGGTG GATGGAATCT CGCGGTCTCG AAATATTCCG ACGAGCAGGA GGCGGCGATT
GCGTTTGTCA AATTCCTCGG GTCAGCCGAG ACGCAGAAGG TGCGCGCGAT CGAGCTCTCG
AACCTGCCGA CGATCGCTGC ACTTTACGAT GATCCGGAAA TCGCGGCCGC TCAGCCGTTC
ATGCCGCACT GGAAGCCTAT CTTCGAGAGC GCCGTGCCGC GCCCCTCGGC AGTGGCCAAG
GTGAAGTATA ACGAGGTTTC GTCCAAGTTC TGGAGCGCCG TGCACAACAC GCTTTCGGGC
AACGGAACGG CCGCGGAGAA CCTGGAACTT CTCGAAGTCG AACTGACCGA ACTCAAGGGT
GACTCCTGGT AA
 
Protein sequence
MDVKPFIRTL ISCAAIVGAI DVAAATELSM AANSTGKNLS FLREQIARFE KETGHKVNLV 
TMPASSSEQF SQYRLWLAAG NKDVDVYQTD VIWAPQLAEQ FVDLTEATKD VVADHFPSII
QSQTVNGKLV ALPFYTDAPA LYYRKDLLDK YGKAPPKTWD EMAATAKEIQ EKERAAGNAD
IWGFVFQGNA YEGLTCNALE WIKSSGGGQI VEPDGTISVN NEKAAAAVER VKEWIGTIAP
KGVLAYQEEE SRGVWQTGNA VFMRNWPYAY ALGNGDDSAV KGKFEVAPLP AAADGEKPSS
TLGGWNLAVS KYSDEQEAAI AFVKFLGSAE TQKVRAIELS NLPTIAALYD DPEIAAAQPF
MPHWKPIFES AVPRPSAVAK VKYNEVSSKF WSAVHNTLSG NGTAAENLEL LEVELTELKG
DSW