Gene Smed_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3329 
Symbol 
ID5324213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3526276 
End bp3527562 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content65% 
IMG OID640792280 
Productextracellular solute-binding protein 
Protein accessionYP_001328985 
Protein GI150398518 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.955635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.757839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATA AAATCAGCAG GCGAAGTGTA CTCGCGGGTG GGGCGGCACT TCTTTCGATG 
TCGGCGATGG CAAGGAGCGC CCTTGCGCAG GAGGCGCGGC TGCGCGTGCT CTGGTGGGGC
TCTCAGGCCC GGGCCGACCG GACCAACAAG GTCAACCAGC TCTTCCAGGA GCAGAATGCG
GGTGTCGCCA TCAACGGCGA ATTTCTCGGC TGGAGCGATT ACTGGCCTCG GCTCGCGACG
CAGGTCGCCG GCCGTAACGC ACCCGACATC ATACAGATGG ACTATCGCTA CATCGTCGAA
TATGCCCGGC GCGGCGCGCT CGCGCCGCTC GACGACTATC TCGGCTCCGT GCTCAAGGTC
GAGGATTTCG ACCAGGTGCA GATCAAGGGC GGCAGCGTCG ACGGCAAGCT CTACGGCATC
AGCCTCGGCG CCAACTCGGC AGCGATGATG GTCAACGCCG CCGCTTTCGA GGAAGCCGGA
GTCGACCTGC CGAGCCCATC CACCACCTGG GAAGAGATGG CAAAGATCGG CGCTGAGATC
ACCCAGGCGG GCAAGCGCAA GGGGTTCTAC GGCCTTTCCG ATGGCAGTGC CGTCGAGCCG
CTGCTCGAAA ACTGGCTTCG CCAGCGCGGC AAGGCGCTCT TCACCGCCGA AGGCAAGATC
GGTTACGATG CCAATGATGC GGCAGAATGG TTTACCATGT GGCAGAACAT GCGCGAGGCC
AAGGCGTGCG TTCCGCCGGA CGTGCAGGCG CTCGACCAGT ACACCGTGGA GACGAGCCCG
CTGTCGCTCG GCAAGTCGGC CGCCTCCTTT GCGCATTCCA ACCAGTTCGT CGCCTATCAG
GGCGTCAGCA AGGACAAGCT GGCGCTCCGC AGCCACCCGT TGATCAGCAA GGATTCGAAG
GGCGGCCATT ACCGCAAGCC GTCGATGTTC TTCTCGGTCG CGGCTCAGAC GAAGGACCCG
GAACTCGGCG CCAAATATGT CAACTTCTTC GTCAAGGATC CGAAGGCCGC AGAAATCCTC
GGCGTCGAGC GCGGCGTACC GGAATCGTCC GCCGTGCGCG AGGCTCTCGC GCCGACGCTC
GACGAGCTCG GCCGCGCCAT GCTCGACTAT GTCTCCGGCC TCGGTCCCCT TGCCGGCGAA
CTGCCGCCAC CCCCGCCGAG CGGTGCCGGC GAGGCGGAAT TCGCACTGCG CAACGTCGCC
GAACAGGTGG GCTTCGGTCA GCTCGATGCC AAGCAGGGCG GCGAGACGCT GGTGAACGAA
GTCAGCCAAA TTCTCGCGCG GGGTTAG
 
Protein sequence
MTHKISRRSV LAGGAALLSM SAMARSALAQ EARLRVLWWG SQARADRTNK VNQLFQEQNA 
GVAINGEFLG WSDYWPRLAT QVAGRNAPDI IQMDYRYIVE YARRGALAPL DDYLGSVLKV
EDFDQVQIKG GSVDGKLYGI SLGANSAAMM VNAAAFEEAG VDLPSPSTTW EEMAKIGAEI
TQAGKRKGFY GLSDGSAVEP LLENWLRQRG KALFTAEGKI GYDANDAAEW FTMWQNMREA
KACVPPDVQA LDQYTVETSP LSLGKSAASF AHSNQFVAYQ GVSKDKLALR SHPLISKDSK
GGHYRKPSMF FSVAAQTKDP ELGAKYVNFF VKDPKAAEIL GVERGVPESS AVREALAPTL
DELGRAMLDY VSGLGPLAGE LPPPPPSGAG EAEFALRNVA EQVGFGQLDA KQGGETLVNE
VSQILARG