Gene Smed_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4017 
Symbol 
ID5318826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp473294 
End bp474601 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID640775825 
Productextracellular solute-binding protein 
Protein accessionYP_001312758 
Protein GI150376162 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCATC ACTTCATGGG CGGCAGTGCT TGGCCGCAGG GAAATGGGGA GGACATCGTG 
AGGCTGAAAC GTCTTCTGAT CGCCGGCGCC GCTGCCGCTC TGGCAGCAAT GCCGGCTTCC
GCCGGAGAGC TTTCGTTCTG GCACGCTTAT GCGGGTCAGC AGGACAAGGT CGAATTCATC
GACTTCGCTC TCGGCGAATT CGCAAAGGCC CATCCCGAGG TCAAGCTCGA AGTGGTCGCT
GCCGAGCAAT CGGCCTACAA GACGAAGCTC AACACCGCCA TGGCCTCGGG CAATCCTCCG
GATGTCTTCT ACACGCTGCC TGGCGGCTTC CTGAACGCCT TCGTCAAAGG CGGGCAGATG
TATGCCCTGG ACGAGGAACT CGCCAGGGAC GGGTGGCGTG ACAGTTTCCT TGAAAGCGCA
ATCTCCCAGA CCAGCAAGGA CGGTCACACC TATGCCGTCC CCGTAGACGT GGATTCGGTG
GTGTTCTGGT ACGACAAGGC CCGTTTCGCC GAGAACGGCT GGACGGTGCC GAAGACCTAT
GAAGAGCTGC TCGCCCTCGC CGAGAAGGTG AAAGGTGAAG GGCTTGTCCC CTTTTCGCTC
GGCAACAAGG ATTCCTGGCC GGCAACGTTC TGGTTCCAGT ATCTCGAAAT GCGGCTCAAG
GGCTCGGGCG TCGTCTCGGC TTTCGTGAAC GGGGATCCGG ATGCGACGCT GGGCGCCGAG
GCGACGAAGG CGATGGAGAA GCTCGCCGAA CTCGCGAAGC AAGAGTATTT CCCGATCGGA
TTCAACGGCA TGAGCGATCA GGAAGCCAAT ATGCTCTTCC TCAACGGTCA GGCTGCAATG
ATGCTGAACG GCACGTGGCA GATAGGCGCA TCGGCGGACG CGCCCGAAGG CTTCGAGCTT
GGCTATTTCG CCTTCCCCGC TGTCGCAGGC GGGGCCGGGG ACCAGTCCGA CGTGCTGGCC
GGCGTCGCTG CGAGTTTCGG CGTTTCGCAG AAGGCGGAGA ATAAGGCAGA CGCGGTCACC
CTGCTGAAGT TCCTGACCTC GCGCGAGGTG ATGACCAAAT ATGTCGAGTT GCGCAAGACG
ATGGTGACCG TCAAGGACGC CACCACCGAA ACGGCCGCCG GGCCAGTCCT CTACGATATC
AGCAACAAGC TGATGAAGGC TGCCGGCCAC CTCGATCCTT TCTACGACAC CGCCATGCCG
CCCGCAGCGA CGAACATCTA TTACACCTCG CTGCAAGGAG TGCTCGATGG CTCGCTGCCG
CCCGCGGATG CGGCCAAGCG CATCGAAGAC GCATTGCGGG CGAAGTAA
 
Protein sequence
MWHHFMGGSA WPQGNGEDIV RLKRLLIAGA AAALAAMPAS AGELSFWHAY AGQQDKVEFI 
DFALGEFAKA HPEVKLEVVA AEQSAYKTKL NTAMASGNPP DVFYTLPGGF LNAFVKGGQM
YALDEELARD GWRDSFLESA ISQTSKDGHT YAVPVDVDSV VFWYDKARFA ENGWTVPKTY
EELLALAEKV KGEGLVPFSL GNKDSWPATF WFQYLEMRLK GSGVVSAFVN GDPDATLGAE
ATKAMEKLAE LAKQEYFPIG FNGMSDQEAN MLFLNGQAAM MLNGTWQIGA SADAPEGFEL
GYFAFPAVAG GAGDQSDVLA GVAASFGVSQ KAENKADAVT LLKFLTSREV MTKYVELRKT
MVTVKDATTE TAAGPVLYDI SNKLMKAAGH LDPFYDTAMP PAATNIYYTS LQGVLDGSLP
PADAAKRIED ALRAK