Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4017 |
Symbol | |
ID | 5318826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 473294 |
End bp | 474601 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640775825 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312758 |
Protein GI | 150376162 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCATC ACTTCATGGG CGGCAGTGCT TGGCCGCAGG GAAATGGGGA GGACATCGTG AGGCTGAAAC GTCTTCTGAT CGCCGGCGCC GCTGCCGCTC TGGCAGCAAT GCCGGCTTCC GCCGGAGAGC TTTCGTTCTG GCACGCTTAT GCGGGTCAGC AGGACAAGGT CGAATTCATC GACTTCGCTC TCGGCGAATT CGCAAAGGCC CATCCCGAGG TCAAGCTCGA AGTGGTCGCT GCCGAGCAAT CGGCCTACAA GACGAAGCTC AACACCGCCA TGGCCTCGGG CAATCCTCCG GATGTCTTCT ACACGCTGCC TGGCGGCTTC CTGAACGCCT TCGTCAAAGG CGGGCAGATG TATGCCCTGG ACGAGGAACT CGCCAGGGAC GGGTGGCGTG ACAGTTTCCT TGAAAGCGCA ATCTCCCAGA CCAGCAAGGA CGGTCACACC TATGCCGTCC CCGTAGACGT GGATTCGGTG GTGTTCTGGT ACGACAAGGC CCGTTTCGCC GAGAACGGCT GGACGGTGCC GAAGACCTAT GAAGAGCTGC TCGCCCTCGC CGAGAAGGTG AAAGGTGAAG GGCTTGTCCC CTTTTCGCTC GGCAACAAGG ATTCCTGGCC GGCAACGTTC TGGTTCCAGT ATCTCGAAAT GCGGCTCAAG GGCTCGGGCG TCGTCTCGGC TTTCGTGAAC GGGGATCCGG ATGCGACGCT GGGCGCCGAG GCGACGAAGG CGATGGAGAA GCTCGCCGAA CTCGCGAAGC AAGAGTATTT CCCGATCGGA TTCAACGGCA TGAGCGATCA GGAAGCCAAT ATGCTCTTCC TCAACGGTCA GGCTGCAATG ATGCTGAACG GCACGTGGCA GATAGGCGCA TCGGCGGACG CGCCCGAAGG CTTCGAGCTT GGCTATTTCG CCTTCCCCGC TGTCGCAGGC GGGGCCGGGG ACCAGTCCGA CGTGCTGGCC GGCGTCGCTG CGAGTTTCGG CGTTTCGCAG AAGGCGGAGA ATAAGGCAGA CGCGGTCACC CTGCTGAAGT TCCTGACCTC GCGCGAGGTG ATGACCAAAT ATGTCGAGTT GCGCAAGACG ATGGTGACCG TCAAGGACGC CACCACCGAA ACGGCCGCCG GGCCAGTCCT CTACGATATC AGCAACAAGC TGATGAAGGC TGCCGGCCAC CTCGATCCTT TCTACGACAC CGCCATGCCG CCCGCAGCGA CGAACATCTA TTACACCTCG CTGCAAGGAG TGCTCGATGG CTCGCTGCCG CCCGCGGATG CGGCCAAGCG CATCGAAGAC GCATTGCGGG CGAAGTAA
|
Protein sequence | MWHHFMGGSA WPQGNGEDIV RLKRLLIAGA AAALAAMPAS AGELSFWHAY AGQQDKVEFI DFALGEFAKA HPEVKLEVVA AEQSAYKTKL NTAMASGNPP DVFYTLPGGF LNAFVKGGQM YALDEELARD GWRDSFLESA ISQTSKDGHT YAVPVDVDSV VFWYDKARFA ENGWTVPKTY EELLALAEKV KGEGLVPFSL GNKDSWPATF WFQYLEMRLK GSGVVSAFVN GDPDATLGAE ATKAMEKLAE LAKQEYFPIG FNGMSDQEAN MLFLNGQAAM MLNGTWQIGA SADAPEGFEL GYFAFPAVAG GAGDQSDVLA GVAASFGVSQ KAENKADAVT LLKFLTSREV MTKYVELRKT MVTVKDATTE TAAGPVLYDI SNKLMKAAGH LDPFYDTAMP PAATNIYYTS LQGVLDGSLP PADAAKRIED ALRAK
|
| |