Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3329 |
Symbol | |
ID | 5324213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3526276 |
End bp | 3527562 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640792280 |
Product | extracellular solute-binding protein |
Protein accession | YP_001328985 |
Protein GI | 150398518 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.955635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.757839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATA AAATCAGCAG GCGAAGTGTA CTCGCGGGTG GGGCGGCACT TCTTTCGATG TCGGCGATGG CAAGGAGCGC CCTTGCGCAG GAGGCGCGGC TGCGCGTGCT CTGGTGGGGC TCTCAGGCCC GGGCCGACCG GACCAACAAG GTCAACCAGC TCTTCCAGGA GCAGAATGCG GGTGTCGCCA TCAACGGCGA ATTTCTCGGC TGGAGCGATT ACTGGCCTCG GCTCGCGACG CAGGTCGCCG GCCGTAACGC ACCCGACATC ATACAGATGG ACTATCGCTA CATCGTCGAA TATGCCCGGC GCGGCGCGCT CGCGCCGCTC GACGACTATC TCGGCTCCGT GCTCAAGGTC GAGGATTTCG ACCAGGTGCA GATCAAGGGC GGCAGCGTCG ACGGCAAGCT CTACGGCATC AGCCTCGGCG CCAACTCGGC AGCGATGATG GTCAACGCCG CCGCTTTCGA GGAAGCCGGA GTCGACCTGC CGAGCCCATC CACCACCTGG GAAGAGATGG CAAAGATCGG CGCTGAGATC ACCCAGGCGG GCAAGCGCAA GGGGTTCTAC GGCCTTTCCG ATGGCAGTGC CGTCGAGCCG CTGCTCGAAA ACTGGCTTCG CCAGCGCGGC AAGGCGCTCT TCACCGCCGA AGGCAAGATC GGTTACGATG CCAATGATGC GGCAGAATGG TTTACCATGT GGCAGAACAT GCGCGAGGCC AAGGCGTGCG TTCCGCCGGA CGTGCAGGCG CTCGACCAGT ACACCGTGGA GACGAGCCCG CTGTCGCTCG GCAAGTCGGC CGCCTCCTTT GCGCATTCCA ACCAGTTCGT CGCCTATCAG GGCGTCAGCA AGGACAAGCT GGCGCTCCGC AGCCACCCGT TGATCAGCAA GGATTCGAAG GGCGGCCATT ACCGCAAGCC GTCGATGTTC TTCTCGGTCG CGGCTCAGAC GAAGGACCCG GAACTCGGCG CCAAATATGT CAACTTCTTC GTCAAGGATC CGAAGGCCGC AGAAATCCTC GGCGTCGAGC GCGGCGTACC GGAATCGTCC GCCGTGCGCG AGGCTCTCGC GCCGACGCTC GACGAGCTCG GCCGCGCCAT GCTCGACTAT GTCTCCGGCC TCGGTCCCCT TGCCGGCGAA CTGCCGCCAC CCCCGCCGAG CGGTGCCGGC GAGGCGGAAT TCGCACTGCG CAACGTCGCC GAACAGGTGG GCTTCGGTCA GCTCGATGCC AAGCAGGGCG GCGAGACGCT GGTGAACGAA GTCAGCCAAA TTCTCGCGCG GGGTTAG
|
Protein sequence | MTHKISRRSV LAGGAALLSM SAMARSALAQ EARLRVLWWG SQARADRTNK VNQLFQEQNA GVAINGEFLG WSDYWPRLAT QVAGRNAPDI IQMDYRYIVE YARRGALAPL DDYLGSVLKV EDFDQVQIKG GSVDGKLYGI SLGANSAAMM VNAAAFEEAG VDLPSPSTTW EEMAKIGAEI TQAGKRKGFY GLSDGSAVEP LLENWLRQRG KALFTAEGKI GYDANDAAEW FTMWQNMREA KACVPPDVQA LDQYTVETSP LSLGKSAASF AHSNQFVAYQ GVSKDKLALR SHPLISKDSK GGHYRKPSMF FSVAAQTKDP ELGAKYVNFF VKDPKAAEIL GVERGVPESS AVREALAPTL DELGRAMLDY VSGLGPLAGE LPPPPPSGAG EAEFALRNVA EQVGFGQLDA KQGGETLVNE VSQILARG
|
| |