Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5858 |
Symbol | |
ID | 5320160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 820037 |
End bp | 821344 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640777553 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314485 |
Protein GI | 150377890 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.437643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA CAGTGGCCGG ACTAATGACC GGCATCGGTT TCATGTTTGC CTGCGGAACA TCGGCGCAGT CGCAAGAACT GACGATCTTC TGGGCGGAAT GGGATCCAGC CAACTACCTC CAGGAGCTTG TAAACGAGTA CGAAGCCGAG ACCGGTGTGA CGATCACCGT GGAGACGACA CCTTGGGCGG ATTTCCAGAC GAAGGCCTTC ACCGAGTTCA ATGCCAAGGG ATCAGCCTAT GATATGGTCG TCGGCGACTC TCAATGGATC GGGGCCGCGT CGGAGGCCGG CCACTACGTG GATCTCACGG AGTTCTTCAA CAAGCACAAG CTAAAGGAGG TAATGGCCCC GGCGACCGTG AAGTACTACG CGGAGTACCC CGCAAACTCC GGTAAATACT GGTCGATACC GGCCGAAGGC GACGCTGTCG GTTGGTCCTA TCGTAAGGAT TGGTTTGAAG ATCCAAAGGA AATGGAGGCC TTCAAGGCGA AGTATGGCTA TGACCTTGCT CCTCCGAAGG ATTGGAAACA ACTGCGTGAC ATCGCCGAAT TCTTCCATCG TCCGGACCAG AAGCGCTACG GCATCGCAAT CTACACCGAC AACTCCTATG ACGGATTGGT CATGGGCGTA GAGAACGCCA TCTTCTCTTT CGGTGGAGAA CTCGGCGACT ACAGCACCTA CAAGGTGGAC AGCATCATCA ACTCGGAGAA GAACGTCAAG GCTCTGGAAA CTTACCGCGA GCTCTATGGT TTCACGCCTC CGGGCTGGGC CAAGTCTTTC TTTGTCGAGA ACAACCAGGC TATCACCGAG AACCTGGCAG CGATGAGCAT GAACTACTTC GCCTTCTTCC CGGCGCTGGT CAATGAAGCT TCGAACCCGA ACGCGAAGGT CACCGGCTTC TTCGCCAATC CCGCCGGTCC AGACGGGGAC CAGTATGCCG CTCTTGGCGG CCAGGGCATT TCGATTGTCT CCTATTCCCA GAACAAGGAA GAGGCGATGA AATTTCTCGA ATGGTTCATC AAGGACGAGA CACAGAAGCG CTGGGCCGAG CTCGGCGGCT ATACGGCAAG CGCCAAGGTT CTGGAGTCGG AAGAGTTCCA GAACGCGACC CCATACAACA AGGCATTTTA CGAGACCATG TTCCGGGTGA AGGACTTCTG GGCAACACCG GAATATGCCG AGTTGCTGAT CCAGATGAAC CAGCGCATCT ATCCTTATGT CACCGCGGGC CAGGGCACGG CAAAGGAGGC GCTCGACGCA CTTGCTGAGG ACTGGAATGC AACCTTCAAG AAGTACGGCC GCCACTAA
|
Protein sequence | MRKTVAGLMT GIGFMFACGT SAQSQELTIF WAEWDPANYL QELVNEYEAE TGVTITVETT PWADFQTKAF TEFNAKGSAY DMVVGDSQWI GAASEAGHYV DLTEFFNKHK LKEVMAPATV KYYAEYPANS GKYWSIPAEG DAVGWSYRKD WFEDPKEMEA FKAKYGYDLA PPKDWKQLRD IAEFFHRPDQ KRYGIAIYTD NSYDGLVMGV ENAIFSFGGE LGDYSTYKVD SIINSEKNVK ALETYRELYG FTPPGWAKSF FVENNQAITE NLAAMSMNYF AFFPALVNEA SNPNAKVTGF FANPAGPDGD QYAALGGQGI SIVSYSQNKE EAMKFLEWFI KDETQKRWAE LGGYTASAKV LESEEFQNAT PYNKAFYETM FRVKDFWATP EYAELLIQMN QRIYPYVTAG QGTAKEALDA LAEDWNATFK KYGRH
|
| |