Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4644 |
Symbol | |
ID | 5319289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1153481 |
End bp | 1154710 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640776442 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313374 |
Protein GI | 150376778 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0265502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0904039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAATCG CTTACAAAGC TACCTGTTTC ACGCTTGCCT TGCTCGGATC GACCGCCCTC GGCGGCTTTG CCGCACAGGC TGCCGATCAG GAGATCAGCT GGATCTATTG CGGCGACAAG ATTGATCCGA TCCACGAGAA ATACATCAAG GAATGGGAGG GCAAGAATCC CGGTTGGAAG GTCACGCCTG AAGTCGTCGG TTGGGCCCAG TGCCAGGACA AGGCGACGAC GCTCGCCGTG GCCGGCACGC CGGTCGGCAT GGCCTATGTG GGTTCCCGCA CCCTCAAGGA ATTCGCCGAG AACGAGCTGA TCATCCCGGT TCCAATGACC GAGGAAGAGA AAAAGAGCTA TTACCCGAAC ATCGTCGAGA CGGTGACCTT CAACGACAAT CAGTGGGGCG TGCCGATCGC CTTTTCGACC AAGGCGCTCT ACTGGAACAA GGACCTATTC AAGCAGGCGG GCCTCGACCC GGAGACGCCG CCGAAGAGCT GGGCCGAAGA GATCGCCTTT GCGAAGCAAA TCAAGGAAAA GACCGGAATT GCAGGTTACG GCCTTCCCGC CAAGACATTC GACAACACGA TGCATCAATT CATGCATTGG GTTTACACCA ATAACGGCAA GGTTATCGAT GGCGACGAGA TCGTCATGGA CAGCCCGGAA GTGCTCGCGG CGCTGCAGGC CTACAAGGAC CTTGTGCCCT ATTCTGTCGA AGGGGCGACG GCCTACGAGC AGAACGAGAT TCGCGCCATT TTCCTCGACG GCAAGGTGGG CATGATCCAG TCCGGCTCAG GCGCCGCCGC ATTGCTGAAG GATACCAAGA TCAACTGGGG CATCGCGCCG CTGCCGCTCG GCCCTTCGGC CAAGGGCGAA GGCACGCTCC TCATCACCGA CAGCCTGGCG ATCTTCCAGG GCACGGGCGT CGAGGAAAAG GCGATCGAGT TTGCCAAGTT CATCACATCT CCCGGTCCGC AGGGCGAGTA TGAACTGCAG GGAGGCGCAG GGCTGACACC GCTGCGCCCG TCGCCGATGG TCGATGAGTT CGTAAAGGCC GACCCGTCGT GGAAGCCATT CATCGACGGG ATCGCCTATG GAGGTCCGGA GCCGCTCTTC AAGGACTACA AGGGCTTCCA GAACGCTATC ATCGACATGG TCCAGTCGGT CGTGACCGGC AAGTCCGAGC CGGTGGACGC TTTGAAGAAG GCGGCAGCTG ACATCGAACA GTATAAGTAA
|
Protein sequence | MTIAYKATCF TLALLGSTAL GGFAAQAADQ EISWIYCGDK IDPIHEKYIK EWEGKNPGWK VTPEVVGWAQ CQDKATTLAV AGTPVGMAYV GSRTLKEFAE NELIIPVPMT EEEKKSYYPN IVETVTFNDN QWGVPIAFST KALYWNKDLF KQAGLDPETP PKSWAEEIAF AKQIKEKTGI AGYGLPAKTF DNTMHQFMHW VYTNNGKVID GDEIVMDSPE VLAALQAYKD LVPYSVEGAT AYEQNEIRAI FLDGKVGMIQ SGSGAAALLK DTKINWGIAP LPLGPSAKGE GTLLITDSLA IFQGTGVEEK AIEFAKFITS PGPQGEYELQ GGAGLTPLRP SPMVDEFVKA DPSWKPFIDG IAYGGPEPLF KDYKGFQNAI IDMVQSVVTG KSEPVDALKK AAADIEQYK
|
| |