Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5385 |
Symbol | |
ID | 8007343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 797285 |
End bp | 798559 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644822289 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973549 |
Protein GI | 241113714 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.832628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA TCATGTCATT TGGAATTCTG GCTTCCACGG TGCTGGCCTT CGCTTCGCCG GTGCTGGCCC AGACCGTTTT CGTATCCACC CAGCTTCGTC CGATCGAGGA GGCGACCGTC GTTCGCGAGG AACTCCTGAA GGACGTCGGC TCGGTTGACT ACGTGGTCGA GGAGCCGCCG CAGTTTGCGG TGCGCATGGA AGCCGAGCGT CAGGCCGGCA AGCACACCGT CAGCCTCGTC GGTGCACTGC ATGGCGAGCT CTCGCCGCTC GCCGACAAGG ACACGCTCGA GCCGCTTGAC GATCTCGCCA AGAAGTTGGC AGCGAGCGGC ATGCCGCAGT CATTGCTCGA CCTCGGCAAG CTCGGCAAGT CGACCCAGCA GTACATCCCG TGGATGCAGG CGACCTATGT GATGGCCGCC AAGAAGGAAG CGCTGCAATA CCTGCCTGCC GGCGCCGACG TGAACGCGCT GAACTACGAT CAGTTGATCG AGTGGGGCAA GAATATGCAG GACGCAACCG GCCAGCCGCA GATCGGCTTT CCCGCCGGCC CGAAAGGCCT GATGGCGCGT TATTTCCAGG GCTATTTCTA TCCCTCCTTC ACAGGCGGCG TCGTGCGTAC TTTCCAGAGT GCCGATGCGG CCGCCGGCTG GGAGAAGCTG AAGGTGCTGT GGGCCTATGT GACACCGAAC TCGACCAGCT ACGACTTCAT GCAGGAGCCG CTCGCGGCCG GCGAAGTCAT GGTTGCCTGG GACCACATCG CCCGGCTGAA GAATGCTATT TCCGCAGCAC CGGATGATTA TGTGGTCTTC CCCGCTCCCG CCGGTCCGAA GGGCCGCGGC TATATGCCGG TCGTTGCCGG TCTCGCCATT CCGAAGGGCG CGCCTGACAA GGCCGGCGCA GAAAAAATCA TCGAGCACCT GTCCATGCCG GACACGCAGC TCCTGACCGC CTCCAAGGTC GGCTTCTTCC CGACCCTCAA CGTCAAGCTG CCGCCGGATC TCGATGCCGG CGTCGCCCTG CTCGCCGGTG CCGTCACCGC CACCCAGGCC TCCAAGGACG CGGTCATCTC GCTGCTCCCG GTCGGCCTCG GCGACAAGGG CGGCGAGTTC AACAAGGTCT ACATGGACAG TTTCCAGCGC ATCGTACTGC AGAACGAGCC CGTCGCAGAC GTCCTGAAGG CCCAAGGCGC GACAATGGCC AAGCTGATGG CCGATACGAA AGCTGCCTGC TGGGCGCCCG ATGCCAAGAG CGACGGCCCC TGCCCAGTCG AATAA
|
Protein sequence | MKHIMSFGIL ASTVLAFASP VLAQTVFVST QLRPIEEATV VREELLKDVG SVDYVVEEPP QFAVRMEAER QAGKHTVSLV GALHGELSPL ADKDTLEPLD DLAKKLAASG MPQSLLDLGK LGKSTQQYIP WMQATYVMAA KKEALQYLPA GADVNALNYD QLIEWGKNMQ DATGQPQIGF PAGPKGLMAR YFQGYFYPSF TGGVVRTFQS ADAAAGWEKL KVLWAYVTPN STSYDFMQEP LAAGEVMVAW DHIARLKNAI SAAPDDYVVF PAPAGPKGRG YMPVVAGLAI PKGAPDKAGA EKIIEHLSMP DTQLLTASKV GFFPTLNVKL PPDLDAGVAL LAGAVTATQA SKDAVISLLP VGLGDKGGEF NKVYMDSFQR IVLQNEPVAD VLKAQGATMA KLMADTKAAC WAPDAKSDGP CPVE
|
| |