Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3119 |
Symbol | |
ID | 6981864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3187665 |
End bp | 3189020 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643397829 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002282612 |
Protein GI | 209550695 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCCA AGCTTCTCGG CGCCGTTGGC GCCCTTCTCG CTACAGCGTT TCTTGCCGGT CCCGCTGCCG CCGCCGATAA GACCAAGATT GACTTCTGGT TCGGCAATTC CGGTGACATC GCAAAGCGTG TCCAGGAACA GTGCGATCGC TTCAACCAGT CGCAGGCCGA TTACGAAGTC GTCTGCACCA GCCAGGGCAG CTACGACGCG TCCTTGCAGA ACACCATTGC CGCCTTCCGC GCCGGCAAGC AGCCGACCAT CGCTCAGGTT TCCGACGCCG GCACGCTCGA CATCATGCTT TCCGGCGCCT ACTACCCGGC CAATCAGCTG ATGACCGACA TGGGCTACAC GGTCGACTGG AAAGATTACT TCTCCGGCAT CGCCAACTAT TATGCGACGT CCAAGGGCGA GATGTATTCC TTCCCCTTCA ACTCCTCGAC CGCATTGCTC TACTGGAACA AGGATGCCTT CGCCAAGATC GGCAAGGACC ATGCTCCGGC CACCTGGCAG GAAGCCGGCG AAGATTTCAA GGCTCTGAAG GATGCAGGTT ATGCTTGCCC GCTCGCCTTC GACATCTCCA ACAACGAAGT CTGGCAATAT GTCGAGCAGT TCGAAGCCGT TAACGGCGAA GCGATCGCGA CGAAGAAGAA CGGCTTTGAA GGCCTCGACG CCGAGCTGAC CTACAACAAG AACCCGCTGC TCGTCAGCTA CATCAAGGAC CTCAAGTCCT GGTACGACAA CAAGCTGGCT TTCATCAAGA ACAAGGCCGT CGGCCAGACC TTCGTCGAAG CCTTCGCCGC CGGCGATTGC CAGGTTATCC TCACCTCGGT CGGCGACCAC GGCAATATCG GCCGCACCGC CAAGCAGGGC ATGAACTGGG GCGTTGCCAT GCTCCCGACC TACGGCACTG CAACCCGCCA CAGCTCCTAT GTCGGCGGCG CTTCGCTCTG GGTTCTGAAG GGTCACACCG ACGCCGAATA CAAGGCTGCC GCTGCCTTTT TCAACTTCAT CGCAAAGCCG GAAGAAGCCC TGACCTGGTC GACGGTCACC GGCTACATCC CGGTTCGTAA CTCCGGCTTC GAATATCTCA AGAAGCAGGG CTTCTACGAC AAGGCTCCTT ATGCCGGCCG TGAACTCGCC ATTCAGAGCC TGACGGCATC GCCTGCCGAC GACACGGCGC CGCACGGCAT CCGTCTCGGC GGCCTGCTTC AGGTCCGCAC CGAAATCGCC AACGGCCTGC AGGCGATCTT CGTCAACAAT GCCGACGTCC AGGCTTCGCT CGACGGCGCT GCCGAACGCG GCAACCAGCT GCTGCGCCGC TTCCAGCAGA CCTACAAGAA CGTTCAGCTT CCTTGA
|
Protein sequence | MQAKLLGAVG ALLATAFLAG PAAAADKTKI DFWFGNSGDI AKRVQEQCDR FNQSQADYEV VCTSQGSYDA SLQNTIAAFR AGKQPTIAQV SDAGTLDIML SGAYYPANQL MTDMGYTVDW KDYFSGIANY YATSKGEMYS FPFNSSTALL YWNKDAFAKI GKDHAPATWQ EAGEDFKALK DAGYACPLAF DISNNEVWQY VEQFEAVNGE AIATKKNGFE GLDAELTYNK NPLLVSYIKD LKSWYDNKLA FIKNKAVGQT FVEAFAAGDC QVILTSVGDH GNIGRTAKQG MNWGVAMLPT YGTATRHSSY VGGASLWVLK GHTDAEYKAA AAFFNFIAKP EEALTWSTVT GYIPVRNSGF EYLKKQGFYD KAPYAGRELA IQSLTASPAD DTAPHGIRLG GLLQVRTEIA NGLQAIFVNN ADVQASLDGA AERGNQLLRR FQQTYKNVQL P
|
| |