Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3391 |
Symbol | |
ID | 8014268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3406846 |
End bp | 3408201 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825949 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977176 |
Protein GI | 241206080 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.297931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCCA AGCTTCTCGG CGCCGTTGGC GCCCTTCTCG CTACAGCGTT TCTTGCTGGT CCCGCTGCCG CCGCCGATAA GACCAAGATC GACTTCTGGT TCGGCAATTC CGGGGACATC GCAAAGCGTG TCCAGGAGCA GTGCGATCGC TTCAACCAGT CGCAGGCCGA CTACGAAGTC GTCTGCACCA GCCAGGGCAG CTATGACGCC TCCCTGCAGA ACACCATCGC CGCCTTCCGC GCCGGCAAGC AGCCGACCAT CGCCCAGGTC TCCGACGCCG GCACCCTCGA CATCATGCTC TCCGGCGCCT ACTACCCGGC AAACAAGCTG ATGACCGACA TGGGCTATAC CGTCGACTGG AAGGACTATT TCTCCGGTAT CTCCGGCTAT TACGCGACAT CGAAGGGCGA GATGTACTCC TTCCCCTTCA ACTCCTCGAC CGCTCTTCTC TACTGGAACA AGGACGCCTT CGCCAAGATC GGCAAGGATC ATGCTCCGGC AACCTGGAAG GAAGCAGGCG AGGACCTCAA GGCTCTGAAG GATGCAGGCT ATGCTTGCCC ACTCGGCTTT GACATCTCCA ACAATGAAGT CTGGCAGTAC ATCGAGCAGT TCGAAGCCGT CAACGGCGAA GCGATCGCCA CGAAGAAGAA CGGCTTTGAA GGTCTGGACG CCGAGCTGGT GTTCAACAAG AACCCGCTTC TCGTCAGCTA CGTCAAGGAT CTCAAGTCCT GGTACGACGA CAAGCTTGTC GTCATCAAGA ACAAGGCTGT CGGCCAGACC TTCGTCGAAG CCTTTGCCGC CGGCGATTGC CAGGTCATCC TGACCTCGGT CGGCGACCAC GGCAATGTCG GCCGCACCGC CAAGCAAGGC ATGAACTGGG ACGTTGCCAT GCTCCCGACC TACGGCGACG CAGCCCGTCA CAGCTCTTAC GTCGGCGGCG CTTCGCTCTG GGTTCTGCAG GGTCACTCCG ACGCCGAATA CAAGGCTGCC GCTGCCTTCT TCAACTTCAT CGCAAAGCCG GAAGAAGCTC TTACCTGGTC GACCGTTACC GGCTACATCC CGGTTCGCAA CTCCGGTTTC GAATATCTGA ATAAGCAGGG CTTCTACGGC AAGGCGCCTT ATGCCGGCCG CGAACTCGCC ATTCAGAGCC TGACCGCTTC TCCGGCTGGC GATGCGGCTC CGCAGGGCAT CCGCCTCGGT GGCCTGCTGC AGGTCCGCAC CGAAATCGCC AATGGTCTGC AGGCAATCTT CGTTAACAAT GCCGATGTCC AGGCTTCGCT CGACAGTGCT GCCGAACGCG GCAATACGCT GCTCCGTCGC TTCCAGCAGA CCTACAAGAA CGTTCAGCTC CCCTGA
|
Protein sequence | MQAKLLGAVG ALLATAFLAG PAAAADKTKI DFWFGNSGDI AKRVQEQCDR FNQSQADYEV VCTSQGSYDA SLQNTIAAFR AGKQPTIAQV SDAGTLDIML SGAYYPANKL MTDMGYTVDW KDYFSGISGY YATSKGEMYS FPFNSSTALL YWNKDAFAKI GKDHAPATWK EAGEDLKALK DAGYACPLGF DISNNEVWQY IEQFEAVNGE AIATKKNGFE GLDAELVFNK NPLLVSYVKD LKSWYDDKLV VIKNKAVGQT FVEAFAAGDC QVILTSVGDH GNVGRTAKQG MNWDVAMLPT YGDAARHSSY VGGASLWVLQ GHSDAEYKAA AAFFNFIAKP EEALTWSTVT GYIPVRNSGF EYLNKQGFYG KAPYAGRELA IQSLTASPAG DAAPQGIRLG GLLQVRTEIA NGLQAIFVNN ADVQASLDSA AERGNTLLRR FQQTYKNVQL P
|
| |