Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2426 |
Symbol | |
ID | 8013408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2428728 |
End bp | 2430053 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825007 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002976237 |
Protein GI | 241205141 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.741734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA CATCGAAAGC CCTTTTCGGG CTGGCCACGG CATTTGTCAT GTCGTCGGCA TTGCCCAATC TCGCCAAAGC CGATGAACTG ACCCTGTGCT GGGCCGCATG GGACCCCGCC AATGCGCTGG TCGAGCTCTC GAAGGATTTC ACCGCCAAGA CCGGCACGCA GATGAAGTTC GAATTCGTTC CCTGGACGAG TTATGCCGAT CGCTTCCTCA ACGAGCTGAA TTCGCACGGC AAGCTCTGCG ACCTGATCAT CGGCGACAGC CAGTGGATCG GCGGCTCGGC AGAGAACGGC CATTACGTCA AGCTCAACGA CTTCTTCGAC AAGGAAGGCA TCAAGATGGA TGACTTCGTG CCGGCGACGG TCGTCGGCTA CTCGGAATGG CCGAAGAACA CCCCGAACTA CTGGGCGCTG CCCGCCATGG GCGACGTCGT CGGCTGGACC TACCGCAAGG ACTGGTTCGA GAAGCCGGAA CTGCAGAAGG AATTCAAGGA GAAATACGGC CACGATCTCG CAGCGCCGAA GACCTACGAC GAACTGAAGC AGATCGCCGA GTTCTTCCAG AAGCGTGAGA TCGACGGCAA GACCGTCTAC GGCGCCTCGA TCTATACCGA GCGCGGCTCC GAAGGCATCA CCATGGGCGT CACCAACGTG CTCTACGACT GGGGCTTCCA GTACGAGAAC CCGAAGAAGC CCTATGACAT GGAAGGCTTC GTCAACTCGG CCGACGCGGT CAAGGGCCTC GAATTCTACA AGTCGCTCTA TGATTGCTGC ACCCCGCCCG GCAGCTCCAA CGTCTACATG GTCGAATCCG CCGACGCCTT CAAATCCGGC CAGGTCGCCA TGCAGATGAA CTTCGCCTTC ACTTGGCCCG GCCTTTACAA GGACGAGAAG GTCGGCGGCG ACAGGATCGG CTTCTTCCCC AATCCGGCTG AAAAGGCGCA TTTCGCCCAG CTCGGCGGCC AGGGCATCTC GGTGGTCTCC TATTCCGACA AACGCGATGC CGCCCTGCAA TACATCAAGT GGTTCGCACA GCCCGATGTA CAGGCCAAAT GGTGGGAACT CGGCGGTTTT TCCTGCCTGA ACTCCGTCGT CAATGCGCCA GGCTTTGCCA AGAGCCAGCC CTATGCCCAG GCCTTCCTGG ACTCGATGGC GATCGTCAAG GATTTCTGGG CCGAGCCGAG CTACGCCTCG CTGCTGCAGG CCATGCAGAA GCGCGTCCAT AATTACGTGG TCGCCGGCAA CGGCACTGCC AAGGAAGCGC TCGACGGTCT GGTGAAAGAC TGGAGCGACG TCTTCAAGGA CGACGGCAAG ATCTGA
|
Protein sequence | MQKTSKALFG LATAFVMSSA LPNLAKADEL TLCWAAWDPA NALVELSKDF TAKTGTQMKF EFVPWTSYAD RFLNELNSHG KLCDLIIGDS QWIGGSAENG HYVKLNDFFD KEGIKMDDFV PATVVGYSEW PKNTPNYWAL PAMGDVVGWT YRKDWFEKPE LQKEFKEKYG HDLAAPKTYD ELKQIAEFFQ KREIDGKTVY GASIYTERGS EGITMGVTNV LYDWGFQYEN PKKPYDMEGF VNSADAVKGL EFYKSLYDCC TPPGSSNVYM VESADAFKSG QVAMQMNFAF TWPGLYKDEK VGGDRIGFFP NPAEKAHFAQ LGGQGISVVS YSDKRDAALQ YIKWFAQPDV QAKWWELGGF SCLNSVVNAP GFAKSQPYAQ AFLDSMAIVK DFWAEPSYAS LLQAMQKRVH NYVVAGNGTA KEALDGLVKD WSDVFKDDGK I
|
| |