Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6112 |
Symbol | |
ID | 8016069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 151585 |
End bp | 152859 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644827418 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002978618 |
Protein GI | 241258734 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.521338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCA AACTTCTCGC TGCGACCGCA GCTGTCGCAG TGCTTGCTTC CGGCTCCGCA TACGCGCAGT CGGCCAATCT TACCATCTGG AGCTGGAATG TCGCCGCGTC GGCGTTGAAG TCCACGCTTC CAGGCTTCAA CAAACAGTTT CCCGATATCA AGATCACCGT CGAGGACCTC GGCAACAGCC AGGTCTTCGA TAAGACGCTC GCTGCCTGCG CCGCCGGCGG CGACGGTTTG CCCGACATCG TCAGCATCGA GAATTTCGAG GCTGAAATCT TCTGGAGCCG TTTCCCGGAT TGCTTCGCCA ATCTGAAGGA GCTCGGCTAC ACAGCCGATA TCCAGGCGAA ATTCCCTGAT TTCAAGCGCA CCGAGCTTGA AGTCGGCGAT GTCGCCTACG CCATGCCGTG GGATTCCGGT CCTGTCGCCG TCTTCTACCG CCGCGATCTC TACGAAAAGG CCGGCGTCGA TCCGAGCACG ATCAGCACCT GGGACGATTT CATCGCTGCC GGCAAGAAGA TTTCCGCCGC CAATCCCGGC GTCGTCATGG CCCAGGCCGA CTTCAACGGC GACAGCGAAT GGTTTCGCAT GATCGCCAAC GAACAGGGTT GCGGCTATTA CTCGACCGAC GGTCAGAATA TCACCATCAA CCAGCCGGCC TGCGTCGCCA CGCTGCAAAA GGTGAAGGAG ATGAAGGATG CCGGCACGCT GACGGCGGCC AACTGGGAAG AAAAAATCCA GGCCGATACC GCCGGCAAGG CCGCAAGCCA GCTTTATGGC GGCTGGTATG AGGGCACGGT GCGCTCGACC TCTCCCGATC TCAAGGGCAA ATGGGGTGTC TACAGAATGC CGAGCCTGAC GGCAGATGGT CCGCATGCGG CCAATCTCGG CGGTTCGTCG CTCGCCATTT CGGCGACATC CGCGAATAAG GAAGCCGCCT GGAAATTCGT CAACTACGCC CTCGGCACGG ATGAGGGCCA GATCACCATG CTGAAGGAAT TCGGTCTGGT CCCGTCGCTG CTTTCGGCTG AGAAGGATCC CTTCGTCAAT GAGCCGCAGC CCTATTGGGG CGGCCAGAAG GTCTGGGCGG ATATTCTGGC GACACTGCCG AAGATCGTAC CGAGCCGCGG CACCGCCTTC CAGAGCGATG CGGAAGCCAT CTTCAAGGCG ACGCAGACGA AGTTCTTCGC TGGCGGCTAT CCCGATGCGA AGGCGGCTCT CGACGATGCC GCCAACCAGA TCGCTTCGGC GACCGGCCTT CCGATCGCGC AATGA
|
Protein sequence | MRFKLLAATA AVAVLASGSA YAQSANLTIW SWNVAASALK STLPGFNKQF PDIKITVEDL GNSQVFDKTL AACAAGGDGL PDIVSIENFE AEIFWSRFPD CFANLKELGY TADIQAKFPD FKRTELEVGD VAYAMPWDSG PVAVFYRRDL YEKAGVDPST ISTWDDFIAA GKKISAANPG VVMAQADFNG DSEWFRMIAN EQGCGYYSTD GQNITINQPA CVATLQKVKE MKDAGTLTAA NWEEKIQADT AGKAASQLYG GWYEGTVRST SPDLKGKWGV YRMPSLTADG PHAANLGGSS LAISATSANK EAAWKFVNYA LGTDEGQITM LKEFGLVPSL LSAEKDPFVN EPQPYWGGQK VWADILATLP KIVPSRGTAF QSDAEAIFKA TQTKFFAGGY PDAKAALDDA ANQIASATGL PIAQ
|
| |