Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0151 |
Symbol | |
ID | 6978861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 147731 |
End bp | 148720 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394862 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002279679 |
Protein GI | 209547762 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00856305 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0288368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCG TGAAATCCCT TCTGTCGCGC CGCGCCTTTA CCGCGCTCGC CGGCGCCGCC GTCATCGCCA CGGCGATGCC GGTCACGTCC TTTGCCGCCG ACGTGACGAT CCCGATCATC GTCAAGGACA CGACGTCCTT CTACTGGCAG ATCGTTCTGG CCGGCGCCCG CAAGGCCGGC AAGGATCTCG GCGTCAACGT GCCGGAACTC GGCGCCCAGG CGGAATCCGA CATCAACGGC CAGATCAGCA TTCTGGAGAA CGCCGTTGCC GGCAAACCGG CCGCCGTCGT CATTTCGCCG ACCGAATTCA AGGCGCTCGG CAAGCCGATC GATGAAGCCG CCAAGTCGGT TCCGATCATC GGCATCGATT CAGGCGCCGA CTCCAAGGCG TTCAAGTCGT TCCTGACGAC CGACAACGTC CAGGGCGGTC GCATCGCCGC CGACGGTCTT GCCGCCGCCA TCAAGGAGAT GACGGGCAAG GAAGAAGGCG AAATCGTCAT CCTCACCAAT CTTCCCGGCG TCGGCTCGCT GGAGCAGCGC CGCGAAGGCT TCCTGGATCA GATCAAGACC AAGCATCCGG GCCTCAAGGT CATCGCCGAC AAGTATGGCG ACGGCCAGGC AACGACCGGC CTCAACATGA TGACCGACCT GATTACGGCC AATCCGAAGC TCGTCGGCGT CTTCGCCTCG AACCTGATCC TGGCGCAGGG TGTTGGTCAG GCGATCGCCG AAAACAAGCT CGGCGACAAG ATCAAGGTCA TCGGCTTCGA CAGCGACGAC AAGACGGTCG GCTTCCTCAA GGACGGCGCC ATTGCCGGTC TCGTCGTCCA GGACCCCTAC CGCATGGGTT ATGACGGCAT CAAGACCGCG CTTGCCGTCT CCAAGGGCGA GAAGGTCGAA GCCAATGTCG ATACCGGCGC CAACCTCGTC ACCAAGGCGA ATATGGCCGA TCCGAAGATC GACGCCCTGC TGAACCCGAA GATCAAGTAA
|
Protein sequence | MTIVKSLLSR RAFTALAGAA VIATAMPVTS FAADVTIPII VKDTTSFYWQ IVLAGARKAG KDLGVNVPEL GAQAESDING QISILENAVA GKPAAVVISP TEFKALGKPI DEAAKSVPII GIDSGADSKA FKSFLTTDNV QGGRIAADGL AAAIKEMTGK EEGEIVILTN LPGVGSLEQR REGFLDQIKT KHPGLKVIAD KYGDGQATTG LNMMTDLITA NPKLVGVFAS NLILAQGVGQ AIAENKLGDK IKVIGFDSDD KTVGFLKDGA IAGLVVQDPY RMGYDGIKTA LAVSKGEKVE ANVDTGANLV TKANMADPKI DALLNPKIK
|
| |