Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6524 |
Symbol | |
ID | 6983594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 195018 |
End bp | 196292 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643399520 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002284276 |
Protein GI | 209552361 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.514278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.451088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTCA AACTTCTCGC TGCGACCGCC GCTGTCGCAC TGCTTGCTTC CGGCTCTGCA TTCGCTGAGT CGGCCAATCT GACCATCTGG AGCTGGAATG TCGCCGCGTC GGCATTGAAG TCCACGCTTC CGGGCTTCAA CAAACAGTTC CCCGATATCA AGATCACCGT CGAGGACCTC GGCAACAGCC AGGTCTTCGA CAAGACGCTG GCTGCCTGCG CCGCCGGCGG CGACGGCTTG CCCGATATTG TCAGCATCGA GAATTTCGAG GCTGAAATCT TCTGGAGCCG TTTCCCGGAT TGCTTTGCCA ATCTGAAGGA GCTTGGCTAC ACCCCCGAGA TCCAGGCGAA ATTTCCTGAC TTCAAGCGCA CAGAGCTCGA AGTCGGCGAT GTCGCCTATG CCATGCCGTG GGATTCCGGC CCGGTCGCGG TCTTTTACCG CCGCGACATG TACGAAAAGG CCGGTGTCGA TCCGAGCACG ATCAGCACCT GGGACGATTT CATCGCCGCC GGCAAGAAGA TTTCCGCCGC CAATCCCGGC GTCGTCATGG CCCAGGCCGA TTTCAACGGC GACAGCGAAT GGTTTCGCAT GATCGCCAAC GAACAGGGCT GCGGCTATTA TTCGACCGAC GGCCAGAATA TCACCATCAA CCAGCCGGCC TGCGTCGCCT CGCTGCAAAA GGTGAAGGAA ATGAAGGACG CCGGCACGCT GACGGCGGCC AACTGGGACG AAAAAATCCA GGCCAATACC GCCGGAAAGG CCGCCAGCCA GCTCTATGGG GGCTGGTACG AGGGCACCGT GCGCTCGACC TCTCCCGATC TCAAGGGCAA GTGGGGCGTC TACAGGATGC CGAGCCTGAC GGCTGACGGC CCGCATGCGG CCAATCTCGG CGGTTCGTCG CTCGCCATTT CGGCGACTTC GGCCAACAAG GAAGCCGCCT GGAAGTTCGT CAACTACGCG CTAGGCACCA ATGAAGGCCA GATCACTATG CTGAAGGAAT TCGGCCTGGT GCCGTCGCTG CTTTCGGCAG AAAAGGACCC CTTCATCAGC GAACCGCAGC CCTATTGGGG CGGCCAGAAG GTCTGGGCCG ATATCCTGGC GACGCTGCCG AAGATCGTGC CGAGCCGCGG TACCGCTTTC CAGAGCGATG CCGAAGCGAT CTTCAAGGCA ACGCAGACGA AGTTCTTCGC CGGCGGTTAT CCCGACGCGA AGGCAGCCCT CGACGATGCC GCCAAGCAGA TCGCTTCGGC GACCGGGCTT CCAGTGGCGC AATGA
|
Protein sequence | MRFKLLAATA AVALLASGSA FAESANLTIW SWNVAASALK STLPGFNKQF PDIKITVEDL GNSQVFDKTL AACAAGGDGL PDIVSIENFE AEIFWSRFPD CFANLKELGY TPEIQAKFPD FKRTELEVGD VAYAMPWDSG PVAVFYRRDM YEKAGVDPST ISTWDDFIAA GKKISAANPG VVMAQADFNG DSEWFRMIAN EQGCGYYSTD GQNITINQPA CVASLQKVKE MKDAGTLTAA NWDEKIQANT AGKAASQLYG GWYEGTVRST SPDLKGKWGV YRMPSLTADG PHAANLGGSS LAISATSANK EAAWKFVNYA LGTNEGQITM LKEFGLVPSL LSAEKDPFIS EPQPYWGGQK VWADILATLP KIVPSRGTAF QSDAEAIFKA TQTKFFAGGY PDAKAALDDA AKQIASATGL PVAQ
|
| |