Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5713 |
Symbol | |
ID | 6977104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 113404 |
End bp | 114708 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393170 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002277988 |
Protein GI | 209546098 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.285791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA AGAGACGTGA ATTTCTTGCT GCCTCGGCAG CCGTTGCCGG CGCTGCCGGC TTCGGCATCA AGCCATCTTT TGCGCAGGCC GAACCGACCT ACACGCCGGA AAGCGGTGCC AGCCTTCGCC TGCTTCGCTG GACGCCCTTC GTCAAGGGCG ACGAGGAGGC CTGGCTTGCC AACACCAAGA AATTCACCGA AGCGACCGGC GTCGAGGTGC GCATCGACAA GGAGAGCTGG GAAGACATCC GCCCGAAGGC TGCAGTCGCC GCGAATGTCG GCTCCGGCCC GGATCTCATC ATGTGCTGGT TCGACGACGC GCATCAGTAT CCGGACAAAC TGGTCGATCT CACCGAACTC GGCAATTATC TCGGCAACAA GTATGAGGGC TGGTACGACG GCGTGAAGGG TTATGCCACC CGCGGTGACA CCTTCATCGC CATGCCGCTG ACGGCGATCG GCAATGCGGT GGTCTATCGC GACAGCCATG TGAAGGCCGC CGGTTTCAAC GAATTCCCGA ACGATACGGC AGGCTTCCTC GAGCTTTGCA AGGCGATGAA GGCAAAGGGT ACACCGGCCG GCTTCCCGCA CGGCAAGGCG GTCGGCGACG GCAACAATTA CGCCCATTGG CTGCTGTGGA GCCATAACGG CATGATGGTC GACGAGGGCG GCAAGGTGAC GATCAACAGC CCGGAGACGC TCGCTTCGAT CAACTATGCC AAGGAGCTCT ACGCGACCTT CATTCCGGGC ACGGAAAGCT GGCAGGACGT CAACAACAAC CGCGCCTTCC TGGCCGGCCA GGTCTCGCTG ATCGCCAATG GCGTCTCGGT CTATTACACG GCCAAGAACG ATCCGAAGCT CGCCGAGATC GCCAAGGATA TCCGCACGAC GAACTTCCCG ATCGGCCCTG TCGGCAAGAG CGTCGAGCTT TGCCAGACGA GCTCGCTGCT TCTCTTCAAG CACAGCAAAT ATCCGGAAGC GGCCAAGGCT TACATCAAGT TCATGATGGA GGCCGACCAG ATGAACGCCT GGATCCAGGG CTCCAGCGCC TATTGCTGCC AGCCGCTCAA GGCCTTCGCC AAGAACCCGA TCTGGACGGC CGATCCGGTC CATGCGCCTT ATGCGCGCGC TTCGGAAAAA CTGCGCCCGA ACGGCTATGC CGGCCCGCTC GGCTACGCCT CGGCGGCGAC CATGGCCGAC TATGTTCTGG TCGACATGTA TGCCGCCGCC GTTACCGGCC AGATGTCGCC GGAGGATGCG ATGAAGGAAG CTGAACGCCG GGCAAACCGT TACTATCGCG TCTGA
|
Protein sequence | MTIKRREFLA ASAAVAGAAG FGIKPSFAQA EPTYTPESGA SLRLLRWTPF VKGDEEAWLA NTKKFTEATG VEVRIDKESW EDIRPKAAVA ANVGSGPDLI MCWFDDAHQY PDKLVDLTEL GNYLGNKYEG WYDGVKGYAT RGDTFIAMPL TAIGNAVVYR DSHVKAAGFN EFPNDTAGFL ELCKAMKAKG TPAGFPHGKA VGDGNNYAHW LLWSHNGMMV DEGGKVTINS PETLASINYA KELYATFIPG TESWQDVNNN RAFLAGQVSL IANGVSVYYT AKNDPKLAEI AKDIRTTNFP IGPVGKSVEL CQTSSLLLFK HSKYPEAAKA YIKFMMEADQ MNAWIQGSSA YCCQPLKAFA KNPIWTADPV HAPYARASEK LRPNGYAGPL GYASAATMAD YVLVDMYAAA VTGQMSPEDA MKEAERRANR YYRV
|
| |