Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5476 |
Symbol | |
ID | 6978570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1123428 |
End bp | 1124675 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394576 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002279394 |
Protein GI | 209547476 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.487316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA GAAAATTCAG CGTGACCATG CGGGCTGCCG TGGCGGTGTG GGCGCTTTGC GCCACCTCGG CATTCGCCGA CACCACCATC GAATTCATTC AATGGTGGGA ACCGGAAATG CCCTCCGGCG CACTGCGCGG CATCATGAAC GATTTCGAGG CGGCAAATCC TGGCATCAAG GTGACGCTCG TCAGCGGTCC CTATGCCACG ACCCGTGACC AGATCGTCGT CGGCGCCGCC TCCGGGACCT TAAGCGACGT CGTCGGCCTG GATGGCGCCT GGGTCAACGG TCTCTCCAAG CAGGGCGCGA TCGCCTCCAT GGACGAACTG ATGGAGAAGG CGAAATACGA CAAGAGCCAG ATCACCGATA TCGTCAAAGT CGACGGCAAG AGCGTGATGT TCCCGCTGGC TTCCTTCGTC TACCCGGTCT TCGTCAACCT CGATATCGCC AAGGCGTCGG GCGTCGACAA GCTGCCGACC ACGCGCACGG AATTTGCCGA AGCCGCCAAG AAGATGACCG ACGCGTCAAA GAACCAGTAT GGCTGGGTTC TGCCGCTATC GCTGCAGTCT CCGAGCGGAA TCCAGAACGA CGTGATGTCC TGGGTCTGGG CCTCCGGCGC CTCGATGATG AAGGACGGCA AGCCTGACCT CGAAAATGAG GCGGTCGTCG GCACGCTCGA TTATCTGGCT TCCCTCAACA AGGAAGGCGT TATTTCTCCC GGCATTTTCG CCAAGAAGGA ACAGGACAAG GTCGAGGAAT TCGTCAACGG CCGCGTCGGC ATGATGGTCG ATTCCCTCGC CCATGTGAAT CTCATCCGCG AACGCAATCC GAAGCTCAAG TTCGGCATCT CCGCCTTGCC GGCCACCGAC GGCTATACCG GCAAACGCGG CATGCCCTAT GCGTCCTGGG GCATCGGCAT CAGCGAGGGC AGCAAGCATA AGGAAGAAGC CTGGAAGCTG GTGGAATATC TGATGAGCCC ACACGTCAAC GGCCGCCTGG TCTCGATTGC CAACGCCTTC CCCGGCAACG TCCACGCCAA GCCGGACTTC GTGGCATCGG ACCCGATCTT CGCCGAAGCT TTCAAAATCT TCCAGAGCGG CTATCCTGCC AACGAATTCG TCGGCCTTCC GGTTGCCGAA GAGCTGATGC GCGACATGAA CGTCGAAGTT CAGAAGATGT TCGACGGCGG CCAGTCGGCC AAGGACGCGG CTGCTAATAC CGAGAAAGCC TGGCTCGCGA AGTTCTGA
|
Protein sequence | MNIRKFSVTM RAAVAVWALC ATSAFADTTI EFIQWWEPEM PSGALRGIMN DFEAANPGIK VTLVSGPYAT TRDQIVVGAA SGTLSDVVGL DGAWVNGLSK QGAIASMDEL MEKAKYDKSQ ITDIVKVDGK SVMFPLASFV YPVFVNLDIA KASGVDKLPT TRTEFAEAAK KMTDASKNQY GWVLPLSLQS PSGIQNDVMS WVWASGASMM KDGKPDLENE AVVGTLDYLA SLNKEGVISP GIFAKKEQDK VEEFVNGRVG MMVDSLAHVN LIRERNPKLK FGISALPATD GYTGKRGMPY ASWGIGISEG SKHKEEAWKL VEYLMSPHVN GRLVSIANAF PGNVHAKPDF VASDPIFAEA FKIFQSGYPA NEFVGLPVAE ELMRDMNVEV QKMFDGGQSA KDAAANTEKA WLAKF
|
| |