Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5213 |
Symbol | |
ID | 8007108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 623753 |
End bp | 625000 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644822122 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973382 |
Protein GI | 241113547 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.506772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA GAAAATTCGG CGTGACGATG CGGGCTGCCG TAGCGGTATG GGCGCTCTGC GCCACATCCG CGTTTGCCGA CACCACCATC GAGTTCATTC AATGGTGGGA ACCCGAAATG CCGTCCGGCG CCTTGCGCGG CATTATGAAC GATTTCGAAG CCAAAAATCC CGGCATCAAG GTGACGCTTG TCAGCGGTCC CTATGCCACG ACGCGTGACC AGATCGTCGT CGGCGCCGCC TCGGGAACGC TCAGCGACGT GGTCGGCCTC GATGGTGCCT GGGTAAACGG GCTCGCCAAG CAGGGCGCGA TCGCCTCCAT GGACGAACTG ATGGAGAAGG CGAAATACGA CAAGAGCCAG ATCACCGATA TCGTCAAGGT CGATGGCAAG AGCGTGATGT TCCCGCTGGC ATCCTTCGTC TACCCGGTCT TCGTGAACCT GGATATCTCC AAGGCCGCCG GCGTCGACAA GTTGCCGACC ACGCGCACGG AATTCGCTGA AGCCGCGAAG AAGATGACCG ACGCCTCGAA GAACCAGTAC GGCTGGGTTC TGCCGCTATC GCTGCAGTCT CCGAGCGGAA TCCAGAACGA CGTCATGTCC TGGGTCTGGG CCTCCGGCGC CTCGATGCTG AAGGACGGCA AGCCTGACCT CGAAAATGAC GCCGTCGTCG GCACGCTCGA TTATCTTGCC TCGCTCAACA AGGAAGGCGT GATTTCTCCC GGCATCTTCG CCAAGAAGGA ACAGGACAAG GTCGAAGAAT TCGTCAACGG CCGCGTCGGC ATGATGGTGG ATTCCCTCGC CCATGTGAAC CTCATCCGCG AGCGCAATCC GAAGCTCAAC TTCGGCATCT CCGCCTTGCC GGCCACTGAC GGCTACACCG GCAAGCGCGG CATGCCCTAT GCCTCCTGGG GCATCGGCAT CAGCGAAGGC AGCCAGCACA AGGAAGAAGC CTGGAAGCTG GTCGAATACC TGATGAGCCC TGACGTTAAC GGCCGTCTCG TCTCGATTGC CAATGCCTTC CCCGGCAACG TCCATGCCAA GCCGGACTTC GTGGCCTCGG ACCCGATTTT CGCGGAAGCT TTCAAAATCT TCCAGAGCGG CTATCCTGCC AACGAATTCG TCGGTCTTCC GGTTGCCGAA GAGCTGATGC GCGACATGAA CGTCGAAGTC CAGAAGATGT TCGACGGCGG GCAGTCGGCT AAGGACGCAG CTGCCAATAC CCAGAAGGCG TGGCTCGCGA AGTTCTAA
|
Protein sequence | MNIRKFGVTM RAAVAVWALC ATSAFADTTI EFIQWWEPEM PSGALRGIMN DFEAKNPGIK VTLVSGPYAT TRDQIVVGAA SGTLSDVVGL DGAWVNGLAK QGAIASMDEL MEKAKYDKSQ ITDIVKVDGK SVMFPLASFV YPVFVNLDIS KAAGVDKLPT TRTEFAEAAK KMTDASKNQY GWVLPLSLQS PSGIQNDVMS WVWASGASML KDGKPDLEND AVVGTLDYLA SLNKEGVISP GIFAKKEQDK VEEFVNGRVG MMVDSLAHVN LIRERNPKLN FGISALPATD GYTGKRGMPY ASWGIGISEG SQHKEEAWKL VEYLMSPDVN GRLVSIANAF PGNVHAKPDF VASDPIFAEA FKIFQSGYPA NEFVGLPVAE ELMRDMNVEV QKMFDGGQSA KDAAANTQKA WLAKF
|
| |