Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2834 |
Symbol | |
ID | 6981578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2884190 |
End bp | 2885770 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643397546 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002282330 |
Protein GI | 209550413 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.360597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGT TCACGAAAAA ATTTCTCGCC TCCGCAATGT TTGGCACATT GCTGGCGTTT TCGGCGCATG CGGCCACGCT GAATATTCAC AATGGCGGCG ACCCGCAATC GCTCGATCCG CAGAAGCTGT CCGGCGATTG GGAGAATCGT ATCGCCGGCG ACATTTTCGA AGGCTTGGTC ACGGAAGACG CCAAGGACAA TCCGGTCCCC GGCCAGGCCG AAAGCTGGAC GATTTCACCT GATGGCAAGG TTTACACCTT CAAGCTTCGC GACGGCATCA AATGGTCCGA TGGCCAGCCG GTAACGGCAG GAGACTTCGT CTTCGCCTTC CAGCGCCTCG TCGACCCGAA GAACGCCGCC GACTATGCCT ATCTGCAGTT CACCATCAAA AATGCCGAAA AGATCAACAA GGGTGAGATT ACCGATCTCA ATCAGCTCGG CGTCAAGGCA ATCGACGACA AGACGCTTGA AATCACCCTC GAAAACCCGA CCCCCTATTT CCTCAACGCT CTGATGCACT ACACCGCCTA TCCGCTGCCC AAGCACGTGG TCGAGGCGAA GGGCCAGGAT TGGGTCAAGA TCGGCAATAT CGTCACCAAC GGACCTTACA AGCCGGTCGA GTGGGTTCCG GGCTCGCATG TCACCACGGT CAAAAACGAC CAGTGGTACG GCACCAAGGA CCTGAAGATC GACGGTGCCA AGTTCTTCGT GCTCGAGGAT CAGGAAGCGG CACTGAAACG TTACCGCGCC GGCGAATTCG ATATCCTCAC CGATTTCCCC ACCGACCAGT ATGAGTGGAT GAAGAAGAAC CTGCCGGGCC AGGCACATGT CGCTCCCTTC TCCGGCCTCT ATTACTACGT CATCAATTCG ACCAAGCCGC CCTTCGCCGA CAAGCGCGTG CGCCAGGCTC TCTCCATGGC GATCAACCGC GAAGTCATCG GCCCGCAGAT TCTCGGCACC GGCGAACTGC CGGCCTATTC CTGGGTCCCG CCAGGCACGG CAAACTACGG CGAACCGGCC TACGTCTCCT GGAAGGATCT TCCCTACAAG GACAAGGTCG AAGAAGCCAA GAAGCTGCTG AAGGAAGCCG GTTTCGGTCC GGATCATCCG CTGACAGCCG AGCTCAAATA CAACACCAAC GACAATCACA AGCGCATCGC CGTGGCGATC GCCTCGATGT GGAAGCCGCT TGGCGTCAAT GTCGAACTCG TCAATGCCGA GACCAAGGTC CATTATGACC AGTTGCAGCG CGGCGAAGTG CAGATCGGCC GCGCCGGCTG GCTGGCCGAC TATAACGACC CAGACAACTT CCTGAACCTC CTGGTCACAG GCGTCCAGAT GAATTACGGC CGCTGGTCCA ATCCTGACTA CGACAAGATG ATCAAGGACG GCAACGCCGA GACCGATCTC GCCAAGCGCG CCGCAATCTT CAAGAAAGCC GAACAGCTGG CACTGGATGA TTCCGCCGCC CTGCCGATCT ACTATTATGT TTCGAAGAAC GTCGTCTCAC CGAAGATCGA AGGCTTCGTC GATAACATCC AGGACATCCA CCGCACCCGC TGGCTGTCGA TGAAAGAGTA A
|
Protein sequence | MNQFTKKFLA SAMFGTLLAF SAHAATLNIH NGGDPQSLDP QKLSGDWENR IAGDIFEGLV TEDAKDNPVP GQAESWTISP DGKVYTFKLR DGIKWSDGQP VTAGDFVFAF QRLVDPKNAA DYAYLQFTIK NAEKINKGEI TDLNQLGVKA IDDKTLEITL ENPTPYFLNA LMHYTAYPLP KHVVEAKGQD WVKIGNIVTN GPYKPVEWVP GSHVTTVKND QWYGTKDLKI DGAKFFVLED QEAALKRYRA GEFDILTDFP TDQYEWMKKN LPGQAHVAPF SGLYYYVINS TKPPFADKRV RQALSMAINR EVIGPQILGT GELPAYSWVP PGTANYGEPA YVSWKDLPYK DKVEEAKKLL KEAGFGPDHP LTAELKYNTN DNHKRIAVAI ASMWKPLGVN VELVNAETKV HYDQLQRGEV QIGRAGWLAD YNDPDNFLNL LVTGVQMNYG RWSNPDYDKM IKDGNAETDL AKRAAIFKKA EQLALDDSAA LPIYYYVSKN VVSPKIEGFV DNIQDIHRTR WLSMKE
|
| |