Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3103 |
Symbol | |
ID | 8014011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3102504 |
End bp | 3104084 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644825670 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002976898 |
Protein GI | 241205802 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.529212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.177978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCATT TCGCGAAAAA ATTTCTCGCC TCTGCAATGC TTGGCACATT GCTGGCGTTT TCGGCCCACG CGGCCACGCT CAACATTCAC AATGGTGGCG ACCCGCAGTC GCTCGATCCG CAGAAGCTTT CCGGCGACTG GGAGAACCGC ATCGCCGGCG ACATTTTTGA AGGCCTCGTC ACTGAGGACG CCAAGGATAA TCCGATCCCC GGCCAGGCTG AAAGCTGGAC AATTTCGCCT GACGGCAAGG TCTACACCTT CAAGCTTCGC GACGGCATCA AGTGGTCCGA TGGCCAGCCG GTAACGGCAG GAGACTTCGT CTTCGCCTTC CAGCGCCTCG TCGACCCTAA GAACGCCGCC GACTACGCTT ATCTCCAGTT CACCATCAAG AACGCGGAAA AGATCAACAA GGGTGAGATC ACCGATCTCA ACCAGCTCGG CGTCAAGGCG ATCGACGACA AGACGCTCGA AATCACCCTC GAAAACTCCA CCCCTTATTT CCTCAATGCC CTGATGCACT ACACCGCCTA TCCGCTGCCG AAGCATGTCG TCGAGGCGAA GGGGCAGGAT TGGGTCAAGA TCGGCAACAT CGTCACCAAC GGCCCCTACA AGCCCGTCGA ATGGGTTCCG GGCTCGCATG TCACGACAGT CAAGAACGAT CAGTGGTATG ACACCAAAGA CCTGAAGATC GACGGAGCGA AGTTCTTCGT GCTCGAAGAC CAGGAAGCCG CGCTGAAACG CTACCGCGCC GGCGAATTCG ACATCCTCAC CGACTTCCCA ACCGACCAAT ACGAGTGGAT GAAGAAAAAC CTGCCGGGCC AGGCGCATGT CGCCCCCTTC TCTGGCCTCT ATTATTACGT CGTCAACTCG CAGAAACCGC CCTTCAGCGA CAAGCGCGTC CGCCAGGCTC TCTCCATGGC GATCAACCGC GAGGTCATCG GCCCGCAGAT CCTCGGCACC GGCGAACTGC CGGCCTATTC CTGGGTTCCG CCGGGCACGG CGAATTACGG CGAACCGGCC TATGTTAGCT GGAAGGACCT GCCCTACAGC GAGAAGGTCG CCGAAGCCAA GAAGCTCCTG ACCGAAGCCG GTTTCGGCCC CGACAAGCCG CTTCACGCCG TGCTGAGCTA CAACACCAAC GACAACCACA AGCGCATCGC CGTCGCCATC GCATCCATGT GGAAGCCGCT TGGCGTCGAT GTCGAACTCG TCAATGCCGA AACCAAAGTG CATTACGACC AGATGCAGCG TGGTCAAGTC GAAATCGGCC GCGCCGGCTG GCTCGCCGAC TACAACGACC CTGATAATTT CCTGAACCTC CTGGTGACAG GCGTGCAGAT GAACTACGGC CGCTGGTCGA ATCCCGAGTA CGACAAGATG ATCAAGGAAG GCAACGCCGA GACGGATCTC ACCAAGCGTG CCGCGATCTT CAAGAAGGCC GAACAGCTGG CGCTGGATGA ATCCGCCGCC CTGCCGATCT ACTACTATGT CTCGAAGAAC GTCGTTTCGC CGAAGATCGA AGGCTTCGTC GACAACATCC AAGACATCCA CCGCACCCGC TGGCTGTCGA TGAAAGAGTA A
|
Protein sequence | MNHFAKKFLA SAMLGTLLAF SAHAATLNIH NGGDPQSLDP QKLSGDWENR IAGDIFEGLV TEDAKDNPIP GQAESWTISP DGKVYTFKLR DGIKWSDGQP VTAGDFVFAF QRLVDPKNAA DYAYLQFTIK NAEKINKGEI TDLNQLGVKA IDDKTLEITL ENSTPYFLNA LMHYTAYPLP KHVVEAKGQD WVKIGNIVTN GPYKPVEWVP GSHVTTVKND QWYDTKDLKI DGAKFFVLED QEAALKRYRA GEFDILTDFP TDQYEWMKKN LPGQAHVAPF SGLYYYVVNS QKPPFSDKRV RQALSMAINR EVIGPQILGT GELPAYSWVP PGTANYGEPA YVSWKDLPYS EKVAEAKKLL TEAGFGPDKP LHAVLSYNTN DNHKRIAVAI ASMWKPLGVD VELVNAETKV HYDQMQRGQV EIGRAGWLAD YNDPDNFLNL LVTGVQMNYG RWSNPEYDKM IKEGNAETDL TKRAAIFKKA EQLALDESAA LPIYYYVSKN VVSPKIEGFV DNIQDIHRTR WLSMKE
|
| |