Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5780 |
Symbol | |
ID | 6977169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 190798 |
End bp | 192444 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393235 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278053 |
Protein GI | 209546163 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0222518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0795178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT ATCTTCTTGC CGCCGCCGCA CTGACGCTGC TTTCGGGCTC CGCCATGGCG CAGACGGTCC TGACGGCGAA TATCGAGCCG GCGACGACCT GGGTTCGCAA CTTCAATCCG TTCAACCAAA CCTCGTCGCG CCAGTCGACG CTCGACTTCA TCTACGAGCC GCTGGTTATT TTCAACCGCT TCGACAGCAA CAAGCCGGTC TATCGCCTGG CCGAAAGCTT CACGCTCTCC GATGATCTGA AGAGCATCGA TTTCAAGCTG CGCCCGAACC TGAAATGGTC TGATGGTAAG CCGCTGACAT CAGCCGACGT CAAGTTCACC TATGATTATC TGAAGAAATT CCCGGCGCTC GACTTCGTCA GCATCTGGAG CTTCATCACC GATGTCAAAG CCGTCGACGG CCAGACGGTG CGCTTCACGC TCGCCAATCC GAGTTCGCTG GCTGCCGAGC AGATCTCGCA GCTGCCGATC GTTCCGGAAC ATGTCTGGAA GGACGTCGCC GATCCGGTCA CTTTCGCCAA CGAGAACCCG GTCGGCAGCG GCCCGCTGAC CGAGGTTCCG CGCTTCACCG GCCAGACCTA CGACCAGTGC CGCAACCCGA ACTATTGGGA CAATGCGCAT CTGAAGGTCG ATTGCATGCG CTTCCCGCAG CTTGCCGACA ACAACCAGAT GCTGACGGCA ACAGCCGACG GCACGCTCGA CTGGGGCGTC TCCTTCATTC CCGATATCGA CAATGTCTAT GTGTCCAAGG ATCCGGCGCA TTTCCACTAT TGGTATTCGC CGAGCAGCAT GGTCGCCTTC CTGTTCAACC TGGAAACGGC GAACGAGAAC AATAAGAAGG CCTTCATCGA CCTGAAATTC CGCCGTGCCG TCTCCATGGC GCTCGACCGC AAGACGATGA TCGATGTCGC CGGCTACGGC TATCCGACGC TGAACGAAGA CCCCGGCCTG ATGGGCGAGC TTTACAAGAG CTGGGCAGAC CCCTCCGTCA AATCAGACTT CGGCAAGTTC GCGACCTATG ACGCCGACGC TGCCAAGGCC CTGCTCGACG AGGCGGGTTA CAAGGACAAG GACGGCGACG GCTTCCGCGA CAACCCCGAC GGCAGCAAGA TCTCTTTCTC GATCATCGTC CCGAGCGCCT GGACCGACTG GATCGACACC GTCAATCTCG CCGTCGAAGG CATGCAGGCG GTCGGCATCG ACGCCAAGAT CGAAACGCCT GAAGAAGCCG TCTGGACCGG CAACCTCATC AACGGCACCT TCGATGCGGC GATCAACAGC CTGCCGGCAT CGGCTTCGCC CTATTATCCC TACAAGCGCG CCTTCAGCGC TTCGGACAAG GGCAAGACCC GCTTCACCGC GCAGCGCTGG TTCAATCCCG AGGTCGAGAA GCTCGTCACC GAGTTCACCC AGACGGCGGA TCTTGCCAAG CAGAAGGACG CGATGAACAA GGCGCAACGC ATCGTCGCCG AAAACATGCC GATGATTCCG GTGTTCAACA ATCCGAACTG GTATCAGTAC AACACCAAGC GCTTCACCGG CTGGTCGACC AAGGAAAACC CCTTCGTCAA TCCGTCGATC TCGCGGACCA ACCCGGCACG CCTCTTGAAC CTGCTCGCGC TCGAGCCGGT CAAGTAA
|
Protein sequence | MKKYLLAAAA LTLLSGSAMA QTVLTANIEP ATTWVRNFNP FNQTSSRQST LDFIYEPLVI FNRFDSNKPV YRLAESFTLS DDLKSIDFKL RPNLKWSDGK PLTSADVKFT YDYLKKFPAL DFVSIWSFIT DVKAVDGQTV RFTLANPSSL AAEQISQLPI VPEHVWKDVA DPVTFANENP VGSGPLTEVP RFTGQTYDQC RNPNYWDNAH LKVDCMRFPQ LADNNQMLTA TADGTLDWGV SFIPDIDNVY VSKDPAHFHY WYSPSSMVAF LFNLETANEN NKKAFIDLKF RRAVSMALDR KTMIDVAGYG YPTLNEDPGL MGELYKSWAD PSVKSDFGKF ATYDADAAKA LLDEAGYKDK DGDGFRDNPD GSKISFSIIV PSAWTDWIDT VNLAVEGMQA VGIDAKIETP EEAVWTGNLI NGTFDAAINS LPASASPYYP YKRAFSASDK GKTRFTAQRW FNPEVEKLVT EFTQTADLAK QKDAMNKAQR IVAENMPMIP VFNNPNWYQY NTKRFTGWST KENPFVNPSI SRTNPARLLN LLALEPVK
|
| |