Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6684 |
Symbol | |
ID | 8022594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 115007 |
End bp | 116653 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644833551 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002984685 |
Protein GI | 241666601 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.146405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.692475 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT ATCTTCTTGC CGCCGCCGCA CTAACGCTGC TTTCGGGATC TGCCATGGCG CAAACGATCC TCACAGTGAA TATCGAACCG GCGACGACCT GGGTCCGCAA CTTCAACCCG TTCAACCAGA CCTCGTCGCG TCAATCGACA CTCGACTTCA TCTACGAGCC GCTGGTCGTC TTCAATCGCT TCGACAGCAA CAAGCCGGTC TATCGCCTGG CGGAAAGCTT CAAACTCTCC GACGATCTGA AGAGCATCGA TTTCAAGCTG CGCCCGAACC TGAAATGGTC GGACGGTAAG CCGCTGACCG CAGCCGACGT CAAGTTCACC TATGATTACC TGAAGAAATT TCCGGCGCTC GACTTCGTCA GCATCTGGAC CTTCATCACC GATGTGCAGG CCGTCGACGG CCAGACGGTG CGCTTCACGC TCGCCAATCC GAGCTCGCTC GCCGCCGAGC AGATCTCGCA ACTGCCGATC GTTCCGGAAC ATGTCTGGAA GGACGTTGCC GATCCCGTCA CCTTCGCCAA CGAGACACCT GTTGGCAGCG GCCCGCTGAC GGAAGTGCCG CGCTTCACCG GCCAGACTTA CGACCAGTGC CGCAACCCGA ACTACTGGGA CAACGAGCAC CTGAAAGTCG ATTGCATGCG CTTCCCGCAG CTCGCCGACA ACAATCAGAT GCTGACGGCA ACGGCCGACG GCACGCTCGA CTGGGGCGTC TCCTTCATCC CCGATATCGA CAATGTCTAT GTTTCCAAGG ACCCGGCGCA TTTCCACTAC TGGTATTCGC CAAGCAGCAT GGTCGCCTTC CTGTTCAACC TGGAAACGGC GAACGAGAAC AACAAGAAGG CCTTCAACGA CCTGAAGTTC CGCCGTGCCG TCTCGATGGC ACTCGACCGC AAGACGATGA TCGACGTCGC AGGCTACGGC TATCCGACGC TGAACGAAGA CCCCGGCCTG ATGGGCGAGC TCTACAAGAG CTGGGCGGAC CCGTCCGTCA AGGCCGACTT CGGCAAGTTC GCGACCTATG ATGCCGATGC TGCCAAGGCC TTGCTCGACG AGGCGGGCTA CAACGACAAG GACGGCGACG GCTTCCGCGA CAATCCCGAC GGCACCAAGA TCTCCTTCTC GATCATCGTC CCCAGCGCCT GGACGGACTG GATCGATACC GTCAACCTCG CGGTCGAGGG CATGCAGGCG GTCGGGATCG ACGCCAAGAT CGAAACGCCG GAGGAAGCCG TCTGGACCGG AAACCTCATC AACGGCACCT TCGATGCGGC GATCAACAGC CTGCCGGCAT CGGCCTCGCC CTATTACCCC TACAAGCGCG CTTTCAGTGC TTCGGATAAG GGCAAGACCC GCTTCACCGC GCAGCGCTGG TTCAATCCGG AGGTCGAAAA ACTCGTCACC GAGTTCACCC ATACCGCCGA CCTTGCCAAG CAGAAGGATG CGATGAACAA GGCGCAGCGC ATCGTCGCCG AAAACATGCC TGTGATTCCG GTGTTCAACA ATCCGAACTG GTATCAGTAC AACACCAAGC GCTTCACCGG CTGGTCGACC AAGGAAAACC CCTTCGTCAA TCCGTCGATC TCGCGGACCA ATCCGGCACG CCTGCTGAAC CTGCTGGCCC TCGAGCCGGT CAAGTAA
|
Protein sequence | MKKYLLAAAA LTLLSGSAMA QTILTVNIEP ATTWVRNFNP FNQTSSRQST LDFIYEPLVV FNRFDSNKPV YRLAESFKLS DDLKSIDFKL RPNLKWSDGK PLTAADVKFT YDYLKKFPAL DFVSIWTFIT DVQAVDGQTV RFTLANPSSL AAEQISQLPI VPEHVWKDVA DPVTFANETP VGSGPLTEVP RFTGQTYDQC RNPNYWDNEH LKVDCMRFPQ LADNNQMLTA TADGTLDWGV SFIPDIDNVY VSKDPAHFHY WYSPSSMVAF LFNLETANEN NKKAFNDLKF RRAVSMALDR KTMIDVAGYG YPTLNEDPGL MGELYKSWAD PSVKADFGKF ATYDADAAKA LLDEAGYNDK DGDGFRDNPD GTKISFSIIV PSAWTDWIDT VNLAVEGMQA VGIDAKIETP EEAVWTGNLI NGTFDAAINS LPASASPYYP YKRAFSASDK GKTRFTAQRW FNPEVEKLVT EFTHTADLAK QKDAMNKAQR IVAENMPVIP VFNNPNWYQY NTKRFTGWST KENPFVNPSI SRTNPARLLN LLALEPVK
|
| |