Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6643 |
Symbol | |
ID | 8022893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 73169 |
End bp | 74713 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644833510 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002984644 |
Protein GI | 241666560 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.19211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAAGC TTCTTCTTGC AGGCACCATG TTCATGGCGA TGACCGGCGT GATCCACGCC CGCGACATCG TCGTGGCTCA AAGTTCCGAT CTGCGCAGCA ACAATCCGGG CGTCAATCGC GACGGCAATA CCGATGGCGT CATCCTGCAT ATCGTCGAAG GGCTCGTCGG CTATGCCAAC AACGGCGAGG TCAAGCCGCT GCTGGCAAAG AGCTTCGAAG TCTCGGCCGA TGGGCTGACC TACAGCTTCA AACTGCGTGA CGACGTCAAA TTCCACAACG GCAAGACATT GACCGCCGAT GACGTCGTTT GGAACTGGAA CCGCTATCTC AAGCCCGAAA CGAAATGGAC CTGCCTTCCT GACTTCGACG GCAACGGCAG CGTGCATGTT ACGGGGGTCA AGGCAGTCGA TGCGTCGACC GTCACCATCA CGCTGGAAAA GCCATCCGCG GTTTTCCTTG GCCTGATGTC GCGCCCCGAA TGCGGTTACA CCGGGATCAT TTCCCCGGAA TCGGTCGGCG CAGACGGAAA TTTCGTCAAG CCGATCGGCA CCGGTCCCTT CAAATGGGAT GAATGGAAAA AGGGCGAGTA TATCCATCTC GCCAAGTTCG ACGATTATGT CTCGCCAGAG AATGACGGCA AGCCCGACGG CATGGTCGGC TCCAAACGCC CTCTCGTCGA TGGCATCAAG TTCATGGTGA TTCCCGATGC TTCGACCGTA AAGGCCGGCC TTCAGTCCGG GGTGCTCGAT ACCGCGGAGA TTTCGCCGGA TCTCATTCCC GAATTCAAGA CGAGCGACAC GATGCAATTG ATCGTGGCGC GCAACAACGG CAAAAACCTC TTCTACATCC AGACGCGCGA CAAGGTTCTG AGCAATCCCG GCGTGCGCCG CGCCATGGCA ATGGCGCTCG ATCTCGACCA ACTCGTCGAG GCCGCCTCCA ATGGCACCGG CGCAGCCAAC GGTTCGATGG TTTCGCAAGA CTCGCTCTAT TTCGACGATG TCCAGAAGGA GCGTCTGCCC TACGACGTCG AGGCCGCGAA GAAAGAACTT GCGACGGCCG GCTATAAGGG CGAGCCGATT ACCATCATCG CCAACAAGCG CAGCAACGTG CCAAGCTTCC CGGCCGCGGT GATGGCGCAG GCCATGATGC AGCAGGCAGG TCTCAATGTG CAGATCGAGG TGCTCGACTA TGCAACGCAG GTCGATCGCC GCCGGTCCGG CAACTACCAG ATCATCTCGC AATCGGTCGC GCCGCGGCTC GATCCGGCGC TGATGTACGG CTTCTATGTC GGCAACAAGG ACAAGAATGC GTCGTTGATG TGGGATGATC CAAAGGCCGT CGAGTTGATG AAGGCCGCCT ATGCGGAACC CGACCAGACG AAGCGTCAAG CGATCTTCGA CGAGTTTCAC ACGCTGATGC TCAAGGAAAT GCCGGGCATC TTCCTCTATG ACATGGTCGA TGTCTGGGGC GCGACCAAGA AGCTGAAGGG CCAGCCCGTC TGGCAATCGA ATGCCCGTCT TTGGGAAGTT TCGCTCGACA ACTGA
|
Protein sequence | MHKLLLAGTM FMAMTGVIHA RDIVVAQSSD LRSNNPGVNR DGNTDGVILH IVEGLVGYAN NGEVKPLLAK SFEVSADGLT YSFKLRDDVK FHNGKTLTAD DVVWNWNRYL KPETKWTCLP DFDGNGSVHV TGVKAVDAST VTITLEKPSA VFLGLMSRPE CGYTGIISPE SVGADGNFVK PIGTGPFKWD EWKKGEYIHL AKFDDYVSPE NDGKPDGMVG SKRPLVDGIK FMVIPDASTV KAGLQSGVLD TAEISPDLIP EFKTSDTMQL IVARNNGKNL FYIQTRDKVL SNPGVRRAMA MALDLDQLVE AASNGTGAAN GSMVSQDSLY FDDVQKERLP YDVEAAKKEL ATAGYKGEPI TIIANKRSNV PSFPAAVMAQ AMMQQAGLNV QIEVLDYATQ VDRRRSGNYQ IISQSVAPRL DPALMYGFYV GNKDKNASLM WDDPKAVELM KAAYAEPDQT KRQAIFDEFH TLMLKEMPGI FLYDMVDVWG ATKKLKGQPV WQSNARLWEV SLDN
|
| |