Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4682 |
Symbol | |
ID | 8007158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 46020 |
End bp | 47618 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821616 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002972876 |
Protein GI | 241113041 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC TGTCTAGATT ATCCGTGATT GCGCTTGGCG CCCTGCTGTC GACGGCTGCC GTTCCAGCTC TTGTCGTTTC GGGCGTTGCA ATAGAGGCCC AGGCAGCCAC GCTATCGGGC GGCTTCGATG TCGGTCCCGG AGGTTTCCAG GGCAACTTCA ATCCGCTCGC CGCGACCGCC GGCTTCACCT GGCTCAGCAT CTATTACGAA CCGCTGATCA CTTATGACGA GAAGCTGCAG AAGGTCGTCG GCGCGCTGGC AAGCTCCTAC GAGGTCAGCT CCGACCAGAT GACCTACACG TTCAAGCTGG TGGACGCCAA ATGGCATGAC GGCAAACCGT TCACTGCCAA GGACGCAAAG TTCACCATGG CCCTTGCGAT GGACGCGAAA ACCGGCTCGG TGCTCGCCGC CCGGCTGAAG GGCATATCGT CCGTCGAGAC GCCGGATGAG CACACTGTTG TCATCAAGCT CAGCGCCCCC AGCAGCAGTT TTCCCGACAC GATGACCAAA GTGATGATGC TGCCCGAGCA TGCGCTCTCC TCGATCCCGG CCGACCAGCT GACGAAGAAC ACCTGGTGGT CCACAGCTCC GATCGGCACC GGTCCGTTCA AATTCACCAA ATACGTCTCG GATCAATATG TCGAACTTGC CGCAAACACC GATTATCGCG GTGGCAAACC CGCACTGGAA CGCGTCATCA ATCGCTATTT CGCCAACCCG GCCGCAGCAA TCGCTGCGCT GAGATCCGGC GAAATCCAGT TCACCTATGT CGATTCCAAC GACGTGCCGA CCTTCAAGGA CAACAAGGAC TTCCAGGTCA TAGAAGGCAA CTCTTTCGTC GTCAACTACC TGGGCTTCAA CCACGAATCC CCGCTCTGGA AGGACGTGCG CGTCCGCCAG GCGGTGATGT ACGCGATCAA TCGCGATGCC ATCATCCAGA GCCTTTATGG CGGTGCGGCC AAGCCTGCCA ACTGCGCCTA TGTCGCCGAA CAGCTGATAC CCCCTGATAT CGACAGCTAT GCCTATGATC CCGAGAAGGC CAAGCAGTTG TTGACGGAAG CCGGCTGGGA CCAGATCAAC GGCGGCAAGC AGATCACCCT TCTGACCTAT TACACCACGC CGCTGGCGAC CAACGTGCTT GCCGCAGTCC AGGCGATGCT TGCCCAGGTC GGCATCAACA TCGTCCCGCG CGCCGTCGAT GCGCCGACCT ATAACAGCAT CGTGCTCAAT GCGACGCCGG ATATCGCCCA GTTCCAGTTG GTTTATGCCG GGCTGCAGAA CGGGCCGGAT GCCGGAAGCA TCAATGTCGG CCTCAACGAG AAGCAGATCC CTCCGGCCGG GCCGAATGTC GCCAGGGTTC GCATGCCTGA CCTCACCAAG GCGCTCGATA GCGCCTTGGC CGAGCCTGAT AGCACCAAGC GGGATGCCGC CTACCAGGAC GTCTGCAAGG TGATGAACAC CAACCTGCCC TGGGCGACGC TCTGGGTGGC AAACCGCTAT GGCATCGTCT CGACAAAGGT GAAGGATTTC GTCTGGACGC CAGCGCCGGG CGGCGGCCCC TACCAGGCCA ATCCGCAGAA ATGGTCGATC GCCGAATAG
|
Protein sequence | MKRLSRLSVI ALGALLSTAA VPALVVSGVA IEAQAATLSG GFDVGPGGFQ GNFNPLAATA GFTWLSIYYE PLITYDEKLQ KVVGALASSY EVSSDQMTYT FKLVDAKWHD GKPFTAKDAK FTMALAMDAK TGSVLAARLK GISSVETPDE HTVVIKLSAP SSSFPDTMTK VMMLPEHALS SIPADQLTKN TWWSTAPIGT GPFKFTKYVS DQYVELAANT DYRGGKPALE RVINRYFANP AAAIAALRSG EIQFTYVDSN DVPTFKDNKD FQVIEGNSFV VNYLGFNHES PLWKDVRVRQ AVMYAINRDA IIQSLYGGAA KPANCAYVAE QLIPPDIDSY AYDPEKAKQL LTEAGWDQIN GGKQITLLTY YTTPLATNVL AAVQAMLAQV GINIVPRAVD APTYNSIVLN ATPDIAQFQL VYAGLQNGPD AGSINVGLNE KQIPPAGPNV ARVRMPDLTK ALDSALAEPD STKRDAAYQD VCKVMNTNLP WATLWVANRY GIVSTKVKDF VWTPAPGGGP YQANPQKWSI AE
|
| |