Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6426 |
Symbol | |
ID | 8016925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012854 |
Strand | - |
Start bp | 141495 |
End bp | 143084 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644828221 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002979421 |
Protein GI | 241554208 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0312111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACT ACACGAAGTA TCTCGCAAGC CGTGTCACCG CCGGCGGGCT CAGCCGCCGC GAATTCATGG GACGCGCCAT GGCCGCGGGC ATCACACTTG CCGTCGCCGA CAAGCTCTTC ACCGAAAGTG CGCAAGCTGC CGAACCGAAG CGCGGCGGTC ACTTGAAGCT CGGCCTCGAA GGCGGTGCCG CTACCGATTC CAACGACCCG GCGAAGTTCC TGTCGCAGGT CATGTTCTGC ATCGGCCGCT GCTGGGGCGA CATGCTGGTC GAGTCTGATC CGCTGACCGG TGCCGCCGTG CCGGCGCTCG CCGAATCCTG GGAACCGTCG AAGGACGCGG CGACCTGGAC CTTCAAGATC CGCAAGGGCG TCAAGTTTCA CGATGGCAAG GAACTGACGA TCGATGACGT TGTTGCGACG CTGAAGCGCC ACACCGACGC CAAGTCGGAA TCCGGCGCGC TCGGTGTTCT CGGCTCGATC AAGGAGATCA AGGCCGACGG CGGCAATCTC GTGCTGACGC TCAGCGAAGG CAATGCCGAC ATGCCGCTGC TGCTGTCGGA CTACCATCTG GTCATCCAGC CGAACGGCGG CGTCGACGAT CCTCTCGCCT CGATCGGCAC CGGCCCCTAC AAGATGACAA GCTTCGAGCC CGGCGTCCGC GCCACCTTCG AAAGGAACAA AGACGACTGG CGCACCGACC GCGGTTACGT CGATTCGATC GAAATCATCG GCATGAACGA TGCGACCGCC CGCATCGCGG CCCTGTCGTC CGGCCAGGTG CACTACATCA ACCGGGTTGA CCCGAAGACC GTCAACCTCT TGAAACGCGC ACCCAACGTC GAGATTCTCT CGACCGCCGG CCGTGGCCAT TACGTCTTCA TCATGCATTG TGACAAGGCG CCGTTCGACA ACAACGACCT GCGCCTGGCC CTCAAATATG CCATGGACCG TGAGGCCATG GTGCAGAAGA TCCTCGGCGG TTACGGCAAG GTCGGCAACG ACTTCCCGAT CAACAGCACC TATGCGCTGT TTCCCGAGGG CATCGAGCAG CGCGTTTACG ATCCTGACAA GGCTGCCTTC CACTATAAGA AGTCAGGTCA TAGCGGCTCG GTCCTCCTGC GCACCTCCGA AGTCGCCTTC CCCGGCGGTG TCGACGCAGC CGTCCTCTAT CAGGAAAGCT GCAAGAAGGC CGGCATCGAG ATCGAGGTCA AGCGCGAACC GGGCGACGGC TACTGGACCA ACGTCTGGAA CGTCCAGCCC TTCTCGACCT CCTATTGGGG TGGCCGCCCG ACGCAGGACC AGATGTATTC AACCGCCTAT CTCTCGACGG CGGATTGGAA CGACACCCGT TTCAAGCGTC CTGACTTCGA TAAGCTGCTG CTGCAGGCCC GTTCCGAACT TGATGAAGTC AAGCGCAAGG ACATGTATCG CACCATGGCG ATGACGGTGC GCGACGAGGG CGGGGTGATC TTGCCGATGT TCAACGATTT CGTGAATGCC TCCACCAAGC AGGTGAAGGG TTATGTCCAC GACATCGGCA ACGACATGTC GAACGGCTAC GTTGCGACCC GCGTCTGGCT GGACGCTTGA
|
Protein sequence | MNDYTKYLAS RVTAGGLSRR EFMGRAMAAG ITLAVADKLF TESAQAAEPK RGGHLKLGLE GGAATDSNDP AKFLSQVMFC IGRCWGDMLV ESDPLTGAAV PALAESWEPS KDAATWTFKI RKGVKFHDGK ELTIDDVVAT LKRHTDAKSE SGALGVLGSI KEIKADGGNL VLTLSEGNAD MPLLLSDYHL VIQPNGGVDD PLASIGTGPY KMTSFEPGVR ATFERNKDDW RTDRGYVDSI EIIGMNDATA RIAALSSGQV HYINRVDPKT VNLLKRAPNV EILSTAGRGH YVFIMHCDKA PFDNNDLRLA LKYAMDREAM VQKILGGYGK VGNDFPINST YALFPEGIEQ RVYDPDKAAF HYKKSGHSGS VLLRTSEVAF PGGVDAAVLY QESCKKAGIE IEVKREPGDG YWTNVWNVQP FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKLL LQARSELDEV KRKDMYRTMA MTVRDEGGVI LPMFNDFVNA STKQVKGYVH DIGNDMSNGY VATRVWLDA
|
| |