Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5304 |
Symbol | |
ID | 6978398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 933377 |
End bp | 934987 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643394408 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002279226 |
Protein GI | 209547308 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.85757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCA AGATTACCAA CTGGACCCGC TCTGACGACG CCATGATCGA AACCGCCATC CGTCGTGGCG CGACCCGCCG CGAGTTGCTG CATATGATGC TGGCGGGCGG CGTGGCCATG TCCGCCGGCG GGCTCGTGCT CGGCCGCGCC GGCAAGGCGC TTGCGGCGAC GCCGGTTTCC AGCGGTTCGC TCAAGGCAGC CGGCTGGTCG TCCTCGACGG CCGATACGCT CGACCCCGCC AAGGCGTCGC TCTCCACCGA TTATGTCCGG TGCTGCTCCT TCTATAACCG CCTCACCATC CTCGACCAGA GCGGCAAGCC GCAGATGGAG CTTGCCGACG CGATCGAGTC CAAGGATGCG AAGACCTGGA CGGTCAAGCT GAAGAGCGGC GTCACATTTC ACGACGGCAA ACCGCTGACA TCAGACGACG TGGTCTTCTC ACTGAAGCGC CATCTCGACC CCGCGGTCGG CTCGAAGGTC GCCAAGATTG CCGCCCAGAT GACTGGCTTC AAGGCAGTCG ATAAGCAGAC CGTCGAGATT ACCCTTGCCG ATGCAAACGC GGACTTGCCG ACCATCCTGT CGTTACACCA CTTCATGATT GTCGCAGATG GCACCACCGA CTTTTCGAAG GCGAACGGCA CCGGTGCTTT CGTCAAGGAG GTCTTCGAGC CAGGTGTGCG CTCGGTCGGC ATCAAGAACA AGAACTACTG GAAATCGGGC CCGAACGTCG ATTCCTTTGA ATATTTCGCG ATCAGCGACG ACAATGCCCG CGTCAACGCA CTGCTGGCCG GCGATATCCA CCTCGCCGCC TCGATCAATC CGCGCTCGAT GCGCCTCATC GAGGCCCAGG GCGATGGGTT TACCTTGTCG AAGACGACCT CCGGCAACTA TACCAACCTC AATATGCGGA TGGATATGGA ACCCGGCAAT AAGCAAGATT TCATCGAAGG CATGAAGTCC CTTGTCAACC GCGAACAGAT CGTCAAGTCG GCGCTGCGCG GTCTCGGCGA GGTTGGCAAC GACCAACCCA TTTCTCCGGC GAACTTCTAT CATGATGCGG ACTTGAAAGC GCGGGGCTTC GATCCCGAAA AGGCGAAGTT CCACTTCGAA AAAGCAGGCG TCCTCGGCCA ATCGATTCCG ATTATCGCTT CGGATGCGGC GAACTCGTCG ATCGACATGG CCATGATCAT CCAGGCGGCC GGCGCCGAGA TAGGGATGAA GCTCGATGTC CAGCGCGTCC CCGCCGACGG CTACTGGGAC AATTATTGGC TTAAGGCGCC TATTCACTTC GGCAATGTCA ATCCTCGTCC AACACCGGAC ATTTTGTTCT CGTTGTTCTA CACCTCTCAA GCGCCCTGGA ATGAAAGCCG CTACAAGTCT GAAAAATTCG ACAAGATGCT GATCGAGGCG CGCGGTTCGC TCGACCAGGA GAAGCGCAAG ACGATCTACA ACGAGATGCA GGTTATGGTC GCTCAGGAAG CCGGCACCAT TATTCCAGCC TATCTATCGA ATGTCGATGC CACCACTGCC AAGCTCAAGG GCTTGCTACC CAGCCCCCTT GGCGGCCAGA TGGGATACGC GTTTGCCGAA TATGTCTGGC TCGAAGCCTG A
|
Protein sequence | MNSKITNWTR SDDAMIETAI RRGATRRELL HMMLAGGVAM SAGGLVLGRA GKALAATPVS SGSLKAAGWS SSTADTLDPA KASLSTDYVR CCSFYNRLTI LDQSGKPQME LADAIESKDA KTWTVKLKSG VTFHDGKPLT SDDVVFSLKR HLDPAVGSKV AKIAAQMTGF KAVDKQTVEI TLADANADLP TILSLHHFMI VADGTTDFSK ANGTGAFVKE VFEPGVRSVG IKNKNYWKSG PNVDSFEYFA ISDDNARVNA LLAGDIHLAA SINPRSMRLI EAQGDGFTLS KTTSGNYTNL NMRMDMEPGN KQDFIEGMKS LVNREQIVKS ALRGLGEVGN DQPISPANFY HDADLKARGF DPEKAKFHFE KAGVLGQSIP IIASDAANSS IDMAMIIQAA GAEIGMKLDV QRVPADGYWD NYWLKAPIHF GNVNPRPTPD ILFSLFYTSQ APWNESRYKS EKFDKMLIEA RGSLDQEKRK TIYNEMQVMV AQEAGTIIPA YLSNVDATTA KLKGLLPSPL GGQMGYAFAE YVWLEA
|
| |