Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5566 |
Symbol | |
ID | 6978660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1215093 |
End bp | 1216028 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643394664 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002279482 |
Protein GI | 209547564 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.306318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCCT TTCCGGCATT GCGGACCCTG CTGATCGCCG CCATGGCGTC CGGCCTTTCC TTCGGCGCAA CTCGTGCGGC CGACGATTTC GACCTGAGCC CGCAGCAGCC GGGCAGGCTG CATGCCGCCA GGAACGAGGC GGCAATCGCT GCGATCCCCA AGGAGTTCAA GTTCGTCACG CCAGGCAAGT TCACCATCGC CGTCAGTCCG GGCGGTCCGC CGCTTGCGAC CTATGCCACC GACGCCAAGA CCGTCGTCGG GGCGGATCCC GATTATGCCT ATGCCATCGC CGACAGCCTC GGCCTGACGC TGGAGATTGT GCCCGTGGCC TGGATCGACT GGCCGCTCGG CCTCGCCTCC GGCAAGTATG ATGCCGTCAT TTCCAATGTC GGGGTCACCG AACAGCGCAA GGAGAAGTTC GATTTCTCCA CCTATCGTCA GGGCCTGCAT GGCTTCTTCG TGAAATCCGA CAGCCCCATC ACCTCGATCA AGCAGCCGAA GGATGCGGCG GGTTTGAGGA TCATCGTCGG GGCCGGAACC AACCAGGAGC GCATCCTGGT GAAGTGGAGC GACGAGGATG TCGCCGCCGG CCTGAAGCCG ATCGAGCTGC AATATTACGA CGACGAGGCG GCAAGCCTCC TCGCGCTCCG TTCCGGTCGG GCCGATGTCA TCGTGCAGCC GCATGCGCAG CTCGTCTTCA TCGCGGCGCG CGACAAGAAC ATCAAGCGTG TGGGCACGCT GAGCGCCGGC TGGCCCGATC GTTCCGACGT TGCGATCACC ACCCGCAAGG GAAGCGGGCT CGCCGATGCG CTGACCGTCG CCACCAACGG CCTGATCAAG GACGGCAGCT ACGCGAAGAT CCTCGATCAC TGGCATCTGT CCGAGGAAGC CTTGCCGGCA TCCGAGACCA ATCCGCCCGG CCTGCCGAAA TACTGA
|
Protein sequence | MIAFPALRTL LIAAMASGLS FGATRAADDF DLSPQQPGRL HAARNEAAIA AIPKEFKFVT PGKFTIAVSP GGPPLATYAT DAKTVVGADP DYAYAIADSL GLTLEIVPVA WIDWPLGLAS GKYDAVISNV GVTEQRKEKF DFSTYRQGLH GFFVKSDSPI TSIKQPKDAA GLRIIVGAGT NQERILVKWS DEDVAAGLKP IELQYYDDEA ASLLALRSGR ADVIVQPHAQ LVFIAARDKN IKRVGTLSAG WPDRSDVAIT TRKGSGLADA LTVATNGLIK DGSYAKILDH WHLSEEALPA SETNPPGLPK Y
|
| |