Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4751 |
Symbol | |
ID | 8007004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 119746 |
End bp | 120999 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644821681 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002972941 |
Protein GI | 241113106 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.137656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.378465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCC TTTCTGCGAA ATTGAAGACT GCGAGCATCG TTGCGATCGC CGTGGCGTCA CTATCGGCCA CGCCGGTTCT TGCGGAAGAC ATCACGCTTT GGACCCTCAA CTTCGACAAC AATGCTGCCA ACACGGCTCT GAAAAAGGTG GCGACGGACT TCGAAGCGGC AAACCCCGGA ACGCATGTCG AGATCGTTCA GCGCGCCGTC GACGAGCATA AGACCGCCTT GCGCGTCGCT GCTGGCTCCG ACAAGGGACC TGACATTTAT TTCAGCTGGG CGGGCCTCGG CCTCGGCGGC GAGTATGTGA AGGCCGGTCT GTCCCTGCCC CTCGACAAAT ACTATGCCGA GTATAAGTGG AGCGACGAAT TGCTGCCCTC GGCAGCGGCT TTTGCCGACC TCTATCCCGG CGGCAAGCAC GGCGTCCCCT TCACCTTCAA GGGTGAGGCC GTCTATTACA ACAAGAAGCT TTTCGAACAG GCCGGCATCA AGGAAGAGCC GAAGACCTAC GAGGAATTCC TTGCAGCGGC CGATAAGCTG AAGGCTGCCG GCATTCCCGC CTTCACCTTC GGCGGCACGG TCAACTGGCA CGTCATGCGT CTCATGGACG TCATCCTTGA AACGAAGTGC GGTGCTGAAA AGCACGATGC GCTGAAGGCG ATGACGCTGG ATTGGACCAA GGAACCCTGC GCGACGGATT CATTCGCGGA GTTTGCGAAG TGGACGAAGG ACTATACGCT GCAGCCGTTC ATGGGCATCG ACAACAAACA GTCCTACAGC CTCTTCACCG CGGGTCGTGC AGCGATGATG CTCGAAGGCG ACTGGCTGGT CAGCCAGCTT AACGGCTCCG GCGCCAATCT CGACGACTAC GGGATTTTCC CCTTCCCGAC CAACACCGAT CGTCTCTACG GTTTCGCCGA GTACAACTAC ATCAGCACCA AGAGCAAGAG CCCTGATGTA GCGGCGAAGT TCCTCGACTA CTTCCTCTCG ACGAAGGTCC AGCAGGACCT GCTCGGCCAG CTGAGTTCAA CCTCCGTCAA CAAGAACGTC CAATATGCCA ACCAGAAGCC GCTCGAGGCG GAATGGCTGG GGATCTTCCA GAAATACGGC AAGGTCTACA TGAACGGCGA CCAGGCGTTC CCGCTCGACG TCACGACGGA GTACTTCCGG GTCATCAACG ATGTTGCTTC CGGCAACACC GAGCCGGCCG ATGCGGCCAA GCAGTTGCAG AGCTTTATCG CAAGCCGAAC CTGA
|
Protein sequence | MLTLSAKLKT ASIVAIAVAS LSATPVLAED ITLWTLNFDN NAANTALKKV ATDFEAANPG THVEIVQRAV DEHKTALRVA AGSDKGPDIY FSWAGLGLGG EYVKAGLSLP LDKYYAEYKW SDELLPSAAA FADLYPGGKH GVPFTFKGEA VYYNKKLFEQ AGIKEEPKTY EEFLAAADKL KAAGIPAFTF GGTVNWHVMR LMDVILETKC GAEKHDALKA MTLDWTKEPC ATDSFAEFAK WTKDYTLQPF MGIDNKQSYS LFTAGRAAMM LEGDWLVSQL NGSGANLDDY GIFPFPTNTD RLYGFAEYNY ISTKSKSPDV AAKFLDYFLS TKVQQDLLGQ LSSTSVNKNV QYANQKPLEA EWLGIFQKYG KVYMNGDQAF PLDVTTEYFR VINDVASGNT EPADAAKQLQ SFIASRT
|
| |