Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3779 |
Symbol | |
ID | 8014608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3833921 |
End bp | 3835213 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826342 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977561 |
Protein GI | 241206465 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.292836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGAAAA TGGTTCTTGG CGGCCTATGC GCCATTCTGC TGAGCGCGGT TTCGACGCTG GCACAGGCAG AAACAATCCG AATTGCAAAT CATGGTCAGG CCGGCATCGA CGCGATGAAG TCGACGGTCG AGCGGATCGA AAAGAAATAT GGCGTCACCG TCGAGGTCAT CGAATATCCG GCGCCCGACA AGGACTACCT GACGAAACTC CTGACCGAGC TCGGCGCCGG CAACGCGCCC GACCTGTTCT CCATCCCGAC GACGGCTGCG GTCGCCGACA TGGTGGAAGC CGGTTACCTC GCGCCTGTTA CCAAGGAGTT CAAGGCCTGG GACGGCTACG CCAACCTCTA TGACGTCGCC AAGGAGCTTG CCGTCAGCCC GGATGGTGAA ACCTATGTGA TGCCGTTCAT GCTCGGCATT CAGGAAATCT ATTACCGCAA GGACGTTCTC GAAAAAGCCG GCATCTCCAC CGAACAGCCG AAGACCTGGC AGGAACTTCT CGACCGCGCG GCCGAGATCA AGCAGAAGAC CGGCGCCTAC GGCCTACTCT TTCCGGCCGG CGTTTCCTGG GGAAGCGGTG CCTTCGACGA AGGTTTCCAG CATCTGCTCG TCGGTTCCAA GACGCCGCAG TTGGTCGATG CCGACGGCAA GCTCGATCTG AATGGCGAAG GCATCAAGGA TGTCTTCAAC GTCTACAAGG AACTGATCGA CAAGGATCTG ATGCCGACGC AGCCGCTGCT CGGACCCGAG CCCTGGGTCG TGCCGAAGTA CCAGATGTTC CCGGAAGGCA AGCTCGCCGC GACTACCTGC GGCTCCTGGT GCTATATTTT CGACTGGGGT CGCGAAAGCA AGAACCCGAT CCCTGACGTG ACGAAGGTGG TCGGCACCTG GACGGTTCCC GGCCAGAGCA GCGGCCAGTA TGTGCTTGCC AATCTCGCTG CGCCGTGGGC CGTCAATTCC AAGTCGGCCA ATACCGAACT GGCAATCAAG GCGCTGATGG AGATCGGCTC GATCGAGACG CAGGTTTCCT ACGCCGCCCG CATCGGCAAC ATCCCCGCCA GCAAGGATGC GGCTGACAAT GCCGAGTTCC AGAAGCTGAC GGAACTGGTT CCGATCCATG CTGCGGCCGG GAACGGCGTG TTCCTGAAGC AGGCCTCTGG TTTCGGTACG GTCTCGGAAG GGGTCGCGCG GGCCACCGAA GCGCTGCTGC GCAAGGAAAC CGATGCCGCC GGCGCCCAGA AGATCCTTGT CGACTACGTC AAGGAAACGC TCGGCGACGA CATGGTCAAG TAA
|
Protein sequence | MKKMVLGGLC AILLSAVSTL AQAETIRIAN HGQAGIDAMK STVERIEKKY GVTVEVIEYP APDKDYLTKL LTELGAGNAP DLFSIPTTAA VADMVEAGYL APVTKEFKAW DGYANLYDVA KELAVSPDGE TYVMPFMLGI QEIYYRKDVL EKAGISTEQP KTWQELLDRA AEIKQKTGAY GLLFPAGVSW GSGAFDEGFQ HLLVGSKTPQ LVDADGKLDL NGEGIKDVFN VYKELIDKDL MPTQPLLGPE PWVVPKYQMF PEGKLAATTC GSWCYIFDWG RESKNPIPDV TKVVGTWTVP GQSSGQYVLA NLAAPWAVNS KSANTELAIK ALMEIGSIET QVSYAARIGN IPASKDAADN AEFQKLTELV PIHAAAGNGV FLKQASGFGT VSEGVARATE ALLRKETDAA GAQKILVDYV KETLGDDMVK
|
| |