Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5611 |
Symbol | |
ID | 8016837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 195475 |
End bp | 198714 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644827776 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002978976 |
Protein GI | 241518348 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.115035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGGA GGCCGCACAT CCTACTGGCG ACCGACAGTG CAGAACCGTC CGGGATGGGC GAACACATGC TTGCTCTCGG GCGGGCTTTG AGCGAACGGT GGGATGTCAC AATAGCCCTG TTGACCGAGG ACAGGACCGG TTTGTTGCCG CGGGCTGCGC GCCACGGAAT TGGCATCAAG CTAAGCGAAG ATGCGGGCGT ATTCCAGCAG TGGTTGGCGC GATCCAGTAT TGATCTTCTG CACGTTCATG CGGGAATCGG ATGGGAGGGA CACCGCCTAG CCGCCGCCGC GGATGCCCTG GCAATCCCGA TCATCCGCAC CGATCATTTG CCATATCTGC TCACCGATCC TGATCAGATC GAGCTCTACC GGCGCGAGAC CGGGAGGCTG TCGCATCACA TCGTCGTTTC CGAAGCCTCG CGTGAAAGCT TCTGCGATGC AAACGTCGCG CCGTCGCGTT TGACAGTCGT GCGCAACGGC ATCTTTCCAC TTTCGCCTGC GCGAGCAGCG GCTGAACTAA GGCGAGCGCT CGACCTGGTG GATAAAACCG TGCTGCTGAC GGTCGCCCGC TTCACCGAGC AGAAGGATCA CGTGTCGCTT GTCCGCGCCT TGCCGAACAT ACTTGGAACT CACCCCAAAG CGGTGTTGCT TTTTGTCGGA TCGGGTCCAG AAGAAGATCG TGTCCAAACC CTGGCGGAAG ATTTAAAAGC CTTTGGTCAC GTTCGCTTTC TGGGGCACCG GGCGGATGTT GCGGAGATCA TGGCGATCGC CGATCTCTTC GTGCTTCCCT CGCTCTTCGA AGGCCTTCCA CTCGCGGTTC TCGAGGCGAT GTCGCTTGCC GTCCCCGTGG TCGCGACGAG GATCGGCGGA ACCGTCGAGG CACTGGGCGG TGATCATCCC TATTTCGCGG AGCCCGGCAA CCCAGCGTCG ATCACCGCTG TCGTCAATCA GGCATTGAGC GATCCCCATC TAAAGGCCAC CGGCAGAGTC GGCCACGCGC GTTTCGAGCG CAACTTTTCC GCCCGCCGAA TGGCTGATGA GACCGGCGCC GTCTACGAGC GGTTCCTCAC CCCACATGCG GATCAAAAAC AAAAGGATAC CGAGATGCGA AAAACGCGGC TGGGCTTCAT CGGCGTGGGT GGCATCGCCC ATCGGCACCT TGATATCCTA ACCGGCTTCG ACGACGTCGA ACTCGTCGCC TTTGCCGACC TGGACCTTCC CCGTGCGGAT GCGGCGGCAA TGCGCTTTGG CGCCCGCTCT TTCAGCCATC ACCGTGAAAT GCTCGACACG CAACGGCTCG ACGCGGTCTA CATCTGCATT CCGCCTTTCG CCCATGGCGA GCCGGAACAG GACCTCATTG ACCGCAACAT TCCATTCTTT GTCGAAAAGC CAGTCACGCT CGATCTGGCG CTGGCGGAAG AGATCGCAGC CGGCGTGTCG AAGGCCAGCT TGATTACTGG CGTTGGCTAT CACTGGCGTT ATCTCGATAT CGTCGATGAA GCCCGAAGCC TGCTGGCGGA TAACCCAGCG CAACTGCTGT CGGGATACTG GCTGGATTCC ACGCCTCCCC CGCAGTGGTG GTGGAAGAAG GATCAATCCG GCGGCCAGAT GGTTGAGCAG GCGACGCATC TCCTTGATCT CGCGCGCTTC CTCGTGGGCG AGGTCACCGA CGTATACGGC CGGGCGGGAT ACAAGGAAAG GGTGGAATTT CCCGGCCTCG ATGTGCCGAC GGTCACGACA GCTAGCCTGA CCTTCGAAAC GGGCGTGGTC GCCAATATCG CCGCGAGCTG CCTTCTCGGC TGGAGCCACC GCGTCGGCTT GCACATCTTC GCCGACCGAC TGGCGATCGA ATTGACCGAT CGCGACATCA TGGTGGATGT TGGAAGAGGA CGCCCAGTGC GAAGCGCCGA TGGCGATCCG GTATGGCGCG AAGACCGCGA TTTCATCGAC GCCGTCCGGG GTGGTGAAAA CCGCATTCGC TGCCCGTATG GTGATGCTCT AGCCACCCAC CGGCTTGCGG TGGCCGTCGT ATCTTCGGCA CGCAGCGGGG AGCCGGTACA CCTCGACGCC CCGGCTCTAA AGCGAAGCGA GCTATCGCCC TTGCTGGCGC AACCACGTGC CGAGATGTCT CAAGAACCGC GGCCTGGGCA CCGCAAAATC CGCTCGCTCG GCATCGAGGC GCCGGGCCGA GCCTATTTCT TGGAATACGA GGAAGGCCCG CCCGCCGATG GACATGTGCG GCTCGACACG CTCTATACCG GTCTTTCCGC TGGGACGGAG CTGACGTTCT TGAAGAACAC CAACCCCTAT TTCCGCGCGC GCTTTGATGG CGAACGCGGC GTCTTCATCG AAAACGAGCC GGACCTTAAC TATCCAGTGC CTTTCCTCGG GTACATGGAG GTCGCACGCG TTTCGCAATC GCGCGCGCGC GGCTTTGCCG ATGGCGAGTT GCTCGCGGCA AGCTACGCGC ACAAGACTGG GCACACTGCC GACCCCTCAC ACGATCTTCT TGTCATGCTG CCGCCGGACC TCGATCCGCT GCTCGGCGTC TTTGTCGCGC AGATGGGACC GATCGCCGCC AACGGCATCC TCCATGCCGA CGCGGAAGCA TTTGGCGCAA CCGTGCCCGC ACTCGGCGCG GGCGTTTGCG GGCGAACGGT AATCGTCCTC GGCGCGGGTA CCGTGGGTTT GATGACGGCG CTTTTTGCCC GTTCGCTCGG CGCATCGGAC ATCGTGATCA CAGATCCCTC CGAATTTCGC CGCGGCAAGG CGGAGGCGAT GGGATTGATG GCGATGGCCG AGGATCAAGC CTGGCAGCAT GCCAAGGCGC GCTGGCACGA CGGCACCATG GGACGCGGCG CGGATGTAGC GTTTCAGACG CGCGCTCATG CGGGTAGCCT GCACACGGCG CTCAAGGCCT TGCGCCCACA AGGCACCGTG ATCGATCTTG CCTTTTACCA GAACGGGGCC AGCTCATTGC GGCTGGGGGA AGAATTCCAT CACAACGGCC TGAACATCCG CTGCGCGCAA ATCAACCGCG TCCCCCGCGG GCTCGCCCCA CGGTGGGACC GTCGAAGGCT TGCAGGCGTA ACGCTCGACC TCTTGAAGAC GGAAGGTGCG GCGATCCGCG AGCACATGAT CACTCACATC GTGCCCATCG ACGAAGCGCC TGCTTTCCTC GTCGATCTGA TCGAGAACCG GCCGGAATTC CTCCAGGTTA TATTCAAGGT CGGCGAATGA
|
Protein sequence | MNRRPHILLA TDSAEPSGMG EHMLALGRAL SERWDVTIAL LTEDRTGLLP RAARHGIGIK LSEDAGVFQQ WLARSSIDLL HVHAGIGWEG HRLAAAADAL AIPIIRTDHL PYLLTDPDQI ELYRRETGRL SHHIVVSEAS RESFCDANVA PSRLTVVRNG IFPLSPARAA AELRRALDLV DKTVLLTVAR FTEQKDHVSL VRALPNILGT HPKAVLLFVG SGPEEDRVQT LAEDLKAFGH VRFLGHRADV AEIMAIADLF VLPSLFEGLP LAVLEAMSLA VPVVATRIGG TVEALGGDHP YFAEPGNPAS ITAVVNQALS DPHLKATGRV GHARFERNFS ARRMADETGA VYERFLTPHA DQKQKDTEMR KTRLGFIGVG GIAHRHLDIL TGFDDVELVA FADLDLPRAD AAAMRFGARS FSHHREMLDT QRLDAVYICI PPFAHGEPEQ DLIDRNIPFF VEKPVTLDLA LAEEIAAGVS KASLITGVGY HWRYLDIVDE ARSLLADNPA QLLSGYWLDS TPPPQWWWKK DQSGGQMVEQ ATHLLDLARF LVGEVTDVYG RAGYKERVEF PGLDVPTVTT ASLTFETGVV ANIAASCLLG WSHRVGLHIF ADRLAIELTD RDIMVDVGRG RPVRSADGDP VWREDRDFID AVRGGENRIR CPYGDALATH RLAVAVVSSA RSGEPVHLDA PALKRSELSP LLAQPRAEMS QEPRPGHRKI RSLGIEAPGR AYFLEYEEGP PADGHVRLDT LYTGLSAGTE LTFLKNTNPY FRARFDGERG VFIENEPDLN YPVPFLGYME VARVSQSRAR GFADGELLAA SYAHKTGHTA DPSHDLLVML PPDLDPLLGV FVAQMGPIAA NGILHADAEA FGATVPALGA GVCGRTVIVL GAGTVGLMTA LFARSLGASD IVITDPSEFR RGKAEAMGLM AMAEDQAWQH AKARWHDGTM GRGADVAFQT RAHAGSLHTA LKALRPQGTV IDLAFYQNGA SSLRLGEEFH HNGLNIRCAQ INRVPRGLAP RWDRRRLAGV TLDLLKTEGA AIREHMITHI VPIDEAPAFL VDLIENRPEF LQVIFKVGE
|
| |