Gene Rleg_5611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5611 
Symbol 
ID8016837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp195475 
End bp198714 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content62% 
IMG OID644827776 
Productglycosyl transferase group 1 
Protein accessionYP_002978976 
Protein GI241518348 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.115035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGGA GGCCGCACAT CCTACTGGCG ACCGACAGTG CAGAACCGTC CGGGATGGGC 
GAACACATGC TTGCTCTCGG GCGGGCTTTG AGCGAACGGT GGGATGTCAC AATAGCCCTG
TTGACCGAGG ACAGGACCGG TTTGTTGCCG CGGGCTGCGC GCCACGGAAT TGGCATCAAG
CTAAGCGAAG ATGCGGGCGT ATTCCAGCAG TGGTTGGCGC GATCCAGTAT TGATCTTCTG
CACGTTCATG CGGGAATCGG ATGGGAGGGA CACCGCCTAG CCGCCGCCGC GGATGCCCTG
GCAATCCCGA TCATCCGCAC CGATCATTTG CCATATCTGC TCACCGATCC TGATCAGATC
GAGCTCTACC GGCGCGAGAC CGGGAGGCTG TCGCATCACA TCGTCGTTTC CGAAGCCTCG
CGTGAAAGCT TCTGCGATGC AAACGTCGCG CCGTCGCGTT TGACAGTCGT GCGCAACGGC
ATCTTTCCAC TTTCGCCTGC GCGAGCAGCG GCTGAACTAA GGCGAGCGCT CGACCTGGTG
GATAAAACCG TGCTGCTGAC GGTCGCCCGC TTCACCGAGC AGAAGGATCA CGTGTCGCTT
GTCCGCGCCT TGCCGAACAT ACTTGGAACT CACCCCAAAG CGGTGTTGCT TTTTGTCGGA
TCGGGTCCAG AAGAAGATCG TGTCCAAACC CTGGCGGAAG ATTTAAAAGC CTTTGGTCAC
GTTCGCTTTC TGGGGCACCG GGCGGATGTT GCGGAGATCA TGGCGATCGC CGATCTCTTC
GTGCTTCCCT CGCTCTTCGA AGGCCTTCCA CTCGCGGTTC TCGAGGCGAT GTCGCTTGCC
GTCCCCGTGG TCGCGACGAG GATCGGCGGA ACCGTCGAGG CACTGGGCGG TGATCATCCC
TATTTCGCGG AGCCCGGCAA CCCAGCGTCG ATCACCGCTG TCGTCAATCA GGCATTGAGC
GATCCCCATC TAAAGGCCAC CGGCAGAGTC GGCCACGCGC GTTTCGAGCG CAACTTTTCC
GCCCGCCGAA TGGCTGATGA GACCGGCGCC GTCTACGAGC GGTTCCTCAC CCCACATGCG
GATCAAAAAC AAAAGGATAC CGAGATGCGA AAAACGCGGC TGGGCTTCAT CGGCGTGGGT
GGCATCGCCC ATCGGCACCT TGATATCCTA ACCGGCTTCG ACGACGTCGA ACTCGTCGCC
TTTGCCGACC TGGACCTTCC CCGTGCGGAT GCGGCGGCAA TGCGCTTTGG CGCCCGCTCT
TTCAGCCATC ACCGTGAAAT GCTCGACACG CAACGGCTCG ACGCGGTCTA CATCTGCATT
CCGCCTTTCG CCCATGGCGA GCCGGAACAG GACCTCATTG ACCGCAACAT TCCATTCTTT
GTCGAAAAGC CAGTCACGCT CGATCTGGCG CTGGCGGAAG AGATCGCAGC CGGCGTGTCG
AAGGCCAGCT TGATTACTGG CGTTGGCTAT CACTGGCGTT ATCTCGATAT CGTCGATGAA
GCCCGAAGCC TGCTGGCGGA TAACCCAGCG CAACTGCTGT CGGGATACTG GCTGGATTCC
ACGCCTCCCC CGCAGTGGTG GTGGAAGAAG GATCAATCCG GCGGCCAGAT GGTTGAGCAG
GCGACGCATC TCCTTGATCT CGCGCGCTTC CTCGTGGGCG AGGTCACCGA CGTATACGGC
CGGGCGGGAT ACAAGGAAAG GGTGGAATTT CCCGGCCTCG ATGTGCCGAC GGTCACGACA
GCTAGCCTGA CCTTCGAAAC GGGCGTGGTC GCCAATATCG CCGCGAGCTG CCTTCTCGGC
TGGAGCCACC GCGTCGGCTT GCACATCTTC GCCGACCGAC TGGCGATCGA ATTGACCGAT
CGCGACATCA TGGTGGATGT TGGAAGAGGA CGCCCAGTGC GAAGCGCCGA TGGCGATCCG
GTATGGCGCG AAGACCGCGA TTTCATCGAC GCCGTCCGGG GTGGTGAAAA CCGCATTCGC
TGCCCGTATG GTGATGCTCT AGCCACCCAC CGGCTTGCGG TGGCCGTCGT ATCTTCGGCA
CGCAGCGGGG AGCCGGTACA CCTCGACGCC CCGGCTCTAA AGCGAAGCGA GCTATCGCCC
TTGCTGGCGC AACCACGTGC CGAGATGTCT CAAGAACCGC GGCCTGGGCA CCGCAAAATC
CGCTCGCTCG GCATCGAGGC GCCGGGCCGA GCCTATTTCT TGGAATACGA GGAAGGCCCG
CCCGCCGATG GACATGTGCG GCTCGACACG CTCTATACCG GTCTTTCCGC TGGGACGGAG
CTGACGTTCT TGAAGAACAC CAACCCCTAT TTCCGCGCGC GCTTTGATGG CGAACGCGGC
GTCTTCATCG AAAACGAGCC GGACCTTAAC TATCCAGTGC CTTTCCTCGG GTACATGGAG
GTCGCACGCG TTTCGCAATC GCGCGCGCGC GGCTTTGCCG ATGGCGAGTT GCTCGCGGCA
AGCTACGCGC ACAAGACTGG GCACACTGCC GACCCCTCAC ACGATCTTCT TGTCATGCTG
CCGCCGGACC TCGATCCGCT GCTCGGCGTC TTTGTCGCGC AGATGGGACC GATCGCCGCC
AACGGCATCC TCCATGCCGA CGCGGAAGCA TTTGGCGCAA CCGTGCCCGC ACTCGGCGCG
GGCGTTTGCG GGCGAACGGT AATCGTCCTC GGCGCGGGTA CCGTGGGTTT GATGACGGCG
CTTTTTGCCC GTTCGCTCGG CGCATCGGAC ATCGTGATCA CAGATCCCTC CGAATTTCGC
CGCGGCAAGG CGGAGGCGAT GGGATTGATG GCGATGGCCG AGGATCAAGC CTGGCAGCAT
GCCAAGGCGC GCTGGCACGA CGGCACCATG GGACGCGGCG CGGATGTAGC GTTTCAGACG
CGCGCTCATG CGGGTAGCCT GCACACGGCG CTCAAGGCCT TGCGCCCACA AGGCACCGTG
ATCGATCTTG CCTTTTACCA GAACGGGGCC AGCTCATTGC GGCTGGGGGA AGAATTCCAT
CACAACGGCC TGAACATCCG CTGCGCGCAA ATCAACCGCG TCCCCCGCGG GCTCGCCCCA
CGGTGGGACC GTCGAAGGCT TGCAGGCGTA ACGCTCGACC TCTTGAAGAC GGAAGGTGCG
GCGATCCGCG AGCACATGAT CACTCACATC GTGCCCATCG ACGAAGCGCC TGCTTTCCTC
GTCGATCTGA TCGAGAACCG GCCGGAATTC CTCCAGGTTA TATTCAAGGT CGGCGAATGA
 
Protein sequence
MNRRPHILLA TDSAEPSGMG EHMLALGRAL SERWDVTIAL LTEDRTGLLP RAARHGIGIK 
LSEDAGVFQQ WLARSSIDLL HVHAGIGWEG HRLAAAADAL AIPIIRTDHL PYLLTDPDQI
ELYRRETGRL SHHIVVSEAS RESFCDANVA PSRLTVVRNG IFPLSPARAA AELRRALDLV
DKTVLLTVAR FTEQKDHVSL VRALPNILGT HPKAVLLFVG SGPEEDRVQT LAEDLKAFGH
VRFLGHRADV AEIMAIADLF VLPSLFEGLP LAVLEAMSLA VPVVATRIGG TVEALGGDHP
YFAEPGNPAS ITAVVNQALS DPHLKATGRV GHARFERNFS ARRMADETGA VYERFLTPHA
DQKQKDTEMR KTRLGFIGVG GIAHRHLDIL TGFDDVELVA FADLDLPRAD AAAMRFGARS
FSHHREMLDT QRLDAVYICI PPFAHGEPEQ DLIDRNIPFF VEKPVTLDLA LAEEIAAGVS
KASLITGVGY HWRYLDIVDE ARSLLADNPA QLLSGYWLDS TPPPQWWWKK DQSGGQMVEQ
ATHLLDLARF LVGEVTDVYG RAGYKERVEF PGLDVPTVTT ASLTFETGVV ANIAASCLLG
WSHRVGLHIF ADRLAIELTD RDIMVDVGRG RPVRSADGDP VWREDRDFID AVRGGENRIR
CPYGDALATH RLAVAVVSSA RSGEPVHLDA PALKRSELSP LLAQPRAEMS QEPRPGHRKI
RSLGIEAPGR AYFLEYEEGP PADGHVRLDT LYTGLSAGTE LTFLKNTNPY FRARFDGERG
VFIENEPDLN YPVPFLGYME VARVSQSRAR GFADGELLAA SYAHKTGHTA DPSHDLLVML
PPDLDPLLGV FVAQMGPIAA NGILHADAEA FGATVPALGA GVCGRTVIVL GAGTVGLMTA
LFARSLGASD IVITDPSEFR RGKAEAMGLM AMAEDQAWQH AKARWHDGTM GRGADVAFQT
RAHAGSLHTA LKALRPQGTV IDLAFYQNGA SSLRLGEEFH HNGLNIRCAQ INRVPRGLAP
RWDRRRLAGV TLDLLKTEGA AIREHMITHI VPIDEAPAFL VDLIENRPEF LQVIFKVGE