Gene Rleg_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4035 
Symbol 
ID8014840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4112005 
End bp4114845 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content60% 
IMG OID644826604 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_002977815 
Protein GI241206719 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAG GCGGCGACGT TAAAGCGGAG GAGTTATCCT TCGGTCCGTT CCGTTTGAGT 
GTCGGCCAAC GGCTTCTCGC GAAGGACGGG GTTCCGATAA ACCTCGGGGC GCGTGCACTG
GACTTGCTGG TCGCTCTCAC CCTCGCTCCC AATGTCATCG TCAGCAAGCA AGACCTGATA
TCCCGCGTCT GGCCTGATGT CATCGTCGAT GAAGGCAGCC TGCGTTTCCA CATGACAGGC
CTGAGGAAGG CGCTGGGCGA CGGTCACGAT GGAGCACGGT ACATCACCAC TATTGCCGGA
CGGGGCTATT GTTTCGTCGC GCCAATCTCA CGATCCGGTC TCCTTCGACA AGTCTCCGCC
GAACTCGATT TCCGTTACGC TATTCTGCCG GGCCGACCGG ATCGCATGGT CGGCCGAGAG
CAGGATGTTC TCGCGCTGAC CGAAAAGCTG ATGGCCTCGC GAATGGTCAC CATTGTCGGC
GTCGGCGGTG TCGGAAAGAC GACCGTCGCC ACGGCAGTTG CGCATCACCT CGCTCCCACA
TTCAAGGGAG CAGTCCTGTT TGCCGATTAT GGCATGTTGA GCGATCCGGC TCTAGTCGCG
GCCGGGATCG CGTCGATGCT TGGTTTGTCC GTCAGCTCCA GCGATGTACG TCCCAGTTTG
ATTGCCTACC TCCGCGACAA GCAAATCATG CTGATACTCG ATACCTGCGA GCATCTGATT
GATGCCATTG CCGATCTCGT GGCCGCCATT GTCGAAGCCG CCCCTCAGGT CTTTCTGCTG
GCAACAAGTC GCGAGGCGTT AAGGATCGAG GCCGAAAGCG TCTACCGGCT AGATACCCTC
GCTTTCCCCC CGGATGACCT GGAACTTACG ACCGACACGA TACTTGCCTT TCCAGCGACT
CGGCTGTTCG TCGAACGGGC TGCGGCGAGC GGTGCCAATC TCAATCTCAG CGACCAGGAT
GCCCGCGTCG TCGCGAGTAT TTGCCGAAAA CTCGATGGTA TGGCGCTTGC GTTAGAACTC
GCCGCCAGAC GCGTCGAGAG CTATGGCCTC CTTCAGACCG CCAAGCTGCT CGATCAGCAT
CTGACATTGG GATGGGCGGG ATCGCGAAAC GCACCGCCGC GACAAAGGAC CCTACAGGCG
ACGCTCGACT GGAGCTTTGG ACTTCTAACA GACCTGGAGC GCATGGTGCT CCGGCGGCTT
GCCGTTTTTG TCGGCGACTT CACCCTCGAC GCCGCGCTGG AGGTCATTTC CACAGCGGAC
ATGGCGCCCT CGGCCATATT CGAGGCACTG GACAATCTGG TCGCCAAGTC TCTGCTCGCA
ACGCGGCCGG CAGGGGCGAC GATGCGCTAC CGCTTGCTGG ATACGACACG TGCCTACGCG
CTCCATGCGC AGACGGACGA GGACCGCGCC GGATTGAATG CGCGTCACGC GACCTATTGT
CAGCGTTGGC TGGAGCAATT CGGACCGGAT TGGCCGACGC TTTCTACGGG GCCTGAGCGA
TTACCGTATT TCGTGAGCAT CAACAACGTG CGGGCGGCAC TGGAATGGGC GTTCGGGGAG
CATGGCGATA TCGACGTCGG CATCAGGCTT GCGGCTGCTG CTGTACCTGT GTTCCAGGTG
ATGTCACTTT TCCCGGAGTG CCAGCGATGG TCGAAACGGG CGGTTCTTGC ACTGGATGAG
GCGTTCAGGG GAGGCGTCGA GGAAATGCAC CTACAAGCCG GTCTGGGAAT TTCCCGGATG
TATCTGCAGG GAGGCCGCGA GACACCGCAG ATCGCTCTTG GTCGCGCCCT CCATATTGCC
GAGGACCGGG GGAACACGCT CGACCAGTTG CGCATACTAG GTCCGTTGCA CATGTTCAGC
CTTCGGATAG GAGATTTTAA CGCCGCTCTT GATTATGCCC GGCGGTGTTC GGCAATCGCC
GCCACTCTTG ATGACGCCGC CACTGTCGAA CTGGCGCACT TCTTCCTCGG CAATTCGCTG
CACTTTACCG GTGATCTTCG CCACGCCCGC ACCGAACTCG AAGCGGCTAC AAGAAGCGAA
CATCAGCCAC AACGAACACC TGCAAGCTAC GTCGGCTTCG AGGGAAAGCA TCTCGCTGGC
GGAATTCTTG CACGGAACCT TTGGCTACAA GGCTACCCGG AGCAGGCCGA CGTCCAAGCG
CGACAGGCGA TCAGCGATGC CGCTAAGTTG GATCACTCGC TGACACTCTG CATCGCACTG
CTTGGAGGCA TTGCAGTCTT CCTATGGCGG GGCGACGTGC CGAGTGCCGA GGAACACATT
GAATGGCTGG TCTCCCGTGC GGGGCTACAC AACCTGTCTC CCTACGTGTC GGTGGCTCGG
GGTTTCGAAG GCGAACTGGC CATCCGTCGG GGACAGGTGA AGCTGGGGAT CGAGACACTC
AGACGATGCA TCGAGAAGCT TCATGCATCC ACCTACGAGG TGTTCACGAC GATGCTGGAG
CTGTCCCTTG CAAAAGGACT GGCACTGATC GGAGAGCGCG AGGAAGGGAT GGCCCGGATC
AACAAGACCA TCGAACTTGT CGAGAGGAAC GGCGACCTTT GCTATATGCC GGAGCTGCTG
CGCGTGAAGG CAGGTTTGCT GTCGATCAAT TCCGCAACCG ACGCCGAGGC TTGCTTGGTC
TCGTCTTTGG AAAGAAGTGC CAGCATGGGT GCACATGCCT GGGAGCTGCG GGCGGCAACC
GATCTGGCCG CACTGATGGC CAGCGACGGT AGGTTACGCG AGGCTCGGGT ACTGCTGACG
CCAGTGTGTG AGAGATTTGA AGAGGGTATG GACACGACAG ACGTGATGGC CGCAGACACA
CTACTACAAA ACCTGTCATA A
 
Protein sequence
MNIGGDVKAE ELSFGPFRLS VGQRLLAKDG VPINLGARAL DLLVALTLAP NVIVSKQDLI 
SRVWPDVIVD EGSLRFHMTG LRKALGDGHD GARYITTIAG RGYCFVAPIS RSGLLRQVSA
ELDFRYAILP GRPDRMVGRE QDVLALTEKL MASRMVTIVG VGGVGKTTVA TAVAHHLAPT
FKGAVLFADY GMLSDPALVA AGIASMLGLS VSSSDVRPSL IAYLRDKQIM LILDTCEHLI
DAIADLVAAI VEAAPQVFLL ATSREALRIE AESVYRLDTL AFPPDDLELT TDTILAFPAT
RLFVERAAAS GANLNLSDQD ARVVASICRK LDGMALALEL AARRVESYGL LQTAKLLDQH
LTLGWAGSRN APPRQRTLQA TLDWSFGLLT DLERMVLRRL AVFVGDFTLD AALEVISTAD
MAPSAIFEAL DNLVAKSLLA TRPAGATMRY RLLDTTRAYA LHAQTDEDRA GLNARHATYC
QRWLEQFGPD WPTLSTGPER LPYFVSINNV RAALEWAFGE HGDIDVGIRL AAAAVPVFQV
MSLFPECQRW SKRAVLALDE AFRGGVEEMH LQAGLGISRM YLQGGRETPQ IALGRALHIA
EDRGNTLDQL RILGPLHMFS LRIGDFNAAL DYARRCSAIA ATLDDAATVE LAHFFLGNSL
HFTGDLRHAR TELEAATRSE HQPQRTPASY VGFEGKHLAG GILARNLWLQ GYPEQADVQA
RQAISDAAKL DHSLTLCIAL LGGIAVFLWR GDVPSAEEHI EWLVSRAGLH NLSPYVSVAR
GFEGELAIRR GQVKLGIETL RRCIEKLHAS TYEVFTTMLE LSLAKGLALI GEREEGMARI
NKTIELVERN GDLCYMPELL RVKAGLLSIN SATDAEACLV SSLERSASMG AHAWELRAAT
DLAALMASDG RLREARVLLT PVCERFEEGM DTTDVMAADT LLQNLS