Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4035 |
Symbol | |
ID | 8014840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4112005 |
End bp | 4114845 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826604 |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_002977815 |
Protein GI | 241206719 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATAG GCGGCGACGT TAAAGCGGAG GAGTTATCCT TCGGTCCGTT CCGTTTGAGT GTCGGCCAAC GGCTTCTCGC GAAGGACGGG GTTCCGATAA ACCTCGGGGC GCGTGCACTG GACTTGCTGG TCGCTCTCAC CCTCGCTCCC AATGTCATCG TCAGCAAGCA AGACCTGATA TCCCGCGTCT GGCCTGATGT CATCGTCGAT GAAGGCAGCC TGCGTTTCCA CATGACAGGC CTGAGGAAGG CGCTGGGCGA CGGTCACGAT GGAGCACGGT ACATCACCAC TATTGCCGGA CGGGGCTATT GTTTCGTCGC GCCAATCTCA CGATCCGGTC TCCTTCGACA AGTCTCCGCC GAACTCGATT TCCGTTACGC TATTCTGCCG GGCCGACCGG ATCGCATGGT CGGCCGAGAG CAGGATGTTC TCGCGCTGAC CGAAAAGCTG ATGGCCTCGC GAATGGTCAC CATTGTCGGC GTCGGCGGTG TCGGAAAGAC GACCGTCGCC ACGGCAGTTG CGCATCACCT CGCTCCCACA TTCAAGGGAG CAGTCCTGTT TGCCGATTAT GGCATGTTGA GCGATCCGGC TCTAGTCGCG GCCGGGATCG CGTCGATGCT TGGTTTGTCC GTCAGCTCCA GCGATGTACG TCCCAGTTTG ATTGCCTACC TCCGCGACAA GCAAATCATG CTGATACTCG ATACCTGCGA GCATCTGATT GATGCCATTG CCGATCTCGT GGCCGCCATT GTCGAAGCCG CCCCTCAGGT CTTTCTGCTG GCAACAAGTC GCGAGGCGTT AAGGATCGAG GCCGAAAGCG TCTACCGGCT AGATACCCTC GCTTTCCCCC CGGATGACCT GGAACTTACG ACCGACACGA TACTTGCCTT TCCAGCGACT CGGCTGTTCG TCGAACGGGC TGCGGCGAGC GGTGCCAATC TCAATCTCAG CGACCAGGAT GCCCGCGTCG TCGCGAGTAT TTGCCGAAAA CTCGATGGTA TGGCGCTTGC GTTAGAACTC GCCGCCAGAC GCGTCGAGAG CTATGGCCTC CTTCAGACCG CCAAGCTGCT CGATCAGCAT CTGACATTGG GATGGGCGGG ATCGCGAAAC GCACCGCCGC GACAAAGGAC CCTACAGGCG ACGCTCGACT GGAGCTTTGG ACTTCTAACA GACCTGGAGC GCATGGTGCT CCGGCGGCTT GCCGTTTTTG TCGGCGACTT CACCCTCGAC GCCGCGCTGG AGGTCATTTC CACAGCGGAC ATGGCGCCCT CGGCCATATT CGAGGCACTG GACAATCTGG TCGCCAAGTC TCTGCTCGCA ACGCGGCCGG CAGGGGCGAC GATGCGCTAC CGCTTGCTGG ATACGACACG TGCCTACGCG CTCCATGCGC AGACGGACGA GGACCGCGCC GGATTGAATG CGCGTCACGC GACCTATTGT CAGCGTTGGC TGGAGCAATT CGGACCGGAT TGGCCGACGC TTTCTACGGG GCCTGAGCGA TTACCGTATT TCGTGAGCAT CAACAACGTG CGGGCGGCAC TGGAATGGGC GTTCGGGGAG CATGGCGATA TCGACGTCGG CATCAGGCTT GCGGCTGCTG CTGTACCTGT GTTCCAGGTG ATGTCACTTT TCCCGGAGTG CCAGCGATGG TCGAAACGGG CGGTTCTTGC ACTGGATGAG GCGTTCAGGG GAGGCGTCGA GGAAATGCAC CTACAAGCCG GTCTGGGAAT TTCCCGGATG TATCTGCAGG GAGGCCGCGA GACACCGCAG ATCGCTCTTG GTCGCGCCCT CCATATTGCC GAGGACCGGG GGAACACGCT CGACCAGTTG CGCATACTAG GTCCGTTGCA CATGTTCAGC CTTCGGATAG GAGATTTTAA CGCCGCTCTT GATTATGCCC GGCGGTGTTC GGCAATCGCC GCCACTCTTG ATGACGCCGC CACTGTCGAA CTGGCGCACT TCTTCCTCGG CAATTCGCTG CACTTTACCG GTGATCTTCG CCACGCCCGC ACCGAACTCG AAGCGGCTAC AAGAAGCGAA CATCAGCCAC AACGAACACC TGCAAGCTAC GTCGGCTTCG AGGGAAAGCA TCTCGCTGGC GGAATTCTTG CACGGAACCT TTGGCTACAA GGCTACCCGG AGCAGGCCGA CGTCCAAGCG CGACAGGCGA TCAGCGATGC CGCTAAGTTG GATCACTCGC TGACACTCTG CATCGCACTG CTTGGAGGCA TTGCAGTCTT CCTATGGCGG GGCGACGTGC CGAGTGCCGA GGAACACATT GAATGGCTGG TCTCCCGTGC GGGGCTACAC AACCTGTCTC CCTACGTGTC GGTGGCTCGG GGTTTCGAAG GCGAACTGGC CATCCGTCGG GGACAGGTGA AGCTGGGGAT CGAGACACTC AGACGATGCA TCGAGAAGCT TCATGCATCC ACCTACGAGG TGTTCACGAC GATGCTGGAG CTGTCCCTTG CAAAAGGACT GGCACTGATC GGAGAGCGCG AGGAAGGGAT GGCCCGGATC AACAAGACCA TCGAACTTGT CGAGAGGAAC GGCGACCTTT GCTATATGCC GGAGCTGCTG CGCGTGAAGG CAGGTTTGCT GTCGATCAAT TCCGCAACCG ACGCCGAGGC TTGCTTGGTC TCGTCTTTGG AAAGAAGTGC CAGCATGGGT GCACATGCCT GGGAGCTGCG GGCGGCAACC GATCTGGCCG CACTGATGGC CAGCGACGGT AGGTTACGCG AGGCTCGGGT ACTGCTGACG CCAGTGTGTG AGAGATTTGA AGAGGGTATG GACACGACAG ACGTGATGGC CGCAGACACA CTACTACAAA ACCTGTCATA A
|
Protein sequence | MNIGGDVKAE ELSFGPFRLS VGQRLLAKDG VPINLGARAL DLLVALTLAP NVIVSKQDLI SRVWPDVIVD EGSLRFHMTG LRKALGDGHD GARYITTIAG RGYCFVAPIS RSGLLRQVSA ELDFRYAILP GRPDRMVGRE QDVLALTEKL MASRMVTIVG VGGVGKTTVA TAVAHHLAPT FKGAVLFADY GMLSDPALVA AGIASMLGLS VSSSDVRPSL IAYLRDKQIM LILDTCEHLI DAIADLVAAI VEAAPQVFLL ATSREALRIE AESVYRLDTL AFPPDDLELT TDTILAFPAT RLFVERAAAS GANLNLSDQD ARVVASICRK LDGMALALEL AARRVESYGL LQTAKLLDQH LTLGWAGSRN APPRQRTLQA TLDWSFGLLT DLERMVLRRL AVFVGDFTLD AALEVISTAD MAPSAIFEAL DNLVAKSLLA TRPAGATMRY RLLDTTRAYA LHAQTDEDRA GLNARHATYC QRWLEQFGPD WPTLSTGPER LPYFVSINNV RAALEWAFGE HGDIDVGIRL AAAAVPVFQV MSLFPECQRW SKRAVLALDE AFRGGVEEMH LQAGLGISRM YLQGGRETPQ IALGRALHIA EDRGNTLDQL RILGPLHMFS LRIGDFNAAL DYARRCSAIA ATLDDAATVE LAHFFLGNSL HFTGDLRHAR TELEAATRSE HQPQRTPASY VGFEGKHLAG GILARNLWLQ GYPEQADVQA RQAISDAAKL DHSLTLCIAL LGGIAVFLWR GDVPSAEEHI EWLVSRAGLH NLSPYVSVAR GFEGELAIRR GQVKLGIETL RRCIEKLHAS TYEVFTTMLE LSLAKGLALI GEREEGMARI NKTIELVERN GDLCYMPELL RVKAGLLSIN SATDAEACLV SSLERSASMG AHAWELRAAT DLAALMASDG RLREARVLLT PVCERFEEGM DTTDVMAADT LLQNLS
|
| |