Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4714 |
Symbol | |
ID | 8007189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 83261 |
End bp | 84400 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644821647 |
Product | putative transcriptional regulator protein, ROK family |
Protein accession | YP_002972907 |
Protein GI | 241113072 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTATC TTCGGTCAGG GCCGCGTCGG CTTCAAACCC CGGACATGGG CGCCCCGCGC ACCAAAATGG TTGGTCTGCG GTCTGGTGAA ATCGCCGACC GGAATATTCG CGTCATTCTG GAGGCGATCC GCCGTCATGG CCCGCTGACC CGGATGGAGC TTGGGCGTCA TAGCGGCCTG ACCGGCCCTG GCATTACCAA TATTCTCCGC CGATTGGCCG AGGAAAAGCT TATCACCTCC AACCGTCGCA ATGGCCTTGG CGGAGGGGCC ACCGCCACCG AGTTTGCCTT GCGTCCGGAA GGTGCTTTCT CGATCGGCGT CAAGCTTCGC CAAAGGCGTG GCGAGGCCGT ACTCATCGAT TTGAGTGGTC AGGTCCATGA CCGGGTCTAT ATCGAGCTGG ACCCCGCTGA CAAAGTCGGT CTGGTGCATG CGGCCGTCAG GGATATGGTC GATCGCCACG CCGCGTTGCC GATCATCGGG CTCGGCATCG CCGCCAACGA CTGGACGGAG GATCAAAGCG ATCAGATCGC CGCGATGTCG ACGATCGCGC GTCCCTATGT CGAGAACGAG TGTACGGCGA GCCTTCTCGC CGAGCGCACG ATCGGAAGCT CCGGCAGGGA AGGCGGACTT GCGATGATCA TCATCGACGA CGACGTTCAG GCCGGCTTTC TCATTCGCGG TATTCCCTAT TCCGGCGTGC ATGGCCGGGC GGGCAGCATC GGCGAAATGT TGACCGGTCC CGACAATGTC CAGCTCAATA CCGTCGTCGG TTTCGAATCC CTGCGATCAC GGATCGGCGA CCAGGCGTTC ACTAGCCTGC TGAAGGGCGA GGAAATCTCC TCGCCGCTAC TGTCGCAATG GATACGGGAA GCCGCTGGCC ATCTGCTCGA CCCGATCATC GCCATGGCCG GTTTCCTCGC CCCGAGCGTC GTCATGATCG GCAGCGATCT GCCGCAGGGC GTGATCGAAG CGCTGATCCA TCAGCTTTCC ATCGAGCGGC TCGACACCTC GACGAGACCG TTACTGACGC CCTGGATTTC TCCGATGAAA CCTGCGAGCT TCAGCGGCGG CGGCGTTGCG CTCGGTGCCG CTCTCCTCCC CTTCCTCAAC ACTTTGCTGC TGCCGCCTGC CTCGGCTTGA
|
Protein sequence | MRYLRSGPRR LQTPDMGAPR TKMVGLRSGE IADRNIRVIL EAIRRHGPLT RMELGRHSGL TGPGITNILR RLAEEKLITS NRRNGLGGGA TATEFALRPE GAFSIGVKLR QRRGEAVLID LSGQVHDRVY IELDPADKVG LVHAAVRDMV DRHAALPIIG LGIAANDWTE DQSDQIAAMS TIARPYVENE CTASLLAERT IGSSGREGGL AMIIIDDDVQ AGFLIRGIPY SGVHGRAGSI GEMLTGPDNV QLNTVVGFES LRSRIGDQAF TSLLKGEEIS SPLLSQWIRE AAGHLLDPII AMAGFLAPSV VMIGSDLPQG VIEALIHQLS IERLDTSTRP LLTPWISPMK PASFSGGGVA LGAALLPFLN TLLLPPASA
|
| |