Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3184 |
Symbol | |
ID | 8014081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3185562 |
End bp | 3186812 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644825748 |
Product | ROK family protein |
Protein accession | YP_002976976 |
Protein GI | 241205880 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00402315 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACCA AATCGAGCAC GGAGCTGGTC CGGCAGAGAA ACAGCGTGCT CGTGCTGTCC GTGCTGCGTC GCCACGGTGC GCTCGCGCAT ACCGAAATAT CCGATTTCAC CGGGCTGTCG TCGGCCACCA TTTCGGCGAT CACCACTGAG CTGGAAAAAG CCCAGATCAT CGAAAAGTCG GAACATCAGC CGGCAAGCGG CCGCGGCCGG CCACGCGTGC TGCTGCGCCA GCGGCGCGAT TGCGGCTATC TTATCGTCGT CATCATCTCC TCAGATGCGG TGCAATATTC GCTGGTCGAT TATGCCGGCA AGCTGATCGA CCGCTTCAGT GAGGAACGTT CGCATGATCC TGCAGGCGCT GCCCGCTTCG TCGCTGCCGT GCGGGCCGGG CTTTTGCGTA TTCTCGATCG TTCGAAGATC AGCCAAGAAA AGGTGCTGCT GATCTCGATC AGCAGCAAGG GGCTGGTCAA TTCGACGGAG CCGGTTCTGG TATGGTCGCC GATCTTCGGC AGCGACCAGA TCGATTTCGA ATTGGCACTC CGGCCGGAAT GGCAGGCCAA GGTGATCCTC GACAACGAGA CGCTGCTGGT CGCAGCCGCG CTCGGCGCGC GTGAGGAGAT GGTGAAGGGC GCCGATTTCC GTTCGCTCGC CGCCCTTTCG CTCGGCCACA GCGTCGGGCT TGGCATCGTC AGGCGCGGCA ACCAGACGGG CCAGGAGATA TCGGCGCCGA ATTTCGGGCA CATGCTGCAC ATGGCCAATG GTGGGCTCTG CCGCTGCGGC ACCCGCGGCT GCATCGAGGC CTATGCCGGT TTCTACGCGA TCCTGCGCAG CGCCTTCGAA GTGCCGCTCG ATACGATCCC GGCAAAGTTC GTGCCGGTGG CGGAACTGGA CAAGATCGCC GCAAAGGCGC GCCAGGGCCA CCGCGTCCCC GCCTTTGCCT TCCGCCAGGC GGGGCTGGCG CTCGGCAACG GGCTGTCGCG CATGCTGAGC TTGACGGAGC GCATGCCGAT CGCCATCACC GGGCCGGGCA CGCGTTATTA CGACCTTCTT CGGCAAGGGA TCGAAGAGGG TCTCGGGCAG TCGCATATTG TGCGCATGGA AGGCATGCCC GAGATCAGGG TGGTGGCCGA CGAGCAGATC CTCGTCTTCG AAGGACATCT GAACCGGGCG CTGTCTGTCA TCGACGAGGA TATCGTTCTC TCGGGCGTTC AGGGAATCCA GGCATCGGCG ATTATTCAGG AATCGGGTTG A
|
Protein sequence | MLTKSSTELV RQRNSVLVLS VLRRHGALAH TEISDFTGLS SATISAITTE LEKAQIIEKS EHQPASGRGR PRVLLRQRRD CGYLIVVIIS SDAVQYSLVD YAGKLIDRFS EERSHDPAGA ARFVAAVRAG LLRILDRSKI SQEKVLLISI SSKGLVNSTE PVLVWSPIFG SDQIDFELAL RPEWQAKVIL DNETLLVAAA LGAREEMVKG ADFRSLAALS LGHSVGLGIV RRGNQTGQEI SAPNFGHMLH MANGGLCRCG TRGCIEAYAG FYAILRSAFE VPLDTIPAKF VPVAELDKIA AKARQGHRVP AFAFRQAGLA LGNGLSRMLS LTERMPIAIT GPGTRYYDLL RQGIEEGLGQ SHIVRMEGMP EIRVVADEQI LVFEGHLNRA LSVIDEDIVL SGVQGIQASA IIQESG
|
| |