Gene Rleg_3184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3184 
Symbol 
ID8014081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3185562 
End bp3186812 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID644825748 
ProductROK family protein 
Protein accessionYP_002976976 
Protein GI241205880 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00402315 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCA AATCGAGCAC GGAGCTGGTC CGGCAGAGAA ACAGCGTGCT CGTGCTGTCC 
GTGCTGCGTC GCCACGGTGC GCTCGCGCAT ACCGAAATAT CCGATTTCAC CGGGCTGTCG
TCGGCCACCA TTTCGGCGAT CACCACTGAG CTGGAAAAAG CCCAGATCAT CGAAAAGTCG
GAACATCAGC CGGCAAGCGG CCGCGGCCGG CCACGCGTGC TGCTGCGCCA GCGGCGCGAT
TGCGGCTATC TTATCGTCGT CATCATCTCC TCAGATGCGG TGCAATATTC GCTGGTCGAT
TATGCCGGCA AGCTGATCGA CCGCTTCAGT GAGGAACGTT CGCATGATCC TGCAGGCGCT
GCCCGCTTCG TCGCTGCCGT GCGGGCCGGG CTTTTGCGTA TTCTCGATCG TTCGAAGATC
AGCCAAGAAA AGGTGCTGCT GATCTCGATC AGCAGCAAGG GGCTGGTCAA TTCGACGGAG
CCGGTTCTGG TATGGTCGCC GATCTTCGGC AGCGACCAGA TCGATTTCGA ATTGGCACTC
CGGCCGGAAT GGCAGGCCAA GGTGATCCTC GACAACGAGA CGCTGCTGGT CGCAGCCGCG
CTCGGCGCGC GTGAGGAGAT GGTGAAGGGC GCCGATTTCC GTTCGCTCGC CGCCCTTTCG
CTCGGCCACA GCGTCGGGCT TGGCATCGTC AGGCGCGGCA ACCAGACGGG CCAGGAGATA
TCGGCGCCGA ATTTCGGGCA CATGCTGCAC ATGGCCAATG GTGGGCTCTG CCGCTGCGGC
ACCCGCGGCT GCATCGAGGC CTATGCCGGT TTCTACGCGA TCCTGCGCAG CGCCTTCGAA
GTGCCGCTCG ATACGATCCC GGCAAAGTTC GTGCCGGTGG CGGAACTGGA CAAGATCGCC
GCAAAGGCGC GCCAGGGCCA CCGCGTCCCC GCCTTTGCCT TCCGCCAGGC GGGGCTGGCG
CTCGGCAACG GGCTGTCGCG CATGCTGAGC TTGACGGAGC GCATGCCGAT CGCCATCACC
GGGCCGGGCA CGCGTTATTA CGACCTTCTT CGGCAAGGGA TCGAAGAGGG TCTCGGGCAG
TCGCATATTG TGCGCATGGA AGGCATGCCC GAGATCAGGG TGGTGGCCGA CGAGCAGATC
CTCGTCTTCG AAGGACATCT GAACCGGGCG CTGTCTGTCA TCGACGAGGA TATCGTTCTC
TCGGGCGTTC AGGGAATCCA GGCATCGGCG ATTATTCAGG AATCGGGTTG A
 
Protein sequence
MLTKSSTELV RQRNSVLVLS VLRRHGALAH TEISDFTGLS SATISAITTE LEKAQIIEKS 
EHQPASGRGR PRVLLRQRRD CGYLIVVIIS SDAVQYSLVD YAGKLIDRFS EERSHDPAGA
ARFVAAVRAG LLRILDRSKI SQEKVLLISI SSKGLVNSTE PVLVWSPIFG SDQIDFELAL
RPEWQAKVIL DNETLLVAAA LGAREEMVKG ADFRSLAALS LGHSVGLGIV RRGNQTGQEI
SAPNFGHMLH MANGGLCRCG TRGCIEAYAG FYAILRSAFE VPLDTIPAKF VPVAELDKIA
AKARQGHRVP AFAFRQAGLA LGNGLSRMLS LTERMPIAIT GPGTRYYDLL RQGIEEGLGQ
SHIVRMEGMP EIRVVADEQI LVFEGHLNRA LSVIDEDIVL SGVQGIQASA IIQESG