Gene Rleg2_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1684 
Symbol 
ID6980421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1713611 
End bp1714786 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content65% 
IMG OID643396408 
ProductROK family protein 
Protein accessionYP_002281198 
Protein GI209549281 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.805279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGA TCAATCGCTT GAACGTGCTC GATACCATCC GGCGCCATGG TCCGATCTCG 
CGCATCGAGA TCAGCGAACG CACCGAGCTT TCGACCACCA CCGTTTCCGC CATCACCGCC
TCGCTGCTCG ACGACGGGCT GATCCTGCCG CGCCATGAAG GCGATATCCG CAACGAGGCG
GTGCGCGGCA GGCCGCGCGT GGCGCTGGAG CTCAACCCGG ACGCCGCCCG CGTCGTCGGC
GGCAAGATCG CCGCCAATAG GATGGTCTTC GTCGTAACCA ATTTTCGTGG TGACGTGCTG
TCGAAACTCT CCCTGCCGAT CCGCATCGAC CGGCAGCCGA TCGGCGTCAT CGCCGATCTC
GTCGAGGATG GTGTGCGGCG CTGCGTCGTC GATGCCGGGC TGTCGCTTGA AGATGTCGAC
AGCGTCTGCC TCGGCTTTCC CGGCGTCATC GAGCACCGCA CCGGTTACAT AAGAAGCAGC
CCGATCTTTC GCGACACCAA CGTCGATTTC GCCGCCGAAA TGTCGACGCG GCTGTCGACG
CCGACGATCG TCGAGAGCGA CGCGCATGCC ATCACGCTGG GCCATCACTG GTTCGGGAAG
GCGCGCGATC TCGAGGATAT GGTGTTGATT TCGCTGGAAC AAACGCTGGG GCTCGGCGTG
CTGCACGGCA ACAGCCTGTT TCGCGGCGCC GGCGGCCTCA GCCACAATCT CGGCGACCTG
GTGCTCGGCA TGGGACCAAA CGGCGTGATC CGGCTTTTCA GCCAGGCCGG CGAAAGCGCC
ATCCTCGGCG AACAGCCGAC CGACGGGCGT TTTGCCGAAG CGATCCGGCT CGGCCGCGGC
ATGACCCATG CCCAGGCGCT GATCAAAGCG GATGACGACC GGCTGATCGG TGCGGCCATC
CGCGCCGGCG AAGCGGTGGG GCTGACCATC GCCAATATCG TCACGCTGTT TGCGCCGCCC
CGCGTCATTC TTGTGGGGTC GAGCCTGGCG CTCGGCGAAC CCTTTCTGAA CAGCCTGCGC
GATGCCTATG CGCTCGCCAT TCCGCCCTCG CTGAAGGGGG TGAGCGAACT CGTCTTCGAC
GATTCGAGCG ATGATTTCTG GGCGCAGGGC GCGGCTGCCG TGGCGCTATA CGAGCTTTAC
GAATCGCCCT GGAGCACCAC CGGGCCGGCG CTCTGA
 
Protein sequence
MRAINRLNVL DTIRRHGPIS RIEISERTEL STTTVSAITA SLLDDGLILP RHEGDIRNEA 
VRGRPRVALE LNPDAARVVG GKIAANRMVF VVTNFRGDVL SKLSLPIRID RQPIGVIADL
VEDGVRRCVV DAGLSLEDVD SVCLGFPGVI EHRTGYIRSS PIFRDTNVDF AAEMSTRLST
PTIVESDAHA ITLGHHWFGK ARDLEDMVLI SLEQTLGLGV LHGNSLFRGA GGLSHNLGDL
VLGMGPNGVI RLFSQAGESA ILGEQPTDGR FAEAIRLGRG MTHAQALIKA DDDRLIGAAI
RAGEAVGLTI ANIVTLFAPP RVILVGSSLA LGEPFLNSLR DAYALAIPPS LKGVSELVFD
DSSDDFWAQG AAAVALYELY ESPWSTTGPA L