Gene Rleg2_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5286 
Symbol 
ID6978380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp908844 
End bp909857 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID643394390 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002279208 
Protein GI209547290 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.371656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.135191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCGG AGCAGCATCG ACAGGGAGAC GGCGTCACGA AATCCGGTGC CCGGCTGAAG 
GTGGGGTTCG TGCTGTCGCG GTCGTTTACG CTGTCGGCCT TCGCACTCTT CGTCGACACG
CTGCGGCTTG CCAGTGACGA GCAGGATCGG TCCGGAAGGG TGCTTGCCGA CTGGCAGGTC
ATCGGCAGCA CGCGGCATCT GATCACCTCA AGCTGCGGCG TCCAGGTTGC TCCGACCTCC
GATTTCGTCG ATCCGCTGAA ATTCGACTAT ATCGTCGTCG TCGGCGGGCT TCTGACCGTG
GAAAACCCTG TCGACCAGCA GACCATCAAT TTTCTCAGGC AGGCGGATGC CAAGAAGGTG
CCGCTGATCG GCGTCTGCAC CGGGTCGTTC ATTCTTGCGG CCGCGGACCT GATGAAGCGG
CACGAGTCCT GCGTGAGCTG GCTGCATTAC AAGGAATTTC GCGAGCGGTT TCCCGACCTC
GGCGTTCGGT CCGACCGGCT TTTCAATCTC GACCGCCAGC GCGGATCCTG CGCCGGCGGC
AGCAGTTCGG CCGACATGGC GGCGCTGCTG GTCAGGAAAT ATATCAGCCG GGATGCCGAG
CGAAATGCGC TTGAGGTGCT TCAGATCGAG AAGGCCCGGG CGCCGGCGGA CATCCAGCCC
CGCCGCCCGC TGTATGACGA CTATGACGAC GCCCGCGTCA AGGCGGCGAT GATTACGATG
GAACAGTTCG TCGACGGCAG CATATCGATC CAGAAGCTTG CCGGCATGGT TGGGCTGTCA
CGGCGGCAGC TGGAGAGAAT TTTCATCGAC AAGACGGGAA TGTCTCCCGC CAAGGCCTAT
AATCGGGTCC GCATGGAGCG GGCAAAATCG ATCCTGGTCC AGTCGAAGGC GCCGCTTATC
GAGATCGCGC TCGATGTCGG TTTCGAAAAC GCCTCGCAGT TCACGCGAAC GTTCAAGCGG
ACCTTCGGGC AGACCCCGTC GCAGCATCGC GCGGCAGCTT TAAGAGCACA CTGA
 
Protein sequence
MRPEQHRQGD GVTKSGARLK VGFVLSRSFT LSAFALFVDT LRLASDEQDR SGRVLADWQV 
IGSTRHLITS SCGVQVAPTS DFVDPLKFDY IVVVGGLLTV ENPVDQQTIN FLRQADAKKV
PLIGVCTGSF ILAAADLMKR HESCVSWLHY KEFRERFPDL GVRSDRLFNL DRQRGSCAGG
SSSADMAALL VRKYISRDAE RNALEVLQIE KARAPADIQP RRPLYDDYDD ARVKAAMITM
EQFVDGSISI QKLAGMVGLS RRQLERIFID KTGMSPAKAY NRVRMERAKS ILVQSKAPLI
EIALDVGFEN ASQFTRTFKR TFGQTPSQHR AAALRAH