Gene Rleg2_5364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5364 
Symbol 
ID6978458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp993911 
End bp994930 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content59% 
IMG OID643394466 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002279284 
Protein GI209547366 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0980026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTACG AGAACACTGC GCGCGCTCCT GAGCGCGAGG ATTATGAGTT CGCACCGACG 
CATCAGGGCC ACGACTTTGA GTCGATGGTT AAGACATTGT CGACCGGTTA TGGGGTTTTT
GCCGCACAGC CGCTAGGCAA CGATCGAAAC TTCCGCTGGG CGGCGGACTT GAGAAGAGGC
GATGGGTTTA CGGTCCTTCA CTCCGTCTTT CGAAGTTCCT GGACAGTTCG AACGCTCAAT
GAGACGCCCC AACACCTTGC CTTCTACCTG CCGCAGACCG GGTCCTTTCG AGTGTCCATC
GGGAAAAGGG CGGTTGAAAG TGGAGCGGGT TATCTCCTTA TGGCCAACAA CCATGAGGCC
GGCGATCGCT TCGTCCAGGG CCCGCACACT TCGGATGTCC TCTTCCTCGA CTGGAATATC
GTAAAGCGTG TGCTCATTTC ACTTGCGGAA ACGCCACTTT CCGAGTCGCT CAACCTTGAG
CCCATCCTGG ATCTTTCGAC ACAGTCGGGC CAGCTTATCG GCAACCTGGT GCAGACGATC
GTGCAGGGCA CGCGCAACAG CGGTCCGCTT CTGTCTTCCC CTCTCGCCAT GGCGACGATG
AGTGAAACGC TCGCTCATCT TGTCATCCGA TTTGGCCACC ACCGCCTGTC CGGCCATCTG
GAGAAAAAGA AAGTGTCGTT GGTTGCGCCT TGGCATGTCC GGCGTGCCAT CGACTATATG
CATGCCAATA TCGCAGAGCC TCTCACCATG ACGATGGTTG CGGACGATGT CGGCGTTTCA
CTTCGCGCGC TGCAGACGGG TTTCAAGGCC TTCAGAGGAA CCTCGCCGGC TGGTTACCTG
CGTACGATCC GGCTCCAGGC GGCCCGTGAG CAACTGCGGG ATCCAACGAA CCAGCGATCC
GTCCGCGAAA TCTGCGCGAT ATGGGGCTTC GCACATGCGG GCAGGTTCTC CATCATCTAT
CGCAGCACCT TCGGCGAAAG CCCGCGCGAT ACGCGCCTAC GGGCAGAGCG CTTGCGTTAG
 
Protein sequence
MRYENTARAP EREDYEFAPT HQGHDFESMV KTLSTGYGVF AAQPLGNDRN FRWAADLRRG 
DGFTVLHSVF RSSWTVRTLN ETPQHLAFYL PQTGSFRVSI GKRAVESGAG YLLMANNHEA
GDRFVQGPHT SDVLFLDWNI VKRVLISLAE TPLSESLNLE PILDLSTQSG QLIGNLVQTI
VQGTRNSGPL LSSPLAMATM SETLAHLVIR FGHHRLSGHL EKKKVSLVAP WHVRRAIDYM
HANIAEPLTM TMVADDVGVS LRALQTGFKA FRGTSPAGYL RTIRLQAARE QLRDPTNQRS
VREICAIWGF AHAGRFSIIY RSTFGESPRD TRLRAERLR