Gene Rleg_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0049 
Symbol 
ID8011296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp49014 
End bp50003 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content59% 
IMG OID644822639 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002973899 
Protein GI241202803 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.80049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAA ATGCATCCGC CAATTTCACC CCCCGATTTC AGGGGGCAAC TTTCGACGAT 
ATGGTCGGGG CTCTGACCAA GGGGTTTGGT TCGTTCGACG CATGGCGCGA TGGTCGAGAC
AAACCGCTCG ACTGGAAGGT CGGGTTTTGG GGTGATGAAA GCTTGTCGCT GGTCAGCAAC
CAGGATTCCG GCGGCTGGGG TGCAAGAACG GCCCATGGAA CGCCGGAAAC TCTGGCAATC
ATCGTTCCAC GCACTGGTGC TCTCGATGTA ACGCTTGGTC GGTCTGTGAT CGAAGGGACA
CCTGGGCGGC TGCTGTTGGC GAACAATCTT GAGCCGGAAC GGATTTCCGT GCGGGCAGCG
CCGCACCGGT CAGACACACT GAGTCTGAGC TGGACAATCA TCGCTCAGAC CGTTGCCTCC
GTATTGGAGA CCCCCCTGAT CGGGGCAATG GACTTGGCAC CGGTCATTGA TCTTTCCACG
GCGGCGGGCC GGTTGATCGG CAGTCTCGCC CAAACGATCA TCATCGGCAT GCGCAACAAC
GGGCCGCTCC TTGCCTCGCC AATCGCCATG TTGAACCTGA CCCAGGCGTT TGCCGATCTG
CTGGTGCGCT CGGTTCCTCA TCGGCTCTCG CATCTTCTCG ACAGGAAAAT CCATCTGATC
GCACCCCGGC ATGTCCGTCG GGCAATTGAA TTCATGCATG CAAACATCGC AGAGCCGTTG
ACGATGCAGA GTGTCGCGGA GGCGGCAGGC ATTTCCATTC GCGCTCTTGA AAGCGGTTTT
CGTGCCTTCA AGGGAACCAC ACCGGCTGCC TACCTCAGGA CGATCCGCTT GCAAGCGGTC
CGGGAGGATC TGCGCGATCC TTCAAACCGG CAGCCGTTGC GGGACATCTG CTTGAAGTGG
GGATTCTTTC ACTTTGGCCG CTTTGCGGCA ACCTATCGGG CGGCGTATGG AGAGAACCCG
TCGGACACCA GAAGGCGATC CGATCACTAG
 
Protein sequence
MPENASANFT PRFQGATFDD MVGALTKGFG SFDAWRDGRD KPLDWKVGFW GDESLSLVSN 
QDSGGWGART AHGTPETLAI IVPRTGALDV TLGRSVIEGT PGRLLLANNL EPERISVRAA
PHRSDTLSLS WTIIAQTVAS VLETPLIGAM DLAPVIDLST AAGRLIGSLA QTIIIGMRNN
GPLLASPIAM LNLTQAFADL LVRSVPHRLS HLLDRKIHLI APRHVRRAIE FMHANIAEPL
TMQSVAEAAG ISIRALESGF RAFKGTTPAA YLRTIRLQAV REDLRDPSNR QPLRDICLKW
GFFHFGRFAA TYRAAYGENP SDTRRRSDH