Gene Rleg_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1052 
Symbol 
ID8012181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1027627 
End bp1028571 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content61% 
IMG OID644823635 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002974886 
Protein GI241203790 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.2685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.950103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGT TATCCGATGT TCTCGCATTG CTCAAACCGC GCAGCTATGT TTCCGCGGGG 
CTTGACGCTG GCGGTGCTTG GGCGATCGAT TTTCCTCCCC CTGACGGCAT CAAGTTCAAC
GCAGTGATTT CAGGCGCGTG CTGGCTGAGC GTCGATGGCG TCCCCGAAGC TGTCCGCCTG
GAGGAAAGCG ACTGCTTTCT GCTGACGAGC CGCCGAGCCT TTCGTCTCGC CAGCGATCCG
GCTCTCGAAG CGATCCCGTC CGATGCGATC TATTCGATCG CCCGCGACGG CATTGCGACT
TGCAATGGCG GCGGCGATTT CTTCCTGATC GGCAGCCGTT TTTCCTTTTC GGGAGGAAAT
TCGGACATCC TTCTCGGAAT CCTGCCGCCG ATCGTCCACG TGAAGAGGGA TTCCGATCAC
GCCGCCGTGC TGCGCTGGTG TCTCGATCGG ATGACGCGCG AATTGCGCGA CCAGCAGCCG
GGTGGCTTTC TGATGGCGGA GCATTTTGCT CATGTTATGC TCATGCAGGT GTTGCGCCTC
CATATCGCAT CGCCGAATGC GCGCGGCGTC GGCTGGCTTT TCGCGCTTAC CGACCGGCGG
ATCGGTGCGG CTATCGGTGC CCTGCATGCC GATCCAGCCC GCAAGTGGAC GCTGCAGTCA
CTGGCTGAAC GTGCCTCGAT GTCGCGGTCC AGTTTTGCTC TCCACTTCAA GGAAAAGGTC
GGGCTTGCGC CGATGGATTA TCTGACGCGC TGGCGCATGC TTCTCGCCGG TGACCGATTG
ACAAACTCGG CCGAAGCAAT TGCCGGTGTC GCCCTGTCGC TCGGCTATGA ATCCGAAAGC
GCATTCAGCA CCGCTTTCAA GAGAGTGATG GGATGCTCGC CGCGGCAATA TGGTCGCGCT
CATCCGCCCG CCGGTGCCAT GCGAGATAGT CTCGGCGCTC AGTGA
 
Protein sequence
MDPLSDVLAL LKPRSYVSAG LDAGGAWAID FPPPDGIKFN AVISGACWLS VDGVPEAVRL 
EESDCFLLTS RRAFRLASDP ALEAIPSDAI YSIARDGIAT CNGGGDFFLI GSRFSFSGGN
SDILLGILPP IVHVKRDSDH AAVLRWCLDR MTRELRDQQP GGFLMAEHFA HVMLMQVLRL
HIASPNARGV GWLFALTDRR IGAAIGALHA DPARKWTLQS LAERASMSRS SFALHFKEKV
GLAPMDYLTR WRMLLAGDRL TNSAEAIAGV ALSLGYESES AFSTAFKRVM GCSPRQYGRA
HPPAGAMRDS LGAQ