Gene Rleg2_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0116 
Symbol 
ID6978826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp108739 
End bp109794 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID643394827 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002279644 
Protein GI209547727 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.615337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAG ACGCCAACCT TTCAGATGCA CCTGCCATCG AGCCGGACTA TGTCCGCTGG 
CTCGATGGTT TGTGGGCCGA ACGGCTTTTG GCCGCCCGAG ACCGCAAAGC CGATCTCTCC
ATCGGCATTC TGCTCTGGCC GAGCTTCCCG ATGATGTCGC TGACCGGGAT CGTCGAGCCC
TTGCGGCATG CGGCCGATTT CGCCGACAAC TCCCGGCCTC TGCATTGCCG CTGGTCGATC
ATGGGAGCGC CGGGCCACGC CGCGGTGGCG AGCTGCGGCA TCCGCGTGCA GGCAGATGCG
CCCTATATCA ACCCGACGGA CTTCGACTAT ATCGCCGTCA TCGGCGGCCT CCTGCCGCAT
CTGCGCGCGG CACCCAGCAA GCATCGCGAC TACCTGCGGG TCGCCGCATC CGCCGGAGTC
GCGGTGATCG GTGTCTGCAC CGGCGTCTTC GTACTCGCCC AAGAAGGTCT GCTGACCGGG
CGCAAAGCCT CCGTTCATCC CTTCCATGCC GAGGATTTCA AGATCGCCTT TCCGCGCCAG
GCATTCTCGA CGCGCGACGA CTTCCTGATC GAGAACGGCC GCATCACCGT TCCCGGCGGC
GTCTCGATCC TGTCGCTGAT GACGGAACTC ATCGGCACCC ATTGCGGTCC CGACCGGGCG
GCCAAGGCAG TTCACCAGCT GTCTCTGACC GAGCACAAGG GCCTGAGCGC TTTCGACCAA
GGCCGGGTTT CCAGCTTTCG ACATGTCGAG GATTCCCGCA TTCAGCGCGC GGTCGTTTTG
ATCGAGAGCC GCAAGGGCCG TGACGTATCA CCCGAGCAGG CGGCCAGCAT GATCGGGCTG
TCGCCGCGTC AGTTCGGGCG CCTGTTCCAG CAAGGCATCG GCATGACTCC GAAGCGGTTC
ATCATCGAGA CCCGCCTGCG TTACGCCCGC TTTCTGGTAG AAAACAGCAC GCTCTCGATG
ACGCAGATCG CCTTCGAGAC CGGCTTTTCG GATGCGGCAC ATTTCGCAAC CGCTTTCCGC
CAAAAATTCA ACCAGTCGCC GCGACAATTG AGATAA
 
Protein sequence
MNADANLSDA PAIEPDYVRW LDGLWAERLL AARDRKADLS IGILLWPSFP MMSLTGIVEP 
LRHAADFADN SRPLHCRWSI MGAPGHAAVA SCGIRVQADA PYINPTDFDY IAVIGGLLPH
LRAAPSKHRD YLRVAASAGV AVIGVCTGVF VLAQEGLLTG RKASVHPFHA EDFKIAFPRQ
AFSTRDDFLI ENGRITVPGG VSILSLMTEL IGTHCGPDRA AKAVHQLSLT EHKGLSAFDQ
GRVSSFRHVE DSRIQRAVVL IESRKGRDVS PEQAASMIGL SPRQFGRLFQ QGIGMTPKRF
IIETRLRYAR FLVENSTLSM TQIAFETGFS DAAHFATAFR QKFNQSPRQL R