Gene Rleg_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0210 
Symbol 
ID8011437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp221486 
End bp222634 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content61% 
IMG OID644822803 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002974060 
Protein GI241202964 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.278356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.347271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAG GGTCTGAATC CGACGATGAT CCTGCCGCGG AACAGGAACG GCCGGCGCAT 
ATGGAGCGGC GGCGCTGGCC GCGTGAACCG GCCATGCGCA AGGAACGCCC GCCCCATGCC
TCCCCTGCAC TCGTGCCGCT GCGATTCTCG ACGCAGGACC TACCACCGGC AGAACAATTC
CAGGCCTGGC GGGCACATAT GGCGCCGCTC GTGGATGTTC ATCTGCCGGA GGGAAAATCA
CCGGAAGACG GGTTCCTCGC GGAGCAGATC GGCTGGCATC TCGGCGATAT CCTGATCGTC
CAGCAGCGCG CGCATGCCCA CAGATATGTC CGCGATCAGG CCATGCTTCG ATCGAGCCCT
ATCGACCATT GGAACGTCGG CCTGCAGCGC AGCGGCCAGG CCTGGACCGA GGTCAACCGT
CGTGTCACCG AGACCGGTGC CGGCGAGATA TTTTTCATGT CGCTCGGCAG CCCCTATCGC
GGACGGACGA CCGATACCGA AGCTTTGCTC GTGTTCCTGC CGTATGAGAT GCTGGCTCGC
GATGCGAGCC TTCTCCAGAG CGCCGGCAAC ACGGTTCTTT CGGGCAGCCA CGCCGAGTTG
CTCACGGGCT ATCTCACGGG CCTCGAAACG AACCTCGGGA ATCTGACGAT AGAGGAAGTG
CCCCGGATCA TTCAAACCAT CGGCGATATG GTCGTTGCGG GCGCCGCATC GTCCACAAGG
ACTGATACCG GTCAAAACCA GGCCAACATG GGACTGATGG AGCGGGCGCA CCGCTACATT
CACGTCAATC TCCATTCGGA AAACTTGACA CCGGACGTGA TGTGCCGCGC GTTGGGGATT
TCGCGAACGC GGCTTTATCA ACTGTTTGAG GCGAGCGGAG GCGTGCTCAA TTATATTCGC
AAGCGGCGAT TGCTGCAGGC CTATGCGGAT CTCAGCAATT CGGCCGACCA CAGGCCGATT
TCAGAAATCG CAGAAGCCGC CGGTTTCGAG GTCGCCGCCA ATTTTACGCG CGCCTTCATC
CACGAATTCG GGCTGAGCCC GCGTGAAATC CGAAGGACGA TGGCAACCCA ACAGCGGCCG
GCTCCGGCCA TCCGCTCCAC GCGGCGTTAC GAAAAAACGA TCGGTGATTG GCTGGCCCTG
ACGGGTTGA
 
Protein sequence
MATGSESDDD PAAEQERPAH MERRRWPREP AMRKERPPHA SPALVPLRFS TQDLPPAEQF 
QAWRAHMAPL VDVHLPEGKS PEDGFLAEQI GWHLGDILIV QQRAHAHRYV RDQAMLRSSP
IDHWNVGLQR SGQAWTEVNR RVTETGAGEI FFMSLGSPYR GRTTDTEALL VFLPYEMLAR
DASLLQSAGN TVLSGSHAEL LTGYLTGLET NLGNLTIEEV PRIIQTIGDM VVAGAASSTR
TDTGQNQANM GLMERAHRYI HVNLHSENLT PDVMCRALGI SRTRLYQLFE ASGGVLNYIR
KRRLLQAYAD LSNSADHRPI SEIAEAAGFE VAANFTRAFI HEFGLSPREI RRTMATQQRP
APAIRSTRRY EKTIGDWLAL TG