Gene Rleg_6507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6507 
Symbol 
ID8017169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp221200 
End bp222234 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content67% 
IMG OID644828296 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002979496 
Protein GI241554283 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.739448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.944873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA TCGAGCGACG CATGATCGCG CCGGGCTTCG TGGAGGAGGC ACTCGACAGC 
CTGCGGCGGC TCGGCAAGCC GACGGCGCCG GTCCTTGCCC GCGTCGGCCT GGCATCCCCG
GTCGATCAGC CGGTTTCGGC AGAGACCTAT GGCGCGCTCT GGCTGGCGAT CGCTGTCGAG
CTCGACGATG AATTCTTCGG CATGGGGGGT CGACCGATGC GTAGCGGCAG CTTCACGCTG
CTTTGCCACT CTGTGCTGCA TGCGCCGACG CTGGGTCACG CTCTGCGCCG GGCGCTGCGC
TTCCTCGATG TCGTGCTCGA CGACCCCCGG GGACGGCTCG TCATCCGCGA CGGTCTGGCC
GAGATCGAAC TGAGCGATGC CGGCGGCCCG CGTTCGGCCT TCGCCTACCG CACCTACTGG
ATCATCCTGC ACGGCCTCAT CTGCTGGCTG GTCGGCCGGC GCATTCCGAT CCGGCTCGTC
GATTTCCGCT GCGCGGAACC CAAACAAGGG GCCGACTACC GGCTATTCTT CGGCGCCCCT
GTGCGTTTCT CGCAAGCTAT CAGCCGGCTC GGCTTCGACA GTGCCCTGCT CGACCTGCCG
GTGGGGCGCA GCGAACAGGC GCTCAAACAG TTCCTGCGCG GCGCGCCCGC CAATATTCTA
GTGCGCTACC GTTATGATGC GGGCATCGCC GCCGGCGTCC GCCGTCGCCT GAACCAGGCG
ACGCCTGCCA TGTGGCGGAG TTTTGCCGAG CTTGCCGCCG ATATGCGCAT GCCGCCCTCC
ACACTCCGCC ACCGCCTGCG CGACGAGGGA CAAAGCTATG CCGCCATCAA GGACGACATC
CGTCGCGACC TCGCCGTCGA ACTGCTGCTG AACACGACGA TGACCATCGG TGAGATTGCC
GTGCAGCTTG GCTATTCCGA GCCGAGCGCC TTCTTCCGGG CCTTCCGCAA ATGGGTGGGC
AAGAGCCCCG AAGCCTTTCG GCGGGACGGA GCGGAAATCG AGGGCATGTC AGTCGAATCT
GTTGACCCGC CCTGA
 
Protein sequence
MAEIERRMIA PGFVEEALDS LRRLGKPTAP VLARVGLASP VDQPVSAETY GALWLAIAVE 
LDDEFFGMGG RPMRSGSFTL LCHSVLHAPT LGHALRRALR FLDVVLDDPR GRLVIRDGLA
EIELSDAGGP RSAFAYRTYW IILHGLICWL VGRRIPIRLV DFRCAEPKQG ADYRLFFGAP
VRFSQAISRL GFDSALLDLP VGRSEQALKQ FLRGAPANIL VRYRYDAGIA AGVRRRLNQA
TPAMWRSFAE LAADMRMPPS TLRHRLRDEG QSYAAIKDDI RRDLAVELLL NTTMTIGEIA
VQLGYSEPSA FFRAFRKWVG KSPEAFRRDG AEIEGMSVES VDPP