Gene Rleg_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3207 
Symbol 
ID8014101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3209393 
End bp3211240 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content60% 
IMG OID644825768 
Producttranscriptional regulator, SARP family 
Protein accessionYP_002976995 
Protein GI241205899 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCATC TGCAGACATT TGGCGATCTG CGGTTGATCG AGACGAATGG CGACCAGGTT 
CGCTATCCGA TCAAGGGCCT GTTGATGATG GCCCATCTCT ATGCCGGCGC GAGCCATGAG
CTCAGCCGGT ACGAATTGGC CCAGTTTCTG TGGAATGATG TCGAACCAGA GCTGGCGCGG
CTCAATCTGC GCAAACTGCT TTCCCGCATC CGCGAGACAG ACGGCGGGCG CGCCGAATTG
CCCTTCGATT TCACCGCCAC CACGGTTCGT CTCAACACGC AGGCAGTTTC CAGCGATCTG
GATATTTTTC GCGCCGGCGG TTCGCCGCTG GAGCGGCTGA ACGCAATCGC CGAATTAACC
CAGCGCGGGT TCATCGGAAA TATCAAACCA GCGACAAAGC TGATAGACGC CTGGATCAGG
GCCCAACGGG ACGCTCAGGC GCTGCAGTTG CGGCAGGCAC TGCTGGACGC GTTGCCGGAT
GCGCAAAAGC CCGGCGCTAC GCGCTCCATT TCCGGCGCCG CTCTGCAGAT CCTCGAACGG
GATCCGAATG ACGAGCAGGT CAGAGCCTTA CTGCACCGGT TATCCGGCGG TTCGTCCCTC
AGCGAAAGAT TGCCGAACGA TGACGGCCAT GCCAGGGTAC AGGTGAAGCG TTCCGAGGCC
GGCGGTGAGG CAACTGCCGA CATCGAACGC ATGTCGCCAG CGCCGCTGAT CCTGCCTCGA
CTGGTGCTGC TTCCCCCGAC ATCGAAACAT GCCGATGCCG GGCTTGCGCT TGCCAATGCC
CTGATCGAAG ACGTCACGAT CGAACTCTGC GCGCTCAGAA ACATTTCCAT CGTAGCGCCC
CATACGGCCG GCCAGATCCG CCGCGATTCC GAGAAGGCTG CTGTGGTCGC CCGTCACTCG
ATCGCCTATC TTCTGGATAC ACGGCTTTCC GAAGAGGGAT TGTTCGCCCA GCTGGTTTAT
TTTCCCACGG ACGAGATCAT CTGGGCCAAT CGCTTTACGA TGACGCCCGA TATATTGCCG
CGTCAGAGAC GGCTGATTGC GCAGCAGTTG ACTATGTCGG TGGCCCGCGA GCTGGCTGAA
AACGAGGAAG AGCGCTTACG CTTCGAAGCC AATCCTGAGG CCTATCACGC CTATCTGGTC
GGCTCGAGCC TGATGAGCAA ATTGACATTG CCGCATATCC GCCGGGCACG AAAAGCTTTC
AAGCAGTCGC TGTCGCACAA GGCGGATTTC TCTCCGTCAT TTACCGGGCT GGCGAGAACC
TTCACCAGCG AGTGGCTCGT GACGGCGCAG GGTAACAATG AGCTGCTGCA TCTGGCGGAG
CAGAACGCGC TTCGGGCCAT CGAGCGGGAT CCGGCCTCGG CGGCGGGCCA TCGCGAGCTT
GGCGTCACCA AACTCTATCT CGGCGATGTC GATGCCAGTG TCGCCGCACT TGATCTGGCG
GAACAACTCA GCCCGCATTT TGCCGATGTC ATCTACAGCC ATGCCGACAC GCTCGTGCAT
GCATCCCGCC CTGGCGACGC GCTCGCCAAG ATCAGAAGAG CGATTTCGCT CAATCCGATC
GCGCCTGACG CCTACCTCTG GTGCGCTGCG GGAGCGAGCT TCTTCCTGGA ACAGTACGAG
GAGGCCATCG CCTATGTCGA GGCGATGAAA GACAAGGCGC CGGCCCATCG CATCGCCGCG
GCCAGTTGCG CAATGATCGG CGATCGAAAA CGGGCGCTGT TCCATCGGCA ACGAGCCGAA
AGCATCAACC CGGTCTTCGA CGTCGAGAAG TGGCTCACCA TCGTTCCCTT CAAGGAAGAT
TGGCAAAAAG AGCTTTATCG GGAGGGCCTG CTGAAGGCCG GTTTTTAA
 
Protein sequence
MLHLQTFGDL RLIETNGDQV RYPIKGLLMM AHLYAGASHE LSRYELAQFL WNDVEPELAR 
LNLRKLLSRI RETDGGRAEL PFDFTATTVR LNTQAVSSDL DIFRAGGSPL ERLNAIAELT
QRGFIGNIKP ATKLIDAWIR AQRDAQALQL RQALLDALPD AQKPGATRSI SGAALQILER
DPNDEQVRAL LHRLSGGSSL SERLPNDDGH ARVQVKRSEA GGEATADIER MSPAPLILPR
LVLLPPTSKH ADAGLALANA LIEDVTIELC ALRNISIVAP HTAGQIRRDS EKAAVVARHS
IAYLLDTRLS EEGLFAQLVY FPTDEIIWAN RFTMTPDILP RQRRLIAQQL TMSVARELAE
NEEERLRFEA NPEAYHAYLV GSSLMSKLTL PHIRRARKAF KQSLSHKADF SPSFTGLART
FTSEWLVTAQ GNNELLHLAE QNALRAIERD PASAAGHREL GVTKLYLGDV DASVAALDLA
EQLSPHFADV IYSHADTLVH ASRPGDALAK IRRAISLNPI APDAYLWCAA GASFFLEQYE
EAIAYVEAMK DKAPAHRIAA ASCAMIGDRK RALFHRQRAE SINPVFDVEK WLTIVPFKED
WQKELYREGL LKAGF