Gene Rleg_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1549 
Symbol 
ID8012630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1528342 
End bp1529292 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content61% 
IMG OID644824135 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002975377 
Protein GI241204281 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0143415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00302933 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGACCG ATATCGACAT CAAGGTGAGC TTGGGCAGGA TACGTGACGT CCTCGAAACC 
ATCGACGACT GCCTGCACGT CGAGCACTAC AGCGTCGCCG ACCCGAACAT CTCACTTGCA
AAGCTGACAC GCGAGAGCGG AAATCGGGCG ATCACCACCT CGCTGGGTTC CGGCGTCGGG
ACGTTCGTCG TTGCGGCCCG CGATTCCGAG CTGCGCCTTG AGCAAAAGGG GCGTGTCCTG
TTTCGGGGAC TGATGGTCGA GGACAGCGTC ACCTTCATTG AAGGCCCCGA CCCCGTCGCC
GTCGAGCTGC AGTCCGACGC CTGTCTCTAT GTCCTCACTT TTCTACGAGA GCCTCACAGG
GGGCAAAGCC CCGTTGTGGG AGGACGATTC AACATCCGCG ACATCCGCCT CGCGGAGCTG
ATCGGTGAGA TCTCAGAAAG TTCGCCCGAG CGACGATGCG AACTCACCCG AGACTTCTCG
CGGCGGGCGT TCGAACACTA TCGATCGCTG CCACGCGTTC ATCCCCTCTC GGCATGGCGG
CTCAGTCGCG TCAGACGATA TGCGGAATCC CGTCTCGATC AGCGGATGTC GCTGGAGGAA
CTGGCGGACG TCGCAGGGTT CTCGCGGGCG CATTTCGCGG CAACGTTCAA GGCGGCCACC
TCGATGAGCG CCCATGAATT CCTTCTGCGG ATACGAATTC AGGCCGCAGC CAGGCTGCTG
ACTTGCACCG ATCTTGCCCT CGTCGAAATC GCCCTGCAGA CGGGGTTCCA GAACCAGCCG
CATTTTACGA CGGTCTTCAA GAAGATCACG GCGATCACGC CGCGAAAATG GCGGGATTCT
CGTCGGTCCA CGGCAATGGC GCACAGCCTC AAGCAAGTGG TCGACGGTGC CATCGTCATC
GAACATCTGC CGGGAAAGCC GGAAGGATTC CTTCCGCACG ATCTGCATTG A
 
Protein sequence
MQTDIDIKVS LGRIRDVLET IDDCLHVEHY SVADPNISLA KLTRESGNRA ITTSLGSGVG 
TFVVAARDSE LRLEQKGRVL FRGLMVEDSV TFIEGPDPVA VELQSDACLY VLTFLREPHR
GQSPVVGGRF NIRDIRLAEL IGEISESSPE RRCELTRDFS RRAFEHYRSL PRVHPLSAWR
LSRVRRYAES RLDQRMSLEE LADVAGFSRA HFAATFKAAT SMSAHEFLLR IRIQAAARLL
TCTDLALVEI ALQTGFQNQP HFTTVFKKIT AITPRKWRDS RRSTAMAHSL KQVVDGAIVI
EHLPGKPEGF LPHDLH