Gene Rleg_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3583 
Symbol 
ID8015821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3617952 
End bp3618896 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content64% 
IMG OID644826148 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002977368 
Protein GI241206272 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.612075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.105088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAA TCGGCTTTCT GCTCTATCCC GGCTTTCAGA TAATGAGCCT GGCGGCCGTC 
TCCGCCTTCG AATTCGCCAA TCTCGAACTC GAGGAAAAGG TTTACGAGAT CCGCTACCTC
TCCGAAAATG GCGGGCCGGT TGCCAACTCG CTTGGCATGA TGATGGAGAC GGAGGCCTTC
GGCACCCCGG CCCTCGACAC CCTGATCGTG GCGGGCGCGC CCGATATCAG GCTGCCGAAT
GCCGCAGAAG CCGACTTCAT TCGTGCCGCC CTGCCCGCCA CACGTCGCTT GGCGTCGATC
TGCACCGGCG CCTTCTTTCT TGCCGAAGCC GGCATCCTCG ACGGCCGCCG CGCGACGACG
CACTGGTATG TCTCTCGCGA GCTGCAAAGC CGCTATCCCA AGATAAAGAT GGAAGAGGAC
CGGATCTTCA TCATCGACGG TTCCATCTGG ACCTCGGCAG GCATGACGGC CGGTCTCGAC
CTGGCTTTGG CGATGGTCGA GAAGGATCAT GGTTTCGAGG TGGCGCGCGC CGTTTCCCGC
AAGCTCGTCG TCTATCATCG CCGCGCCGGC GGCCAGTCGC AATTCTCAGC CCTTCTGGAA
CTGGAGCCAA AATCGGACCG GATCCAAAAG GCGCTCGCCC ACGCCCGCAG CAATCTGAAA
TCGGCGCTTT CGGTGGAGGA GTTGGCGGAA GTCGCCCATC TCAGCCCCCG TCAGTTCAGC
CGCGCCTTCC GCGACGAAAC CGGGCAGTCG CCGGCGAAGG CCGTCGAGAA CCTGAGGCTG
GAAGCGGCCC GGCTGATGAT GGAGCAGGGC CGCCACCCGA TCGATGTCGT TGCCCGCGAA
ACCGGCTTTG CCGATCGCGA GCGCATGCGC CGCGCCTTCC TTCGCGCTTT CGGTCAGCCG
CCGCAGGCAA TTCGGCGCGC CGCCCTGCAG GAACTGCAGA TGTGA
 
Protein sequence
MQQIGFLLYP GFQIMSLAAV SAFEFANLEL EEKVYEIRYL SENGGPVANS LGMMMETEAF 
GTPALDTLIV AGAPDIRLPN AAEADFIRAA LPATRRLASI CTGAFFLAEA GILDGRRATT
HWYVSRELQS RYPKIKMEED RIFIIDGSIW TSAGMTAGLD LALAMVEKDH GFEVARAVSR
KLVVYHRRAG GQSQFSALLE LEPKSDRIQK ALAHARSNLK SALSVEELAE VAHLSPRQFS
RAFRDETGQS PAKAVENLRL EAARLMMEQG RHPIDVVARE TGFADRERMR RAFLRAFGQP
PQAIRRAALQ ELQM