Gene Rleg_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1012 
Symbol 
ID8012146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp996352 
End bp997389 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content65% 
IMG OID644823595 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002974846 
Protein GI241203750 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCAA ATCCGAATTT CCTGCTTCAT CGTTCAAAAA TGATCAGTGA CCCACTCTCC 
GAAATGCTCA ATCTACTCGA TGCGCGCTGC CTTGTGTCGG GTGGGCTGAT CGCCGGCGGC
GCCTGGGCGC TGCGCTTCCC CCGGCCGAAC CGAATCAAGA TCAGCGCGGT CGCCAAAGGG
CGCTGCTGGC TTTGCCTCGA CAATGGCAGT GAACCCATTC TGCTCGAAGC AGGCGACGTT
GCGCTTCTGA ACGGCCGGCA TTCGTTCGTC CTGGCGAGCG ACCTTGCCGT TACGCCCACT
GACGCGGTCG GCGCCTTCAA GGAGAAGGTC GACGGCCTGG CACGCCACGG CGTTGGCGAG
GATTTCCATT ATCTCGGCGG CCACATCGCG CTCGGACCGC AGGGCATGGA GCTGCTGTCC
GACGTACTGC CGCCGATCAT TCATGTGCGT GCCGCCCTTG CCGAAGCCGG CGTGCTGCGC
TGGCTACTCG ACCAGTTGGT CCGGGAGATG GCGGCAAAGC GGCCAGGCGC CCTGCTCGCC
TCGACCCAGC TTGCGCAGTT GATGTTCGTG CAGGTGCTGA GGGCGCACAT CATGAGCTCG
GCGCCACTGA CGGTCGGGTG GCTCCGCGCC TTCGGCGACG ATCGCATTGC GCCGGCGCTG
CGGCTGATGC ATGGCGATCC CGGCCGGTCC TGGCAGCTTG GCGAACTGGC AAAGGCGGCC
GGCATGTCGC GCACGAGCTT TGCCCTGCGG TTCAAGACCG TGGCGGGGGT GGCGCCGCTC
ACCTATCTCA CCGGCTGGCG CATGCGCCTT GCCGAACGCG AGCTGCGGGA GGGCAGCATG
CCGGTTTCGG CGCTGGCGCT TTCGCTCGGC TACACATCCG AAAGCGCCTT CAGCAACGCT
TTCAAGCGGA TGACGGGTAT GGCGCCCAGG CGCTATCGCG TGGCGATGGC CCGCGAGGCC
GGGCCGATCG AAGAGGTGGT GGATGTCGAG GGCCAGGCGA TGACGACACA TTACCGGCTG
TTGAAGGCGG CATCCTAG
 
Protein sequence
MLANPNFLLH RSKMISDPLS EMLNLLDARC LVSGGLIAGG AWALRFPRPN RIKISAVAKG 
RCWLCLDNGS EPILLEAGDV ALLNGRHSFV LASDLAVTPT DAVGAFKEKV DGLARHGVGE
DFHYLGGHIA LGPQGMELLS DVLPPIIHVR AALAEAGVLR WLLDQLVREM AAKRPGALLA
STQLAQLMFV QVLRAHIMSS APLTVGWLRA FGDDRIAPAL RLMHGDPGRS WQLGELAKAA
GMSRTSFALR FKTVAGVAPL TYLTGWRMRL AERELREGSM PVSALALSLG YTSESAFSNA
FKRMTGMAPR RYRVAMAREA GPIEEVVDVE GQAMTTHYRL LKAAS