Gene Rleg_6986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6986 
Symbol 
ID8023014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp423874 
End bp424944 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID644833839 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002984973 
Protein GI241666889 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.667581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTG TCATCCTCGC GCCGCCAGGC GTGCAGTCGC TGGACATCGT CGGCCCTGCT 
GAAGTTTTCT GGGAGGCTGC GCGAAGGCTG GGCGACATGA GCGCCTACGA TATACAGGTC
ATGTCAACCG GAGCGCGCTC GATCGCCGGA ACCGGTCAGC TGAGGTTCAT GGCGGATCGC
ACCATCTTCG ACGAAGATGA GGAGATCGAC ACGTTGCTGG TCGCCGGAGA TCCTGCTTTT
CTCGAGATCG ATCCAGAAGT CACTGCATGG CTGCGGCGCC GCGTTCCAGG CGTTCGGCGG
TTCGGCTCGA TCTGCACCGG GGTTTTCCTG CTTGCCGAAG CCGGGCTTCT CGATGGGAAG
CGGGTGACGA CACACTGGGA ATGCGCGGCG AAGTTTAGCC GCGAGTATCC GGCGATCGAT
CTCGACGCCG ATGCCATCTA CGTACGGGAC GGGTCGCTTA TCACCGCCGC CGGTGTCACC
GCCGGCATCG ATCTCGCCCT TTCGCTTGTT GAAGAGGATC ACGGCAAGGA CGTAGCAATG
ATCGTCGCCC GTTACATGGT CATGTTCATG AAAAGACCTG GCGGCCAATC GCAGTTCAGC
GCGCACCTTG TCGGGCAGAT GTCCGAGACG ACGTTGATAC AGAAGGCTCA GGAGTTCGTG
CTCGCAAATC TGAACGGCAA CCTCGACGTC GAGAGTTTGG CGCAAGAGAT TGGAATGAGC
ATCCGTAATT TTGCACGCGT CTTTCGCAAG GAGCTTGGAA TCACGCCGGC CGATTTCGTC
GCGGCCGCTC GGACGGACGC CGCACGGCGG CTGCTCGAGG ACACCGTCCA TCCGCTACAG
AGGATCGCCA CCATCTGCGG GTTCGCAGAC GTCAACGCGA TGAGGCGGGT CTTTACCAAG
ACGATCGGCG TGAGCCCGAA CGATTATCGA AGCCGTTTCC AGGTATCATC CAAAACCATT
TCCCAGCCGG CCTCCGACCG GCGCCCTGCT CGACAAACGA TAGCCATGGA CCTCGCACTA
GCAATCTCGC ATCCTCAAGA GGCATCAACG AGCGCAAGCC GACACACTTG A
 
Protein sequence
MRIVILAPPG VQSLDIVGPA EVFWEAARRL GDMSAYDIQV MSTGARSIAG TGQLRFMADR 
TIFDEDEEID TLLVAGDPAF LEIDPEVTAW LRRRVPGVRR FGSICTGVFL LAEAGLLDGK
RVTTHWECAA KFSREYPAID LDADAIYVRD GSLITAAGVT AGIDLALSLV EEDHGKDVAM
IVARYMVMFM KRPGGQSQFS AHLVGQMSET TLIQKAQEFV LANLNGNLDV ESLAQEIGMS
IRNFARVFRK ELGITPADFV AAARTDAARR LLEDTVHPLQ RIATICGFAD VNAMRRVFTK
TIGVSPNDYR SRFQVSSKTI SQPASDRRPA RQTIAMDLAL AISHPQEAST SASRHT