Gene Rleg2_5085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5085 
Symbol 
ID6978179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp731183 
End bp732154 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content61% 
IMG OID643394222 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002279040 
Protein GI209547122 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0266874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCC TTTCTGAAGT TCTCAGCTTG CTCAAGCCGA GCAGCACCAT CTCGTCGGGC 
TTTGATGCCG CTGGCGAGTG GTCGATCCAG TTCGGCGACC AGCACCGCCA GATCAAATGC
TATGTAATCG TCACCGGTGG GTGCTGGTTG GCGGTCGATG GCGTCGACGA AGCGGTTCGT
CTCGAACAGG GCGACTGCTT CGTCCTGCCG CGCGGGCTGC CATTTCGGCT CGCAAGTGAT
CTGGGCCTCT CTCCGGTCCC TGCGCCAACG CTCTTTCCTC CAGCACGTGC GGGTGGTGTG
GTCACCCTCA ACGGAGGCGG GACCTTTTCT CTTGCCGGCG CGCGCTTCGC CGTCGGCGGC
AATAGCGCCG ACATGCTTCT GGAAATGCTA CCGCCCATCG TCCACCTCAG CCGGGAAACA
GAACGGGACG CCTTGCGCTG GTCGATCGAA CGGATGATGC AGGAGCTCAG TTCTCATCAA
CCGGGCGGAC ATCTGATGGC GCAACATCTG TCGCATATGA TGCTGCTCCA GGCACTTCGC
ATTCATCTGT CAGATGGTCA TCAACGAAAG GGCTGGTTCT ATGCCCTTGC CGACAGGAAC
CTGAGCGGCG CAATTCGCGC GATGCATGCC AACCCGGCGA GAAACTGGAC TCTGGCGGAA
TTGGGTGAGA CAGCCGGAAT GTCACGCTCC GTATTCGCTG AGCGCTTCAA GGCGACGGTT
GGAGAGACCC CAATCGAGTA CCTGTCAAGG TGGCGAATGC TTCTCGCCTG CAGCCGGTTT
GAAAGCGGCG ACGACCCTGT TTCGGTTGTC GCGCCGGCGC TGGGCTACCA GTCCGAGAGC
GCCTTCAGCA AAGCGTTCAA GCGAGTCGTC GGATGCTCGC CGCGCCAATA TAGGGCCCAG
GAAATCCTTC CTTCTGAGAC TCCCGCGCGC CTTCACGCCA GACGTTCGCA CGCGGTCTCC
GTTTCGAGAT AG
 
Protein sequence
MDPLSEVLSL LKPSSTISSG FDAAGEWSIQ FGDQHRQIKC YVIVTGGCWL AVDGVDEAVR 
LEQGDCFVLP RGLPFRLASD LGLSPVPAPT LFPPARAGGV VTLNGGGTFS LAGARFAVGG
NSADMLLEML PPIVHLSRET ERDALRWSIE RMMQELSSHQ PGGHLMAQHL SHMMLLQALR
IHLSDGHQRK GWFYALADRN LSGAIRAMHA NPARNWTLAE LGETAGMSRS VFAERFKATV
GETPIEYLSR WRMLLACSRF ESGDDPVSVV APALGYQSES AFSKAFKRVV GCSPRQYRAQ
EILPSETPAR LHARRSHAVS VSR