Gene Rleg_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1659 
Symbol 
ID8012728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1651205 
End bp1652251 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content63% 
IMG OID644824244 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002975485 
Protein GI241204389 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.278996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAGA TTGACGGGAT CATCTATTAC GTGGAGTATG CATTTTATAA CCTATCGTCC 
GAGATTCCCA TGGACCCCTT CGATTCCGTC CTCAGCGCCA TGCAGCTCCA AAGCTCGCTC
TTCGTCCGCA TGCGTGCTCA TGCACCATGG GCGATGTCGT TCGATAGCGG CGGTCAGGCG
CGGCTGATCG TCATCGCTAA GGGCCGGGGC TGGTTCACCC AAGTCGGCCA CCCCCCGGTC
GTTGTCGAGG AAGGCGACTG CCTCATCATC AAGCAGGGGG TCATGGGCAT ATTGGGCGAC
GCTCCGGACC GGGTCGCAGT GCCCTGCTGG CAGATTGCCG ACCATGTGAC GGGCGAGACG
GTGTCCTTCG GTGGAGACGG CGAAGCGTGC GAGTTCTTCT CGACCCTGTT TACGTTCGAC
CACGCTGCGG GCGAGCCCTT ATCGGCGCTG TTGCCCGATG TTGTTCATGT CGCCATGGCG
AAGTCCGACG CAGGGCGGAT GGTCTCGATC CTCGAACAGA TCGGAAAAGA GGAGGCGCAG
GCGTCGCTTG GCGGCTCCTA TGTCGTCGGC AGGCTGCTCG ACGTGCTGTT CATCCAAGCG
ATCCGAAGCT GGGCCAGTTC GGAGGGGAAT ATGCCCGAGG GCTGGCTCGC CGGACTGACC
CATCGCCAGT TGGCGCAAAC GCTGCACCGG ATTCATGCCG ATCTGGCGCA CCCGTGGACG
CTGGAGCAGC TCGCCCGCGA TGTGGGGATG TCGCGCTCCA CCTTTGCAGT GCTGTTCAAG
TCGGTCGTCG GAGTGCCGCC GCTGACCTAC ATCACGACTT GGCGCATTTA TCGTGCGAAA
CTCATTCTCG CCGCCGGCCA CTCAATCTCA GCGGCGGCCG CGCAGACCGG CTATGGCACC
GACATCGCCC TCAGCCGCGC TTTTAAAGCT GCGACCGGCG TGGCGCCGGG GCAGTGGCGG
CGCGAGCGAC GTGGCGTCGA CCGTCCCGTT CCCAGTGGAG ATCGATCAAG GGCGCCGGTC
AGGCACCCTG TTCCGGCTGA TTTGTAA
 
Protein sequence
MVEIDGIIYY VEYAFYNLSS EIPMDPFDSV LSAMQLQSSL FVRMRAHAPW AMSFDSGGQA 
RLIVIAKGRG WFTQVGHPPV VVEEGDCLII KQGVMGILGD APDRVAVPCW QIADHVTGET
VSFGGDGEAC EFFSTLFTFD HAAGEPLSAL LPDVVHVAMA KSDAGRMVSI LEQIGKEEAQ
ASLGGSYVVG RLLDVLFIQA IRSWASSEGN MPEGWLAGLT HRQLAQTLHR IHADLAHPWT
LEQLARDVGM SRSTFAVLFK SVVGVPPLTY ITTWRIYRAK LILAAGHSIS AAAAQTGYGT
DIALSRAFKA ATGVAPGQWR RERRGVDRPV PSGDRSRAPV RHPVPADL