Gene Rleg2_4381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4381 
Symbol 
ID6977475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp10400 
End bp11425 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID643393561 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002278379 
Protein GI209546461 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.165128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA TCGAACGACG CATGATCGCG CCGGGTTTCG TCGAGGAGGC GCTCGACAGC 
CTGCGGCGGC TAGGCAAGCC GACGGAACCG ATCCTTGCCC GCCTCGGCCT GCCGCCCGTC
ATCGATCAGC CGGTTTCGGC CGATACCTAT GGCGCGCTCT GGCTCGCAAT CGCCGCCGAG
CTCGACGACG AATTCTTCGG CATGGGCGCG CGGCCGATGC GCAGCGGCAG CTTCACGCTG
CTCTGCCATT GCGTGCTGCA CGCGCCGACC CTCGGTCATG CGCTGCGCCG GGCGCTGCGC
TTCCTCGATA TCGTGCTCGA CGATCCCCGC GGGCGGCTCG TCGTCCGCGA CGGTCTTGCC
GAGGTCGAAC TCAGGGATGC CGGCGGTCCG CGTTCGGCCT TCGCCTACCG CACCTACTGG
ATCATCCTGC ACGGCATCAC CTGCTGGCTG GTCGGCCGGC GCATCCCGAT CCGCCTCGTC
GATTTCCGCT GCGCCGAGCC CGGGCAAGGC GCCGACTATC GGCTCTTCTT CGGCGCACCG
GTGCGCTTCT CGCAACCCAT CAGCCGGCTC GGCTTCGACA GCGCCTTGCT CGACCTGCCG
GTGGCGCGCA GTGAACAGGC GCTCAAACAA TTCCTGCGCG GCGCGCCCGC CAATATTCTG
GTGCGCTACC GTTACGATGC CGGCATCGCT GCGGCCGTCC GCCGGCGCTT GAGCCAGGCC
ACACCCAATG CCTGGACAAA CTTCGCCGCC CTTGCCGCCG ATATGCGCAT GCCACCCTCG
ACACTCCGCC ACCGCCTGCA TGACGAGGGG CAAAGCTATG CCGCGATCAA GGACGATATC
CGCCGGGATC TCGCCATCGA CCTGCTGCTG AACACATCAA AGACCATCGG TGAGATCGCC
GTGCAGCTCG GCTATTCCGA ACCCAGCGCC TTCTTCCGGG CCTTCCGGAA ATGGATGGGC
AAGAGTCCGG AGTCGTTCCG GCGGGAGGAA GCGGAAAACC AGACCTATGT CAGTCGAACC
GCTTGA
 
Protein sequence
MAEIERRMIA PGFVEEALDS LRRLGKPTEP ILARLGLPPV IDQPVSADTY GALWLAIAAE 
LDDEFFGMGA RPMRSGSFTL LCHCVLHAPT LGHALRRALR FLDIVLDDPR GRLVVRDGLA
EVELRDAGGP RSAFAYRTYW IILHGITCWL VGRRIPIRLV DFRCAEPGQG ADYRLFFGAP
VRFSQPISRL GFDSALLDLP VARSEQALKQ FLRGAPANIL VRYRYDAGIA AAVRRRLSQA
TPNAWTNFAA LAADMRMPPS TLRHRLHDEG QSYAAIKDDI RRDLAIDLLL NTSKTIGEIA
VQLGYSEPSA FFRAFRKWMG KSPESFRREE AENQTYVSRT A