Gene Rleg2_2897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2897 
Symbol 
ID6981641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2949312 
End bp2950301 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content60% 
IMG OID643397607 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002282391 
Protein GI209550474 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.825187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.500009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACC ATCCGTCCCG GCCGTTGTCG CCGTCAACTA TCCCAATGGA TGCCCTGAGC 
GAAGTCCTGC AAGACTTTCG CTTGAGCGGG GTCAACTATG GCCGCTGCGA GCTCAGGCAT
CCATGGAGCA TCGCCTTTCC GCAACAACAG CTGCTTCGTT TCCACTTCGT CAGCCAAGGT
CCGTGCTGGA TCCATACCGA AGTCGAAGGA TGGCAGGAGT TGAATGATGG CGATCTGGTT
CTGCTGCCTC AAGGCATCGC ACATCGGTTG GCCAGCGCGC CGGATGTTGA AGGAGATTCA
CTTAAAGGCT GTCAGATAAC AAGAGTGGGA AGCAATGTCT GCGATGTCGT GCGGGAGGGA
ATGGGGGCGA ATAGCACCCT CTTCTGCGGC TCCATGGCTT TGGGCGCGCA TGCGCTTCAC
CCCTTGATCG CTCTGATGCC GCCAATCATC AAGGGCTGCG ATGTGGCCGG CAATGACCCG
ATCGTTGGCC CCCTTCTGGC CGCCATGTCG GCGGAAGCGA CACAGCCCCA AATGGGAAGC
GCGACCGTGC TATCGCGAAT GGCGGACTTG CTCGCGGCGC GGCTTATCCG CTGCTGGGTC
AATTGCAGCG GAGCTTCGAC CACCGGCTGG CTCGCCGCCA TCCGGGATCC TCATATCGGT
CGTGTATTGG CGGCCATGCA CCGGGACCCC GGCCATAACT GGACCCTCGA AAGCCTCGCT
GGTGTGGCTG GCCAGTCGCG CTCGATCTTC GCCGAGCGTT TCAGCGCTAT CTTGGGTGAA
GGCGCGGCAC ATTACCTCGT CCGTCTGCGT ATGCAGCTTG CCCGCGATTT GTTGGGTCAA
AGCGGCATGT CGATCGCGGA AGTTGCTTCC CGGCTGGGCT ATGAGTCCGA GGCGTCTTTC
GCGCGCGCCT TCAAACGCGT CACCAACGTC TCACCGGGGA TTGTGCGCCG CACAAGTTCC
GGACGAATGG ATATAGATTT CGGATTTTAA
 
Protein sequence
MLDHPSRPLS PSTIPMDALS EVLQDFRLSG VNYGRCELRH PWSIAFPQQQ LLRFHFVSQG 
PCWIHTEVEG WQELNDGDLV LLPQGIAHRL ASAPDVEGDS LKGCQITRVG SNVCDVVREG
MGANSTLFCG SMALGAHALH PLIALMPPII KGCDVAGNDP IVGPLLAAMS AEATQPQMGS
ATVLSRMADL LAARLIRCWV NCSGASTTGW LAAIRDPHIG RVLAAMHRDP GHNWTLESLA
GVAGQSRSIF AERFSAILGE GAAHYLVRLR MQLARDLLGQ SGMSIAEVAS RLGYESEASF
ARAFKRVTNV SPGIVRRTSS GRMDIDFGF