Gene Rleg_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2406 
Symbol 
ID8013391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2410666 
End bp2411685 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID644824987 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002976217 
Protein GI241205121 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0277312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGA TGCAGATCAA GAAGCGTTCG GTAGTCTTCT TTATGGTACC GCAATTCACC 
ATGCTGCCCT TTTCGGCGGC CGTGGACACC TTGCGCATCG CCAATCGCAT GCTCGGCTAT
CAGGCCTATA CCTGGCGGCT GACCTCCGTC GATGGAGAAA AGGTCTATTC CTCCTGCGGC
ATCGGCGTCG AGGCAAATTC CTCGCTTGCC GAGGAGCGCC GTCATCTCGG CGGCGAAAAC
CGGCCGGGCA TGGTCCTCGT CTGTTCCGGC ATCGATGTCG AGCAGTTCAA CAACAAGTCG
GTCAATGCCT GGCTGCGTGA ATGCTACAAT CGCGGTGTCG CCGTCGGCAG CCTCTGTACG
GGCGCGCATG TGCTTGCCCA GGCCGGCCTC CTGAATGGCA AGCGCTGCGC CATCCACTGG
GAAAACCTCC CGGGCTTTTC GGAAGCCTTC CCGCAGGCCG AGGTCTATGC CGATCTCTAC
GAGATCGACG GCAATCTCTA TACCTGCGCC GGTGGCACCG CCTCGCTCGA CATGATGCTG
AACCTCGTCG GCGAGGACTT TGGCGAAAGC CTTGTCAACC GCATCTGCGA GCAGCACCTG
ACTGACCGCG TGCGCAACCC GCACGACCGC CAGCGTCTGC CGCTGCGCGC CCGTCTCGGC
GTGCAGAATG CCAAGGTTCT GTCGATCATC GAGCTGATGG AAGGCAATCT CGCCGAGCCG
CTGTCGTTGA TCGAAATCGC CGACGGCGCC GGCCTCTCCC GCCGCCAGAT CGAACGGCTG
TTCCGCCAGG AAATGGGACG CTCGCCGGCC CGCTACTATC TGGAAATCCG TCTCGACCGT
GCGCGCCACC TCCTGGTGCA GTCCTCGATG CCGGTCGTCG AGGTCGCCGT TGCCTGCGGC
TTCGTCTCCG CCTCGCATTT CTCCAAGTGT TATCGCGAAC TCTACCATCG CTCGCCGCAA
CAGGAGCGCG CCGAGCGCAA GATGACCATG GCGACCGCTC GCCAGGCCGT CGCCGCCTGA
 
Protein sequence
MNRMQIKKRS VVFFMVPQFT MLPFSAAVDT LRIANRMLGY QAYTWRLTSV DGEKVYSSCG 
IGVEANSSLA EERRHLGGEN RPGMVLVCSG IDVEQFNNKS VNAWLRECYN RGVAVGSLCT
GAHVLAQAGL LNGKRCAIHW ENLPGFSEAF PQAEVYADLY EIDGNLYTCA GGTASLDMML
NLVGEDFGES LVNRICEQHL TDRVRNPHDR QRLPLRARLG VQNAKVLSII ELMEGNLAEP
LSLIEIADGA GLSRRQIERL FRQEMGRSPA RYYLEIRLDR ARHLLVQSSM PVVEVAVACG
FVSASHFSKC YRELYHRSPQ QERAERKMTM ATARQAVAA