Gene Rleg2_5225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5225 
Symbol 
ID6978319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp856819 
End bp858144 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID643394339 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_002279157 
Protein GI209547239 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.260155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAT TTCGGGACGC AGACTGGTTT GCAGACAAAT TGAAAGATCG TACGATTCGC 
GGAATAGCGA TCGAAACCAG CGCCTTGATC CGGGCCGGAG CACTGCCGGT GGGCACGAAA
CTGCCGGCCA TCCGCGACCT CGCCTTTGCG CTCGGAATCA GCCCGGCAAC CATTTCGGAA
GCGTGGAGCG AACTGCGGCG GCAGAAAATC ATCAGCGGCC GCGGCCGAAA CGGCACCTGG
GTGAGCGGCG ACCGCTTCAT TGCCAAGCCG GCGCGTCTGG CGAGCGTCGG TGATTACGGC
GATGGCGTTC TGGACCTGAC GGCAGCAGTA CCGGATGTCC GGCTGCTGCC GAAGCTTGCC
GAGGCGATGT CCTATGGCGC GAATGCTGAG AATGTAAACA GCTACCAGCG CAGCCGCATT
CTCCCCGAGC TTGAAGAAGC CGTCAGACGC TCATGGCCTT ATCAGCCGGA AGCATTTCTC
GCGACAAATG GCGGCTACAA TGCCGTCTAT ACGCTGATCC ATGCCCTCGT AACGCCAGGT
GCTTCCGTTG CGATCGAAAT GCCGACGGGC ATGCGGCTTC TCGATATTCT CGAAGATCGC
GGGGTCCGTA TCCTGCCGGT CCAGTGCGAT GACGAGGGAC CTGTGCCGGC GTCGTTGGAA
AACGCGATGA ACTACCGGCC GGCCGCCTTC ATCTTCCAGC CGCGCGTTCA CTCCGTCACC
GGCCAAAGCG TCAGTGCCAC GCGAATGGAA AAACTTGGCG CCATCCTCAA GGATAGCGAT
ACGCTTGTCG TCGAGGATGA TGGGGTGGCC GACATCTCGT CGGCGCCGCG CCACTCGCTC
GGCACACTTT ATCCCGACCG TGTCATTCAC GTCCTGTCCT TCTCGAAGAC GCACGGGCCG
GACCTCAGGC TCGCCGTCTT GTCCGGTTCG CGGGCAATGA TCGAGCAGAT CCAGTCCTAC
CGCGCCTTCA GCGCCGGATG GACGAGCCGC ATCCTGCAAT CGGCGGCGGC CTGGCTGCTG
CGCGATCCGG CAACGGACGA GACGCTCTTC CGGGCGCGCG ACCTCTACGC CAAACGCCGT
GCGGCACTGA TCGATGCGCT CGCCGAGCGC GGCGTCGAGG CTGCCCATGG AGGTGGTCTT
TGCGCCTGGG TCCCGGTGTC CTCGGAACCG TTTGCGCTGG TCACGCTGGC GGCGAGGGGC
ATTGCCGTTC ATCCCGGAGC GAAATTCTCG GTTCTGCCGA GCAGCCACCT CCGTGTCGCG
ACCGCCAATT TGACCGATCG CTGCGAGGAA GTCGCCGACG GCATTGCGCT TGCCGCCGTG
CACTGA
 
Protein sequence
MNEFRDADWF ADKLKDRTIR GIAIETSALI RAGALPVGTK LPAIRDLAFA LGISPATISE 
AWSELRRQKI ISGRGRNGTW VSGDRFIAKP ARLASVGDYG DGVLDLTAAV PDVRLLPKLA
EAMSYGANAE NVNSYQRSRI LPELEEAVRR SWPYQPEAFL ATNGGYNAVY TLIHALVTPG
ASVAIEMPTG MRLLDILEDR GVRILPVQCD DEGPVPASLE NAMNYRPAAF IFQPRVHSVT
GQSVSATRME KLGAILKDSD TLVVEDDGVA DISSAPRHSL GTLYPDRVIH VLSFSKTHGP
DLRLAVLSGS RAMIEQIQSY RAFSAGWTSR ILQSAAAWLL RDPATDETLF RARDLYAKRR
AALIDALAER GVEAAHGGGL CAWVPVSSEP FALVTLAARG IAVHPGAKFS VLPSSHLRVA
TANLTDRCEE VADGIALAAV H