Gene Rleg_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2141 
Symbol 
ID8013159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2130543 
End bp2131961 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content65% 
IMG OID644824727 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_002975957 
Protein GI241204861 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.196836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.113478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATT GGCTTCCCGA TATTTCCCGC GGTTCCGGGC CGGTCTATCT CCGGCTTGCC 
GACAGCATCG AATCCGCCAT ATCAAGCGGC GCCCTGCCCG CCGGCAGCAA GCTGCCGCCG
CAACGCAACC TCGCCTATGA TATTGGCGTG ACGATCGGCA CGATCGGCCG CGCCTATGCG
CTGGTGCATG AGCGCGGCCT GGTCGCCGGC GAAGTGGGGC GCGGCACCTA TGTGCTGAAC
CGCTCCGAAA CGCCGCCCGG CGAACAGATC GATCCGCTAA CCGTCTCGCT CGGCGGCACC
CGCGTCCAGG ATGCGCCGGC GAACAAGATC CGCTTCGACA CGACAGCCGC TCCCGATCTC
GGCCAGGGCA AGATCATAGC AGGCATCCTC GCCGAGATCG GCGAGCAGCA TCTTGCCGAA
ATTTCCTCCT ATTCCCGGAG CTTCCCGCGC AACTGGTTCG AGGCCGGCCG CCTGTGGCTT
GCCCGCAGCG GCTGGACGCC GGAGGTCGAA AACATCGTGC CGACGCTCGG CGCTCATGCA
GCGGCGATAT CAGTCATCGC CGCTGTCTCG GCGCCGGGCG ACAAGATCGT CTTCGAGGAT
CTCACCTATA CCCAGGTCAG CCGCAGCGCC CGCCTGCTCG GCCGCCGCAC GCTGACGGTC
GATTCCGATG AACTCGGCGT GATCCCGGAG GATTTCGAGC GGCTCTGTCA GCAGCAGCAT
CCGAAGATCG CCTTCCTGAT GCCGACCGTC CACAATCCGA CGCTGGCGAT CATGCCCTAT
GAGCGGCGCG CGGCCATCGC CGCAATCGCC AGGAAACATA GTGTCTGGCT GATCGAGGAC
GACCTCTACG GCGGCATGGC CGACGACGAT ACGCCGCTGC TCGCCTCGAT TGCGCCCGAT
CGCACCTTCC TCGTCAACGG CCTGTCGAAA TCGGTCGCCG CCGGCGTGCG CGGTGGCTGG
GTCGCCTGCC CGCCGCATTT TGCCCAGCGC ATCAAGGTGA CGCACAGGAT GATCACCGGC
GGTCTGCCGT TCATTCTGGC GGAGACCTGT GCGCGCCTCG TCGAAAGCGG CATGGCGCAC
GAGATCCGCA AGGCAAGTGT CGAGGAACTT TCCCGGCGGG TCCGGCTCGC CCGCGAGCAG
CTGCAGGGCT TCGATTTCGA ATCGCACGTA CACGCGCCCT TCCTCTGGCT GAAACTGCCG
GAACCCTGGA TGTCCGGCAC CTTCAAGAAT GCCGCCTTCC GCGACGGCGT GCTCGTCGAC
GACGAGGACG AGTTCAAGTC GGCGCGCGGA GAGAGGCCCT ATCATCGCGT TCGCATCGGT
TTTTCCTCGC CGAAGACCGG GCAGGAACTG ATCTCGGGCC TGATGATCCT GCGCCGTCTG
CTGGAAAACG GCGGCTCCGC CTATGATGGC GAAATATGA
 
Protein sequence
MTNWLPDISR GSGPVYLRLA DSIESAISSG ALPAGSKLPP QRNLAYDIGV TIGTIGRAYA 
LVHERGLVAG EVGRGTYVLN RSETPPGEQI DPLTVSLGGT RVQDAPANKI RFDTTAAPDL
GQGKIIAGIL AEIGEQHLAE ISSYSRSFPR NWFEAGRLWL ARSGWTPEVE NIVPTLGAHA
AAISVIAAVS APGDKIVFED LTYTQVSRSA RLLGRRTLTV DSDELGVIPE DFERLCQQQH
PKIAFLMPTV HNPTLAIMPY ERRAAIAAIA RKHSVWLIED DLYGGMADDD TPLLASIAPD
RTFLVNGLSK SVAAGVRGGW VACPPHFAQR IKVTHRMITG GLPFILAETC ARLVESGMAH
EIRKASVEEL SRRVRLAREQ LQGFDFESHV HAPFLWLKLP EPWMSGTFKN AAFRDGVLVD
DEDEFKSARG ERPYHRVRIG FSSPKTGQEL ISGLMILRRL LENGGSAYDG EI