Gene Rleg_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5157 
Symbol 
ID8007053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp560333 
End bp561337 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content62% 
IMG OID644822067 
Producttranscriptional regulator, DeoR family 
Protein accessionYP_002973327 
Protein GI241113492 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATAG CCAAGCTCAA ACCCGCAAAT GCGCCGCGCG AGGAAATCGT CATCGCCCGG 
CAGATGCACC AGGCTCTGGT GCTGCATTTC CTCGAAGGGC TGACGCAGGC GCAGATTGCC
GATCAGCTTG GCATCTCGCA CGCCACCGTC AACCGGCTGA TCAAGCGCGG CCGCCAGCTC
GGTCTGGTCG AGATCAAGAT CAAGTCGCCG GTCGAGCCGC TGGTCGATAT GGAAGAACGG
CTACAAGCGC TTGGCGGTAT CGGCCGCGCC GTGGTGGTGC CGACAGTGTC CGACAATCCG
CAGACGGCAC TTCAAGCCGT CGGCGAAGCG GCAGCAAGGT TGCTGCTGGA AGAGATCACC
GACGGCGATA CGATCTGCAT CACCGGCGGC AAAGGGGTGA GCGCCGTCGT TGCCGGTCTG
CAGCCGCCGC GTCGTTTTGA TGTCGAGGTC ATTCCAGCAA CCGGCTGCGT GCAGGGTAAA
CACTATACCG ACGTCAATCA TGTCTCGACC TTGATGGCCG ACCGGCTTGG GGGGCGTTCC
TACCAGATCC ATGCGCCCCT CTTTGCCGAT GATGCCGAGC AGCGGGCGAT GCTGATCAAC
ATGCGTTCCG TTGCAGACGT TTTCAAACGG GCGCGTGAGG CGAAGGTGGC GGTGGTCGGC
ATCGGCTCGA TCCTCTCGGA CGATTCGAGC TATTACGACC TCCATCCATC CTCGAGTACC
GACCGCGCCG CGATCGAGCG GTCCGGCGCC TCCTGCGAAT TGCTGGCGCA TCTGCTCGAT
GATCATGGCC AGGTCTGCGA CTACAGCCTC AACCGTTCGC TGGTGTCGCT GACGCTGGCG
GAATTCGCCT CGATCCCCAC CAAGATCGGT GTGGCGAGCG GGCCGAACAA GGCAGGTCCC
ATCCTCAGTG TTCTGCGCGG CAATCATCTC GATACGTTGG TGACCGATGA GGCGACGGGT
GCCCGCGTGC TGGCACTGGC GAATGGTGAA GGAAAATGGG CATGA
 
Protein sequence
MPIAKLKPAN APREEIVIAR QMHQALVLHF LEGLTQAQIA DQLGISHATV NRLIKRGRQL 
GLVEIKIKSP VEPLVDMEER LQALGGIGRA VVVPTVSDNP QTALQAVGEA AARLLLEEIT
DGDTICITGG KGVSAVVAGL QPPRRFDVEV IPATGCVQGK HYTDVNHVST LMADRLGGRS
YQIHAPLFAD DAEQRAMLIN MRSVADVFKR AREAKVAVVG IGSILSDDSS YYDLHPSSST
DRAAIERSGA SCELLAHLLD DHGQVCDYSL NRSLVSLTLA EFASIPTKIG VASGPNKAGP
ILSVLRGNHL DTLVTDEATG ARVLALANGE GKWA