Gene Rleg_5559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5559 
Symbol 
ID8016450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp144225 
End bp145253 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content59% 
IMG OID644827726 
Producthypothetical protein 
Protein accessionYP_002978926 
Protein GI241518298 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.472868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0109335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGCC ATTCGAGAGA AAAACTGCAT GGGGATTTGC CCTCCGCATC TTCAAAAGCA 
GCTTCCGCCG ATCGGAAAGA GCTGGCAGCG ATCGCATTTG AACGGACCAG GATGCCAATG
GTCGTCACCG ACGCGCGTAA GTCCGATCAG CCGATCGTGC TTGCAAACAA GGCATTCCTC
GAGCTGACAG GTTACGAGGC CGAAGAAGTG TTAGGCCACA ACTGTCGTTT CCTGCAGGGC
CCCGCGACGT CTCCGATCGC CGCCGCTGAA ATTCGTGCCG CAATCGCCGG GGAGCGTGAG
ATCAGCATCG AGATCCTCAA TTATAAGAAG AGCGGAGAGC AGTTCTGGAA CCGCTTGCAT
CTCAGTCCCG TCCATGGGGA TGACGGAAGG ATCCTGTATT TCTTTGGATC TCAAATCGAC
ATGACGGAAT ACCGGCGGAT CGAGGCACTG GAGGCCTCCG AACATCGCCT GCTGATGGAA
GTCGACCACC GATCCAAGAA TGTCCTGGCG ATCGTCGACA GCATCGTTCG CCTGAGCAAC
GCCGATGACC CCGCCCTTTA CGCCGCCGCC ATTCAACACC GCGTGCAGGC GCTCGCCCGT
GCCCATACCT TGCTTGCCGC ACGAAGATGG ACAAGCATTT CTCTTGAAGA ACTCATTCGC
CAACAGGTAT CGCCGTTCGC GGCCACCCGC ACCTTTTTTA GCGGACCGGA TATCGACATG
CCTGCGCCGG CCGTCCAGCC CCTTGCGCTC GTGCTCCACG AGCTCGCCGT CAACGCAGCC
CATCACGGTG CGCTTGCCGC TGCGCAAGGT AGGCTTTCGA TCAGTTGGAA GCCCGGACCG
TCCGGAGCCG GCTTCAGGAT CCGGTGGCAG GAGGTGGGCG TCGACACTCC GCCTAGATCA
GCAAAGCGGG GTTTTGGCAC GGTGATTGTC GGCGCAATGG TTGAAAAACA GCTTAACGGA
CGTCTTGAGA AAACCTGGTC GGATGAAGGG CTGCTTATCG ACATTGAGGT CCCGTCTGCC
GGCTCGTGA
 
Protein sequence
MTGHSREKLH GDLPSASSKA ASADRKELAA IAFERTRMPM VVTDARKSDQ PIVLANKAFL 
ELTGYEAEEV LGHNCRFLQG PATSPIAAAE IRAAIAGERE ISIEILNYKK SGEQFWNRLH
LSPVHGDDGR ILYFFGSQID MTEYRRIEAL EASEHRLLME VDHRSKNVLA IVDSIVRLSN
ADDPALYAAA IQHRVQALAR AHTLLAARRW TSISLEELIR QQVSPFAATR TFFSGPDIDM
PAPAVQPLAL VLHELAVNAA HHGALAAAQG RLSISWKPGP SGAGFRIRWQ EVGVDTPPRS
AKRGFGTVIV GAMVEKQLNG RLEKTWSDEG LLIDIEVPSA GS