Gene Rleg_5060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5060 
Symbol 
ID8007653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp442648 
End bp444051 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content65% 
IMG OID644821975 
Producthistidine kinase 
Protein accessionYP_002973235 
Protein GI241113400 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0718016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.845094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTA GCCTCGTTAC AAAGGTCGGC GCGGCGATCG CGCGGCTCAG CCACAGCTTG 
AGGGTACAAT TGCTCTGCTG GGTGCTGATG ACCCTGTTCG GCGCGATCGG CTTCAATCTT
TACGACAGCT TCTGGACGGC GGATGCGACG GCAAAGCTGG TGACGGATCG AACACTTCTG
GCCTCGGCCC GCGTCATTGC CGAGGCCGTC CGCGTCGACG AGGGCGGCAA TGTCCAGGTG
GACGTGCCGC CTTCCGCGCT GGAGATGTTT GATACAGGCT TCGGCGACCG GGTATCTTAC
CAGGTGATCA CCGCGTGGGG CAGTCTGGTC AGCGGCTTTC CCGACTTGCC GTTGCCGGCC
GTCCAGCGGG CAGGCGCCGA TCGCGCGTTC CATGGTGCCG ATGTGCGTGT CATGATGCTC
GACCATCCTG TCGTCGGCCT GCCCGATGAC GGCACGATCT CCGTGACCGT CGCCGTTACG
CATAACAGCC AGTATGCGAT GCGACGGCAA TTGTGGCTTT CGGACTTTTC GAAGCAGTTC
GTGCTCGTCT TCGTCGCGAG CCTGGTGACC ATCCTCGGGC TTCAACGGGG TCTGGCGCCC
GCTCTGAGAC TGCGGGACGC CGTGCGCCAG CGCGGCCGCC ATCGTCTTGA TCCGCTGCCG
TCGGAAATGG TGCAGAGCGA ACTGCAGCCG CTCGTCCATG CCCTGAACGA CCATATGGAG
CGCGTCCAGA ACCAGATGGC CGCGCAGCGA CGGTTCGTAT CGAATGCCGC GCATCAGCTC
CGAACGCCGC TCGCGCTGAT TTCGACGCAG GCGAGCGTGG CGGCCCGGGA AGCTGATCCG
GCTCGCCGTG ACGAGGCGCT TGTCGCCCTT CGCACCAGCA CGAAGCAGAT TTCGCGTCTC
GCCAGCCAGC TTCTTACCTT GTCGCGGGCC GAGCCCGGAA GCCGGCGCCC GCGCAGCGAT
GCGACAGACC TCAGCAAGGC TGCCCGCGAG ATCCTGGAAG CGCATGCCGA AGAGGCGCTC
AGGCGTAACA TCGACGTCGG TCTGGAAGCG GTCCGCCCGG TCATTGTCGA CGGCGACGCG
ACGATGTTGC GCGAGATGTT GGTCAACCTC ATAGACAACG CGATCCGCTA TACCCGCCCG
AATGGACGGG TGACCGTCGC CGTCGGGCAG GCGGACGGCA ATGCCGTCGT GACCGTCGAG
GACAACGGGC CGGGTATTCC GAGCGGGGAG CGCGAGCAGG TTTTCGAACG GTTCTACCGG
ATCATGGGGA CCGAAGCTGA GGGGAGCGGT CTGGGGTTGT CGATCGTTCG GGAGGTTGTC
GAAGGTGCAG GAGGTTCAGT CTCGCTCGAT GATGCGGAAG GCGGCGGCGG GCTCATCGTG
ACGGTACGGC TTCCGCTCGC TTAA
 
Protein sequence
MSSSLVTKVG AAIARLSHSL RVQLLCWVLM TLFGAIGFNL YDSFWTADAT AKLVTDRTLL 
ASARVIAEAV RVDEGGNVQV DVPPSALEMF DTGFGDRVSY QVITAWGSLV SGFPDLPLPA
VQRAGADRAF HGADVRVMML DHPVVGLPDD GTISVTVAVT HNSQYAMRRQ LWLSDFSKQF
VLVFVASLVT ILGLQRGLAP ALRLRDAVRQ RGRHRLDPLP SEMVQSELQP LVHALNDHME
RVQNQMAAQR RFVSNAAHQL RTPLALISTQ ASVAAREADP ARRDEALVAL RTSTKQISRL
ASQLLTLSRA EPGSRRPRSD ATDLSKAARE ILEAHAEEAL RRNIDVGLEA VRPVIVDGDA
TMLREMLVNL IDNAIRYTRP NGRVTVAVGQ ADGNAVVTVE DNGPGIPSGE REQVFERFYR
IMGTEAEGSG LGLSIVREVV EGAGGSVSLD DAEGGGGLIV TVRLPLA