Gene Rleg_5110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5110 
Symbol 
ID8007702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp510040 
End bp511716 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content63% 
IMG OID644822024 
Producthistidine kinase 
Protein accessionYP_002973284 
Protein GI241113449 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.967716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0814675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG TCAACATCCT CCTCGTCGAC GACCAACCGG CAAAGCTCCT GAGTTACGAG 
GTCATTCTCG AAGAGCTCGA GGAAAACCTC ATCAAGGCGC AATCCGCTCG CGAAGCCTTC
GAGCACCTGT TGCGCACTGA AATCGCGGTG ATCCTCGTCG ATGTCTGCAT GCCCGAACAG
GATGGGTTCG AGCTCGTCAG CATGATCCGG CAGCATCCGC GCTATCAGAA CACGCCGATC
ATCTTTGTTT CCGCCGTGAT GCTGGCGGAA CCCGACCGGC TGCGCGGCTA TGCTGTGGGC
GCGGTCGACT ACGTCTCTGT TCCGATCGTC CCCGAGGTGC TGAGAGCCAA GGTGCGGGTC
TTTGCCGAGC TCTACAGGAA GACGAGGGAA CTCGAACGCC TGAACGTCGA GCTGGAGGCC
CGCGTTCAGC AGCGCACCGC CGAGCTCGAG GCTTCTGCAG CGCAGTTGCG TGAGCTCAAC
GAGGAACTCG AGCACCGGAT CGATCAACGG ACGCGGGAGC GCGAAGAGGC GCTCGCACAG
CTGTTCGAGG CGCAGAAGCT CGACACGATT GGCCACCTGA CGGGTGGCGT GGCCCACGAC
TTCAATAACC TCCTGATGGC AGTTCTCGGC AGCCTGAATC TTCTCAAGAA GCGGCTTCCG
GCCGATGAAC GCAGTGAACG CCTGGTGACG AACGCGATCC AGGCGGCCGA ACGCGGCACG
GCGCTCACCC AGCGCCTGCT TGCTTTCGCA CGCCGCCAGG AGCTTAAGCC GCAGGCGGTC
GACTTCTTCA GGCTGTTCGA AAACATCGAG GATCTTCTCG CCAAGGCGGT GGGGCCGCGC
ATCGAAATCC GCAAAAGCAT CCCGGCGGAT CTGGCACCCC TCCTGGTCGA CAGCAACCAG
TTGGAACTGG CGTTGCTCAA CCTGTTCGTC AATGCGCGGG ATGCGCTCGA AAGCGGCGGA
GCCGTGACGG TTGCCGCGGC GGCAGCCGAA GAAGCCCGGC CGGCCAGCCT TGCAGGCGGA
AATTACATCA GGATATCGGT GTCGGACGAT GGCGAGGGGA TGGACGAGGC AACGGTCTCG
CGTGCCGCCG AACCGTTTTT CACCACCAAG GGGGTCGGCA AGGGCACCGG TCTCGGCCTG
TCGATGGTGC ATGGCCTGGC GGCGCAATCC GGTGGCTCGA TCCAGATATC AAGCGTTAGG
GGCAAAGGCA CGACGGTTTC GCTTTGGCTG CCCGTTGCCG AGGCATTCGT CAAGGTGCAG
CCTCCCGTCG AGCTGCCGGC GACGGAGCCT TTGAAGCCGG CGTCGCGGCC GCTTGCCATT
CTCGTAGTTG ATGACGATGC CCTTGTCAGG ACCGGGACCG TGGCGATGCT GGAGGATCTC
GGGCACCTGC CGCAGGAAGC GTCTTCCGCT TCCCAGGCCT TGGAATTCTT TGCCCACGGG
CAGGATTGCG ATCTCGTCAT CACCGATCAT GCCATGCCGG GCATGACGGG CGCCGAGCTT
GCGCGTCACC TTCGCTCCTC CTTTCCAGGC CTGCCCATCA TCCTTGCCTC AGGCTATGCC
GAGTTTTCCG AGGACCATGG CCTCGGCCGG ATGCTGCGGA TGAAGAAGCC ATTCACACAG
GAACAGCTTC AGGCGGCGAT GGATCAGGCG CTCTCGGGCA AAGTCGCGGC GGCCTGA
 
Protein sequence
MNPVNILLVD DQPAKLLSYE VILEELEENL IKAQSAREAF EHLLRTEIAV ILVDVCMPEQ 
DGFELVSMIR QHPRYQNTPI IFVSAVMLAE PDRLRGYAVG AVDYVSVPIV PEVLRAKVRV
FAELYRKTRE LERLNVELEA RVQQRTAELE ASAAQLRELN EELEHRIDQR TREREEALAQ
LFEAQKLDTI GHLTGGVAHD FNNLLMAVLG SLNLLKKRLP ADERSERLVT NAIQAAERGT
ALTQRLLAFA RRQELKPQAV DFFRLFENIE DLLAKAVGPR IEIRKSIPAD LAPLLVDSNQ
LELALLNLFV NARDALESGG AVTVAAAAAE EARPASLAGG NYIRISVSDD GEGMDEATVS
RAAEPFFTTK GVGKGTGLGL SMVHGLAAQS GGSIQISSVR GKGTTVSLWL PVAEAFVKVQ
PPVELPATEP LKPASRPLAI LVVDDDALVR TGTVAMLEDL GHLPQEASSA SQALEFFAHG
QDCDLVITDH AMPGMTGAEL ARHLRSSFPG LPIILASGYA EFSEDHGLGR MLRMKKPFTQ
EQLQAAMDQA LSGKVAAA