Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5110 |
Symbol | |
ID | 8007702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 510040 |
End bp | 511716 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644822024 |
Product | histidine kinase |
Protein accession | YP_002973284 |
Protein GI | 241113449 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.967716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0814675 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCCG TCAACATCCT CCTCGTCGAC GACCAACCGG CAAAGCTCCT GAGTTACGAG GTCATTCTCG AAGAGCTCGA GGAAAACCTC ATCAAGGCGC AATCCGCTCG CGAAGCCTTC GAGCACCTGT TGCGCACTGA AATCGCGGTG ATCCTCGTCG ATGTCTGCAT GCCCGAACAG GATGGGTTCG AGCTCGTCAG CATGATCCGG CAGCATCCGC GCTATCAGAA CACGCCGATC ATCTTTGTTT CCGCCGTGAT GCTGGCGGAA CCCGACCGGC TGCGCGGCTA TGCTGTGGGC GCGGTCGACT ACGTCTCTGT TCCGATCGTC CCCGAGGTGC TGAGAGCCAA GGTGCGGGTC TTTGCCGAGC TCTACAGGAA GACGAGGGAA CTCGAACGCC TGAACGTCGA GCTGGAGGCC CGCGTTCAGC AGCGCACCGC CGAGCTCGAG GCTTCTGCAG CGCAGTTGCG TGAGCTCAAC GAGGAACTCG AGCACCGGAT CGATCAACGG ACGCGGGAGC GCGAAGAGGC GCTCGCACAG CTGTTCGAGG CGCAGAAGCT CGACACGATT GGCCACCTGA CGGGTGGCGT GGCCCACGAC TTCAATAACC TCCTGATGGC AGTTCTCGGC AGCCTGAATC TTCTCAAGAA GCGGCTTCCG GCCGATGAAC GCAGTGAACG CCTGGTGACG AACGCGATCC AGGCGGCCGA ACGCGGCACG GCGCTCACCC AGCGCCTGCT TGCTTTCGCA CGCCGCCAGG AGCTTAAGCC GCAGGCGGTC GACTTCTTCA GGCTGTTCGA AAACATCGAG GATCTTCTCG CCAAGGCGGT GGGGCCGCGC ATCGAAATCC GCAAAAGCAT CCCGGCGGAT CTGGCACCCC TCCTGGTCGA CAGCAACCAG TTGGAACTGG CGTTGCTCAA CCTGTTCGTC AATGCGCGGG ATGCGCTCGA AAGCGGCGGA GCCGTGACGG TTGCCGCGGC GGCAGCCGAA GAAGCCCGGC CGGCCAGCCT TGCAGGCGGA AATTACATCA GGATATCGGT GTCGGACGAT GGCGAGGGGA TGGACGAGGC AACGGTCTCG CGTGCCGCCG AACCGTTTTT CACCACCAAG GGGGTCGGCA AGGGCACCGG TCTCGGCCTG TCGATGGTGC ATGGCCTGGC GGCGCAATCC GGTGGCTCGA TCCAGATATC AAGCGTTAGG GGCAAAGGCA CGACGGTTTC GCTTTGGCTG CCCGTTGCCG AGGCATTCGT CAAGGTGCAG CCTCCCGTCG AGCTGCCGGC GACGGAGCCT TTGAAGCCGG CGTCGCGGCC GCTTGCCATT CTCGTAGTTG ATGACGATGC CCTTGTCAGG ACCGGGACCG TGGCGATGCT GGAGGATCTC GGGCACCTGC CGCAGGAAGC GTCTTCCGCT TCCCAGGCCT TGGAATTCTT TGCCCACGGG CAGGATTGCG ATCTCGTCAT CACCGATCAT GCCATGCCGG GCATGACGGG CGCCGAGCTT GCGCGTCACC TTCGCTCCTC CTTTCCAGGC CTGCCCATCA TCCTTGCCTC AGGCTATGCC GAGTTTTCCG AGGACCATGG CCTCGGCCGG ATGCTGCGGA TGAAGAAGCC ATTCACACAG GAACAGCTTC AGGCGGCGAT GGATCAGGCG CTCTCGGGCA AAGTCGCGGC GGCCTGA
|
Protein sequence | MNPVNILLVD DQPAKLLSYE VILEELEENL IKAQSAREAF EHLLRTEIAV ILVDVCMPEQ DGFELVSMIR QHPRYQNTPI IFVSAVMLAE PDRLRGYAVG AVDYVSVPIV PEVLRAKVRV FAELYRKTRE LERLNVELEA RVQQRTAELE ASAAQLRELN EELEHRIDQR TREREEALAQ LFEAQKLDTI GHLTGGVAHD FNNLLMAVLG SLNLLKKRLP ADERSERLVT NAIQAAERGT ALTQRLLAFA RRQELKPQAV DFFRLFENIE DLLAKAVGPR IEIRKSIPAD LAPLLVDSNQ LELALLNLFV NARDALESGG AVTVAAAAAE EARPASLAGG NYIRISVSDD GEGMDEATVS RAAEPFFTTK GVGKGTGLGL SMVHGLAAQS GGSIQISSVR GKGTTVSLWL PVAEAFVKVQ PPVELPATEP LKPASRPLAI LVVDDDALVR TGTVAMLEDL GHLPQEASSA SQALEFFAHG QDCDLVITDH AMPGMTGAEL ARHLRSSFPG LPIILASGYA EFSEDHGLGR MLRMKKPFTQ EQLQAAMDQA LSGKVAAA
|
| |