Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5151 |
Symbol | |
ID | 6978245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 789081 |
End bp | 790991 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643394279 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002279097 |
Protein GI | 209547179 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.300471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.605559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATAGAG AGCCGGAAAC CGATCCCATT ATCGATCTGA TCCCCGCAAT GGTGTGGTCG GCAACATCGG ATGGCATGCT GGATTTCGCT AACCGGCACT TTCTCGACTT TATCGGGGCC CCGCTCGAGC AAATCTCCGG CGAGAGATTT TATGGAATAT TCCATCCCGA CGACACGTCG CGACTCGCTT CCGAGTGGCA TGAGATCATG GCTTCTAAAA ATGCGAGGGA GGTAGAGGGC AGGCTCAGGC GCGCGGATGG CCACTATCGT TGGTGCACGC TTCGTCAAAA GCCTCGGTTC GATCCCGACG GAAATGTCGT CAGATGGTAT GGTGTTGTCC TTGATATCGA AGACCGCAAG CAGGCCGAAA ATGCGCTGAA GGAGACGAAG ACGGCACTTG CCGCCAGCGA AGAGAATCTA AGCCTCATCA TCAATTCATT GCCTGTGCTC GTATGGTCGG CACGGCCAGA TGGCAGCGCC GATTTTGTCA ATCAAAGCTG GTTGGATTAT GCGGGTCGCC CGGCCGACCA GATCCTCGAA TGGGGCTTCC TCGACCTTTA CCACCCGGAT GATATTCCCG GTATGGTCGC CATCTGGAAG AGAGATCTCG AACATTCCGA CCATACCGTT CTGAAGGGCC GTATTCGCGG AGCGGACGGT AACTACCGCT GGTTCTATTT CTCGGGCCGC AAGCTGGTCG ATGCCAACGG CGTCGTGCGT TGGTTCGGCT GCAATATCGA TATCGAGGAT CTGCAAGCGG CAGAGAATGC GCTGAGAGAA AGCGAAACCG CGCTCAGGGA AAGCGAACAC AAGCTCAGTC TCATCATCAA TACGATACCG GCGATGGCTT GGTCCTGTAC ACCAGATGGC CGGCTTGAAT ACTTTAACCG AAACCTGATC GATTATGTCG GCCTACCCGT CGAGGAAATC GTCGGGTTCG GCTTTTATCG CATGTTTCAC CCGGATGATG TCGAGCCGAT GCGTTTCGCC TGGGACGACA TCGTCGCCAC GAAAAAAAGT CGTCCGGTGG ACGCGCGCAT CAGACGGGCG GATGGCGAAT ATCGATGGTT CAACCTCCGG CAAAGCCCCC TTCTGGACAG AGATGGGAAT GTCGTGCGGT GGTACGGCGT CGTCGTCGAT ATCGAGGATC GCAAGCGGGC GGAAGAGTCC TTGAGACAAA GCCAGAGCAA TCTGGCGCAT GTGACGCGTA TGACAGCCAC GGGCGAGCTT GCCGTTTCGA TCGCCCACGA GGTCAACCAG CCGCTCATGG CAATCGTCAC GAATGCTGGC ACGTGCCTGC GCTGGCTGCA GCCGGGACAC ACCGATATCG AGCAGGCCCG GCTGGCGGCC GAGAGGATCG TCAAGGATGG TCATCGCGCC GGCGATATCA TCGCGAGTAT CAGGGCAATG GCTCAAAAGT CACCAGCCCG GATGGAACAA ACTGATTTGA AAGGCGCCTT GCATCACGTG CTGGATCTTT TGAGAGGGGA ACTTCGTCAC CGAGACATCG AACTCGATCT TGATCTTCCG CAACGTTCTC TTGAGGTCAT TGGAGATCGA ACCCAGTTGC AGCAAGTCGT GCTGAATTTG GTCATGAATA GCGCCGAAGA GATGGCCAGA TCTTCAGGTG AAAGGCGTAT CGGCATCAGA TGCGCCGAGG ACGAGCACCA ATTCGTGAAG GTGAGCGTCT CGGATACCGG ACGCGGCGTT TCTTCCGACG AGTTGGATCG GGTTTTCGAA ACGTTCTACA GCACGAAGGC AGATGGCATC GGAATGGGAC TTTCGATATG CCGTTCCATT GTCGAAGCTC ATGGCGGCCG GATCTGGGCT TCGGCGGCAG ACAATCAGAT TGGCGCGGTG TTCACGTTTA CGCTGCCGAT GGCGGAGGTT GCGGCCGCAG ATGATCGATG A
|
Protein sequence | MDREPETDPI IDLIPAMVWS ATSDGMLDFA NRHFLDFIGA PLEQISGERF YGIFHPDDTS RLASEWHEIM ASKNAREVEG RLRRADGHYR WCTLRQKPRF DPDGNVVRWY GVVLDIEDRK QAENALKETK TALAASEENL SLIINSLPVL VWSARPDGSA DFVNQSWLDY AGRPADQILE WGFLDLYHPD DIPGMVAIWK RDLEHSDHTV LKGRIRGADG NYRWFYFSGR KLVDANGVVR WFGCNIDIED LQAAENALRE SETALRESEH KLSLIINTIP AMAWSCTPDG RLEYFNRNLI DYVGLPVEEI VGFGFYRMFH PDDVEPMRFA WDDIVATKKS RPVDARIRRA DGEYRWFNLR QSPLLDRDGN VVRWYGVVVD IEDRKRAEES LRQSQSNLAH VTRMTATGEL AVSIAHEVNQ PLMAIVTNAG TCLRWLQPGH TDIEQARLAA ERIVKDGHRA GDIIASIRAM AQKSPARMEQ TDLKGALHHV LDLLRGELRH RDIELDLDLP QRSLEVIGDR TQLQQVVLNL VMNSAEEMAR SSGERRIGIR CAEDEHQFVK VSVSDTGRGV SSDELDRVFE TFYSTKADGI GMGLSICRSI VEAHGGRIWA SAADNQIGAV FTFTLPMAEV AAADDR
|
| |