Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6856 |
Symbol | |
ID | 8022439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 302468 |
End bp | 305470 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644833722 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002984856 |
Protein GI | 241666772 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATAA ACGTCTTTCA ATTCTTCGAT GCGCTGCCCG GCATGGCATG GACCATGCTG GAGGATGGCC AAGTCGATTT CGTCAACCGA AGCTGGGTAG AGTTCACGGG GCTGCGACCG GCGTCCGGAG AGTCTTGGCG CTGGGAGGAT GCGGTGCACC CGGCCGACGT TGGCGCGGTC ACCGCCCATC TGCAGAGCGT TAGGCACGCA GAGACACAGG GAACGATCGA AGCGCGGCTT CGGAACGCTG CAGGCGAATT TCACCATTTC CTTATCCAGT GGGCACCTCT GGAAGACGCT CCCCACGGCG TAGTGAATTG GTGCGCGGTC GCGACCAATA TTGAACCCGT CGTTCAAAAA CGGGAAAGGT CACAGGCCGC ACTCGATTTC CAGCTCGTCG TCGATAGCAT CCCGATACCC GTGGCCGTGA CAACACCTGC TGGGGAAGTC GAAGGCCTCA ACCAACTGAC CCTCTCCTAT TTTGGGCTAA GCCTCTCGGA TCTAAAGGAC TGGAAGGCTT CGGAGGTCGT GCACCCGGAT GACTTGAAGG AAACCATCGA AGCCCAGATC GCCGCTCACA TGGCGGGCAC GTCTTATAAC GTTGAGAGCC GCCATCTTCG CGCCGACGGA ATTTATCGGT GGCACAATGT ACTTGGTCTA CCGCTGCGGG ACCAGTCGGG TGCGATTCAG CGGTGGCTGC ATCTTTTGAT AGACATCGAT GACCGAAAAC GTGCCGAGGT AGCTCTTGCA AACAGTGAAC GAGAGTCTCG CTTGATTGTC GGCACTATCG CAGGAATGGT CGCGCTCTTT ACGCCCGAAG GTCAGTTGAA CGGCGCAAAT CAACAGCTTC TCGACTACTT CCAACTGCCC CTGGAGGAGG TTGTGAACTG GGCGACCAAC GGTATCACGC ACCCCGACGA CCTGCAGCAC TGTGTCGAGA CATTTACGGC GTCGCTCAAA ACGGGAGAGC CGTATGACTT CGAGACGCGC TTCCGTCGCC ATGATGGCGA ATTTCGCTGG TTCCAGGTCC GCGGCCACCC GGTCAAAGAC GATAATGGCG GGATCGTCCG CTGGTATGGT CTGCTGACTG ATATCGATGA TGGGCGCAGG GCGGTGGAGG CGTTGCGCGA GCGCGAGATC GAGCTGCAAT TGATCGTAAA CTCGATCCCA GGCCTCATCA TTGTGCTCCG GCCCGACGGG GCTGTGGAGA GCGTAAACGA TCAGTCCTTG CGATATTTCG GCTATGATTT CAACGAGCAT CAAAAGTGGA AGACCAACGA TATCATTCAT CCTGACGATC GTGACCGGGG CGTGGCCAGG TTCGCCGAGG CAGTCGCCGC TGGTCAATCG TACGAGGTAG TGGAAAGGCT CCGTCGGCAC GACGGCGTCT ATCGTTGGTT CCAGGTGCGG GGAACTCCCG TGCGCGACTA CGAGGGCCTC GTTGTGCGAT GGTACTTCCT GCTGAATGAT ATCGACGACC GGAAGCACGC CGAAGTGGCG CTTGCTAACA GCGAGAGGGA ATTCCGCCAT ATCGTCAACA TGGTTCCGGG CATGATCATA CTTTCGCAGC CTGATGGAAC GCTCGACGGG AGCAACCAAC AGCTACTGGA CTACTTCGGA ATCTCTCTCG ACGAGGTGCA AGACTGGTCA ACGAACGGCA TAACTCACCC CGATGACGTG CAAGTCAATA TCGATACCTT CCTGGGCGCA CTCAAGAGCG GCAATCCTTA CGACTACCAA AGCCGTTATC GAAGACACGA CGGCGTGTTC CGATGGTTCC AGGTGCGAGG ACAGCCACTT CGTGACGCAG AGGGCAAGAT CGTCCGATGG TATGGACTAC TGACTGACAT CGACGATCGC AAGCAGGCCG AGGATGAACT GCGTAGAAGT CAGGCCCTTC TCGTTGCAGG CCAGCGCTTG ATCAGGACGG GTACGTTCTC CTGGCATGTC GAAACCGACG AGCTGATCTT GTCCGACGAA TGGCTTCGAA TCCTCGAATT CGAAAAGGAC GAAGTCGTGA CGTTTGACCG GATAACGGAG CGGATACACC CGGACGACGT TGCGCTCTTT GCCGGGAAGA TCGGCGCCGT ACGCGAAGGC GACGAGGACT CGGAATACGA GGTCCGCGTT CTCGCCCGCA ACGGCGACAT AAAGTACGTT CGCGTCATCG GCGAGGTTAT AATTCACCGG AATGGAAATC GGGAATGCCT TGGAGCGATC CAGGACGTGA CACAAAGGCG CTTGACGGAG GAAGCGCGCG ATCGGTTGCG CACAGAGCTT GCGCGAGTGA CCAGCATCCT CAGCCTCGGC CAGATGTCGG CGGCGATCGC GCACGAGGTG AACCAGCCAC TGTCAGGCAT CATCACGAAC GCGAACACCT GTCTGCGGAT GCTTGCAGCC ACACCGCCCG ACATAGAAAC GGCCCTTGAG ACGGCGCGGC GCACGATCAG GGACGGCAAT CGTGCCACCG AGGTAATCGC CAGACTGCGG GCGTTATTCA GTAAGCGCAA CATCGAATTC GAAGACGTCG ATATAAACGA GGCGGTGAGT GAGGTAGTCG CCTTGTCTGC AGGTGACCAG AGACGCAACG GCGTCGCTAT TCGAACGCAC TTCGCCACCT CCCTTCCCCC CGTCAACGGC GACCGAGTCC AACTTCAACA AGTGATCAAC AACTTGCTTC GCAACGCCAT TGACGCGGTA TCTGGCGTGA AGGATCGGCT GAGATTAGTC GAGATCCGAA CCCAGCTCGG CGGTGATGGG CAGATAAGCG TCGCGGTTAG CGACAATGGA ATTGGTCTCG ACCCAGACGG AGGGACGCGG ATCTTCGAAG CCTTCTACAC GACGAAGAAC AATGGGATGG GGATCGGTCT TTCAGTCTGT CGTTCCATCA TCGAGAGTCA TGGTGGTCGC CTTTGGGCTG AGCCTAACCA GGGGCCGGGC GTCACCATGC ACTTCTCCGT ACCGTCGGCC GAAGAGGCAA GCATCACGGC AGCCTCGCAC TGA
|
Protein sequence | MGINVFQFFD ALPGMAWTML EDGQVDFVNR SWVEFTGLRP ASGESWRWED AVHPADVGAV TAHLQSVRHA ETQGTIEARL RNAAGEFHHF LIQWAPLEDA PHGVVNWCAV ATNIEPVVQK RERSQAALDF QLVVDSIPIP VAVTTPAGEV EGLNQLTLSY FGLSLSDLKD WKASEVVHPD DLKETIEAQI AAHMAGTSYN VESRHLRADG IYRWHNVLGL PLRDQSGAIQ RWLHLLIDID DRKRAEVALA NSERESRLIV GTIAGMVALF TPEGQLNGAN QQLLDYFQLP LEEVVNWATN GITHPDDLQH CVETFTASLK TGEPYDFETR FRRHDGEFRW FQVRGHPVKD DNGGIVRWYG LLTDIDDGRR AVEALREREI ELQLIVNSIP GLIIVLRPDG AVESVNDQSL RYFGYDFNEH QKWKTNDIIH PDDRDRGVAR FAEAVAAGQS YEVVERLRRH DGVYRWFQVR GTPVRDYEGL VVRWYFLLND IDDRKHAEVA LANSEREFRH IVNMVPGMII LSQPDGTLDG SNQQLLDYFG ISLDEVQDWS TNGITHPDDV QVNIDTFLGA LKSGNPYDYQ SRYRRHDGVF RWFQVRGQPL RDAEGKIVRW YGLLTDIDDR KQAEDELRRS QALLVAGQRL IRTGTFSWHV ETDELILSDE WLRILEFEKD EVVTFDRITE RIHPDDVALF AGKIGAVREG DEDSEYEVRV LARNGDIKYV RVIGEVIIHR NGNRECLGAI QDVTQRRLTE EARDRLRTEL ARVTSILSLG QMSAAIAHEV NQPLSGIITN ANTCLRMLAA TPPDIETALE TARRTIRDGN RATEVIARLR ALFSKRNIEF EDVDINEAVS EVVALSAGDQ RRNGVAIRTH FATSLPPVNG DRVQLQQVIN NLLRNAIDAV SGVKDRLRLV EIRTQLGGDG QISVAVSDNG IGLDPDGGTR IFEAFYTTKN NGMGIGLSVC RSIIESHGGR LWAEPNQGPG VTMHFSVPSA EEASITAASH
|
| |