Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2152 |
Symbol | |
ID | 4568647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2490830 |
End bp | 2493787 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639766727 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_912581 |
Protein GI | 119357937 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0146455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAGC CGGAATACCA TGATCTTCAG CAACGGATAC TGGAACTCGA AACAGATTCG TTACGAAGCA GACGAATCGA GCAGGAGCTG CTTGAGAAGC AGGCTGTTCT CGGGCATCAG AATATCAAGC TGATACGAAA ATCTATTGAG CTTTCAGACG TTAAAAGGCA GCTCGAAGAT AATAACTATG AGCTTGAGAT ATCGCAGGCG AAACTCCAGA ATGCATTGAA CTCTTTGCGT GAAAGCGAAA ATACGCTCAG TTCCGTGCTT GTCAACAGTC CTGATACGAT CATTGCCGTT GACAGAGACC ATCGCATTAT TTATGTCAAC AGGGCCATGC CTGGGCATAA AAAAACGCTG GCAGTAGGGG ATCACCTCTG CGATCATATT ATGCAACCCT ATCATGACCG GTATCATCAT ACTATAGAGC GGGTTATTGT AACCGGTAAA ACTGCGGTTC TCGAATGTGA ACTTGTTGTT TCTCCTGAGG AAACCATTGA GCTTGAGTCC CGTTTTGCGC CCTGTTTTCT CAATGGCGAG GTTACTTCCG TCGTTATGCT TTCAGTGGAT ATAACCGAGC GAAAGGGAAT GGAGCTGGAG CTTAAAAAAA CATTGATCGA TCTTGAACGC TTTAACAGGG TTATGGTCGG TCGTGAACTG CGTAATATCG AGCTTAAAAA ACGCATAGCC AGCCTCCAGA ACGATCTTGT TCTTTCTTCA GGTAACGCTC AGGCTTCTGA TGATTCCAGG CAGGACTTTG CCACAGAGGA TGAGGATTGC GCCGGAATAC ACCACGGTGA TGGTTTCGAA GAGGCCTGTG ACGAGGTGCA ATACAGGAAG CAACAGCGGA TGGCTCTTCT CAATCTTATC GAAGATGCCA ATCTTGCACG CAATGAGCTG CTCGAAACAA ATCGGAAGCT TGAAGAGTCT GTTATCCGAA CACAGGAAAT GGCCAGAGCT GCAAGTAAAG CCAATGAGGC AAAGAGCCAG TTTCTTGCCA ATATGAGCCA TGAAGTCCGT ACCCCGATGA ATGGCGTTAT CGGCATGTCC GACCTTCTTC TTGATACCAG TCTCGATCCT GAACAGCGGA AATATGTTGA AACCATTATC AGCAGCGGCA AGAATCTGCT GAGTATCATC AACGATATTC TTGATTTTTC TAAAATAGAA GCCAATCGTC TTGACCTTGA TGTCGTTGAT TTTGATTTAC TCGAACTGCT TGAAGATGTT TGCGGCATAC TTGGCTTGCA GGCACAGCAA AAAGGTCTTG AATTAACGCT TGTAACCGGT TCTTTCCTGC CCCGTTTTCT CAGAGGTGAT CAGGCAAGAA TTCGCCAGAT TCTTGTTAAT CTTGTTGGCA ATGCGGTAAA ATTTACTCAT TCCGGAGAGG TCGTTGTATG CGCAATGGCT CAAGAGGAGC GTGATTCGCA GGTTACCATA AGGCTGTTGG TAAGAGATAC TGGTATTGGT ATACCCCGGG AGATGATGAA GGCTGTTTTT GAGCCTTTTA TACAGGCGGA TGGTTCGACA AGAAGAAAAT ATGGCGGCAC AGGGCTTGGA CTTGCGATTT CAAACCAGCT TGCTAAAAAG ATGGGAAGCA CCATCATTCT CGAAAGCACT AATGGCGAAG GGTCTGTTTT CTGGTTTGAC GTTGTGCTTG AAAAACAGTG TCAATCATCA GAGCTGCTTC CGGAAAGCGG CGCGGGTTTG GCTGGAAAAA GAGTGCTGGT TGTCAACAGG AATGCTTCGA TGCGTTTTAT GCTCAAGGGC GTTCTTGAAT CCTGCAGCGT TGACTGTACC GTATTCGGAG GTATTGAAGA GGCTTTGGCC GCAGTTACAT CCTCATCGGT TCATCTCGAA TCAGTTCCAG TATGGAATGT CGCTATTCTT GATACGAATG TCGCGGAACA CTCATTAGAG GAGTTTCAAC GTTTGATCGG TACGATTACG GAGGTTCATC GGTGTCCGAT TATTCTGCTC GTGTCATTCG GGCAGTATGA GGAGATGAAA AAACTGTTCA GTACAGGGGT ATTCAGGCTG CTTTTGAAGC CGGTTCGCCA GGCAGAGGTG GTCGTCGCTG TTGTTGACGC ACTGAATAAT GAATCGGCAG AATATCAGGA ACCTGTTAAA CAGGTTGGTG CAGGCTGGCA AGATTCGGAA ACCGAAACGT ATCATATTCT TCTTGTTGAA GACAGTCCTG TGAATCAGCA GGTTGCGGTT GCCATGCTCA GGAAAATCGG TTACTCTCCT GATGTGGTTG CAAGTGGAAA AGCTGCGATT GATGCCATGC GCTGTAAGGT GTATGATCTT GTTCTGATGG ATTGCCAGAT GCCGGAAATG GATGGTTATG AAGCAACCAG AATAATCAGA ACCGACAGGA CGCTTTGTGG AACTCCGGAC ATTCCTGTTG TCGCCATGAC GGCACATGCC ATGATTGGAG ATCGGGAAAA GTGTCTCAGT GCAGGGATGG ACGATTATCT TCCGAAACCG GTTTGTAAAT CTGATCTTAA CGCTGTTCTT CTGAAATATC TTCAACGAAA AAAAAAACCG GAAAAAATCG TGGAACAAAA GACATGTATT GCTGAGGTCA AGAGCGAGCT GATAACTGCC GATGAAGTTT TTCTTATTGA TGATCTTCTC TGGAGAATGC AGAATGATCG TGAATTTGTA CGTATGATTC TGGGGCAGTT CATCGGCGAG GTTCCAAAAC GGATCGTTGA GATGGAGACC GCTCTTGATC GTCATGATAC GGATTTGGCA AGTATGATAG CCCATACCAT AAAAGGCGAG GCCGTAACCG TTGGTGGCAA GGTGCTGGGC CTGCATGCCT CATCGATAGA GATGGCGGCG AAATCCGGCG ATATAAAAAA ATCACGGGAG TTTCTCCGGG ATTTGAAAGA GCAGTTCAGG ATTTTCATCG AGCGAGTTTC TGCTACGGGA TGGTATGCCG CTGAGTGA
|
Protein sequence | MQKPEYHDLQ QRILELETDS LRSRRIEQEL LEKQAVLGHQ NIKLIRKSIE LSDVKRQLED NNYELEISQA KLQNALNSLR ESENTLSSVL VNSPDTIIAV DRDHRIIYVN RAMPGHKKTL AVGDHLCDHI MQPYHDRYHH TIERVIVTGK TAVLECELVV SPEETIELES RFAPCFLNGE VTSVVMLSVD ITERKGMELE LKKTLIDLER FNRVMVGREL RNIELKKRIA SLQNDLVLSS GNAQASDDSR QDFATEDEDC AGIHHGDGFE EACDEVQYRK QQRMALLNLI EDANLARNEL LETNRKLEES VIRTQEMARA ASKANEAKSQ FLANMSHEVR TPMNGVIGMS DLLLDTSLDP EQRKYVETII SSGKNLLSII NDILDFSKIE ANRLDLDVVD FDLLELLEDV CGILGLQAQQ KGLELTLVTG SFLPRFLRGD QARIRQILVN LVGNAVKFTH SGEVVVCAMA QEERDSQVTI RLLVRDTGIG IPREMMKAVF EPFIQADGST RRKYGGTGLG LAISNQLAKK MGSTIILEST NGEGSVFWFD VVLEKQCQSS ELLPESGAGL AGKRVLVVNR NASMRFMLKG VLESCSVDCT VFGGIEEALA AVTSSSVHLE SVPVWNVAIL DTNVAEHSLE EFQRLIGTIT EVHRCPIILL VSFGQYEEMK KLFSTGVFRL LLKPVRQAEV VVAVVDALNN ESAEYQEPVK QVGAGWQDSE TETYHILLVE DSPVNQQVAV AMLRKIGYSP DVVASGKAAI DAMRCKVYDL VLMDCQMPEM DGYEATRIIR TDRTLCGTPD IPVVAMTAHA MIGDREKCLS AGMDDYLPKP VCKSDLNAVL LKYLQRKKKP EKIVEQKTCI AEVKSELITA DEVFLIDDLL WRMQNDREFV RMILGQFIGE VPKRIVEMET ALDRHDTDLA SMIAHTIKGE AVTVGGKVLG LHASSIEMAA KSGDIKKSRE FLRDLKEQFR IFIERVSATG WYAAE
|
| |