Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0859 |
Symbol | |
ID | 3774036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 860118 |
End bp | 862976 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637799275 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_399876 |
Protein GI | 81299668 |
COG category | [K] Transcription [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.571784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0418696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATC CTCAGGCGAT TATTCGACTG CAATTTCTGG AAGAAGCTGA AGAATACCTT GGCACGATTG AAAAGGAACT GGTCGATTTA AGCCAAGCGA GCGATCGCCG ATCGCAAGTT GATCTCATCC TCCGGGCCGC CCATTCAATC AAGGGTGGTG CGGCCATGAT GGGCTTCAAC ACCCTCAGCG AACTAGCGCA TCAACTTGAA GACTTCTTTA AGGTGATGCG CAGCGGCAAA AACTTTGACC AGTCTCTAGA ACGTCAGCTA CTTCAGGGAG TTGATCGGCT CCATCAGGTT CTGCAAGTTA ATCGCCAGGG GACAGACCCT TCACCTCAAT GGCTTGAGAG TCAAGTCCAA CCCTTATTTG CTGTCCTCCG AGAGCAGTTT GGGGACCCTC AACCTGACGC TGATTTGGCA ATGCTCTCGG AAGAAGCGGG CGATGATATG CGATCGCTGC TGTTTGAATC GGAGGTTGAA GGGTGCTTGC AACGTTTGGA GTCGGTGCTG CAAACCCCAG AGCATCCCTG CTTATTGGAA GAATTTGCAA TCGTTGCCCA AGAGCTAGCG GGGTTGGGGC AGATGTTAGA ACTGTCCCAG TTCAGTCAGT TCTGTACCTC GATCGCTGAA CTTCTTGAAC AGCAGCCGAC TGATTTAAGG GCGACCGCTA CCAAGATCCT TAAAGACTTA AGACATTGCC AGGCTCTCGT TTTAACCCAG CAAATCAATG CTCTGCCCAG TCAATTTGTT GCTCAAGATC AGTCTCCAGA GGGGATAACT CTAGAAGCAA ATTCTGGCTT AGCAGAAACT TTCCCTCAAA CTAAGGAAGT CATTCACCAA AATCAAATTA CAGCAACTCG GAATTCGGAA AATATACAGT CCAGCTTGAC AGTTGCTGAA GCTCGTCAGT TGGACTTTAC GCCGGAGACA ATCACAACCC AAACGCTTCA TCCCCCAATA AATGCAACAG ATTTAACGTC AGAGATTAAC CAAGCACGAA CTGTACGGGT TGGGATTCAA CAACTAGAAG ACCTGAGCGA ATTGTTTGGT GAATTGAATA TTGAGCGGAA TGGGCTCAGT TTGCAACTGA AGCGCATGCG GCAGCTTCTG CAAACACTCA AGACAAAAGT TCGTCGACTC GAGCAGTCGA GCTTTGAGTT ACGGTCTACG ACTGATCGCG CGGCGACAGC AACAACAGCT TTCAGTTTGG TTGGTAGTGG CGGCAGTTCT TGGCACCAAA ACTTTGATAT TTTGGAGTTC GATCGCTACA GCCCTCTGCA TTTGGTTTCC CAAGACGTGA TGGAAACCAT TGTTCAGATT CAAGAAATTA CTAGCGATAT CGAAACGAAC CTTGAAGAGA CGGAACAGAC AGAACAGTCT TTGCGGCGAA CTGCGAAGCA GATGCAGGGT CGCCTTACCC AAGTAAGAAT GCGCCCCTTT GCTGACTTGG TCAATCGCTA TCCTCGGATG ATTCGCCAAC TAGGGCAAGA GTATGGAAAA GACGTTCGGC TCATTATCAA CGGCGAGAAT ACACTAGTTG ACCGCGCCAT TCTAGAGCTT CTAGCAGATC CGCTATTACA CCTTGTTCGC AACGCCTTTG ACCATGGTGT TGAATCCCCT CAGCTGCGGC AATCTCGGGA TAAATCCCCA ACAGCAACCA TCGAAGTGTC AGCAGCCTAT CGAGGGAATC AAACAGTCAT TACAATTCGG GATGATGGGT GTGGCGTTGA TCTCCAGAAA ATTCGCCAGC GTGCCCAACG GATGGGGCTA GATCAATCAA GCTTAGAAGC TGCCAGCGAT CGCGAGCTTT TGGATTTGAT TTTCGAGCCT GGATTTACAA CTGCGGAGCA AGTGACCGAA CTCTCAGGCC GAGGGGTTGG CATGGATGTT GTCCGCACGA ATCTTCAGCA GATTCGGGGA GATATCCAGA TTGAAACAAA ACCTAACCAA GGCACAACTT TTACGATTAC AGTGCCTTTT TCTCTCTCGG TTGCACGGGT TCTCCTCATC GAGGTCAGCA ATATTCTAGT TGCCATCCCG ACTGATGTAA TTGAGGAAAT TATTGAACTC AATCCAAGCT GGATTCTAAA TAGTGCTGGT CAAACAGTTC TTAATCTCGA TGAGATACTG GTTCCGCTGC TCCAGCTCGA TCAGTGGTTT CAGTTTAGTC GGCCCTGCCC GCCCATTAGT TTAGATGGTG TCCCGACAAT CAATCAGCCT ACTGTTCTTC TTGTTAACCA GGGCAATAGT TTCGTCGGCA TTCAAGTCGA TCGCTACTGG GGAGAACAGG AGGTTACAAT CCGTCAAGTA AATAGCACGA TTCCTTTGCC GCCGGGATTT AGTAGTTGCA CTATTCTGGG TGACGGGCGA ATTGTTCCTT TAGTCGATAC CTTTTCGCTC TTGCAGTGGA TTCAGAATTC GAATGTTCCT AGTCGTCTTT CCTTGAAAGC GATACAGCCA GCAACCTATC AACTTCCCTC TCAGAAGAGC ATCTTAATTA TCGATGACTC GATTAATGTC AGACGGTTTT TAGCCAATAT TTTAGAGAAA GCAGGCTATC GAGTTGAACA AGCTAAGGAT GGCCAAGAAG CCATTGATAA ACTAAAAGAT GGCTTAAAGG TTGAAGCCGC CATTTGTGAT GTGGAGATGC CTCGGTTGGA TGGTTATGGA TTCCTGACTC AAGTCAAAAA TATCGTTGCT GGCGCAAACC TACCGATCGC TATGTTGACC TCACGCAGCG GGGATAAGCA CCGGCGCTTA GCGCTCAATC TGGGAGCCGC CGCCTACTTC ACCAAACCTT TCCGCGAGCC AGAATTGCTC CAGACCCTAC AAGAACTAAT TCAAGTGAAA CAGTCTTAG
|
Protein sequence | MSDPQAIIRL QFLEEAEEYL GTIEKELVDL SQASDRRSQV DLILRAAHSI KGGAAMMGFN TLSELAHQLE DFFKVMRSGK NFDQSLERQL LQGVDRLHQV LQVNRQGTDP SPQWLESQVQ PLFAVLREQF GDPQPDADLA MLSEEAGDDM RSLLFESEVE GCLQRLESVL QTPEHPCLLE EFAIVAQELA GLGQMLELSQ FSQFCTSIAE LLEQQPTDLR ATATKILKDL RHCQALVLTQ QINALPSQFV AQDQSPEGIT LEANSGLAET FPQTKEVIHQ NQITATRNSE NIQSSLTVAE ARQLDFTPET ITTQTLHPPI NATDLTSEIN QARTVRVGIQ QLEDLSELFG ELNIERNGLS LQLKRMRQLL QTLKTKVRRL EQSSFELRST TDRAATATTA FSLVGSGGSS WHQNFDILEF DRYSPLHLVS QDVMETIVQI QEITSDIETN LEETEQTEQS LRRTAKQMQG RLTQVRMRPF ADLVNRYPRM IRQLGQEYGK DVRLIINGEN TLVDRAILEL LADPLLHLVR NAFDHGVESP QLRQSRDKSP TATIEVSAAY RGNQTVITIR DDGCGVDLQK IRQRAQRMGL DQSSLEAASD RELLDLIFEP GFTTAEQVTE LSGRGVGMDV VRTNLQQIRG DIQIETKPNQ GTTFTITVPF SLSVARVLLI EVSNILVAIP TDVIEEIIEL NPSWILNSAG QTVLNLDEIL VPLLQLDQWF QFSRPCPPIS LDGVPTINQP TVLLVNQGNS FVGIQVDRYW GEQEVTIRQV NSTIPLPPGF SSCTILGDGR IVPLVDTFSL LQWIQNSNVP SRLSLKAIQP ATYQLPSQKS ILIIDDSINV RRFLANILEK AGYRVEQAKD GQEAIDKLKD GLKVEAAICD VEMPRLDGYG FLTQVKNIVA GANLPIAMLT SRSGDKHRRL ALNLGAAAYF TKPFREPELL QTLQELIQVK QS
|
| |