Gene Synpcc7942_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0859 
Symbol 
ID3774036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp860118 
End bp862976 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content48% 
IMG OID637799275 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_399876 
Protein GI81299668 
COG category[K] Transcription
[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.571784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0418696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC CTCAGGCGAT TATTCGACTG CAATTTCTGG AAGAAGCTGA AGAATACCTT 
GGCACGATTG AAAAGGAACT GGTCGATTTA AGCCAAGCGA GCGATCGCCG ATCGCAAGTT
GATCTCATCC TCCGGGCCGC CCATTCAATC AAGGGTGGTG CGGCCATGAT GGGCTTCAAC
ACCCTCAGCG AACTAGCGCA TCAACTTGAA GACTTCTTTA AGGTGATGCG CAGCGGCAAA
AACTTTGACC AGTCTCTAGA ACGTCAGCTA CTTCAGGGAG TTGATCGGCT CCATCAGGTT
CTGCAAGTTA ATCGCCAGGG GACAGACCCT TCACCTCAAT GGCTTGAGAG TCAAGTCCAA
CCCTTATTTG CTGTCCTCCG AGAGCAGTTT GGGGACCCTC AACCTGACGC TGATTTGGCA
ATGCTCTCGG AAGAAGCGGG CGATGATATG CGATCGCTGC TGTTTGAATC GGAGGTTGAA
GGGTGCTTGC AACGTTTGGA GTCGGTGCTG CAAACCCCAG AGCATCCCTG CTTATTGGAA
GAATTTGCAA TCGTTGCCCA AGAGCTAGCG GGGTTGGGGC AGATGTTAGA ACTGTCCCAG
TTCAGTCAGT TCTGTACCTC GATCGCTGAA CTTCTTGAAC AGCAGCCGAC TGATTTAAGG
GCGACCGCTA CCAAGATCCT TAAAGACTTA AGACATTGCC AGGCTCTCGT TTTAACCCAG
CAAATCAATG CTCTGCCCAG TCAATTTGTT GCTCAAGATC AGTCTCCAGA GGGGATAACT
CTAGAAGCAA ATTCTGGCTT AGCAGAAACT TTCCCTCAAA CTAAGGAAGT CATTCACCAA
AATCAAATTA CAGCAACTCG GAATTCGGAA AATATACAGT CCAGCTTGAC AGTTGCTGAA
GCTCGTCAGT TGGACTTTAC GCCGGAGACA ATCACAACCC AAACGCTTCA TCCCCCAATA
AATGCAACAG ATTTAACGTC AGAGATTAAC CAAGCACGAA CTGTACGGGT TGGGATTCAA
CAACTAGAAG ACCTGAGCGA ATTGTTTGGT GAATTGAATA TTGAGCGGAA TGGGCTCAGT
TTGCAACTGA AGCGCATGCG GCAGCTTCTG CAAACACTCA AGACAAAAGT TCGTCGACTC
GAGCAGTCGA GCTTTGAGTT ACGGTCTACG ACTGATCGCG CGGCGACAGC AACAACAGCT
TTCAGTTTGG TTGGTAGTGG CGGCAGTTCT TGGCACCAAA ACTTTGATAT TTTGGAGTTC
GATCGCTACA GCCCTCTGCA TTTGGTTTCC CAAGACGTGA TGGAAACCAT TGTTCAGATT
CAAGAAATTA CTAGCGATAT CGAAACGAAC CTTGAAGAGA CGGAACAGAC AGAACAGTCT
TTGCGGCGAA CTGCGAAGCA GATGCAGGGT CGCCTTACCC AAGTAAGAAT GCGCCCCTTT
GCTGACTTGG TCAATCGCTA TCCTCGGATG ATTCGCCAAC TAGGGCAAGA GTATGGAAAA
GACGTTCGGC TCATTATCAA CGGCGAGAAT ACACTAGTTG ACCGCGCCAT TCTAGAGCTT
CTAGCAGATC CGCTATTACA CCTTGTTCGC AACGCCTTTG ACCATGGTGT TGAATCCCCT
CAGCTGCGGC AATCTCGGGA TAAATCCCCA ACAGCAACCA TCGAAGTGTC AGCAGCCTAT
CGAGGGAATC AAACAGTCAT TACAATTCGG GATGATGGGT GTGGCGTTGA TCTCCAGAAA
ATTCGCCAGC GTGCCCAACG GATGGGGCTA GATCAATCAA GCTTAGAAGC TGCCAGCGAT
CGCGAGCTTT TGGATTTGAT TTTCGAGCCT GGATTTACAA CTGCGGAGCA AGTGACCGAA
CTCTCAGGCC GAGGGGTTGG CATGGATGTT GTCCGCACGA ATCTTCAGCA GATTCGGGGA
GATATCCAGA TTGAAACAAA ACCTAACCAA GGCACAACTT TTACGATTAC AGTGCCTTTT
TCTCTCTCGG TTGCACGGGT TCTCCTCATC GAGGTCAGCA ATATTCTAGT TGCCATCCCG
ACTGATGTAA TTGAGGAAAT TATTGAACTC AATCCAAGCT GGATTCTAAA TAGTGCTGGT
CAAACAGTTC TTAATCTCGA TGAGATACTG GTTCCGCTGC TCCAGCTCGA TCAGTGGTTT
CAGTTTAGTC GGCCCTGCCC GCCCATTAGT TTAGATGGTG TCCCGACAAT CAATCAGCCT
ACTGTTCTTC TTGTTAACCA GGGCAATAGT TTCGTCGGCA TTCAAGTCGA TCGCTACTGG
GGAGAACAGG AGGTTACAAT CCGTCAAGTA AATAGCACGA TTCCTTTGCC GCCGGGATTT
AGTAGTTGCA CTATTCTGGG TGACGGGCGA ATTGTTCCTT TAGTCGATAC CTTTTCGCTC
TTGCAGTGGA TTCAGAATTC GAATGTTCCT AGTCGTCTTT CCTTGAAAGC GATACAGCCA
GCAACCTATC AACTTCCCTC TCAGAAGAGC ATCTTAATTA TCGATGACTC GATTAATGTC
AGACGGTTTT TAGCCAATAT TTTAGAGAAA GCAGGCTATC GAGTTGAACA AGCTAAGGAT
GGCCAAGAAG CCATTGATAA ACTAAAAGAT GGCTTAAAGG TTGAAGCCGC CATTTGTGAT
GTGGAGATGC CTCGGTTGGA TGGTTATGGA TTCCTGACTC AAGTCAAAAA TATCGTTGCT
GGCGCAAACC TACCGATCGC TATGTTGACC TCACGCAGCG GGGATAAGCA CCGGCGCTTA
GCGCTCAATC TGGGAGCCGC CGCCTACTTC ACCAAACCTT TCCGCGAGCC AGAATTGCTC
CAGACCCTAC AAGAACTAAT TCAAGTGAAA CAGTCTTAG
 
Protein sequence
MSDPQAIIRL QFLEEAEEYL GTIEKELVDL SQASDRRSQV DLILRAAHSI KGGAAMMGFN 
TLSELAHQLE DFFKVMRSGK NFDQSLERQL LQGVDRLHQV LQVNRQGTDP SPQWLESQVQ
PLFAVLREQF GDPQPDADLA MLSEEAGDDM RSLLFESEVE GCLQRLESVL QTPEHPCLLE
EFAIVAQELA GLGQMLELSQ FSQFCTSIAE LLEQQPTDLR ATATKILKDL RHCQALVLTQ
QINALPSQFV AQDQSPEGIT LEANSGLAET FPQTKEVIHQ NQITATRNSE NIQSSLTVAE
ARQLDFTPET ITTQTLHPPI NATDLTSEIN QARTVRVGIQ QLEDLSELFG ELNIERNGLS
LQLKRMRQLL QTLKTKVRRL EQSSFELRST TDRAATATTA FSLVGSGGSS WHQNFDILEF
DRYSPLHLVS QDVMETIVQI QEITSDIETN LEETEQTEQS LRRTAKQMQG RLTQVRMRPF
ADLVNRYPRM IRQLGQEYGK DVRLIINGEN TLVDRAILEL LADPLLHLVR NAFDHGVESP
QLRQSRDKSP TATIEVSAAY RGNQTVITIR DDGCGVDLQK IRQRAQRMGL DQSSLEAASD
RELLDLIFEP GFTTAEQVTE LSGRGVGMDV VRTNLQQIRG DIQIETKPNQ GTTFTITVPF
SLSVARVLLI EVSNILVAIP TDVIEEIIEL NPSWILNSAG QTVLNLDEIL VPLLQLDQWF
QFSRPCPPIS LDGVPTINQP TVLLVNQGNS FVGIQVDRYW GEQEVTIRQV NSTIPLPPGF
SSCTILGDGR IVPLVDTFSL LQWIQNSNVP SRLSLKAIQP ATYQLPSQKS ILIIDDSINV
RRFLANILEK AGYRVEQAKD GQEAIDKLKD GLKVEAAICD VEMPRLDGYG FLTQVKNIVA
GANLPIAMLT SRSGDKHRRL ALNLGAAAYF TKPFREPELL QTLQELIQVK QS