Gene Synpcc7942_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1014 
Symbol 
ID3773942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1025629 
End bp1028415 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content57% 
IMG OID637799434 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_400031 
Protein GI81299823 
COG category[K] Transcription
[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.186099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCA CCAGTGATAA CGTTTTGCGA TCGCTGTTTC TACGAGAAAC GCGTGAAAAT 
CTCCAGCTGC TGACCCAGAT GATTGGCCAG ATGGAAGCTG GGCTGCCGGA TGCAAAAACC
TTTGCAGACG CTTCGCGGGC CGCGCATTCG CTCAAGGGTG GAGCCGCCCT GCTCAAGTTC
GAAACCTTCC GGCAACTGGC AGCTGGACTC GAGACTTGCC TGTCTCTGCT ATCTGAAGCC
GAGTTGCCCG ACTACGAGGT GAGTCTCCCG CTGTTTCAAC GGGCCGCCGC GCTGCTGATT
ACGGCGACCG ATCGCATTGA AGCGGGCGTT GAAGAGCAGG CTGTGGTGCC CGAGGAAGGG
AACCCCCTGA TTGAACAGCT CCAGGTTTTG CAGGAGATGC TGCAGGCAGC TGTGCAAGTG
CCACCGGATC AATCCGCCGT TACCAGCACG CTCGAAGACT TGCTGACCCA AGATTTTGAG
ACAGTCTTGC TGCCGCCGAC TCCTGATTTG GAGCAGACCC TCGATCAGCT CCTGGGCGAC
CTCGAGGAAA CTCCCAGCCC GATCGCTGGC CAGGGAAGCG ACGGCGATCG CCCAGCTTTA
GAGGACCTCA ACTTTGACCT CGAGGACTGG ATTGAGGCCA GCGAAGAAGA GTCTGAGACG
GTCGAAATCG ACATTGTGGC CCAGGTCAAA GCCCTGGAAG CTGCGATCGC TGACTTGACG
CAGCACCCCC ACGAGCAGCC CCTCAGCTTG GGCATCAAGG ACGACATTCT GAGTGCGGTC
ATCCAAGCCG CTCCCCGTCT GCCGGTGGCC CTGACGCTAG AGCCAGTGGC TAAGGCCCGT
ACCCAGGACA ATGGCAAAAC CCCCGTTGTC TTTGAACAGA CTCTGCGGGT GCCGGTGCGT
CAGTTGGATA ACCTCAGCAA CCTAGTCGGG GAGATTGTCG TCAGCCGGAA TACCCTCGAA
AGCGATCAGT CCCACCTACG GGAAACCCTC GAAAACCTGA TGCTACAGGT GCACCAGCTG
GGTGATTTGG GTGAGCGGAT GCGGGAGCTC TACGAGCGAT CGCTTCTGGA AACTGCCCTC
GCTGCACGTC GCCAACTAGT TGGTAGTTCT ACAGCCGCTG GCTCTGCCGG AGCTTCGACC
TTTCCCACCG AGCAATTCGA CGACCTCGAA CTCGATCGCT TTAGTGGTTT CCATTCCATT
TCTCAAGAAA TGATGGAATT GCTAGTCCGT CTGCGCGAGG CAGCCTCGGA CATTGACTTC
ATGGTCAATG AACCCTTTGA CCAACTGACC CGCAACTTCC AGCAAATTAC CCTGCAGTTG
CAAAGCGAGT TGAACCAGAC TCGAATGGTC CCCTTCTCGG ACCTGACGGA TCAACTGCCA
CGGGCTGTCC GCGAAATCTC ACGGCGCTGT GGCAAGGGTG CCCGTCTGGA GATCGAAGGG
AAAGACATCT TGATTGACAA GATGATCCTG CAGAAACTGC AGGCGCCCAT GACTCACCTG
ATCAACAACG CCCTCAGTCA TGGCATTGAA ACGGCAGAAA TTCGCCAATA CCGCAATAAA
TCAGCGGAGG GAACCCTGGT CGTTCGTGCC TCACTCCAGG GCAATCGGAC CCTAATCACT
GTTACCGATG ATGGGGCCGG GATCGATGCT GAGCAGGTCA AGCTCAAAGC TTTGGCACGG
GGTTTGATCA CCCCAGAAGA AGCCGCAGGG ATGCGGGTGC GGGAGATCTA TGAACTCCTG
TTCCGACCCG GCTTCAGTAC CCGAGATCAG GCCGATGAAT TGGCGGGCCG CGGGGTGGGG
CTAGATGTGG TTCGCCGGGC GCTCCATGAA ATTCAGGGCG AGATCGTGGT GGATTCGGAG
CTGGGGAAAG GGACAACCTT TACCTTCCGC TTGCCCCTCA CTCTCTCGAT TACCAAGGCT
TTGACCTGCC TCCACGAACG GGCGCGTTTG GCCTTCCCGA TGGATAGCGT TGAGGAAATG
CTGGATATTC CCCAGCAGCG CCTCGCGATG GATCCCCAGG GGCAGTTGCG CCTGGATTGG
CACAATTTGT CGCTCCCCGT CTTCCCGCTC TCATCCCTCC TCGACTTCCA TCTGCCGTTT
GGGCGATCGC GCTATTACAA CAGTCTGGGC GATGACGGTG TCGTCTCGAT TGTCGTTCTG
CGTAATGACA ACGAGTACCT CGCCGTTCAG ACCGATCAGA TTGAAGGGGA ACGGGAAATT
GTGATCAAGC AGATCGAAGG TCCGATCGCC AAACCCCTCG GGATCTTGGG CATCACCGTC
CGCTCTGACG GTCAGGTCAT GCCGATCGTG GACGTCTTGG AACTGTTTGA TATTGCCTAC
GGCCGCGTCC GTCAGCGTCA AGTAGCCCCC GTAGCCCCCA GTGCTGGCGA GACGAGAACC
AATACAGAAC CGATGGTGCT GATCGTCGAT GACTCGATCA CGGTGCGTGA GCTGCTGTCG
ATGACCTTCA AAAAGGCGGG CTATGTGGTG GAACAAGCGC GCAACGGCCA AGAAGCCCTA
GAGAAGCTCT ATTCGGGTCT GCCCTGCGAT TTGGTTTTCT GCGATATCGA AATGCCGAAG
ATGAATGGGC TAGAGCTCTT AGAGCGGCTT CAGGCTGATC CCAAGCTGCG GAGTTTACCG
GTGGCGATGC TGACATCGCG AGGCGCTGAT CGCCATCGTC AAATTGCTGC CCACTTGGGC
GCTCGGGCTT ACTTCACAAA ACCCTATCTA GAAGAGCAGC TGTTGGAAGC CTCGGCTCGC
TTGAGAAATG GCGAACGACT CATTTAA
 
Protein sequence
MSSTSDNVLR SLFLRETREN LQLLTQMIGQ MEAGLPDAKT FADASRAAHS LKGGAALLKF 
ETFRQLAAGL ETCLSLLSEA ELPDYEVSLP LFQRAAALLI TATDRIEAGV EEQAVVPEEG
NPLIEQLQVL QEMLQAAVQV PPDQSAVTST LEDLLTQDFE TVLLPPTPDL EQTLDQLLGD
LEETPSPIAG QGSDGDRPAL EDLNFDLEDW IEASEEESET VEIDIVAQVK ALEAAIADLT
QHPHEQPLSL GIKDDILSAV IQAAPRLPVA LTLEPVAKAR TQDNGKTPVV FEQTLRVPVR
QLDNLSNLVG EIVVSRNTLE SDQSHLRETL ENLMLQVHQL GDLGERMREL YERSLLETAL
AARRQLVGSS TAAGSAGAST FPTEQFDDLE LDRFSGFHSI SQEMMELLVR LREAASDIDF
MVNEPFDQLT RNFQQITLQL QSELNQTRMV PFSDLTDQLP RAVREISRRC GKGARLEIEG
KDILIDKMIL QKLQAPMTHL INNALSHGIE TAEIRQYRNK SAEGTLVVRA SLQGNRTLIT
VTDDGAGIDA EQVKLKALAR GLITPEEAAG MRVREIYELL FRPGFSTRDQ ADELAGRGVG
LDVVRRALHE IQGEIVVDSE LGKGTTFTFR LPLTLSITKA LTCLHERARL AFPMDSVEEM
LDIPQQRLAM DPQGQLRLDW HNLSLPVFPL SSLLDFHLPF GRSRYYNSLG DDGVVSIVVL
RNDNEYLAVQ TDQIEGEREI VIKQIEGPIA KPLGILGITV RSDGQVMPIV DVLELFDIAY
GRVRQRQVAP VAPSAGETRT NTEPMVLIVD DSITVRELLS MTFKKAGYVV EQARNGQEAL
EKLYSGLPCD LVFCDIEMPK MNGLELLERL QADPKLRSLP VAMLTSRGAD RHRQIAAHLG
ARAYFTKPYL EEQLLEASAR LRNGERLI