Gene Cyan8802_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4054 
Symbol 
ID8393405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4169237 
End bp4172206 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content35% 
IMG OID644981973 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_003139686 
Protein GI257061798 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.264346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACG ATCCCGACAT TCGAGACCAA GCGTACCAAT ATTTCCTTGA TGAAATTCCG 
GGGTTATTGG AAACCATTGA GCAAGAATTA TTAGCCTTAA ACCAGTCTGA TGAAGGGCGA
TCGCTTAAGG TTAATCATAT TATGCGGGCA ACCCATACCC TCAAAGGGGG AGCCGCTAAT
GTAGGGTTAG AAACTCTGCA AAAAATTGCC CATTCCCTAG AAGATATTTT TAAAGCACTG
TACCATCCTG AATTAACCAT AGATTCAGAG ATTAAAGGCC TATTATTGGA AAGTTATGAG
TGTATTCGCT TGCCAGCCAT GGCTCAATTA ACCCAAGCAG CGATCAATGA GCAAGAAATT
TTAGAGCGAT CGGCGGATAT TTTCGCTAAA TTACACGATA AACTGGGTCA TTATATGGCA
GATCAATCCG CTTTTCCTCC TTCTGAAGAA CTGGGTATTG ATGTTGTTAA AACCTTCTTT
GAAGACGTTG TTCCTCAACG CTTAGAAGAA ATTGCTAAGG TTTTAGAGAC AAATAATCCT
GAGCAAATTC AAACTATTTT ATACGAACAA ATAGAGGTTC TTTTAAGTTT AGGAGAATCT
TTGAATTTAC CCGGTTTTGA AGCTATTGCT AAAATGACTA TAGCAGCACT GAATAACTCT
CCTGATCAAG TTCAAGTTAT TGCTAAAACG GCTTTAGAAG ACTTTAGAAA AGGTCAAAAA
CAGATTCTTG AAGGCGATCG CGTTCAAGGG GGAAACCCTT CTGACTTTTT GCAACAATTA
GCTCATAATT CGCTCAATGA GCAGTTATTA GACGCACCCA AACAACCTAG TAATAATTTT
GACGAAGAGT TTTCAAAAAG TCTTAATGAA GTTTTAGGAA ACAAACAGTT AATTGAACAG
CATACAGAGA CTAAAAAGAA AAATTCATTA TCAGAAAAAC CTGAAAATTT AGATCATCAC
TTTTTAGCTT ATTCATCTAA TAAATCGCAA GAAAAAAATA AACCAGAAAA ACGGTTATCT
AGTCAAAATA TTCGAGTTAA ATTAGAAGGG CTCGAAAGAT TAAATCATAT TGTTGGAGAA
CTGGTTATTA ACCACAATAA ACAAGCAATA AAAAAACAAA AAATACAAGA ATTAATTGAC
CACTTGCTTG AAAACCTTGA AGAAAACCAA CAAAGTTTTT ATCAATTAAA CAATTTAATT
GATTCCTTAT TAATGCTAGT TGAGTATAGT CAAAATCCTC TTAATTTATC TTGTGTTAGC
CTAGATTCAA GTATAAGTTG TGATCTCAAT ATTAGTTCTT CATTAAAACT ATCCTATAGT
TATTGGCTAA AATCCGACCC GTATTTAAAC TTATCTCAAC AAATAAAAAC AGCCTTAAAA
AGCATTCTAC AATCTACTAA AACCGCCGAA AAAATTAGAA ATTTGACCAA AGAATCTAAC
CAAGCGTTTA AAAAACAGGA ACGAACTTTA TTTACCATGA GAGATGAATT AATAGAAACA
AGAATGTCAC CTTTAGGCAA TCTTTTAAGT CGTTTTCCTC GATTAATTGA ACAATTATCA
ACAGTTCAAA ATAAGCAAGT AGAATTAAGA CTAAAAGGCA GTCATATTTT AGTTGACAAA
GCCATTGAAC AAAAGCTTTA TGATCCCTTA CTTCATTTAG TGAGAAATGC CTTTGATCAT
GGAATTGAAA CCCCTGAAAT TCGCAGAAAA TTAGGAAAAC CCGAAACAGG AGTCATTGAA
ATTGATGCCT ATCATCAAGG CAGTCGGACA ATTATTGAAG TCCGAGATGA TGGACAAGGA
CTAGACTTTG AACGGATTAG AAATCGAGTT CTTGAACTGC ATTTAATGAC CCCTGAAGAA
GTCTCTACCC TAAGCGATTC TCAACTCTTG GAATTTCTGT TTGAACCGGG ATTTTCTACA
TCATCTCAAG TGAATGAAAT TTCGGGACGG GGAGTGGGAT TAGATATTGT TCATTCCCAA
TTAGAAGCTT TAAAAGGAAA AATTGCCATT GAATCTCGAC AGAACCAAGG GACAACTTTT
TCTTTGCAAA TTCCCCTAAC CTTGAGTATT GCTAAATTAA TGGTGTGTCA AACAGAAGGA
ATTGTTTATT CATTATTACC CGATGTCATT GAAAAAATTA TCTTACCCCA ATCCAAAGAA
ATTAAGCTAT TTAAAGGACG TAAAGTATTA TACTGGCAAA CTGAAACAGA TAATTATAAT
GTTCCCATTC GTAAATTATC TGAATTAATT AACTATAATC GAATTTTCGC TAACCAAACT
TCAAAATTAA ACGCTGATGA TAACCAACAA TCGATTAATC CCATTTTATT ACTTCGTCGT
CATCAAGGAT TAATCGGGTT AGAAGTAGAC CAAGTATTGG GAGAACAAGA GTTAGTGATT
CGTCCCTTGG GAACTACCTT AAATCCCCCC AATTATGTTT ATGGTTGTAG TATTTTAAGT
GATAATCGTT TAAGTTTAGT GATTGATGGA GCCGCCTTAG TTAATCAAAC CCACAATCAC
CCCTTAACCG CTAATCAATC TGCTACGAAA TTGAGCGATA AATCTAGCCA TAAATGGCTG
TCAAAATCCC CTGGAAGTTC TGATGTTTTA TTAGTCGTAG ATGATTCCAT TAGCTTACGA
CAAACAGCGA CTTTAACCTT GCAAAAATTA GGGTATCATG TATTACAAGC AGCCGATGGA
ATAGAAGCGT TAGAAGAATT AGAAAGACTT AAGGGAATTA GTTTAGTGAT TTGTGATTTA
GATATGCCTC GGATGAATGG TTTTGAGTTC TTAAAAACCT TGCGTCAACA TCCAGAATTA
TCCCATTTAC CTGTTATTAC TTTAACTTCC CACGATAGTG AACCCTATCG ACAATTAGCT
CAACAATTAG GCACAACAGC TTATATGACT AAACCCTATA AAGGAGACGA ATTAGTAGAG
ACAATTTTAC ACTTAATTCA AGGAACATAG
 
Protein sequence
MINDPDIRDQ AYQYFLDEIP GLLETIEQEL LALNQSDEGR SLKVNHIMRA THTLKGGAAN 
VGLETLQKIA HSLEDIFKAL YHPELTIDSE IKGLLLESYE CIRLPAMAQL TQAAINEQEI
LERSADIFAK LHDKLGHYMA DQSAFPPSEE LGIDVVKTFF EDVVPQRLEE IAKVLETNNP
EQIQTILYEQ IEVLLSLGES LNLPGFEAIA KMTIAALNNS PDQVQVIAKT ALEDFRKGQK
QILEGDRVQG GNPSDFLQQL AHNSLNEQLL DAPKQPSNNF DEEFSKSLNE VLGNKQLIEQ
HTETKKKNSL SEKPENLDHH FLAYSSNKSQ EKNKPEKRLS SQNIRVKLEG LERLNHIVGE
LVINHNKQAI KKQKIQELID HLLENLEENQ QSFYQLNNLI DSLLMLVEYS QNPLNLSCVS
LDSSISCDLN ISSSLKLSYS YWLKSDPYLN LSQQIKTALK SILQSTKTAE KIRNLTKESN
QAFKKQERTL FTMRDELIET RMSPLGNLLS RFPRLIEQLS TVQNKQVELR LKGSHILVDK
AIEQKLYDPL LHLVRNAFDH GIETPEIRRK LGKPETGVIE IDAYHQGSRT IIEVRDDGQG
LDFERIRNRV LELHLMTPEE VSTLSDSQLL EFLFEPGFST SSQVNEISGR GVGLDIVHSQ
LEALKGKIAI ESRQNQGTTF SLQIPLTLSI AKLMVCQTEG IVYSLLPDVI EKIILPQSKE
IKLFKGRKVL YWQTETDNYN VPIRKLSELI NYNRIFANQT SKLNADDNQQ SINPILLLRR
HQGLIGLEVD QVLGEQELVI RPLGTTLNPP NYVYGCSILS DNRLSLVIDG AALVNQTHNH
PLTANQSATK LSDKSSHKWL SKSPGSSDVL LVVDDSISLR QTATLTLQKL GYHVLQAADG
IEALEELERL KGISLVICDL DMPRMNGFEF LKTLRQHPEL SHLPVITLTS HDSEPYRQLA
QQLGTTAYMT KPYKGDELVE TILHLIQGT