Gene PCC8801_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4016 
Symbol 
ID7103499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4201345 
End bp4204314 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content35% 
IMG OID643477011 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_002374111 
Protein GI218248740 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAACG ATCCCGACAT TCGAGACCAA GCGTACCAAT ATTTCCTTGA TGAAATTCCG 
GGGTTATTGG AAACCATTGA GCAAGAATTA TTAGCCTTAA ACCAGTCTGA TGAAGGGCGA
TCGCTTAAGG TTAATCATAT TATGCGGGCA ACCCATACCC TCAAAGGGGG AGCCGCTAAT
GTAGGGTTAG AAACTCTGCA AAAAATTGCC CATTCCCTAG AAGATATTTT TAAAGCACTG
TACCATCCTG AATTAACCAT GGATTCAGAG ATTAAAGGCC TATTATTGGA AAGTTATGAG
TGTATTCGCT TGCCAGCCAT GGCTCAATTA ACCCAAGCAG CGATCAATGA GCAAGAAATT
TTAGAGCGAT CGGCGGATAT TTTCGCTAAA TTACACGATA AACTGGGTCA TTATATGGCA
GATCAATCCG CTTTTCCTCC TTCTGAAGAA CTGGGTATTG ATGTTGTTAA AACCTTCTTT
GAAGACGTTG TTCCTCAACG CTTAGAAGAG ATTGCTAAGG TTTTAGAGAC AAATAATCCT
GAGCAAATTC AAACTATTTT ATATGAACAA ATAGAGGTTC TTTTAAGTTT AGGAGAATCT
TTGAATTTAC CTGGTTTTGA AGCCATTGCT AAAATGACAA TAGCAGCCTT GAATAATGCT
CCTGAGCAAG TTCAAGTTAT TGCTAAAACG GCTTTAGAAG ACTTTAGAAA AGGTCAAAAA
CAGATTCTTG AAGGCGATCG CGTTCAAGGG GGAAACCCTT CTGACTTTTT GCAACAATTA
GCTCATAATT CGCTCAATGA GCAGTTATTA GACGCACCCA AACAACCTAG TAATAATTTT
GATGAAAACT TCTCAAAAAG TCTTAATGAA GTTTTAGGAA ACAAACAGTT AATTGAACAG
CATACAGAGA CGAAAAATAA AAATTCATTG TCAGAAAAAC CTGACAATTT AGATCATCGC
TTTTTAGCTT ATTCATCTAA TAAATCTCAA GAAAAAACTA AACCAGAAAA ACGGTTATCT
AGTCAAAATA TTCGAGTTAA ATTAGAAGGA CTCGAAAGAT TAAATCATAT TGTTGGAGAA
CTGGTTATTA ACCACAATAA ACAAGCAATA AAAAAACAAA AAATACAAGA ATTAATTGAC
CACTTGCTTG AAAACCTTGA AGAAAACCAA CAAAGTTTTT ATCAATTAAA CAATTTAATT
GATTCCTTAT TAATGCTAGT TGAGTATAGT CAAAATCCTC TTAATTTATC TTGTGTTAGC
CTAGATTCAA GTATAAGTTG TGATCTCAAT ATTAGTTCTT CATTAAAACT ATCCTATAGT
TATTGGCTAA AATCCGACCC GTATTTAAAC TTATCTCAAC AAATAAAAAC AGCCTTAAAA
AGCATTCTAC AATCTACTAA AACCGCCGAA AAAATTAGAA ATTTGACCAA AGAATCTAAC
CAAGCGTTTA AAAAACAGGA ACGAACTTTA TTTACCATGA GAGATGAATT AATAGAAACA
AGAATGTCAC CTTTAGGCAA TCTTTTAAGT CGTTTTCCTC GATTAATTGA ACAATTATCA
ACAGTTCAAA ATAAGCAAGT AGAATTAAGA CTAAAAGGCA GTCATATTTT AGTTGACAAA
GCCATTGAAC AAAAGCTTTA TGATCCCTTA CTTCATTTAG TGAGAAATGC CTTTGATCAT
GGAATTGAAA CCCCTGAAAT TCGCAGAAAA TTAGGAAAAC CCGAAACAGG AGTCATTGAA
ATTGATGCCT ATCATCAAGG CAGTCGGACA ATTATTGAAG TCCGAGATGA TGGACAAGGA
CTAGACTTTG AACGGATTAG AAATCGAGTT CTTGAACTGC ATTTAATGAC CCCTGAAGAA
GTCTCTACCC TAAGCGATTC TCAACTCTTG GAATTTCTGT TTGAACCGGG ATTTTCTACA
TCATCTCAAG TGAATGAAAT TTCGGGACGG GGAGTGGGAT TAGATATTGT TCATTCCCAA
TTAGAAGCTT TAAAAGGAAA AATTGCCATT GAATCTCGAC AGAACCAAGG GACAACTTTT
TCTTTGCAAA TTCCCCTAAC CTTGAGTATT GCTAAATTAA TGGTGTGTCA AACAGAAGGA
ATTGTTTATT CATTATTACC CGATGTCATT GAAAAAATTA TCTTACCCCA ATCCAAAGAA
ATTAAGCTAT TTAAAGGACG TAAAGTATTA TACTGGCAAA CTGAAACAGA TAATTATAAT
GTTCCCATTC GTAAATTATC TGAATTAATT AACTATAATC GAATTTTCGC TAACCAAACT
TCAAAATTAA ACGCTGATGA TAACCAACAA TCGATTAATC CCATTTTATT ACTTCGTCGT
CATCAAGGAT TAATCGGGTT AGAAGTAGAC CAAGTATTGG GAGAACAAGA GTTAGTGATT
CGTCCCTTGG GAACTACCTT AAATCCCCCC AATTATGTTT ATGGTTGTAG TATTTTAAGT
GATAATCGTT TAAGTTTAGT GATTGATGGA GCCGCCTTAG TTAATCAAAC CCAAAATCAC
CCCTTAACCG CTAATCAATC TGCTACGAAA TTGAGCGATA AATCTAGCCA TAAATGGCTG
TCAAAATCCC CTGGAAGTTC TGATGTTTTA TTAGTCGTAG ATGATTCCAT TAGCTTACGA
CAAACAGCGA CTTTAACCTT GCAAAAATTA GGGTATCATG TATTACAAGC AGCCGATGGA
ATAGAAGCGT TAGAAGAATT AGAAAGACTT AAGGGAATTA GTTTAGTGAT TTGTGATTTA
GATATGCCTC GGATGAATGG TTTTGAGTTC TTAAAAACCT TGCGTCAACA TCCAGAATTA
TCCCATTTAC CTGTTATTAC TTTAACTTCC CACGATAGTG AACCCTATCG ACAATTAGCT
CAACAATTAG GCACAACAGC TTATATGACT AAACCCTATA AAGGAGACGA ATTAGTAGAG
ACAATTTTAC ACTTAATTCA AGGAGCATAG
 
Protein sequence
MINDPDIRDQ AYQYFLDEIP GLLETIEQEL LALNQSDEGR SLKVNHIMRA THTLKGGAAN 
VGLETLQKIA HSLEDIFKAL YHPELTMDSE IKGLLLESYE CIRLPAMAQL TQAAINEQEI
LERSADIFAK LHDKLGHYMA DQSAFPPSEE LGIDVVKTFF EDVVPQRLEE IAKVLETNNP
EQIQTILYEQ IEVLLSLGES LNLPGFEAIA KMTIAALNNA PEQVQVIAKT ALEDFRKGQK
QILEGDRVQG GNPSDFLQQL AHNSLNEQLL DAPKQPSNNF DENFSKSLNE VLGNKQLIEQ
HTETKNKNSL SEKPDNLDHR FLAYSSNKSQ EKTKPEKRLS SQNIRVKLEG LERLNHIVGE
LVINHNKQAI KKQKIQELID HLLENLEENQ QSFYQLNNLI DSLLMLVEYS QNPLNLSCVS
LDSSISCDLN ISSSLKLSYS YWLKSDPYLN LSQQIKTALK SILQSTKTAE KIRNLTKESN
QAFKKQERTL FTMRDELIET RMSPLGNLLS RFPRLIEQLS TVQNKQVELR LKGSHILVDK
AIEQKLYDPL LHLVRNAFDH GIETPEIRRK LGKPETGVIE IDAYHQGSRT IIEVRDDGQG
LDFERIRNRV LELHLMTPEE VSTLSDSQLL EFLFEPGFST SSQVNEISGR GVGLDIVHSQ
LEALKGKIAI ESRQNQGTTF SLQIPLTLSI AKLMVCQTEG IVYSLLPDVI EKIILPQSKE
IKLFKGRKVL YWQTETDNYN VPIRKLSELI NYNRIFANQT SKLNADDNQQ SINPILLLRR
HQGLIGLEVD QVLGEQELVI RPLGTTLNPP NYVYGCSILS DNRLSLVIDG AALVNQTQNH
PLTANQSATK LSDKSSHKWL SKSPGSSDVL LVVDDSISLR QTATLTLQKL GYHVLQAADG
IEALEELERL KGISLVICDL DMPRMNGFEF LKTLRQHPEL SHLPVITLTS HDSEPYRQLA
QQLGTTAYMT KPYKGDELVE TILHLIQGA