Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4016 |
Symbol | |
ID | 7103499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4201345 |
End bp | 4204314 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643477011 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_002374111 |
Protein GI | 218248740 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAACG ATCCCGACAT TCGAGACCAA GCGTACCAAT ATTTCCTTGA TGAAATTCCG GGGTTATTGG AAACCATTGA GCAAGAATTA TTAGCCTTAA ACCAGTCTGA TGAAGGGCGA TCGCTTAAGG TTAATCATAT TATGCGGGCA ACCCATACCC TCAAAGGGGG AGCCGCTAAT GTAGGGTTAG AAACTCTGCA AAAAATTGCC CATTCCCTAG AAGATATTTT TAAAGCACTG TACCATCCTG AATTAACCAT GGATTCAGAG ATTAAAGGCC TATTATTGGA AAGTTATGAG TGTATTCGCT TGCCAGCCAT GGCTCAATTA ACCCAAGCAG CGATCAATGA GCAAGAAATT TTAGAGCGAT CGGCGGATAT TTTCGCTAAA TTACACGATA AACTGGGTCA TTATATGGCA GATCAATCCG CTTTTCCTCC TTCTGAAGAA CTGGGTATTG ATGTTGTTAA AACCTTCTTT GAAGACGTTG TTCCTCAACG CTTAGAAGAG ATTGCTAAGG TTTTAGAGAC AAATAATCCT GAGCAAATTC AAACTATTTT ATATGAACAA ATAGAGGTTC TTTTAAGTTT AGGAGAATCT TTGAATTTAC CTGGTTTTGA AGCCATTGCT AAAATGACAA TAGCAGCCTT GAATAATGCT CCTGAGCAAG TTCAAGTTAT TGCTAAAACG GCTTTAGAAG ACTTTAGAAA AGGTCAAAAA CAGATTCTTG AAGGCGATCG CGTTCAAGGG GGAAACCCTT CTGACTTTTT GCAACAATTA GCTCATAATT CGCTCAATGA GCAGTTATTA GACGCACCCA AACAACCTAG TAATAATTTT GATGAAAACT TCTCAAAAAG TCTTAATGAA GTTTTAGGAA ACAAACAGTT AATTGAACAG CATACAGAGA CGAAAAATAA AAATTCATTG TCAGAAAAAC CTGACAATTT AGATCATCGC TTTTTAGCTT ATTCATCTAA TAAATCTCAA GAAAAAACTA AACCAGAAAA ACGGTTATCT AGTCAAAATA TTCGAGTTAA ATTAGAAGGA CTCGAAAGAT TAAATCATAT TGTTGGAGAA CTGGTTATTA ACCACAATAA ACAAGCAATA AAAAAACAAA AAATACAAGA ATTAATTGAC CACTTGCTTG AAAACCTTGA AGAAAACCAA CAAAGTTTTT ATCAATTAAA CAATTTAATT GATTCCTTAT TAATGCTAGT TGAGTATAGT CAAAATCCTC TTAATTTATC TTGTGTTAGC CTAGATTCAA GTATAAGTTG TGATCTCAAT ATTAGTTCTT CATTAAAACT ATCCTATAGT TATTGGCTAA AATCCGACCC GTATTTAAAC TTATCTCAAC AAATAAAAAC AGCCTTAAAA AGCATTCTAC AATCTACTAA AACCGCCGAA AAAATTAGAA ATTTGACCAA AGAATCTAAC CAAGCGTTTA AAAAACAGGA ACGAACTTTA TTTACCATGA GAGATGAATT AATAGAAACA AGAATGTCAC CTTTAGGCAA TCTTTTAAGT CGTTTTCCTC GATTAATTGA ACAATTATCA ACAGTTCAAA ATAAGCAAGT AGAATTAAGA CTAAAAGGCA GTCATATTTT AGTTGACAAA GCCATTGAAC AAAAGCTTTA TGATCCCTTA CTTCATTTAG TGAGAAATGC CTTTGATCAT GGAATTGAAA CCCCTGAAAT TCGCAGAAAA TTAGGAAAAC CCGAAACAGG AGTCATTGAA ATTGATGCCT ATCATCAAGG CAGTCGGACA ATTATTGAAG TCCGAGATGA TGGACAAGGA CTAGACTTTG AACGGATTAG AAATCGAGTT CTTGAACTGC ATTTAATGAC CCCTGAAGAA GTCTCTACCC TAAGCGATTC TCAACTCTTG GAATTTCTGT TTGAACCGGG ATTTTCTACA TCATCTCAAG TGAATGAAAT TTCGGGACGG GGAGTGGGAT TAGATATTGT TCATTCCCAA TTAGAAGCTT TAAAAGGAAA AATTGCCATT GAATCTCGAC AGAACCAAGG GACAACTTTT TCTTTGCAAA TTCCCCTAAC CTTGAGTATT GCTAAATTAA TGGTGTGTCA AACAGAAGGA ATTGTTTATT CATTATTACC CGATGTCATT GAAAAAATTA TCTTACCCCA ATCCAAAGAA ATTAAGCTAT TTAAAGGACG TAAAGTATTA TACTGGCAAA CTGAAACAGA TAATTATAAT GTTCCCATTC GTAAATTATC TGAATTAATT AACTATAATC GAATTTTCGC TAACCAAACT TCAAAATTAA ACGCTGATGA TAACCAACAA TCGATTAATC CCATTTTATT ACTTCGTCGT CATCAAGGAT TAATCGGGTT AGAAGTAGAC CAAGTATTGG GAGAACAAGA GTTAGTGATT CGTCCCTTGG GAACTACCTT AAATCCCCCC AATTATGTTT ATGGTTGTAG TATTTTAAGT GATAATCGTT TAAGTTTAGT GATTGATGGA GCCGCCTTAG TTAATCAAAC CCAAAATCAC CCCTTAACCG CTAATCAATC TGCTACGAAA TTGAGCGATA AATCTAGCCA TAAATGGCTG TCAAAATCCC CTGGAAGTTC TGATGTTTTA TTAGTCGTAG ATGATTCCAT TAGCTTACGA CAAACAGCGA CTTTAACCTT GCAAAAATTA GGGTATCATG TATTACAAGC AGCCGATGGA ATAGAAGCGT TAGAAGAATT AGAAAGACTT AAGGGAATTA GTTTAGTGAT TTGTGATTTA GATATGCCTC GGATGAATGG TTTTGAGTTC TTAAAAACCT TGCGTCAACA TCCAGAATTA TCCCATTTAC CTGTTATTAC TTTAACTTCC CACGATAGTG AACCCTATCG ACAATTAGCT CAACAATTAG GCACAACAGC TTATATGACT AAACCCTATA AAGGAGACGA ATTAGTAGAG ACAATTTTAC ACTTAATTCA AGGAGCATAG
|
Protein sequence | MINDPDIRDQ AYQYFLDEIP GLLETIEQEL LALNQSDEGR SLKVNHIMRA THTLKGGAAN VGLETLQKIA HSLEDIFKAL YHPELTMDSE IKGLLLESYE CIRLPAMAQL TQAAINEQEI LERSADIFAK LHDKLGHYMA DQSAFPPSEE LGIDVVKTFF EDVVPQRLEE IAKVLETNNP EQIQTILYEQ IEVLLSLGES LNLPGFEAIA KMTIAALNNA PEQVQVIAKT ALEDFRKGQK QILEGDRVQG GNPSDFLQQL AHNSLNEQLL DAPKQPSNNF DENFSKSLNE VLGNKQLIEQ HTETKNKNSL SEKPDNLDHR FLAYSSNKSQ EKTKPEKRLS SQNIRVKLEG LERLNHIVGE LVINHNKQAI KKQKIQELID HLLENLEENQ QSFYQLNNLI DSLLMLVEYS QNPLNLSCVS LDSSISCDLN ISSSLKLSYS YWLKSDPYLN LSQQIKTALK SILQSTKTAE KIRNLTKESN QAFKKQERTL FTMRDELIET RMSPLGNLLS RFPRLIEQLS TVQNKQVELR LKGSHILVDK AIEQKLYDPL LHLVRNAFDH GIETPEIRRK LGKPETGVIE IDAYHQGSRT IIEVRDDGQG LDFERIRNRV LELHLMTPEE VSTLSDSQLL EFLFEPGFST SSQVNEISGR GVGLDIVHSQ LEALKGKIAI ESRQNQGTTF SLQIPLTLSI AKLMVCQTEG IVYSLLPDVI EKIILPQSKE IKLFKGRKVL YWQTETDNYN VPIRKLSELI NYNRIFANQT SKLNADDNQQ SINPILLLRR HQGLIGLEVD QVLGEQELVI RPLGTTLNPP NYVYGCSILS DNRLSLVIDG AALVNQTQNH PLTANQSATK LSDKSSHKWL SKSPGSSDVL LVVDDSISLR QTATLTLQKL GYHVLQAADG IEALEELERL KGISLVICDL DMPRMNGFEF LKTLRQHPEL SHLPVITLTS HDSEPYRQLA QQLGTTAYMT KPYKGDELVE TILHLIQGA
|
| |