Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4054 |
Symbol | |
ID | 8393405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 4169237 |
End bp | 4172206 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644981973 |
Product | CheA signal transduction histidine kinase |
Protein accession | YP_003139686 |
Protein GI | 257061798 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.264346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACG ATCCCGACAT TCGAGACCAA GCGTACCAAT ATTTCCTTGA TGAAATTCCG GGGTTATTGG AAACCATTGA GCAAGAATTA TTAGCCTTAA ACCAGTCTGA TGAAGGGCGA TCGCTTAAGG TTAATCATAT TATGCGGGCA ACCCATACCC TCAAAGGGGG AGCCGCTAAT GTAGGGTTAG AAACTCTGCA AAAAATTGCC CATTCCCTAG AAGATATTTT TAAAGCACTG TACCATCCTG AATTAACCAT AGATTCAGAG ATTAAAGGCC TATTATTGGA AAGTTATGAG TGTATTCGCT TGCCAGCCAT GGCTCAATTA ACCCAAGCAG CGATCAATGA GCAAGAAATT TTAGAGCGAT CGGCGGATAT TTTCGCTAAA TTACACGATA AACTGGGTCA TTATATGGCA GATCAATCCG CTTTTCCTCC TTCTGAAGAA CTGGGTATTG ATGTTGTTAA AACCTTCTTT GAAGACGTTG TTCCTCAACG CTTAGAAGAA ATTGCTAAGG TTTTAGAGAC AAATAATCCT GAGCAAATTC AAACTATTTT ATACGAACAA ATAGAGGTTC TTTTAAGTTT AGGAGAATCT TTGAATTTAC CCGGTTTTGA AGCTATTGCT AAAATGACTA TAGCAGCACT GAATAACTCT CCTGATCAAG TTCAAGTTAT TGCTAAAACG GCTTTAGAAG ACTTTAGAAA AGGTCAAAAA CAGATTCTTG AAGGCGATCG CGTTCAAGGG GGAAACCCTT CTGACTTTTT GCAACAATTA GCTCATAATT CGCTCAATGA GCAGTTATTA GACGCACCCA AACAACCTAG TAATAATTTT GACGAAGAGT TTTCAAAAAG TCTTAATGAA GTTTTAGGAA ACAAACAGTT AATTGAACAG CATACAGAGA CTAAAAAGAA AAATTCATTA TCAGAAAAAC CTGAAAATTT AGATCATCAC TTTTTAGCTT ATTCATCTAA TAAATCGCAA GAAAAAAATA AACCAGAAAA ACGGTTATCT AGTCAAAATA TTCGAGTTAA ATTAGAAGGG CTCGAAAGAT TAAATCATAT TGTTGGAGAA CTGGTTATTA ACCACAATAA ACAAGCAATA AAAAAACAAA AAATACAAGA ATTAATTGAC CACTTGCTTG AAAACCTTGA AGAAAACCAA CAAAGTTTTT ATCAATTAAA CAATTTAATT GATTCCTTAT TAATGCTAGT TGAGTATAGT CAAAATCCTC TTAATTTATC TTGTGTTAGC CTAGATTCAA GTATAAGTTG TGATCTCAAT ATTAGTTCTT CATTAAAACT ATCCTATAGT TATTGGCTAA AATCCGACCC GTATTTAAAC TTATCTCAAC AAATAAAAAC AGCCTTAAAA AGCATTCTAC AATCTACTAA AACCGCCGAA AAAATTAGAA ATTTGACCAA AGAATCTAAC CAAGCGTTTA AAAAACAGGA ACGAACTTTA TTTACCATGA GAGATGAATT AATAGAAACA AGAATGTCAC CTTTAGGCAA TCTTTTAAGT CGTTTTCCTC GATTAATTGA ACAATTATCA ACAGTTCAAA ATAAGCAAGT AGAATTAAGA CTAAAAGGCA GTCATATTTT AGTTGACAAA GCCATTGAAC AAAAGCTTTA TGATCCCTTA CTTCATTTAG TGAGAAATGC CTTTGATCAT GGAATTGAAA CCCCTGAAAT TCGCAGAAAA TTAGGAAAAC CCGAAACAGG AGTCATTGAA ATTGATGCCT ATCATCAAGG CAGTCGGACA ATTATTGAAG TCCGAGATGA TGGACAAGGA CTAGACTTTG AACGGATTAG AAATCGAGTT CTTGAACTGC ATTTAATGAC CCCTGAAGAA GTCTCTACCC TAAGCGATTC TCAACTCTTG GAATTTCTGT TTGAACCGGG ATTTTCTACA TCATCTCAAG TGAATGAAAT TTCGGGACGG GGAGTGGGAT TAGATATTGT TCATTCCCAA TTAGAAGCTT TAAAAGGAAA AATTGCCATT GAATCTCGAC AGAACCAAGG GACAACTTTT TCTTTGCAAA TTCCCCTAAC CTTGAGTATT GCTAAATTAA TGGTGTGTCA AACAGAAGGA ATTGTTTATT CATTATTACC CGATGTCATT GAAAAAATTA TCTTACCCCA ATCCAAAGAA ATTAAGCTAT TTAAAGGACG TAAAGTATTA TACTGGCAAA CTGAAACAGA TAATTATAAT GTTCCCATTC GTAAATTATC TGAATTAATT AACTATAATC GAATTTTCGC TAACCAAACT TCAAAATTAA ACGCTGATGA TAACCAACAA TCGATTAATC CCATTTTATT ACTTCGTCGT CATCAAGGAT TAATCGGGTT AGAAGTAGAC CAAGTATTGG GAGAACAAGA GTTAGTGATT CGTCCCTTGG GAACTACCTT AAATCCCCCC AATTATGTTT ATGGTTGTAG TATTTTAAGT GATAATCGTT TAAGTTTAGT GATTGATGGA GCCGCCTTAG TTAATCAAAC CCACAATCAC CCCTTAACCG CTAATCAATC TGCTACGAAA TTGAGCGATA AATCTAGCCA TAAATGGCTG TCAAAATCCC CTGGAAGTTC TGATGTTTTA TTAGTCGTAG ATGATTCCAT TAGCTTACGA CAAACAGCGA CTTTAACCTT GCAAAAATTA GGGTATCATG TATTACAAGC AGCCGATGGA ATAGAAGCGT TAGAAGAATT AGAAAGACTT AAGGGAATTA GTTTAGTGAT TTGTGATTTA GATATGCCTC GGATGAATGG TTTTGAGTTC TTAAAAACCT TGCGTCAACA TCCAGAATTA TCCCATTTAC CTGTTATTAC TTTAACTTCC CACGATAGTG AACCCTATCG ACAATTAGCT CAACAATTAG GCACAACAGC TTATATGACT AAACCCTATA AAGGAGACGA ATTAGTAGAG ACAATTTTAC ACTTAATTCA AGGAACATAG
|
Protein sequence | MINDPDIRDQ AYQYFLDEIP GLLETIEQEL LALNQSDEGR SLKVNHIMRA THTLKGGAAN VGLETLQKIA HSLEDIFKAL YHPELTIDSE IKGLLLESYE CIRLPAMAQL TQAAINEQEI LERSADIFAK LHDKLGHYMA DQSAFPPSEE LGIDVVKTFF EDVVPQRLEE IAKVLETNNP EQIQTILYEQ IEVLLSLGES LNLPGFEAIA KMTIAALNNS PDQVQVIAKT ALEDFRKGQK QILEGDRVQG GNPSDFLQQL AHNSLNEQLL DAPKQPSNNF DEEFSKSLNE VLGNKQLIEQ HTETKKKNSL SEKPENLDHH FLAYSSNKSQ EKNKPEKRLS SQNIRVKLEG LERLNHIVGE LVINHNKQAI KKQKIQELID HLLENLEENQ QSFYQLNNLI DSLLMLVEYS QNPLNLSCVS LDSSISCDLN ISSSLKLSYS YWLKSDPYLN LSQQIKTALK SILQSTKTAE KIRNLTKESN QAFKKQERTL FTMRDELIET RMSPLGNLLS RFPRLIEQLS TVQNKQVELR LKGSHILVDK AIEQKLYDPL LHLVRNAFDH GIETPEIRRK LGKPETGVIE IDAYHQGSRT IIEVRDDGQG LDFERIRNRV LELHLMTPEE VSTLSDSQLL EFLFEPGFST SSQVNEISGR GVGLDIVHSQ LEALKGKIAI ESRQNQGTTF SLQIPLTLSI AKLMVCQTEG IVYSLLPDVI EKIILPQSKE IKLFKGRKVL YWQTETDNYN VPIRKLSELI NYNRIFANQT SKLNADDNQQ SINPILLLRR HQGLIGLEVD QVLGEQELVI RPLGTTLNPP NYVYGCSILS DNRLSLVIDG AALVNQTHNH PLTANQSATK LSDKSSHKWL SKSPGSSDVL LVVDDSISLR QTATLTLQKL GYHVLQAADG IEALEELERL KGISLVICDL DMPRMNGFEF LKTLRQHPEL SHLPVITLTS HDSEPYRQLA QQLGTTAYMT KPYKGDELVE TILHLIQGT
|
| |