Gene PCC8801_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4223 
Symbol 
ID7103780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4432819 
End bp4434129 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content45% 
IMG OID643477207 
Producthistidine kinase 
Protein accessionYP_002374306 
Protein GI218248935 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA AGCCATCAAC AGTTAGTCCT CAATTTAAAG CGTTACGCTG GCGGTTATTG 
CTGTCCTATT TAGTGGTTAT GATCGCTATT TGGATCATTT CTGATGTGTT GGTTTATCAA
TTTTTTGCCC GTAGTTTATA TCAACAGTTA GATAATCGTT TATTCAATTT AGCCCAAGCA
GCAACCCACA GTTTACTAGC GATTAAAAAT GATCCTAAAG CTGTGAATAA TTCTCCTTAT
CGTGCCTTAG ATGAAGATGG AGATTTAGAT CTTCCTTGGC AGAATTTGCG CCATCCACAT
CAGGGGGTAG AATGGTTTGG AGACGATCGC CAACGGTTAG CCCACTCAGG AACCCTCAAG
GGATCACAAC CCCTGATCAT GGGATTTCAA ACAGTCCAAG GAGGACAGGT TCGTACCTTA
ACCCTTTCTG CTTACCATGA TCCTGATGAA CAAAAACATT TAGAAGGTTA TGTGCGGGTT
AGCGAGTCTA CGGCATCTGT AGAGGCAGTT TTAGCGCGAT TACGGCTAGA ATCGGCCTTG
GGAGGACTAA TTGCCCTAGG ATTAATTAGC GTGGGGGGAA TGTGGCTAAC CCGACAGTCT
TTAAAACCCA TTGAACAGAG TTTTTGGCAA TTAAAGCAAT TTACCGCCGA TGCTTCCCAT
GAATTACGCA GTCCCTTAAC GGCGATTAAA ACCTCGGCGG AAATCTTGCA AACTCACGGC
GATCGCTTGC CTCGTGAGGA TCTTTCCAAA GTAGAGATTA TCCTCTGTGC TACCGATCAA
ATGTCTGAGT TAGTGGAGGA TTTACTGCTG TTAGCTCGTA TGGATGGAAA GCCGGTCATT
TCACAGCAAA AATGGCAAAA AATTCCGTTA ATAGAATTAT TAGACGATGT GGTGGAATTT
TTAGAACCGA TCGCTGAAGC GAAAAATATT GCCCTTAGCT GTCATTTTTT GGCGGAAGTG
ACGGTTCAAG GCGATAGCCA TCAGTTATTG CGGTTATTTT CTAATTTGGT TGACAATGGG
TTGCAATATA CCCCAAGGGA CGGAAAGGTC ACAGTTTCTT TAACCCAAAG CGATAAATGG
GCGGTGGTGT CTGTCGATGA TACGGGAATT GGCATCGCAC CCGAACACCT ACCCTATGTC
TTTGATCGCT TGTGGCGGGC TGAATCTGCG CGTACCTATC GCCCCCAAGG ATCGGGGTTA
GGATTGGCGA TCGCTAGGGC GATCGCTCAC TATCATGGGG GAGAAATTAC CGTTAATAGT
CAATTAGGAC TAGGAAGTCG TTTTCAAGTC CGTCTTCCTC TGATGTCTTA A
 
Protein sequence
MKQKPSTVSP QFKALRWRLL LSYLVVMIAI WIISDVLVYQ FFARSLYQQL DNRLFNLAQA 
ATHSLLAIKN DPKAVNNSPY RALDEDGDLD LPWQNLRHPH QGVEWFGDDR QRLAHSGTLK
GSQPLIMGFQ TVQGGQVRTL TLSAYHDPDE QKHLEGYVRV SESTASVEAV LARLRLESAL
GGLIALGLIS VGGMWLTRQS LKPIEQSFWQ LKQFTADASH ELRSPLTAIK TSAEILQTHG
DRLPREDLSK VEIILCATDQ MSELVEDLLL LARMDGKPVI SQQKWQKIPL IELLDDVVEF
LEPIAEAKNI ALSCHFLAEV TVQGDSHQLL RLFSNLVDNG LQYTPRDGKV TVSLTQSDKW
AVVSVDDTGI GIAPEHLPYV FDRLWRAESA RTYRPQGSGL GLAIARAIAH YHGGEITVNS
QLGLGSRFQV RLPLMS