Gene PCC8801_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0407 
Symbol 
ID7103364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp410213 
End bp412903 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content43% 
IMG OID643473516 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_002370660 
Protein GI218245289 
COG category[K] Transcription
[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGATA ATCTAGACTT AAGCAATTGC TCCATGCTAG ATCTGTTCAG CATGGAAGTA 
GAAAGTCAAG GGGAAGTCCT CAACGATAAC CTCTTGAACT TAGAAAATCA ACTTCAGGAG
TCCCAAGGAC AAGCCAGTGC TAGTTCCCTT GCCTTATTAG AGTCCCTGAT GCGAGCCTCT
CACTCCATCA AGGGAGCCGC TCGAATTGTC CAACTTGAAC CCGCCGTCAG AATTGCCCAT
GTCATGGAAG ATTGCTTTAT GGCGGCCATG GATCGCACCA TTAATTTGCA ATCGGACCAC
ATTGATCTGC TCTTACAAGC CGTCGATTTT CTCCTAGCCA TTGGTCAAGT GGGAGAAGCC
AATATCAATC ATTGGCTTGG AGAACATCAA GGGGAAGCAG AACAATTAGT GATCTCTATT
GCTTCTATCA TGGGAAGACG CAAAAGCGAT CGCGGTTCTG ATAAAAGTCC CCAAACACAA
CGAACCACAA CGACTCCACA ACCCCCACCC CCTTCCCCGG CAAAAAGTCA AAAAATCCCC
AAGACTCCAG AACTTTTAAC CCCTCCTCCA TCGAACTCAA ACCGTCGAAA AACCCCAGAA
ACCCCAGAAG ACGAATTTTT CTTAGGATCT ACCCTAGAAG CTGACGAAGA GCCAGAAAAC
TCAAGTTTTC CAGATTTTGA ACTCTCCTTA GATGATGTCT TTTCTGAGGA TACAGATATA
CAGTTAGGTA AATCCAAGTA TATACAAGAT ATTTCTCATA TTTCTGAGGA GGAGTCAACC
GGAAATCTGC TCACTGAGTC AGCGTCAAAA TTAGAGTCCA CTTTTGACCT ATCCGAAGAA
GTGATGACTT GGGTAGAAGA CGACAACGAA CCGATTAGTC AAGGGACTTC TTCCCTAACC
TCTACCTCAA AAGATGAACC CATCACCTCT ATCTCATCAG GCGGCGGTTC TTCTAAAGAT
CGTTTTGTCC GAGTAAGTGC AGAAGGATTA AATCGCCTGA TGGGATTAGC GGGAGAATCC
CTCGTGGAAG CGACGGCCTT AAGTCCGATG GCAGACTCCT TTATCACCCT CAAACGCAGT
CAACTGGATT TATCGCGCTT GCTCGAACAA TTACAAATGA TCTTGAGTCA GCTTTCTTTA
GGCAAAGAAA TGGAGGACTT TATCACTGAT ATTGTCGAAA AAGAACGAGA ATGTCGCACC
ATTTTAGGCG ATCGCCTCAG TGACCTAGAA CAATTTGCCT ATCGTTCCTT TAATTTATCC
GATCGCCTCT ATCGAGAAGT GATTGCCACC CACATGCGTC CCTTTGAAGA AGGAGTCACC
GGGTTTCCGC GCATGGTACG AGATATATCC CGAAAACTCA ATAAACGAGT CAAATTAGAG
ATTGCTGGAC GCATGACCAT GGTAGATCGG GATATTCTGC GAAAATTAGA AGCTCCTCTG
ACCCATATTT TAAGCAATTC CATCGATCAT GGCATTGAAT CCCCTGAAGA ACGGGTCAAA
AAAGGAAAAC CCCCAGAGGG ACATATTCTC CTCGAAGCCA GTCACCGTTT TGGGATGTTG
TCGATTAATG TGATAGACGA CGGACGGGGA ATTGAACTGG AGAAATTGCG TCAATCAATC
GTTGATAAAG GATTAGTTCC CGCAGAAATG GCCAAACAAC TCAATGAAGC TGAGTTGATG
GAATTTATCT TTTTACCCAA TTTTTCCACC AGTAAGACCG TCACCGATAT CTCCGGTCGT
GGGGTGGGAT TAAATATTGC GAAAACTATG GTACAAGAAG TTGGAGGCAA CCTTCAGGCA
GTTTCTCGGC CAGGAGAAGG TATGAGTTTT CATTTCCAAT TACCTCTAAC CCTTTCGGTG
ATTCGCACCC TTTTAGTCGA TATTGCTGGT CAACCCTACG CCTTCCCCTT ATCCCGTATC
GATCAAATTT TGACCCTCAA TTATAAAGAT ATTCACTCCG TAGAAAATCG ACAATACTTT
ACCCTTGAAG GGCAAAATAT TGGCTTAGTT AGAGCCGATC AAGTGTTAAA TATTTCGTCT
CCGGCTTCTC CCTTAGAACC CCTATCAATT GTTATTTTAA GTGACCAAAC TAATCGCTAT
GGCTTGGTTG TTGATCGCTT TATTGGGGAA AAAAGTTTAG TGGTTCGTCC CTTAGATTCT
CGCTTAGGAA AAGTTCAAGA TATTAGTGGT GCAGCCATTC TCGAAGATGG TTCGCCGATT
CTTATTTTAG ATGTGTTAGA TTTAGTGCGA TCGCTTGATA AACTCTTGGC TAATGTTCAA
GTTAATCAAA TTAAAACCGA GGAAGAAGCA GAGTGGAAAG AAAATAAAAA GCATATTTTA
GTCGTTGATG ATTCCATTAC CGTCCGAGAA ATGGAAAAAA AATTATTGCA AAATCAGGGC
TATCTTGTGG ATGTCGCTGT TGATGGAATG GAAGGGTGGA ATGCCGTGCG AATGGGCAAT
TATGACCTCG TGATTAGTGA TATAGATATG CCCCGGATGA ATGGGATTAA ATTAGTCAGT
CAAATCAAAA ATCATCCTAA TTTAAAATCA ATCCCTGTGA TTATTGTTTC CTATAAAGAT
CGCGAAGAAG ATCGCCTACA AGGCTTAGAA GCAGGAGCCG ATTATTACTT AACAAAAAGT
AGTTTCCATG ATGATACTTT AATCAATGCC GTTGTTGATT TAATTGGGTA A
 
Protein sequence
MRDNLDLSNC SMLDLFSMEV ESQGEVLNDN LLNLENQLQE SQGQASASSL ALLESLMRAS 
HSIKGAARIV QLEPAVRIAH VMEDCFMAAM DRTINLQSDH IDLLLQAVDF LLAIGQVGEA
NINHWLGEHQ GEAEQLVISI ASIMGRRKSD RGSDKSPQTQ RTTTTPQPPP PSPAKSQKIP
KTPELLTPPP SNSNRRKTPE TPEDEFFLGS TLEADEEPEN SSFPDFELSL DDVFSEDTDI
QLGKSKYIQD ISHISEEEST GNLLTESASK LESTFDLSEE VMTWVEDDNE PISQGTSSLT
STSKDEPITS ISSGGGSSKD RFVRVSAEGL NRLMGLAGES LVEATALSPM ADSFITLKRS
QLDLSRLLEQ LQMILSQLSL GKEMEDFITD IVEKERECRT ILGDRLSDLE QFAYRSFNLS
DRLYREVIAT HMRPFEEGVT GFPRMVRDIS RKLNKRVKLE IAGRMTMVDR DILRKLEAPL
THILSNSIDH GIESPEERVK KGKPPEGHIL LEASHRFGML SINVIDDGRG IELEKLRQSI
VDKGLVPAEM AKQLNEAELM EFIFLPNFST SKTVTDISGR GVGLNIAKTM VQEVGGNLQA
VSRPGEGMSF HFQLPLTLSV IRTLLVDIAG QPYAFPLSRI DQILTLNYKD IHSVENRQYF
TLEGQNIGLV RADQVLNISS PASPLEPLSI VILSDQTNRY GLVVDRFIGE KSLVVRPLDS
RLGKVQDISG AAILEDGSPI LILDVLDLVR SLDKLLANVQ VNQIKTEEEA EWKENKKHIL
VVDDSITVRE MEKKLLQNQG YLVDVAVDGM EGWNAVRMGN YDLVISDIDM PRMNGIKLVS
QIKNHPNLKS IPVIIVSYKD REEDRLQGLE AGADYYLTKS SFHDDTLINA VVDLIG