Gene PCC8801_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1716 
Symbol 
ID7101674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1800833 
End bp1804132 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content45% 
IMG OID643474784 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002371920 
Protein GI218246549 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA CCAACTTCGA TTTATCATCT TCGTATTTAG CTAAACAACA AGCATTATTA 
GCAATAACTC GCCAAATTCA ACAAAGAGCA ACGTTAAGGG AAATTCTGGA ATTGATCCTC
TTAAAAGTTG AGCAATTATT AAACTTAGCG TGGGGAGGAA TTTATCAATT TAAGGATAAT
TTAGAGAAGA ATTCAGACCA ATTGATTGCC CAAACTGGAA AATTCCCTCC TTGTCTCTTA
AAACAAGGAG AACCATTAGA AACCCTTGAA GGAATTGGGT TACTCAAGGG CTTAAATAAG
ATAAAATTAG ACAAGAATTA TCCTTATTCG TTTCAACCCT TAAATCTGCC AAACCTGATA
GGCGTTCCCA TCTATCAAAC CGAGGAATTA TGGGGACTAT TGCTGATTTA TTCGAGTGTA
CCCCCTCATG AGTGGCAACA GGAAGACAGC TTTTGGGTTC AGCAGATTGC CCTATCTCTA
GGAATAGCCC TACAGCAAGA CCAAATGTAC CAACAACAGC AGCAACAATC AGAACGCTTA
ACCCAAGCAG TTGAAGGGGC AGTAGAACGG GAGAAAACTG TCGCTAAAAT TATCGATAAA
ATTCGGCGAT CGCTAGACTT AGAGACAATT TTCACCACAG CCACCCAAGA AGTGCGACAA
CTCCTCAACA GCGATCGCGT GGCCATTTAT CGCTTCAATC ACGATTGGAG TGGAAAAATT
ATCTCTGATT CAGTTGGCCA AGGATGGACA TCCTTGGTAG AGAAGCAATG CCAATATCCT
ATGGTTGACC CTAACATGGG GCAATGTCAA GTCAGAAGTC TGTGTAATCC TCAACTCACC
GACACCTATC TAAAAGCCAT TGAAGGGAAG ATTTTTAACC CGAACAACCT AGTTCGTGTT
TGCAATAATA TCTACGAAGC CGGGTTTTCT CCCTGTTATA TCGAGATCCT CGAAGAGTAC
GAAGCCAAAG CCTATATTAT CGTAGCGATT TACCATCAAG ACACCCTCTG GGGATTACTC
GCTGCCTATC AAAACGCAAA TCCCCGTCAG TGGCAAGACG AAGAAATAAA CTTTTTAGTA
CAAATCGGAG CTAATTTAGG CATTGCCCTA GACCAAGCCC AATTATTAAA CAAAACCCGA
CAAAATCAAG AAAAACTCGA AACATCCCTC GCCATTGCCC TGAGGGAACA GAAAGAAACC
CTCGCCAAAA TTGCCGAAAA AGAGCGATCG CTGTCCACCG TCATCGACAA AATTCGCCTC
ACCCTCGACC TAACCACCAT TTTCCAAACC GCCGCCACCG AAGTTCAAAA ACTCCTAGAA
GTCGAACATA TTGCCATTTA TCAGTTTGAA ACAGACGGGA AAGGTCGATT TATCTTTGAA
TCCGATCCAG GACCATTTCC TAGCGTCGTC GGACAGCTTT GGGATGACCA ATTTTTACAG
CAACATCATT CCTGTGTCAT CAACGACATC AAACACGATC AAGACTGTTG TCAAGACCAA
ATTAACCCCT TCAAAGAACT CGGTGTCATG TCCCTAGTAA TGGTTCCCCT CTTTCAAGGA
GAAGACCTTT GGGGACTCCT GACAGCCTTT CAACATACCA GTTCCCGACA TTGGCCAGAA
ACGGACGTGC AATTACTCGA ACAGGTGGCA CACCACCTCA GTATAGCTCT CCAACAAACC
CACTATCTGC AACAAATTCG AGACTATGCC AAGGAACAGG CTAGAGCAGT ACAGCAGGAA
AAAGCCCTCT CTCAAGTCAT TGATAAAATT CGGCGTACCC TTGACCTAGA AACCATTTTT
AAGACAACGG CCACCGAAGT CCGTCAACTG TTAGAAGCTG ACCGGGTAGC TGTCTTCCAA
TTTGATCCTA ACTCCCACTG GACAAGCGGG CAATTCGTCT CCGAGGATGT TAGTCCAGAA
TTTGACTCAG CCTTAGAAGC AAAAGTAGAA GATCACTGTT TTGGCGAAAA TCATGCGAAT
TATTACCAAC AAGGGCAAAT CCTAGCCCAT GATGACATCT ACCAACAGGG CTTACTAGAA
TGTCATATTG CCATCCTCTC TCGGTTTCAA GTTAGGGCTA ACTTGGTTGC CCCCCTCATC
AAAGGACAAC AATTATGGGG GTTACTGTGC ATTCATCAAT GTTCCGGTCC GCGTCAGTGG
CAAACCTCAG AAAAAGAATT TGTGGCAAAA ATTGCTAATC ATTTAGGCGT TGCCCTGCAA
CAAGCCCAAT TACTGCAAGA AACGCAACAA CGCTCCCAAG ATCTACAGAA AGCCCTAGAA
CAGGTTAAAC AACAAAAAGG ACATCTAGCC CAAGTCGCTG CCCAAGATCG GGCTTTAGCG
AGAGTAATCG AGCGGATTCG CCAAACCCTA GACCTAGAGA CGATTTTTAG CACCACCACC
CAAGAAGTGC GCCAGATGCT CAAATGCGAT CGCGTCGTGG TTTATCGCTT TAACCCTGAC
TGGGGAGGGG AATTTCTCTA TGAGTCCGTT GGGGAGGGAT GGACCCCTTT AACGGGAGAA
TCCGCCGAAC AACCCCTTTG GAACGACACC TATTTACAAG AACACCAAGG AGGTCGCTAT
CGCTACAATG AACTGTTTGC TGTGGATGAT ATCTACCAAG CTAACTTAAC CCCTTGCCAT
GTTGAGTTAC TGATTCAATT CCAAGTTAAC GCTTTTGTGG TCGTTCCTGT CTTTGTGGGC
GATCGCCTCT GGGGACTGTT GGCTATCTAC CAAAACTCAG CCCCCCGTTT CTGGGAAAAA
CGAGAAATCG AGCTCCTTAA ACAACTGGCC AATCAATTAG GGGTTGCGGT TCACCAAAGT
CAATTATTAA CCCAAACGCA ACAACAATCA GAAACCCTGC AAAACACCGT CGCGGATCTC
AACGCCATTG TTGATAACCT CGGAGATGGA CTCATCGTAA TCGACATCTA TGGTCAAATT
ACCCGTTATA ATCGCGCATT GACGGAGATG TTTAACCTAA AAGACTCTCC CCAAGGACAA
CACTTATTAG AGGTTTTTCC CCCCGTCTTA GCCACACTCC TCAAACCCAT AGAACCTGAT
CAGCAAAAAG TCGTCACCGT TGAGGTGGAA TTAGCCCAAG GGAAAATTGG ACAAGCATTA
ATTAGTAAGA TTATCCAAGG CAGTTACGGG CAAGGGACAC AAAAATGTAT CGGGGCAGTG
ATTTTAATTC GAGATGTCAC CGCAGAACGG GAAATTGAAC AGATAAAAAA TGATTTTTTA
GCCACCGTTT CCCATGAATT ACGCACCCCG TTAACCTCGG TTTGGGCTTT GCCGCCTTGA
 
Protein sequence
MSETNFDLSS SYLAKQQALL AITRQIQQRA TLREILELIL LKVEQLLNLA WGGIYQFKDN 
LEKNSDQLIA QTGKFPPCLL KQGEPLETLE GIGLLKGLNK IKLDKNYPYS FQPLNLPNLI
GVPIYQTEEL WGLLLIYSSV PPHEWQQEDS FWVQQIALSL GIALQQDQMY QQQQQQSERL
TQAVEGAVER EKTVAKIIDK IRRSLDLETI FTTATQEVRQ LLNSDRVAIY RFNHDWSGKI
ISDSVGQGWT SLVEKQCQYP MVDPNMGQCQ VRSLCNPQLT DTYLKAIEGK IFNPNNLVRV
CNNIYEAGFS PCYIEILEEY EAKAYIIVAI YHQDTLWGLL AAYQNANPRQ WQDEEINFLV
QIGANLGIAL DQAQLLNKTR QNQEKLETSL AIALREQKET LAKIAEKERS LSTVIDKIRL
TLDLTTIFQT AATEVQKLLE VEHIAIYQFE TDGKGRFIFE SDPGPFPSVV GQLWDDQFLQ
QHHSCVINDI KHDQDCCQDQ INPFKELGVM SLVMVPLFQG EDLWGLLTAF QHTSSRHWPE
TDVQLLEQVA HHLSIALQQT HYLQQIRDYA KEQARAVQQE KALSQVIDKI RRTLDLETIF
KTTATEVRQL LEADRVAVFQ FDPNSHWTSG QFVSEDVSPE FDSALEAKVE DHCFGENHAN
YYQQGQILAH DDIYQQGLLE CHIAILSRFQ VRANLVAPLI KGQQLWGLLC IHQCSGPRQW
QTSEKEFVAK IANHLGVALQ QAQLLQETQQ RSQDLQKALE QVKQQKGHLA QVAAQDRALA
RVIERIRQTL DLETIFSTTT QEVRQMLKCD RVVVYRFNPD WGGEFLYESV GEGWTPLTGE
SAEQPLWNDT YLQEHQGGRY RYNELFAVDD IYQANLTPCH VELLIQFQVN AFVVVPVFVG
DRLWGLLAIY QNSAPRFWEK REIELLKQLA NQLGVAVHQS QLLTQTQQQS ETLQNTVADL
NAIVDNLGDG LIVIDIYGQI TRYNRALTEM FNLKDSPQGQ HLLEVFPPVL ATLLKPIEPD
QQKVVTVEVE LAQGKIGQAL ISKIIQGSYG QGTQKCIGAV ILIRDVTAER EIEQIKNDFL
ATVSHELRTP LTSVWALPP