Gene PCC8801_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2052 
Symbol 
ID7104298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2123995 
End bp2125440 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content40% 
IMG OID643475108 
Productresponse regulator receiver modulated diguanylate cyclase 
Protein accessionYP_002372240 
Protein GI218246869 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACA AAGCCACCCA TAAAGCAGAT ATTTTAGTAG TAGAAGATCA ACTTGATAAC 
CTAAAGTTAT TATCTAATAT TCTCTGGGAT GAAGGTTATC AAGTGAGGCA AGCCATTGAT
GGAGAAATGG CCTTAATTGC CATTGATACT AAGTATCCTG ATTTAATTTT ACTTGATATT
AAAATTCCTC GTCTTGATGG TTATCAACTG TGTCAAACCT TAAAAGCCAA AGAAACCACG
GCTAATATTC CGGTAATTTT TTTAAGTGCC TTTGACGAAC TCTGGGAAAA AGTTAAAGCC
TTTGAAATGG GTGCAGTGGA TTATATTACT AAACCTTATC AAGCACCAGA AGTCTTAGCC
AGAGTAGAAA ACCAAGTTAA ATTAAGTCTT TTACAAAAAC AATTAAAAGC CCATAATAAT
ATCTTACAAC GGCAATTAGA TTTAACAGAA GCTGCCCAAG AATTTCATGG ACAACCTCAA
ATTGATGCTT ATTTATTACA TCAAGCCATA AAAGCAACCT ACAATGGAAT TATTATTACC
GATGCGACCC AATCCGATAA CCCAATTATT TACGTTAACC CTGGATTTGA AAGAATGACC
GGATATTCCT TAGAAGAAGT CAAAGGAAAA AATTGTCGCT TTCTTCAAGG AAACGATCGC
AATCAACCAG AGATTTTGTA CATGAAAACT TGTCTTCAAG AACACCGTCA ATGCTTTATT
ACCATTCGCA ATTATCGCAA AGATGGGTCA ATGTTTTGGA ATGAAGTTTC CTTGTCTCCG
GTTAAAGATG AATCAGGAAA ACTGGTCTAC TATATCGGAG TCCAGACCGA TGTAACCGTA
CGCAAACGAG TCGAAGAAGA AAGACAACGT TACGAGGCAT CAGTGCAAAA AATGAATCAA
GAATTGCATG AACTCAATAC GAAATTACAC CGCTTAGCCA ATCTTGATGG CTTAACAGAA
GTCGCTAACC GTCGCTGCTT TGATGATTGC CTGGAACAAG AATGGCGACG ATTATCCCGC
GAAGAGAAGC CCATATCTTT GATTTTAGGC GATATTGACT ACTTTAAACG ATTTAACGAT
ACCTACGGTC ATCTCCAAGG GGATGATTGT CTCAAACAAG TGGCCAAAGC CTTGAGTAAA
GGGGTACACC GTCCTGCTGA TTTAGTCGCG CGGTTTGGGG GAGAAGAATT TGCCGTTTTG
TTGCCGAATA CCCCGGCTTT TGGGGCTATG CAAGTTGCCC AAAAGATTCT AGAAGAAATT
CGACAGCTAC AAATTCCCCA TAAAGCATCT CAGGCTAAAC CCTATGTCAC CATGAGTTTA
GGGGTAGCCA CTGTTATCCC CTCCCTAGAC TTACCGCCGA AAACCTTAAT TGATACAGCC
GATGGGTATC TGTTTCAAGC AAAAAACCAA GGACGCGATC GCGCTATTGA TGGAGACAGT
CCCTGA
 
Protein sequence
MNNKATHKAD ILVVEDQLDN LKLLSNILWD EGYQVRQAID GEMALIAIDT KYPDLILLDI 
KIPRLDGYQL CQTLKAKETT ANIPVIFLSA FDELWEKVKA FEMGAVDYIT KPYQAPEVLA
RVENQVKLSL LQKQLKAHNN ILQRQLDLTE AAQEFHGQPQ IDAYLLHQAI KATYNGIIIT
DATQSDNPII YVNPGFERMT GYSLEEVKGK NCRFLQGNDR NQPEILYMKT CLQEHRQCFI
TIRNYRKDGS MFWNEVSLSP VKDESGKLVY YIGVQTDVTV RKRVEEERQR YEASVQKMNQ
ELHELNTKLH RLANLDGLTE VANRRCFDDC LEQEWRRLSR EEKPISLILG DIDYFKRFND
TYGHLQGDDC LKQVAKALSK GVHRPADLVA RFGGEEFAVL LPNTPAFGAM QVAQKILEEI
RQLQIPHKAS QAKPYVTMSL GVATVIPSLD LPPKTLIDTA DGYLFQAKNQ GRDRAIDGDS
P