Gene PCC8801_4412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4412 
Symbol 
ID7104858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4635314 
End bp4638799 
Gene Length3486 bp 
Protein Length1161 aa 
Translation table11 
GC content38% 
IMG OID643477391 
ProductWD-40 repeat protein 
Protein accessionYP_002374490 
Protein GI218249119 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTAA CGACGGAAAA ACCTTATATC TATAAAGTGG GTGGTAGTCT CGCGTTTGAT 
CATCCTACCT ATGGAGAACG TCAAGCTGAT CGAGAGTTAT TAGACTCATT GAAAGTAGGT
AAATTTTGTT ATATTTTTAA CTGTCGTCAG ATGGGAAAAT CGAGCCTTAG AGTTCGTGCG
ATGCACCAAC TCCAAGCAGA AGGAATGAGT TGTGCATCGG TGGATATTAC CAGTTTAGGG
AGTGATATTA GTCAACAGCA ATGGTACAGT GGCATTATTA CGCAATTATT TCTCGGATTT
AATCTGGTTG GCAAAATTAA TCTGAAAATG TGGCTACGGG AACGAGACGA ATTATCGGGG
GTACAAAAGT TTAGCCATTT CCTTGAAGAA GTCTTATTAG TTCACTGTAA AGGCGAAAAA
ATTTACATTT TTATTGATGA AATTGACAAA GTTCTGAGTC TGAATTTTTC CCTAGATGAT
TTCTTTACTT TAATTCGCTT TTGTTATAAT CAACGAGCAG AAAATAAAAA CTATGAGCGC
CTGGTATTTG CTTTATTTGG CGTTGCGACT CCTTCCGATT TAATCCGAGA AAAAACCCAA
ACTCCTTTTA ACATTGGTCA ACCCATTGAA TTGACGGGGT TTACCCTAGA AGAAGTTGCC
CCATTAGCAT CAGGGTTAAG ACTGATTGCA AACCGTCCTA ACGAGGTTTT AAAAGCCATT
TTAGACTGGA CAGGAGGACA ACCTTTTTTA ACCCAAAAAC TCTGTCAATT GCTCTTAAAA
ACCCTAGAAA CGATTCCTCA AGGACAAGAA GAAGAAACCG TTGCAAAGGT TGTTAAAGAC
TTGATTATTA ATAATTGGGA ATCTCAGGAT GAACCCGTAC ATTTGAAGAC GATCCGCGAT
CGCCTTCTCC GCAATGAAAA ACGAACAGGA AGACTGTTAG GATTATATCA ACAGATTCTG
CAACAGGGAT TAATAGAAGC TGATGATAGT CCCGAACAAA CGGAATTACG ACTCTCAGGA
ATCGTGGTTA AACGAGATAA TAAGCTAGTG GTTTATAATC CTATTTACGA AGCTGTTTTT
AATCCTAAAT GGGTTAGTAA AGAACTCGAA AAGATTCGCC CCTATTCTGA ATCGATTACC
GCTTGGATTG CATCTAATTA TGAAGATAAG TCTCGTTTAT TGCGAGGACA AGCCTTAAAA
GATGCCCTAG GTTGGGCGAT GGATAAAAAC TTAGGCAATA TTGATTATCA GTTTTTAACT
GCTAGTCAGA AATTAGATAA ACGCGAAGCA GAATTAAATT TAGCTGCCGA AAAAGAAGCC
AATGAAATTT TAACTCAAGC CAATCAAAAA GCCCAACGAA TGATCCGAAT TGGGTTAGGA
ATTTTAGTCG TGTCTTTAGT TGGTGCGCTA ATTTCCTTTA CCCAAGCAAG ATCGGCTATT
CAAAAGCAAC AAGAAGCTAA AAAAGGAAGT CAATTAGAAC AAATGGGAGA TAGTGCTTGG
CGACAATTTG AATTTGAACA ACTTGATGGG TTAATATCAG CAATGGAGGG CGTACAAAAG
TTAAAAACTA TTGTTAAAGA TCAACGAATA CTAAAAGACT ATCCAGCCAC TCGTCCCATT
ATTGTCCTAG AACAAATTCT TGATCGCATA CACGAAAAAA ACAAATTAAC TGGTCATGAA
GATGCCGTCA ATAGTGTTAC TTTTAGTCCT AATGGTCAAT TAATTGCCAC CGCATCTAGT
GATGGAACCA TTCGTCTGTG GGATCGCCAA GGACGACAAA AAACAGTTAT TACAGGACAT
AAAGGTAATA TTTATCGAGT CACTTTTAGT CCCGATGGTC AACTTATTGC CAGCGCATCC
CAGGATAATA CCGCCAAAGT TTGGAACTTA CAAGGGCAAG AATTAATGAC ACTTAAAGGT
CACAATTCAT CCGTTTATAG TGTTAGTTTT AGTCCAGATA GCAAACACCT TCTAACCACT
TCTAGGGATG ACACAGCCAG AATTTGGGAC TTACAAGGAC ACCAACTTGC TATCTTAAAA
GGTCATGAAA AATCGATTGA TCATGGTGTA TTTAGTCCCG ATGGTCAACG CATTGCCACA
GCCTCACGGG ACGGAACCGT TAGAATTTGG GATAATCAAG GAAATCTCTT GAAAATCTTG
AAGGATAGCG TAGATTCTTT TTATAGCGTT AGTTTTAGTC CCGATGGTCA ACGTTTAGCC
TCTTCGGCAA AAGATGGAAC TGTTAGAATT TGGGATAATC AAGGAAAATC AATCTTAACT
CTCAAAGGTC ATCAAGAATT AGTAAAAAAT GTCACCTACA GTCACGACGG CAACTGGATA
GCTACTGCAT CGAGTGATGG AACCGCTAGA GTATGGAACA CTCAAGGACA AGAAGTGATG
GTCTTTCGAG GCCATCAAGA CCCTGTTTAT GATGTGGCTA TTAGTTCCAA TAGTCAAGAA
TTAGCAACCG CATCGAGTGA TGGAACAGTC AAACTTTGGC ACATCAACTC ACCTCAGCAA
CAAGGGTTTA ATACCCTTGA TACCTATGTG ACAGCCGTGA GTGTATTTCC TGATGATCAA
TTATTAGCGA TCGCGTCTGA AAATGGTCAA GTTTATCTGT GGAATTTACA AGGAAAATTT
CTCTGGGAAT TTGAAGGACA CAATAGCGGA ATTAATAGTT TAAATTTTAG TCCAGATGGT
CAAAAAATTG CGACGGCTGA TAACAATGGA CGAGTTAAAT TATGGGATAG AAAAGGAAAA
ATTTTAGCGG AATTATTTGA CAATTCAGTT AGGGTTTATA GTGTCACTTT TAGTTCTGAT
AGTAATTTAT TGGCGATCGC TACCCGTTCA GGAGAAGTTT GGCTATGGAA TATCGAAAAA
ATGCCTCCCC AATTGATTCA TCAATTTACA GCCCATCAAG AAACTATTTA TCAACTGAGT
TTTAGTCCTG ATGGACAAAC TTTAGTCACA GCTTCTGGTG ATAAAACAGC TAAATTATGG
GACTTACAAG GAAATTTACA GCAAGAATTT TTAGGACATA CTGCTCAGGT AAATGGGTTA
GCTTTTAGTC CTAATGGTCA ATATTTATTA ACTGCGTCTG AGGATAGCAC TGCCAAATTA
TGGGATCTCA AGGGTAATGT TTTAGCAACC TTAGAAAGCG ATCTTTTTCC AGTTTCGCGT
GTTAATTTTA GTCCCGATGG CCAAAAATTA GCGACTGCAT CACGCGATGG AACAGTTAGA
TTGTGGGATC TTGAAGGTCA TCTTCATACT CAAATGAAAG GACATCAAGA AGCGATTGGA
GAATTACAAT TTACTCAAGA TAGTCAACAG TTAATAACGA TAGATAGGGA TGGTGCAGTC
AAGATTTGGC CAGTACAAGA AGAGTTTGTT CGCCTAGAAA ATCTCTTTAA TAAAGGATGT
CAATGGCTAC AAGATTATTT AGTCACTAAT CAGGAGGAAA AAGCTAAACT AGAGGCCTGC
CAATAG
 
Protein sequence
MNLTTEKPYI YKVGGSLAFD HPTYGERQAD RELLDSLKVG KFCYIFNCRQ MGKSSLRVRA 
MHQLQAEGMS CASVDITSLG SDISQQQWYS GIITQLFLGF NLVGKINLKM WLRERDELSG
VQKFSHFLEE VLLVHCKGEK IYIFIDEIDK VLSLNFSLDD FFTLIRFCYN QRAENKNYER
LVFALFGVAT PSDLIREKTQ TPFNIGQPIE LTGFTLEEVA PLASGLRLIA NRPNEVLKAI
LDWTGGQPFL TQKLCQLLLK TLETIPQGQE EETVAKVVKD LIINNWESQD EPVHLKTIRD
RLLRNEKRTG RLLGLYQQIL QQGLIEADDS PEQTELRLSG IVVKRDNKLV VYNPIYEAVF
NPKWVSKELE KIRPYSESIT AWIASNYEDK SRLLRGQALK DALGWAMDKN LGNIDYQFLT
ASQKLDKREA ELNLAAEKEA NEILTQANQK AQRMIRIGLG ILVVSLVGAL ISFTQARSAI
QKQQEAKKGS QLEQMGDSAW RQFEFEQLDG LISAMEGVQK LKTIVKDQRI LKDYPATRPI
IVLEQILDRI HEKNKLTGHE DAVNSVTFSP NGQLIATASS DGTIRLWDRQ GRQKTVITGH
KGNIYRVTFS PDGQLIASAS QDNTAKVWNL QGQELMTLKG HNSSVYSVSF SPDSKHLLTT
SRDDTARIWD LQGHQLAILK GHEKSIDHGV FSPDGQRIAT ASRDGTVRIW DNQGNLLKIL
KDSVDSFYSV SFSPDGQRLA SSAKDGTVRI WDNQGKSILT LKGHQELVKN VTYSHDGNWI
ATASSDGTAR VWNTQGQEVM VFRGHQDPVY DVAISSNSQE LATASSDGTV KLWHINSPQQ
QGFNTLDTYV TAVSVFPDDQ LLAIASENGQ VYLWNLQGKF LWEFEGHNSG INSLNFSPDG
QKIATADNNG RVKLWDRKGK ILAELFDNSV RVYSVTFSSD SNLLAIATRS GEVWLWNIEK
MPPQLIHQFT AHQETIYQLS FSPDGQTLVT ASGDKTAKLW DLQGNLQQEF LGHTAQVNGL
AFSPNGQYLL TASEDSTAKL WDLKGNVLAT LESDLFPVSR VNFSPDGQKL ATASRDGTVR
LWDLEGHLHT QMKGHQEAIG ELQFTQDSQQ LITIDRDGAV KIWPVQEEFV RLENLFNKGC
QWLQDYLVTN QEEKAKLEAC Q