Gene PCC8801_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3026 
Symbol 
ID7104513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3144275 
End bp3148945 
Gene Length4671 bp 
Protein Length1556 aa 
Translation table11 
GC content34% 
IMG OID643476053 
Productamino acid adenylation domain protein 
Protein accessionYP_002373166 
Protein GI218247795 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTG AACAAAAATT CACTTCAATT AACAGCTACC TCCAAACCTT TAATCCAGAA 
GACGTTTTTA TTTTTCCTAC ATCTTCTGCC CAAGCAAGAC TCTGGTTTTT AGCAGAACTT
GAACCTGATA GTTCCGTTTA TAACGAACAG GTTATTATTG AATTACAAGG ACAATTTAAT
CAAACTGCCT TAGAAAAAAG CTTTAATGAA ATTGTCCGTC GTCATGAAAT TTTACGCACC
TTTTTTACAA CAGATCAAGG TAAACCAGTC CAAGTAATTT TGCCTTACCT ATCCCTAGAA
TTTCCTGTTA TTGATCTTAG TCATCTCTCA TCAAAAGAAC AAGAAACGAT TATTAAAAAA
CACACAATCA ATGAAGGACA AAAACCCTTT GATTTAACAC AGATTCCCCT CATTAGAATT
ATCGTCTTAA AATTACATCC TCAAAAACAT CTATTATTAA TTACTCTACA TCACATTATT
ATTGATGGTT GGTCAGGTGG CGTATTAATG AAAGAATTAG GGCTATTATA TGAAGCTTTT
TGCCATAAGA AATCCTCTCC TCTCCCTCCA TTATCTATTC AATATGCTGA TTTTACTATC
TGGCAAAATG ACTATTTACA AAGCGAAAAA CTGAAACAAC AACTCAGTTA TTGGCAAGAA
AAATTAACAA GTATTCCACC CTTATTAGAA CTGCCAACTG ATTATCATCG TCCTCCCATG
CAAACTTTCA AAGGTCAATG TCAAACCTTT GAATTAACAC CCGAATTAAG CTATCAATTA
AAACAATTAA GTCAGAAAAA TGGGTCAACC CTATTTATGA CCTTACTAGC AGCTTGGACA
ATTTTACTGT CACGTTATAG TCGTCAACAA GATATTGTCA TTGGATCTCC TATCGCTAAT
CGTAACCGAG TAGAAATAGA ACCATTAATC GGTTTTTTTG CTAACACCTT AGTGTTAAGA
ACAAACTTAT CAGGAAACCC CGCTTTTAGC GAATTACTCA ACCAAGTCAG ACAAGTATGT
TTAGAAGCTT ATACACATCA AGATATTCCT TTTGAGAAAT TAGTTGAAGC CTTACAACCC
GAAAGAAATT TAAGTCAAAC CCCTTTATTT CAAGTCATGT TTGTTTTACA AAATGCCAAC
TATGAAGAAC TCAAGTTATC AGATTTTACC TTCAGATTTC TTCCCAAAGA AAATACTATT
GCAAAATTTG ATTTAACCTT GTCAATGTTA GAAAGCCAAG AAAAATTAAC AGGAGAAATA
GAATACAATC AAGATTTATT TAACTGTTCA ACTATTGCTA GAATGGCAGA ACATTTTTTA
GGATTGTTAG GAGAAATTGT TAGAAATCCT CAAAAACCTA TTACTGAATT GTCTTTCTTA
ACAGAATCCG AACAAAACCA ATTATTAATT CATTGGAATA ATACAAAAGT TGAATATTCT
AAAACTCAAT GTATTCATCA GTTATTTGAA GAACAAGTTA TTAGAACTCC TGATGCGATC
GCCCTTGAAT TTGAAGGAAT AACTTTAACC TATTTTGAAT TAAATCAGAA AGCGAATCAG
TTAGGCCATT ACTTACAAAA ATTAGGAGTA AAACCAGATA GTTTAGTCGG AATTTGTGTC
AAGCGATCGC CAGATATGAT CATTGGGGTT TTGGGGATTC TCAAAGCAGG AGGGGCTTAT
ATTCCCATCG ATCCAACCTA TCCCAAAGAA CGGATTGCTT ACTTATTAGA AGATGCTCAA
ATTGACCTAT TATTAAGTCA AGCAGGGTTA TCAACTGAAC TACCCCAGAG TCAAACAACA
GTTATTAACT TAGATGAAAA TTGGTCAGAA ATTGCCTTAG AAAGTTGCAA TAATGTTCTG
AGTCAAGTCA CTCCCCAAAA CTTAGTTTAT ATCATCTATA CATCAGGTTC CACAGGAAAA
CCCAAAGGGG TAATGATTGA ACATCAATCC TTAGTCAATT TCACTAAAAC TGCTATTGAT
ATCTATAACA TTACGCCAAA AGATAGGGTT TTACAATTTG CTTCTATTAG TTTTGATGCC
GCAGCCGAAG AAATCTATCC TTGCTTAAGT TCTGGGGCAA CTTTAGTATT GAGAACTGAC
GAAATGATTG CCTCAACCGA TACCTTTTTC CATCAATGTC AAGCCAAACA GTTAACAGTT
TTAGACTTAC CAACCGCCTA TTGGCAACAA ATTACAACTG AGTTAGAAAA CGCTAATCAA
AAGTTACCTA ATACCTTACG TTTAGTAATT ATTGGTGGGG AAAGAGCAAT TCCCGAAAAA
ATCGAAACTT GGCACAAAAA AGTTGGAGAT TTCCCAAAAC TTTTAAATAC TTACGGACCT
ACTGAATCTA CCGTTGTCGC TACTGTTTAT CCTTTAATTT CATCGGTTAA AATAAAACAA
GAAGTTCCCA TCGGAAAGCC AATTAATAAT GTCACAACTT ATATCTTAGA TCCTAACTTA
CAACTGGTTC CAATTGGAGT TCCAGGGGAA TTATATTTAG GAGGTATGGG ACTAGCAAAA
GGCTATCTTA ATCGCCCTAA ATTAACCGCA GAAAAATTCA TAATTAATCC TTTTAATACT
TCAGAAAGAT TATATAAAAC AGGTGATTTA GTTCGCTATC TTTCTGATGG CAATATTGAA
TTTTTAGGCA GAATTGACAA TCAAGTCAAA ATCCGAGGAT TTCGTATCGA ATTAGGTGAA
ATAGAGTCCA TTTTAAGCCG TCACCCTGAG ATTAAAGAAA CAGTTGTTAT TGTCAGAGAA
GATACTCCAG CTCAAAAACG TTTAGTTGCT TATATTACTA GCGATAAAAT TGCTTCTCAA
ACTGACAATA ATTGGTATGA AACCTTAAAA CATTATCTCA AGACAAGCCT ACCTGATTAT
ATGATTCCCC AAAGTTTTGT TTTGTTAGAA AACCTTCCGT TAACTGTCAA TGGAAAAATC
GATTATTCTC GTTTGCCTTG TCCTGATTAT TCATTAATCA ATTCAGACAA AAATTATATT
GCTCCTCGTA CCCCTATTGA AGCTACCTTA GCCCAAATTT GGTCAGATAT TTTAAAGCAT
CAAAACATTA GCATTAACCA CAACTTTTTT GATTTAGGGG GAGACTCTAT TATCAGTATT
CAAGTGATTG CCCGCGCTAA GCAAGCCGGG ATTAAAATCA AACCTAAACA ACTCTTTGAA
TATCAAACTA TCGCTGAATT AGCTGCTGTC GCCAAGGTTG ATCAATCTTC ATTAAATGAA
CAAGAGTTAA TAACAGGTTC TGTTCTACTT ACTCCTATTC AACACTGGTT TTTTAATCAG
AATTTAGTAG AATCTCATCA TTGGAACCAA GGTATTTTAT TAGAAGTAGA AGCGGATATA
AATATTAACG ATTTACAGGA AGCTATTAAA CATTTATTAA TTTATCATGA TGGTTTACGA
TTGCGGTTTG AGTTCGTTAA TAATCATTGG CAGCAAATTT ATAGTCATCC TCAAGATAGC
ATTCCTTTAG AGATTGTTGA TCTTTCAAAT GTGTCACCTA ATCAACAATC TAATCTGATT
AAACAAAAAG CCAATGAATG CCAAAGTAGT CTTGATTTAA GTCAAGGTTT AGTATTCAAA
TCTCTATTTT TTTACTTAGG AAATAATCAG CCAAATCGTT TATTAATCAT TATCCATCAC
TTAGTCATAG ATGGAGTTTC TTGGCGAATT TTACTTGAGG ATTTAGTGAC AGTCTACCAA
CAACTGCATC AAGAACAAAC AATCCAACTT CCTCCCAAAA CGACTTCTTT GCAAAAATGG
AGTCAAAAAC TTTATGATTA TGCCCAATCA GAAAAAATCC AGAAAGAATT AGATTATTGG
CTAACCCTAT CACAAATAAA AATCAACTCT ATTCCTGTAG ATTATCAGGT TAGTTTAACC
CAAAATACCG TCGCTTCTAC CCAACAAATA ACAGTAGGGT TAAACGCCGA AAAAACTCGC
GCCTTACTTC AAGATGTTTC GGCTGTTTAC AATACTCAAA TTAATGATTT ATTATTAACA
GCTTTAGTTC AAAGTTTTGC TGGTTGGACT GGTGACTTTT CTCTTTTAAT AGAATTAGAA
GGTCATGGAC GAGAAGACTT GTTTGACGAT GTTGATTTAT CCAGAACAAT AGGATGGTTT
ACCTCAGTTT TTACTGTCTT ATTAACCTTA CCTCAAAGTC AGGATTTAGG AGACAGATTA
AAAAGTGTTA AAGAACAACT AAGACGCATT CCACAAAAAG GAATTAATTA TGGAATTTTA
CAGTATTTAA CAGACAATCA ACTTATCAAA AGCCAATTGA ACCAAATGCC AAAACCACAA
ATCAGTTTTA ACTACTTAGG ACAATTTTAT CAACAGATCT CTTCACCTCC TTTACTTAAT
TTAGCGCAAG AACCCATTGG GTTTATGCGG AGTCCTAAAG GAATTAATCA TCATCTGATT
GAGATTAATG CTTGGATTAT GTCAGAAAAA TTAGAAATTA CCTGGAGTTA CAGTGAAAAT
TTGTATTACA AAAACACAAT TGAAAAATTA GCTAAAGGGT ATAAAACAGC CCTAGAAAAA
CTCATTAGTT ATTGTCAATC TGCGGAAGCT GGTGGTTATA CCCCTTCAGA TTTTCCTGAA
GCTAATTTAA GTCAAGAGGA ATTAGATCAA CTCTTTTTAG AATTTAGTTG A
 
Protein sequence
MNTEQKFTSI NSYLQTFNPE DVFIFPTSSA QARLWFLAEL EPDSSVYNEQ VIIELQGQFN 
QTALEKSFNE IVRRHEILRT FFTTDQGKPV QVILPYLSLE FPVIDLSHLS SKEQETIIKK
HTINEGQKPF DLTQIPLIRI IVLKLHPQKH LLLITLHHII IDGWSGGVLM KELGLLYEAF
CHKKSSPLPP LSIQYADFTI WQNDYLQSEK LKQQLSYWQE KLTSIPPLLE LPTDYHRPPM
QTFKGQCQTF ELTPELSYQL KQLSQKNGST LFMTLLAAWT ILLSRYSRQQ DIVIGSPIAN
RNRVEIEPLI GFFANTLVLR TNLSGNPAFS ELLNQVRQVC LEAYTHQDIP FEKLVEALQP
ERNLSQTPLF QVMFVLQNAN YEELKLSDFT FRFLPKENTI AKFDLTLSML ESQEKLTGEI
EYNQDLFNCS TIARMAEHFL GLLGEIVRNP QKPITELSFL TESEQNQLLI HWNNTKVEYS
KTQCIHQLFE EQVIRTPDAI ALEFEGITLT YFELNQKANQ LGHYLQKLGV KPDSLVGICV
KRSPDMIIGV LGILKAGGAY IPIDPTYPKE RIAYLLEDAQ IDLLLSQAGL STELPQSQTT
VINLDENWSE IALESCNNVL SQVTPQNLVY IIYTSGSTGK PKGVMIEHQS LVNFTKTAID
IYNITPKDRV LQFASISFDA AAEEIYPCLS SGATLVLRTD EMIASTDTFF HQCQAKQLTV
LDLPTAYWQQ ITTELENANQ KLPNTLRLVI IGGERAIPEK IETWHKKVGD FPKLLNTYGP
TESTVVATVY PLISSVKIKQ EVPIGKPINN VTTYILDPNL QLVPIGVPGE LYLGGMGLAK
GYLNRPKLTA EKFIINPFNT SERLYKTGDL VRYLSDGNIE FLGRIDNQVK IRGFRIELGE
IESILSRHPE IKETVVIVRE DTPAQKRLVA YITSDKIASQ TDNNWYETLK HYLKTSLPDY
MIPQSFVLLE NLPLTVNGKI DYSRLPCPDY SLINSDKNYI APRTPIEATL AQIWSDILKH
QNISINHNFF DLGGDSIISI QVIARAKQAG IKIKPKQLFE YQTIAELAAV AKVDQSSLNE
QELITGSVLL TPIQHWFFNQ NLVESHHWNQ GILLEVEADI NINDLQEAIK HLLIYHDGLR
LRFEFVNNHW QQIYSHPQDS IPLEIVDLSN VSPNQQSNLI KQKANECQSS LDLSQGLVFK
SLFFYLGNNQ PNRLLIIIHH LVIDGVSWRI LLEDLVTVYQ QLHQEQTIQL PPKTTSLQKW
SQKLYDYAQS EKIQKELDYW LTLSQIKINS IPVDYQVSLT QNTVASTQQI TVGLNAEKTR
ALLQDVSAVY NTQINDLLLT ALVQSFAGWT GDFSLLIELE GHGREDLFDD VDLSRTIGWF
TSVFTVLLTL PQSQDLGDRL KSVKEQLRRI PQKGINYGIL QYLTDNQLIK SQLNQMPKPQ
ISFNYLGQFY QQISSPPLLN LAQEPIGFMR SPKGINHHLI EINAWIMSEK LEITWSYSEN
LYYKNTIEKL AKGYKTALEK LISYCQSAEA GGYTPSDFPE ANLSQEELDQ LFLEFS