Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3095 |
Symbol | |
ID | 8392425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 3149589 |
End bp | 3154259 |
Gene Length | 4671 bp |
Protein Length | 1556 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 644981041 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003138773 |
Protein GI | 257060885 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTG AACAAAAATT CACTTCAATT AACAGCTACC TCCAAACCTT TAATCCAGAA GACGTTTTTA TTTTTCCTAC ATCTTCTGCC CAAGCAAGAC TCTGGTTTTT AGCAGAACTT GAACCTGATA GTTCCGTTTA TAACGAACAG GTTATTATTG AATTACAAGG ACAATTTAAT CAAACTGCCT TAGAAAAAAG CTTTAATGAA ATTGTCCGTC GTCATGAAAT TTTACGCACC TTTTTTACAA CAGATCAAGG TAAACCAGTC CAAGTAATTT TGCCTTACCT ATCCCTAGAA TTTCCTGTTA TTGATCTTAG TCATCTCTCA TCAAAAGAAC AAGAAACGAT TATTAAAGAA CACACAATCA ATGAAGGACA AAAACCCTTT GATTTAACAC AGATTCCCCT CATTAGAATT ATCGTCTTAA AATTACATCC CGAAAAACAT CTATTATTAA TTACTCTACA TCACATTATT ATTGATGGTT GGTCAGGTGG CGTATTAATG AAAGAATTAG GGCTATTATA TGAAGCTTTT TGGCAGAAGA AACCCTCTCC TCTCCCTGCA TTATCTATTC AATATGCTGA TTTTACTATC TGGCAAAATG ACTATTTACA AAGCGAAAAA CTGAAACAAC AACTCAGTTA TTGGCAAGAA AAATTAACAA GTATTCCACC CTTATTAGAA CTGCCAACTG ATTATCATCG TCCTCCCATG CAAACTTTCA AAGGTCAATG TCAAACCTTT GAATTAACAC CCGAATTAAG CTATCAATTA AAACAATTAA GTCAGAAAAA TGGGTCAACC CTATTTATGA CCTTACTAGC AGCTTGGACA ATTTTGCTGT CACGTTATAG TCGTCAACAA GATATTGTCA TTGGATCTCC TATCGCTAAT CGTAACCGAG TAGAAATAGA ACCATTAATC GGTTTTTTTG CTAACACCTT AGTGTTAAGA ACAAACTTAT CAGGAAAGCC CGCTTTTAGC GAATTACTCA ACCAAGTCAG ACAAGTATGT TTAGAAGCTT ATACACATCA AGATATTCCT TTTGAGAAAT TAGTTGAAGC CTTACAACCC GAAAGAAATT TAAGTCAAAC CCCTTTATTT CAAGTCATGT TTGTTTTACA AAATGCCAAC TATGAAGAAC TCAAGTTATC AGATTTTACC TTCAGATTTC TTCCCAAAGA AAATACTATT GCAAAATTTG ATTTAACCTT GTCAATGTTA GAAAGCCAAG AAAAATTAAC AGGAGAAATA GAATACAATC AAGATTTATT TAACTGTTCA ACTATTGCTA GAATGGCAGA ACATTTTTTA GAATTGTTAG GAGAAATTGT TAGAAATCCT CAAAAATCTA TTACTGAATT GTCTTTCTTA ACAGAATCCG AACAAAACCA ATTATTAATT CATTGGAATA ATACAAAAGT TGAATATTCT AAAACTCAAT GTATTCATCA GTTATTTGAA GAACAAGTTA TTAGAACTCC TGATGCGATC GCCCTTGAAT TTGAAGGAAT AAGCTTAACC TATTTTGAAT TAAATCAGAA AGCGAATCAG TTAGGCCATT ACTTACAAAA ATTAGGAGTA AAACCAGATA GTTTAGTCGG AATTTGTGTC AAGCGATCGC TAGATATGAT CATTGGGATT TTGGGGATTC TCAAAGCAGG AGGGGCTTAT ATTCCCATCG ATCCAACCTA TCCCAAAGAA CGGATTGCTT ACTTATTAGA AGATGCTCAA ATTGACCTAT TATTAAGCCA AGCAGGGTTA TCAACTGAAC TACCCCAGAG TCAAACAACA GTTATTAACT TAGATGAAAA TTGGTCAGAA ATTGCCTTAG AAAGTTGCAA TAATGTCCTG AGTCAAGTCA CTCCCCAAAA CTTAGTTTAT ATCATCTATA CATCAGGTTC CACAGGAAAA CCCAAAGGGG TAATGATTGA ACATCAATCC TTAGTCAATT TCACTAAAAC TGCTATTGAT ATCTATAACA TTACGCCAAA AGATAGGGTT TTACAATTTG CTTCTATTAG TTTTGATGCC GCAGCCGAAG AAATCTATCC TTGCTTAAGT TCTGGGGCAA CTTTAGTATT GAGAACTGAC GAAATGATTG CCTCAACCGA TACCTTTTTC CATCAATGTC AAGCTAAACA GTTAACAGTT TTAGACTTAC CAACCGCCTA TTGGCAACAA ATTACAACTG AGTTAGAAAA CGCTAATCAA AAGTTACCTA ATACCTTACG TTTAGTAATT ATTGGTGGGG AAAGAGCAAT TCCCGAAAAA ATCGAAACTT GGCACAAAAA AGTTGGAGAT TTCCCAAAAC TTTTAAATAC TTACGGACCT ACTGAATCTA CCGTTGTCGC TACTGTTTAT CCTTTAATTC CATCGGTTAA AATAAAACAA GAAGTTCCCA TCGGAAAGCC AATTAATAAT GTCACAACTT ATATCTTAGA TCCTAACTTA GAACTGGTTC CAATTGGAGT TCCAGGGGAA TTATATTTAG GAGGTATGGG ACTAGCAAAA GGCTATCTTA ATCGCCCTAA ATTAACCGCA GAAAAATTCA TAATTAATCC TTTTAATACT TCAGAAAGAT TATATAAAAC AGGTGATTTA GTTCGCTATC TTTCTGATGG CAATATTGAA TTTTTAGGCA GAATTGACAA TCAAGTCAAA ATCCGAGGAT TTCGTATCGA ATTAGGTGAA ATAGAGTCCA TTTTAAGCCG TCACCCTGAG ATTAAAGAAA CAGTTGTTAT TGTCAGAGAA GATACTCCAG CTCAAAAACG TTTAGTTGCT TATATTACTA GCGATAAAAT TGCTTCTCAA ACTGACAATA ATTGGTATGA AACCTTAAAA CATTATCTCA AGACAAGCCT ACCTGATTAT ATGATTCCCC AAAGTTTTGT TTTGTTAGAA AACCTTCCGT TAACTGTCAA TGGAAAAATC GATTATTCTC GTTTGCCTTG TCCTGATTAT TCATTAATCA ATTCAGACAA AAATTATATT GCTCCTCGTA CCCCTATTGA AGCTACCTTA GCCCAAATTT GGTCAGATAT TTTAAAGCAT CAAAACATTA GCATTAACCA CAACTTTTTT GATTTAGGGG GAGACTCTAT TATCAGTATT CAAGTGATTG CCCGCGCTAA GCAAGCCGGG ATTAAAATCA AACCTAAACA ACTCTTTGAA TATCAAACTA TCGCTGAATT AGCTGCTGTC GCCAAGGTTG ATCAATCTTC ATTAAATGAA CAAGAGTTAA TAACAGGTTC TGTTCTACTT ACTCCTATTC AACACTGGTT TTTTAATCAG AATTTAGTAG AATCTCATCA TTGGAACCAA GGTATTTTAT TAGAAGTAGA AGCGGATATA AATATTAACG ATTTACAGGA AGCTATTAAA CATTTATTAA TTTATCATGA TGGTTTACGA TTGCGGTTTG AGTTCGTTAA TAATCATTGG CAGCAAATTT ATAGTCATCC TCAAGATAGC ATTCCTTTAG AGATTGTTGA TCTTTCAAAT ATGTCACCTA ATCAACAATC TAATCTGATT AAACAAAAAG CCAATGAATG CCAAAGTAGT CTTGATTTAA GTCAAGGTTT AGTATTCAAA TCTCTATTTT TTTACTTAGG AAATAATCAG CCAAATCGTT TATTAATCAT TATCCATCAC TTAGTCATAG ATGGAGTTTC TTGGCGAATT TTACTTGAGG ATTTAGTGAC AGTCTACCAA CAACTGCATC AAGAACAAAC AATCCAACTT CTTCCCAAAA CGACTTCTTT GCAAAAATGG AGTCAAAAAC TTTATGATTA TGCCCAATCA GAAAAAATCC AGAAAGAATT AGATTATTGG CTAACCCTAT CACAAATAAA AATCAACTCT ATTCCTGTAG ATTATCAGGT TAGTTTAACC CAAAATACCG TCGCTTCTAC CCAACAAATA ACAGTAGGGT TAAACGCCGA AAAAACTCGC GCCTTACTTC AAGATGTTTC GGCTGTTTAC AATACTCAAA TTAATGATTT ATTATTAACA GCTTTAGTTC AAAGTTTTGC TGGTTGGACT GGTGACTTTT CTCTTTTAAT AGAATTAGAA GGTCATGGAC GAGAAGACTT GTTTGACGAT GTTGATTTAT CCAGAACAAT AGGATGGTTT ACCTCAGTTT TTCCTGTCTT ATTAACCTTA CCTCAAAGTC AGGATTTAGG AGACAGATTA AAAAGTGTTA AAGAACAACT AAGACGCATT CCACAAAAAG GAATTAATTA TGGAATTTTA CAGTATTTAA CAGACAATCA ACTTATCAAA AGCCAATTGA ACCAAATGCC AAAACCACAA ATCAGTTTTA ACTACTTAGG ACAATTTTAT CAACAGATCT CTTCACCTCC TTTACTTAAT TTAGCGCAAG AACCCATTGG GTTTATGCGG AGTCCTAAAG GAATTAATCA TCATCTGATT GAGATTAATG CTTGGATTAT GTCAGAAAAA TTAGAAATTA TCTGGAGTTA CAGTGAAAAT TTGTATTACA AAAACACAAT TGAAAAATTA GCTAAAGGGT ATAAAACAGC CCTAGAAAAA CTCATTAGTT ATTGTCAATC TGCGGAAGCT GGTGGTTATA CCCCTTCAGA TTTTCCTGAA GCTAATTTAA GTCAAGAGGA ATTAGATCAA CTCTTTTTAG AATTTAGTTG A
|
Protein sequence | MNTEQKFTSI NSYLQTFNPE DVFIFPTSSA QARLWFLAEL EPDSSVYNEQ VIIELQGQFN QTALEKSFNE IVRRHEILRT FFTTDQGKPV QVILPYLSLE FPVIDLSHLS SKEQETIIKE HTINEGQKPF DLTQIPLIRI IVLKLHPEKH LLLITLHHII IDGWSGGVLM KELGLLYEAF WQKKPSPLPA LSIQYADFTI WQNDYLQSEK LKQQLSYWQE KLTSIPPLLE LPTDYHRPPM QTFKGQCQTF ELTPELSYQL KQLSQKNGST LFMTLLAAWT ILLSRYSRQQ DIVIGSPIAN RNRVEIEPLI GFFANTLVLR TNLSGKPAFS ELLNQVRQVC LEAYTHQDIP FEKLVEALQP ERNLSQTPLF QVMFVLQNAN YEELKLSDFT FRFLPKENTI AKFDLTLSML ESQEKLTGEI EYNQDLFNCS TIARMAEHFL ELLGEIVRNP QKSITELSFL TESEQNQLLI HWNNTKVEYS KTQCIHQLFE EQVIRTPDAI ALEFEGISLT YFELNQKANQ LGHYLQKLGV KPDSLVGICV KRSLDMIIGI LGILKAGGAY IPIDPTYPKE RIAYLLEDAQ IDLLLSQAGL STELPQSQTT VINLDENWSE IALESCNNVL SQVTPQNLVY IIYTSGSTGK PKGVMIEHQS LVNFTKTAID IYNITPKDRV LQFASISFDA AAEEIYPCLS SGATLVLRTD EMIASTDTFF HQCQAKQLTV LDLPTAYWQQ ITTELENANQ KLPNTLRLVI IGGERAIPEK IETWHKKVGD FPKLLNTYGP TESTVVATVY PLIPSVKIKQ EVPIGKPINN VTTYILDPNL ELVPIGVPGE LYLGGMGLAK GYLNRPKLTA EKFIINPFNT SERLYKTGDL VRYLSDGNIE FLGRIDNQVK IRGFRIELGE IESILSRHPE IKETVVIVRE DTPAQKRLVA YITSDKIASQ TDNNWYETLK HYLKTSLPDY MIPQSFVLLE NLPLTVNGKI DYSRLPCPDY SLINSDKNYI APRTPIEATL AQIWSDILKH QNISINHNFF DLGGDSIISI QVIARAKQAG IKIKPKQLFE YQTIAELAAV AKVDQSSLNE QELITGSVLL TPIQHWFFNQ NLVESHHWNQ GILLEVEADI NINDLQEAIK HLLIYHDGLR LRFEFVNNHW QQIYSHPQDS IPLEIVDLSN MSPNQQSNLI KQKANECQSS LDLSQGLVFK SLFFYLGNNQ PNRLLIIIHH LVIDGVSWRI LLEDLVTVYQ QLHQEQTIQL LPKTTSLQKW SQKLYDYAQS EKIQKELDYW LTLSQIKINS IPVDYQVSLT QNTVASTQQI TVGLNAEKTR ALLQDVSAVY NTQINDLLLT ALVQSFAGWT GDFSLLIELE GHGREDLFDD VDLSRTIGWF TSVFPVLLTL PQSQDLGDRL KSVKEQLRRI PQKGINYGIL QYLTDNQLIK SQLNQMPKPQ ISFNYLGQFY QQISSPPLLN LAQEPIGFMR SPKGINHHLI EINAWIMSEK LEIIWSYSEN LYYKNTIEKL AKGYKTALEK LISYCQSAEA GGYTPSDFPE ANLSQEELDQ LFLEFS
|
| |