Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47104 |
Symbol | NRPS |
ID | 7202016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 408261 |
End bp | 412469 |
Gene Length | 4209 bp |
Protein Length | 1367 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | non ribosomal peptide synthase |
Protein accession | XP_002181204 |
Protein GI | 219121710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGAGAGA CGTCCTATCA TCGTTTCTCG AATCGCATTG ACGGCGAAGG CTACTAACCC ACTCCTTCGT ACGTATTCAC TGCTCCCTGT CAACCATCAT TCATCATGAC TCAGGACGAG CCGAAGTGCA TTCATGTTTC GTTTCACGAT CAAGCGGCTC AGACTCCTGA CGCCGTCTGT CTCATCGAAG AAGATCTGAC ATTCACTTAC GCCGAGGTCC AACGTCGCGT GATTCTACTG GCAAAGGAAC TCCGCGACAA TGGTTCCTGT ACGAACGCTG TTGTTGCCAT TTTCATGGAA CCTTGTGCGG ACTATATCAT CTCCATGTTG GCTGTGCTGA CGGCGGGGGC GGCGTACGTG CCGTTGGAAC TCGCTTATCC AATCACCATG TTGCAGCGTG TGCTGCACGA TGCTACACCT GTGGTGGTGG TGACTAAACA GGAACAACGA GCACTGCTGC CCGTGACAAA CACGGCCTTG GCGGTTCTTT GTCTCGACGA TAACGAACAT CACGAGCTGC AGGAAACTGC CGGACAGCCA GAATCACAGG CAGAACTATT GCAGACGTAT CAGTCCTTTC CTCCAGTTTC GCTGGACGAT CTCGCCTTCA TTGTGTACTC CAGTGGTACT ACGGGTCAAC CCAAAGGTAT TGCAAATCCG CACCGAGCTC CGGCCCTTTC GTACCGTTGG CGGTTTGACG AATTCGTCGA CCCTGGTCCA GGCAGTATTG TAGCGTGCAA CGTTTTCTTT GTTTGGGAAG CCCTGCGAGC CGTCATGCGA GGCGGAGCTG TGGTCCCCGT TCCCGCTTCA ATCGTCTTTG ATGGCGAAGC CTTATCTGTC TTTCTGCATC AACACAGCGT TACGGAAATG CTATTTACAC CTTCCCTTTT GGAAAACTTT TTCAATACCA TGTCGGAAGC CGATTTGCGA GCGCGATTGG TTGCCTTGAA GACAATTTTC CTTAATGGGG AAGTCGTCAC CCTGAATTTG CGCGAACGTT GTTTCCGTTT ACTACCCTCC GTTCGCTTCA TCAATCTGTA TTCGATTAGT GAGTGTCACG AAGTGGGTGC TGTCGATTTG CGCGAAATAG ATCTGAATCT TTCCACCAAG TATTGTCCGA TTGGTGCCCC ATGTACCTAT TCACCTGCAT ACATTCTAGA CGATGAAGGA AGACACGCTG TTGCACCTGG TGATGCGGGC GAACTCTACA TTGGCGGAGA CATGTTGGCA GTGGGTTATT TGAATCTACC TGAACTAACG GCCACCCGAT TCGTGCCGGA TCCTTTTCGA CCTGATGAAG GGTGCATGTA CCGGACAGGA GATCGTGCGC GAATGTTGGA AAACGGACAG CTAGAAATTC TTGGTCGCTG TGATTTTATG GTGAAAATTC GTGGATATTC TATCGTGCTA GGCGCCGTGG AAGCGGCACT GGTCGAAACC GTCTCTTTGT CGTCGTGTGT GGTTGTCGCC GACGGAGAAG AGGGCGAAGA TAAACACCTG GTGGCCTATC TGGTGCGCGC ACCCCATGAG GATGTTGAAA CACGCCTCAG CCACTGGTCC ATTGATACTC GTACCGGTGC TTGCCCAGAA ATTCGCCGCG CAGTCGACGG CGCCTTGCCA CATTACATGG TTCCTAGTGT TTTTGTAGAA GTTGAAACAT TGCCAGTCAG TGCGGTCGGA GCAAAACTTG ATCGCAAGGC ATTACAGGCA CAATCGGCCG ATCGCAGGGC CATGCTCCGG TCCTTGCAAT TGTCAGCCGA AACCCACACA ACCCCGTTAC ATACGGCTAC TAGTCATCAG CCAGCACGCT GGAAGCGCGT GGCGAAACAT TTACGGGTAC CGCATGGGTC GAGTCGAGAA GATGTGGAAG ATGTCATGCT CATTTTATGG GAAGTTGTTC TTGATCGCGA GCCAGGCATG TTGGACAGTA ATTCCGACTT TCACGAGCAC GGAGGCCATT CGCTTAGTGC TGCACGACTC GTCTCTTTGA TGAATAAAAC CTTCTCTTGC CGACTACTCG CAGTACAGCT GATGCAAGGA ATGTCCATAG GCACAGCAAC AGATGCTGTA GTGGCATCTT GGTTGGAAGA CCCGATCTCC AATGGTGGGG AATCCGGAAG CAATCGTGTA CATCAAATGA ATGGAAGCGG CGGGACGATT CCGAACGGCG CGTTGAGGAC AGCAGATGAA GATCAGATTA TCCAACAAGT ACGTGGAGCT GCGGTCTTGC CGGAAGATAT TATACCAAAG TCTCAGGGAT TTCCGACTCG TGGTCTCGGC GAGAGCAAAG AAGTATTTTT GACTGGATCC ACAGGCTTTC TCGGAGCTCA CGTGCTGGCT GAGCTCCTAC TCAAATATCC GTCCGCGACA GTGGTATGTC TGGCTCGCTC CAAAGATCCT AAAGTTGTTC AGATTAATCT GGAACGCTAC AAGCTGTGGC AACCAGAATT TTCTACTCGA ATTAAAGCCG TCAGCGGAGA TTTGTCGCTT GCGAAGCTTG GCTTGGATCT AAGCAGCTGG AAGCAAATAA CACAGGCTGC TGATGCTGTC GTCCATTGCG GAGCTGCTGT GTCACTAACA TCTCCGTATG CAATGCTTGA AGCTGTGAAT GTGTACGGCA CACTGAATAT TATTCGTCTT GCTTGTGAAT GCAAAGCCGG CACACCTCTT ATCTATGTCT CGTCCAACGG AATTTTCCCG TGTGACAAGG GCAAAGATGA AATTTTTCTT GAAAATGATG ATGTTGGGTG CCTGCCGGAT CGACTTGGAG CCATGAACGG TTATGGGCTT AGCAAATGGG TTGCAGAGCA GCTTGTTGTC GCTGCGCACA AGCGAGGGCT CCCCACAATG ACAATTCGTT TTGGCAATCT AGGATGGCAA TCAACTTCTG GGATTGGTAA CTCTTTGGAT TTTCAGAGTA TAATTCTAAA TGGCGCTCGG CGAATGGTGG TCCGGCCTCG TGTAAAAGGG TGGAAATTCG AAATCACGCC AATCGATTTT GCCGCAGCAG CGCTCGTCGG TCTTGCAGAC ACTGCTATAC ACCTAAAAGC CGGGTCTATC TTTAATTGTG TCCAGTCAGA ACTTGTCGAT GCAGACCGTG TCTTTGGTTG GGTGTCCGAG AGCGATACCC TTTCTCTCTT GGCGCTTGAC TTCGAAGACT GGCAACAGCG GGTAGACGAG GCGAGCAACG ACGACCTGTC GCTATCCACA TTGCAGGCCT TTGCCATGGG GCTCCCAGGT GGAGCCTCGT ACTTATCCGA ATGTGCACAT CTAGATTGCA GCAAGTTCGA TGCAGCCGTA GCCTCGCTTC ATCCCCCGTT ACGGCGTCTT GGTCCTTCGG AACTTTCGGA GTATTTCAAA ATCTTCCTTA GCGCCAACCC GATTATATCG TCTGTGGCGG CCGACAGCGT CATAAAGCCG TCTGCGGTCG ATCCCTCTGT TTCAACTGAA CATCAAGGTC CTCTGGCTGG TCAAGTTGCC GTCGTTACAG GCGCCTCGTC CGGAATTGGT CGAGCAATCG TCCTGTCACT GGTCCAAGCC GGATGTAATG TTGCTATGGC TGCTCGTAGA TTATCTGAGC TCGAAAAGAC TCAAAAGGAA GTAGCTGAAG CGTGCAGCGG CTCTCCGGTT AAGATGATGT GCGTACGTAC GGACGTTACG AAGCGCGACG AAGTGGCTCA TTTAGTACAG GTTGTAGAAG TTTCTCTGGG GCCAATTGAT ATCATGGTAA ACTGCGCCGG GGTCATGTAC TTCACTTTGA TGAAAAATGT AGTCTGGGAT CAGTGGGAAG CGCAAGTGGA TGTCAACTGT AAGGGAACGA TGTACGGAAT CGGATCTGTA CTTCCCAGAA TGCTCGATCG AGGAAAAGGG CACATCGTGA ACATTACAAG TGATGCCGGC CGCAAGGCGT TTCCTGGGTT GGCGGTGTAC TCTGGTTCAA AGTTTTTTGT CGAAGGGGTG AGCCAGGCAC TTCGCGCGGA GACTGCCTCT ACAGGGCTCC GAGTGACCTG TATTCAGCCT GGTAACGTGG AGACTCCTTT GCTCTCGAAA TCAACCGATC CCGATGGGCT CGCAGAATAT GGGACACCAA CTGGCGCGAA GGTTCTCGAG CCGGCAGATA TAGGCAGGGC TGTCGTATAC GCCGTGTCCC AGCCTGAGTG GTGTGCAGTA AACGAGATTC TCGTCGAACC TCGAGACGAG CCCGCCTAA
|
Protein sequence | MTQDEPKCIH VSFHDQAAQT PDAVCLIEED LTFTYAEVQR RVILLAKELR DNGSCTNAVV AIFMEPCADY IISMLAVLTA GAAYVPLELA YPITMLQRVL HDATPVVVVT KQEQRALLPV TNTALAVLCL DDNEHHELQE TAGQPESQAE LLQTYQSFPP VSLDDLAFIV YSSGTTGQPK GIANPHRAPA LSYRWRFDEF VDPGPGSIVA CNVFFVWEAL RAVMRGGAVV PVPASIVFDG EALSVFLHQH SVTEMLFTPS LLENFFNTMS EADLRARLVA LKTIFLNGEV VTLNLRERCF RLLPSVRFIN LYSISECHEV GAVDLREIDL NLSTKYCPIG APCTYSPAYI LDDEGRHAVA PGDAGELYIG GDMLAVGYLN LPELTATRFV PDPFRPDEGC MYRTGDRARM LENGQLEILG RCDFMVKIRG YSIVLGAVEA ALVETVSLSS CVVVADGEEG EDKHLVAYLV RAPHEDVETR LSHWSIDTRT GACPEIRRAV DGALPHYMVP SVFVEVETLP VSAVGAKLDR KALQAQSADR RAMLRSLQLS AETHTTPLHT ATSHQPARWK RVAKHLRVPH GSSREDVEDV MLILWEVVLD REPGMLDSNS DFHEHGGHSL SAARLVSLMN KTFSCRLLAV QLMQGMSIGT ATDAVVASWL EDPISNGGES GSNRVHQMNG SGGTIPNGAL RTADEDQIIQ QVRGAAVLPE DIIPKSQGFP TRGLGESKEV FLTGSTGFLG AHVLAELLLK YPSATVVCLA RSKDPKVVQI NLERYKLWQP EFSTRIKAVS GDLSLAKLGL DLSSWKQITQ AADAVVHCGA AVSLTSPYAM LEAVNVYGTL NIIRLACECK AGTPLIYVSS NGIFPCDKGK DEIFLENDDV GCLPDRLGAM NGYGLSKWVA EQLVVAAHKR GLPTMTIRFG NLGWQSTSGI GNSLDFQSII LNGARRMVVR PRVKGWKFEI TPIDFAAAAL VGLADTAIHL KAGSIFNCVQ SELVDADRVF GWVSESDTLS LLALDFEDWQ QRVDEASNDD LSLSTLQAFA MGLPGGASYL SECAHLDCSK FDAAVASLHP PLRRLGPSEL SEYFKIFLSA NPIISSVAAD SVIKPSAVDP SVSTEHQGPL AGQVAVVTGA SSGIGRAIVL SLVQAGCNVA MAARRLSELE KTQKEVAEAC SGSPVKMMCV RTDVTKRDEV AHLVQVVEVS LGPIDIMVNC AGVMYFTLMK NVVWDQWEAQ VDVNCKGTMY GIGSVLPRML DRGKGHIVNI TSDAGRKAFP GLAVYSGSKF FVEGVSQALR AETASTGLRV TCIQPGNVET PLLSKSTDPD GLAEYGTPTG AKVLEPADIG RAVVYAVSQP EWCAVNEILV EPRDEPA
|
| |