Gene PHATRDRAFT_47104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47104 
SymbolNRPS 
ID7202016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp408261 
End bp412469 
Gene Length4209 bp 
Protein Length1367 aa 
Translation table 
GC content52% 
IMG OID 
Productnon ribosomal peptide synthase 
Protein accessionXP_002181204 
Protein GI219121710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAGAGAGA CGTCCTATCA TCGTTTCTCG AATCGCATTG ACGGCGAAGG CTACTAACCC 
ACTCCTTCGT ACGTATTCAC TGCTCCCTGT CAACCATCAT TCATCATGAC TCAGGACGAG
CCGAAGTGCA TTCATGTTTC GTTTCACGAT CAAGCGGCTC AGACTCCTGA CGCCGTCTGT
CTCATCGAAG AAGATCTGAC ATTCACTTAC GCCGAGGTCC AACGTCGCGT GATTCTACTG
GCAAAGGAAC TCCGCGACAA TGGTTCCTGT ACGAACGCTG TTGTTGCCAT TTTCATGGAA
CCTTGTGCGG ACTATATCAT CTCCATGTTG GCTGTGCTGA CGGCGGGGGC GGCGTACGTG
CCGTTGGAAC TCGCTTATCC AATCACCATG TTGCAGCGTG TGCTGCACGA TGCTACACCT
GTGGTGGTGG TGACTAAACA GGAACAACGA GCACTGCTGC CCGTGACAAA CACGGCCTTG
GCGGTTCTTT GTCTCGACGA TAACGAACAT CACGAGCTGC AGGAAACTGC CGGACAGCCA
GAATCACAGG CAGAACTATT GCAGACGTAT CAGTCCTTTC CTCCAGTTTC GCTGGACGAT
CTCGCCTTCA TTGTGTACTC CAGTGGTACT ACGGGTCAAC CCAAAGGTAT TGCAAATCCG
CACCGAGCTC CGGCCCTTTC GTACCGTTGG CGGTTTGACG AATTCGTCGA CCCTGGTCCA
GGCAGTATTG TAGCGTGCAA CGTTTTCTTT GTTTGGGAAG CCCTGCGAGC CGTCATGCGA
GGCGGAGCTG TGGTCCCCGT TCCCGCTTCA ATCGTCTTTG ATGGCGAAGC CTTATCTGTC
TTTCTGCATC AACACAGCGT TACGGAAATG CTATTTACAC CTTCCCTTTT GGAAAACTTT
TTCAATACCA TGTCGGAAGC CGATTTGCGA GCGCGATTGG TTGCCTTGAA GACAATTTTC
CTTAATGGGG AAGTCGTCAC CCTGAATTTG CGCGAACGTT GTTTCCGTTT ACTACCCTCC
GTTCGCTTCA TCAATCTGTA TTCGATTAGT GAGTGTCACG AAGTGGGTGC TGTCGATTTG
CGCGAAATAG ATCTGAATCT TTCCACCAAG TATTGTCCGA TTGGTGCCCC ATGTACCTAT
TCACCTGCAT ACATTCTAGA CGATGAAGGA AGACACGCTG TTGCACCTGG TGATGCGGGC
GAACTCTACA TTGGCGGAGA CATGTTGGCA GTGGGTTATT TGAATCTACC TGAACTAACG
GCCACCCGAT TCGTGCCGGA TCCTTTTCGA CCTGATGAAG GGTGCATGTA CCGGACAGGA
GATCGTGCGC GAATGTTGGA AAACGGACAG CTAGAAATTC TTGGTCGCTG TGATTTTATG
GTGAAAATTC GTGGATATTC TATCGTGCTA GGCGCCGTGG AAGCGGCACT GGTCGAAACC
GTCTCTTTGT CGTCGTGTGT GGTTGTCGCC GACGGAGAAG AGGGCGAAGA TAAACACCTG
GTGGCCTATC TGGTGCGCGC ACCCCATGAG GATGTTGAAA CACGCCTCAG CCACTGGTCC
ATTGATACTC GTACCGGTGC TTGCCCAGAA ATTCGCCGCG CAGTCGACGG CGCCTTGCCA
CATTACATGG TTCCTAGTGT TTTTGTAGAA GTTGAAACAT TGCCAGTCAG TGCGGTCGGA
GCAAAACTTG ATCGCAAGGC ATTACAGGCA CAATCGGCCG ATCGCAGGGC CATGCTCCGG
TCCTTGCAAT TGTCAGCCGA AACCCACACA ACCCCGTTAC ATACGGCTAC TAGTCATCAG
CCAGCACGCT GGAAGCGCGT GGCGAAACAT TTACGGGTAC CGCATGGGTC GAGTCGAGAA
GATGTGGAAG ATGTCATGCT CATTTTATGG GAAGTTGTTC TTGATCGCGA GCCAGGCATG
TTGGACAGTA ATTCCGACTT TCACGAGCAC GGAGGCCATT CGCTTAGTGC TGCACGACTC
GTCTCTTTGA TGAATAAAAC CTTCTCTTGC CGACTACTCG CAGTACAGCT GATGCAAGGA
ATGTCCATAG GCACAGCAAC AGATGCTGTA GTGGCATCTT GGTTGGAAGA CCCGATCTCC
AATGGTGGGG AATCCGGAAG CAATCGTGTA CATCAAATGA ATGGAAGCGG CGGGACGATT
CCGAACGGCG CGTTGAGGAC AGCAGATGAA GATCAGATTA TCCAACAAGT ACGTGGAGCT
GCGGTCTTGC CGGAAGATAT TATACCAAAG TCTCAGGGAT TTCCGACTCG TGGTCTCGGC
GAGAGCAAAG AAGTATTTTT GACTGGATCC ACAGGCTTTC TCGGAGCTCA CGTGCTGGCT
GAGCTCCTAC TCAAATATCC GTCCGCGACA GTGGTATGTC TGGCTCGCTC CAAAGATCCT
AAAGTTGTTC AGATTAATCT GGAACGCTAC AAGCTGTGGC AACCAGAATT TTCTACTCGA
ATTAAAGCCG TCAGCGGAGA TTTGTCGCTT GCGAAGCTTG GCTTGGATCT AAGCAGCTGG
AAGCAAATAA CACAGGCTGC TGATGCTGTC GTCCATTGCG GAGCTGCTGT GTCACTAACA
TCTCCGTATG CAATGCTTGA AGCTGTGAAT GTGTACGGCA CACTGAATAT TATTCGTCTT
GCTTGTGAAT GCAAAGCCGG CACACCTCTT ATCTATGTCT CGTCCAACGG AATTTTCCCG
TGTGACAAGG GCAAAGATGA AATTTTTCTT GAAAATGATG ATGTTGGGTG CCTGCCGGAT
CGACTTGGAG CCATGAACGG TTATGGGCTT AGCAAATGGG TTGCAGAGCA GCTTGTTGTC
GCTGCGCACA AGCGAGGGCT CCCCACAATG ACAATTCGTT TTGGCAATCT AGGATGGCAA
TCAACTTCTG GGATTGGTAA CTCTTTGGAT TTTCAGAGTA TAATTCTAAA TGGCGCTCGG
CGAATGGTGG TCCGGCCTCG TGTAAAAGGG TGGAAATTCG AAATCACGCC AATCGATTTT
GCCGCAGCAG CGCTCGTCGG TCTTGCAGAC ACTGCTATAC ACCTAAAAGC CGGGTCTATC
TTTAATTGTG TCCAGTCAGA ACTTGTCGAT GCAGACCGTG TCTTTGGTTG GGTGTCCGAG
AGCGATACCC TTTCTCTCTT GGCGCTTGAC TTCGAAGACT GGCAACAGCG GGTAGACGAG
GCGAGCAACG ACGACCTGTC GCTATCCACA TTGCAGGCCT TTGCCATGGG GCTCCCAGGT
GGAGCCTCGT ACTTATCCGA ATGTGCACAT CTAGATTGCA GCAAGTTCGA TGCAGCCGTA
GCCTCGCTTC ATCCCCCGTT ACGGCGTCTT GGTCCTTCGG AACTTTCGGA GTATTTCAAA
ATCTTCCTTA GCGCCAACCC GATTATATCG TCTGTGGCGG CCGACAGCGT CATAAAGCCG
TCTGCGGTCG ATCCCTCTGT TTCAACTGAA CATCAAGGTC CTCTGGCTGG TCAAGTTGCC
GTCGTTACAG GCGCCTCGTC CGGAATTGGT CGAGCAATCG TCCTGTCACT GGTCCAAGCC
GGATGTAATG TTGCTATGGC TGCTCGTAGA TTATCTGAGC TCGAAAAGAC TCAAAAGGAA
GTAGCTGAAG CGTGCAGCGG CTCTCCGGTT AAGATGATGT GCGTACGTAC GGACGTTACG
AAGCGCGACG AAGTGGCTCA TTTAGTACAG GTTGTAGAAG TTTCTCTGGG GCCAATTGAT
ATCATGGTAA ACTGCGCCGG GGTCATGTAC TTCACTTTGA TGAAAAATGT AGTCTGGGAT
CAGTGGGAAG CGCAAGTGGA TGTCAACTGT AAGGGAACGA TGTACGGAAT CGGATCTGTA
CTTCCCAGAA TGCTCGATCG AGGAAAAGGG CACATCGTGA ACATTACAAG TGATGCCGGC
CGCAAGGCGT TTCCTGGGTT GGCGGTGTAC TCTGGTTCAA AGTTTTTTGT CGAAGGGGTG
AGCCAGGCAC TTCGCGCGGA GACTGCCTCT ACAGGGCTCC GAGTGACCTG TATTCAGCCT
GGTAACGTGG AGACTCCTTT GCTCTCGAAA TCAACCGATC CCGATGGGCT CGCAGAATAT
GGGACACCAA CTGGCGCGAA GGTTCTCGAG CCGGCAGATA TAGGCAGGGC TGTCGTATAC
GCCGTGTCCC AGCCTGAGTG GTGTGCAGTA AACGAGATTC TCGTCGAACC TCGAGACGAG
CCCGCCTAA
 
Protein sequence
MTQDEPKCIH VSFHDQAAQT PDAVCLIEED LTFTYAEVQR RVILLAKELR DNGSCTNAVV 
AIFMEPCADY IISMLAVLTA GAAYVPLELA YPITMLQRVL HDATPVVVVT KQEQRALLPV
TNTALAVLCL DDNEHHELQE TAGQPESQAE LLQTYQSFPP VSLDDLAFIV YSSGTTGQPK
GIANPHRAPA LSYRWRFDEF VDPGPGSIVA CNVFFVWEAL RAVMRGGAVV PVPASIVFDG
EALSVFLHQH SVTEMLFTPS LLENFFNTMS EADLRARLVA LKTIFLNGEV VTLNLRERCF
RLLPSVRFIN LYSISECHEV GAVDLREIDL NLSTKYCPIG APCTYSPAYI LDDEGRHAVA
PGDAGELYIG GDMLAVGYLN LPELTATRFV PDPFRPDEGC MYRTGDRARM LENGQLEILG
RCDFMVKIRG YSIVLGAVEA ALVETVSLSS CVVVADGEEG EDKHLVAYLV RAPHEDVETR
LSHWSIDTRT GACPEIRRAV DGALPHYMVP SVFVEVETLP VSAVGAKLDR KALQAQSADR
RAMLRSLQLS AETHTTPLHT ATSHQPARWK RVAKHLRVPH GSSREDVEDV MLILWEVVLD
REPGMLDSNS DFHEHGGHSL SAARLVSLMN KTFSCRLLAV QLMQGMSIGT ATDAVVASWL
EDPISNGGES GSNRVHQMNG SGGTIPNGAL RTADEDQIIQ QVRGAAVLPE DIIPKSQGFP
TRGLGESKEV FLTGSTGFLG AHVLAELLLK YPSATVVCLA RSKDPKVVQI NLERYKLWQP
EFSTRIKAVS GDLSLAKLGL DLSSWKQITQ AADAVVHCGA AVSLTSPYAM LEAVNVYGTL
NIIRLACECK AGTPLIYVSS NGIFPCDKGK DEIFLENDDV GCLPDRLGAM NGYGLSKWVA
EQLVVAAHKR GLPTMTIRFG NLGWQSTSGI GNSLDFQSII LNGARRMVVR PRVKGWKFEI
TPIDFAAAAL VGLADTAIHL KAGSIFNCVQ SELVDADRVF GWVSESDTLS LLALDFEDWQ
QRVDEASNDD LSLSTLQAFA MGLPGGASYL SECAHLDCSK FDAAVASLHP PLRRLGPSEL
SEYFKIFLSA NPIISSVAAD SVIKPSAVDP SVSTEHQGPL AGQVAVVTGA SSGIGRAIVL
SLVQAGCNVA MAARRLSELE KTQKEVAEAC SGSPVKMMCV RTDVTKRDEV AHLVQVVEVS
LGPIDIMVNC AGVMYFTLMK NVVWDQWEAQ VDVNCKGTMY GIGSVLPRML DRGKGHIVNI
TSDAGRKAFP GLAVYSGSKF FVEGVSQALR AETASTGLRV TCIQPGNVET PLLSKSTDPD
GLAEYGTPTG AKVLEPADIG RAVVYAVSQP EWCAVNEILV EPRDEPA