Gene PHATRDRAFT_47693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47693 
Symbol 
ID7202701 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp537867 
End bp541959 
Gene Length4093 bp 
Protein Length1297 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181930 
Protein GI219123227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAACCTAC CGATTAAAAC AATCGGAGCC TCTACGAGGC TTTCCCCCCA ACTTCTACTC 
TTTCGTCGTA TTCTACAAAT ACACTCGTAA GCAGGTAGAA CTTATCAGTT TACTCAGACC
CAACCCAGCT CTAATTTTTT GAGTTTTTGA AACCTCGACA ACGAATGCGT CGTCCGAGGA
AGGTCACTCC CGCTGTTCCG GCCCCTGCCG CAGCGACGGA CTCACCGGCT GATGCCGCGT
CCGCATCCAA AGAGGATGAG GAGTTCGGAG GATTCGACTC CTCCGACGGT GAGGAGCCTT
CGGGCACCGC ACCGCCATCG CCAGCATCTT CGGATGATGA AGGTGATGGC AAAAAGACTG
CAAAGCCTTT GGCTCGCAGC AAGAACACGT CTGACGAAGT CAGTGTGATC GAGAAAAGCG
TCATCGACGC AGAGCCTCAC TTGTCTAAAG ACAGTGACGG TCTCGACTCC GTTCCTCGGC
AAGACCGTGT TGAACGAAAG GCCTTGATGG TCGTCCTCCG TGACGTCATC TGTGTTCCAT
TGTCAGTTGC GGCTGCAATG TTGAACAACG GCATTAAATC ATCTGATGAT TTCCGTCTTC
TCACGAAGGA GGACATCAAT GATCTCTGCA TGCGGCTCAA AATGGGCTCC ATGCATACCA
AGCGAATACT CGTCTTCGCC AAATGGATGC ATCACGCACC CAACTCAGTC GATGTCGCCA
AAGAGTTCAC GGCTTCCGTG CTACGCTTTG AGATGATGAC TAGAGCCGCG GCGTCGTATG
ATAATGTGAC TACGACGGCT GCAAAGGCTG AAAAATCGGC TACTAGCCTC TTGCCTGAAC
CGTTTGATGG TTCGCAGAAA AAGTGGCTCA CTTTTCGTTA CGGTTTCGAA GCGTGGGCAG
GCGCAAGTGG GTCCACTTTT ACCGCGTGCA TCGCGCACCA TTCGGATCGG TATTCGAAAG
CCGACCCAAC CGGACCCCAT ACGTCGCCCC GTGACGTTTC AGATTTGTTT GCACTCTCCC
CAGTTGTCAA CATCACCAGG AACGCAACAA TCTTCTATAC TCTCATGTCG CTAACCAGCG
CTGGGGACGC CTGGGGACTT GTTGAGCCCC ACGAGCACAC TAAGGACGGA CGCAGTGCCT
GGATTTCTCT ATGTGCCTTC TATGAAGGAA CGAGCCAAGT GGGTCTCACT ACCGAGCAGG
CTCGCGCGAC AGTTATGGAG TCGGTGTATA CAGGACTGTC CAAACAGTTT TCCTTCACCA
AATATGTCGC TCGGCATATT TCTGCCAACA ATGCCCTTTT GCGTAACAAG GAGGGCTATT
CGGACGCTCA GAAAACGAAT TTCTTTCTTA AAGGGATTAC TGATCCGGCA CTCCTTCCTT
ATAAGGCAAC TGCCGAAGCG CGACTCGATG ACTGGAATTT TAATCGGGTC GTCAACTACA
TGCGTACGTC CGCGACGAAA CTCAGTTCCA AGGACAGAAG CGACTCACGG AACGTACGTC
AGACAAAGAC CACTGGCAGA GCCACCGGCA ACCAACGTGG TAACGACAAC AAACGGCGTG
GCTCGTCCAA CCGTCCGTCG AACAAGGGGG CTGAGAAACC TTCCCGCCCT CATAAACACG
TCTTACCTCC TGAGCTGTGG GAAGCCCTAA CCCCAGCTAT CAGGGAGAGT ATCTTGAGCG
CAAAACGCAG TATTGCACCC CCTGGCCGTG AGGCCAAAAG GGCTAAATCC TCAGATACAG
ATAACTCTAG TTCAACCGTT GAATCTTATT CACAACTGCC TAGTAGTAAA AAACCTATTC
GTAAACATAC ATGCGAAGAT CACGTCCAAG TAGATTCCAG TACCCCTGAA ACCCTACTTC
GTGACGCACC CACAGACATT TCACCCCACG TCACCACCAA AAAAGTGACA TTTGGTGCAG
GTGTCCTCTT TGGTCGGTAC GCTAATCGCG TATCGTTGAA TCGTATGGTC CGCTCCGGCA
GTCATTTCGA TCAAGCCCCT TGGCGCAAGT CGGATTTCCG ACTTAACGAT GCGACACTAG
TTCGTATTCG TCAGAACCGC TCACGCGGAA CAAAAACTCC CACCAATTAT GGTGAAGCGG
TAATTGACAC TGGTGCAGAC ACCGTCTGCG TCGGTGCCGG GTACTCTGTA TTGTCATACA
CGGGTCGATC AGTCAGCCTT CGCGGTTTTC ATGATGACGG TGAAACGTTT GAACGGATTC
CGGTTGTCAC GGCGGCAACC GCCTATGATT ATGACGACGG AACAACCGTG ATTCTCATCT
TTCATGAGGC ACTGAACCTC GGGCCCACAC AGACCACCTC GCTCATTAAT TTGAATCAAA
TCCGACATGC CGGACATCAA ACCGATGACA TTCCAAAATT TTTGTCGCAA GGCAAATCCC
TTCACGGCAT CGAAACTCTC GACGGTGATT ATATCCCGTT TGAGCTCAAA GGTCGTGCAT
CCCTGTTGTA TTCTCGCGTA CCTACTCAAC ATGAGCTTGA CAACTGTCAG CACATTGATC
TCACTTGCGA TCAACCATGG GACCCCAACA GTAAAGATTG GGAAGAAAAT GAAGCAAAGT
ACACGCGACA CGATCGTTCT CGTCGTGCCT GCTACACCGA CAGCGTACCG GTTGACATTC
TCCCGGATTG GCCTCCACTA CCCGTTTCCC CTGGATCCGT TGTACCGGAT TTCCATAACC
GTGTCATGAA CCCTCGCGAC ATCGTTCGCG AAATCAAATA CGCCACTATC GGTGCGTCCA
TATCCAGCCC TCGGGTGTTG GACGTCGACC GCGATAAATT ACGGCGAATT CTCGGACATG
TGCCGATGGA AGTAGTTGAG CGTACTCTTA CCGCCACTAC ACAACTAGCG GAGCGCACGG
GCGAAATGCC TTTGCATCGT CGTTATAAAA CCAAGTTTGA ACAACTTCGG TATCGACGTT
TGAAATGCAC ACTTTATAGT GATACTTTTA AATCCTCTAT AAAATCCTCG CGTGGACATA
CCCATACTCA GGGTTTTGTC TGTGGTGACT CTTACTTTAT CTATCATTAC CTAATGAAAG
CAGAGTCCGC GGCAGACCAG GGTCTCGCCG AATTCATCCA CAACATCGGT ATTCCTGCAC
AATTGCACAC CGATAACGCG AAAGTGGAAA CACTTAGCAA ATGGAAAAAA TTAACTTCCA
GTCACTGGAT AAAGACGACG GTCACTGAAC CCTACTCTCC GTGGCAAAAT CGTTGCGAAC
ACGAATTTGG TGCTGCGCGC ATTCACACGC GCCTCGTTCT CGAAACCACC AAGTGTCCCG
AACAATTATG GGACTACGCC CTTGCCTACG TCATTTTCGT ACGTAACCAC ACGGCACGAA
AAGCGCTGGC CTGGATCACG CCTATTACTG CGATGACTGG CGACACCCAT GATATTTCTG
AAATCCTGGT TTTCGAATTT TTCGAACCAG TTCAGTATTT TGACAATCCT GATGTCAAAT
TTCCACAGAA TAAAGCCAAA GTCGGCCGTT GGTTAGGTAT TGCCACCAAT GTGGGCCAAG
CCTTGTGCTA CCATATTTTG ACGGACAAGG GTACTGTAAT CACTCGATCT ACTGTTACAC
CTCTCCAAAA CCTCGATTCG TCTGCTCTGC AGACTGCCCT CGCTACTTTT GACGCCACCA
TAAGGGAGAT TTATCAGCCT TCTGATTTCG CCCTCGGTAA CAAAATCAAA GCGCCGGCTT
TCCGCCGTGA CGAAGCGATG AAAGTCGCTC GGCGATCCGA CGATCCCGGC GATGGCAACA
CCCGTAACAG ACACGTGTTA TACGATCTGA ACGAAGGGGA TGACCATATT CAACTGGATC
CCGGGCTCAC GGTTGACGAT TTCTTCGAGA ACGACTCACC GGATCAGGAC CCCACCTCCT
TAATTATTGG TACTGACGTT CTACTCACTT CGGGTGCGGT TCAGCGCCAG GGCCGAGTTA
CCAAGCGCGA TCGCGACGGT ACTCCAGTCC CTAACGACGA CCCTGGAAAT TTCGTCGTCG
AATTCGACGA CGGTACCGAG GAAGTCCACG GTTACCAAGC TCTCCTTGAT GCTGTTTATA
AGCAGGTCGA TGA
 
Protein sequence
MRRPRKVTPA VPAPAAATDS PADAASASKE DEEFGGFDSS DGEEPSGTAP PSPASSDDEG 
DGKKTAKPLA RSKNTSDEVS VIEKSVIDAE PHLSKDSDGL DSVPRQDRVE RKALMVVLRD
VICVPLSVAA AMLNNGIKSS DDFRLLTKED INDLCMRLKM GSMHTKRILV FAKWMHHAPN
SVDVAKEFTA SVLRFEMMTR AAASYDNVTT TAAKAEKSAT SLLPEPFDGS QKKWLTFRYG
FEAWAGASGS TFTACIAHHS DRYSKADPTG PHTSPRDVSD LFALSPVVNI TRNATIFYTL
MSLTSAGDAW GLVEPHEHTK DGRSAWISLC AFYEGTSQVG LTTEQARATV MESVYTGLSK
QFSFTKYVAR HISANNALLR NKEGYSDAQK TNFFLKGITD PALLPYKATA EARLDDWNFN
RVVNYMRTSA TKLSSKDRSD SRNVRQTKTT GRATGNQRGN DNKRRGSSNR PSNKGAEKPS
RPHKHVLPPE LWEALTPAIR ESILSAKRSI APPGREAKRA KSSDTDNSSS TVESYSQLPS
SKKPIRKHTC EDHVQVDSST PETLLRDAPT DISPHVTTKK VTFGAGVLFG RYANRVSLNR
MVRSGSHFDQ APWRKSDFRL NDATLVRIRQ NRSRGTKTPT NYGEAVIDTG ADTVCVGAGY
SVLSYTGRSV SLRGFHDDGE TFERIPVVTA ATAYDYDDGT TVILIFHEAL NLGPTQTTSL
INLNQIRHAG HQTDDIPKFL SQGKSLHGIE TLDGDYIPFE LKGRASLLYS RVPTQHELDN
CQHIDLTCDQ PWDPNSKDWE ENEAKYTRHD RSRRACYTDS VPVDILPDWP PLPVSPGSVV
PDFHNRVMNP RDIVREIKYA TIGASISSPR VLDVDRDKLR RILGHVPMEV VERTLTATTQ
LAERTGEMPL HRRYKTKFEQ LRYRRLKCTL YSDTFKSSIK SSRGHTHTQG FVCGDSYFIY
HYLMKAESAA DQGLAEFIHN IGIPAQLHTD NAKVETLSKW KKLTSSHWIK TTVTEPYSPW
QNRCEHEFGA ARIHTRLVLE TTKCPEQLWD YALAYVIFVR NHTARKALAW ITPITAMTGD
THDISEILVF EFFEPVQYFD NPDVKFPQNK AKVGRWLGIA TNVGQALCYH ILTDKGTVIT
RSTVTPLQNL DSSALQTALA TFDATIREIY QPSDFALGNK IKAPAFRRDE AMKVARRSDD
PGDGNTRNRH VLYDLNEGDD HIQLDPGLTV DDFFENDSPD QDPTSLIIGT DVLLTSGAVQ
RQGRVTKRDR DGTPVPNDDP GNFVVEFDDG TEEVHGR