Gene PHATRDRAFT_38066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38066 
Symbol 
ID7202748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp792236 
End bp796795 
Gene Length4560 bp 
Protein Length1519 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181977 
Protein GI219123325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00336128 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGA AACCATCGTT TTCGGTTGAC ATTTGTGCCA CTTGCATCAC GTCCAAAGCG 
CCTCCTTTGC CGTTGCGAGA GGAACTCCCC TCGTTGGGAT TCGAAATCGT TTACAACGCC
GGTGGTCTGG GAGCGAAGGC GAGCAAACAC GAAACCAAGA TAGCGTTGGA TCAGTATCGG
AAACAATGCC AAACCAAAGA ACTCAAAAAA CGACGATTCT TTCCCAAGAG TTCGAGACGA
ATCAGAGAAA CCCTGTTGGA GCGGAAAAAG ATTGAGCACC AGCTCAAGAT ACCCCATGCT
CTCTCCTATG CAGCATACGA GCATAGATTG GCCCGTCTCG AGGAGAAAAG TTGGACGTCA
ACTTCGACTT CCTTCACGCC CACTACGCAT GCGAGCTCCC ACAAATCGAT ATCGTTATAC
AAGTACAAAG ACGGCAAGAA GCCTTTCACA AAACGATGCA AGGGAAGATG GGGTATTTCG
CTGCGTAGAC GTATAGGCTC GAATCGTAAG AAAGACACGA AAAGTCTTCC TTCCGAAACA
TCGGAATTTT TGCAGACTTC AAGCATCAGG GATACTATCG GTAAGGCCGC CTTCGGAGAC
TTTCTTGTGC CTGTGAATTC ATCATCGCAC GAAAAGGAAA ATGGCGTAGA TTGCGGCATG
TATACCAAGC GCAATCCACT GAAGTGGAAA AGGGCGTTTG ACTCAGAATC TTTCGACAGC
CACAGCAATG ACTCGTCTAT CTCGAGCAAC GGAGACTGCT CCACCAACGA CATCAAGTTG
CAATTTGTCG CTTCTGACGA AATGGAAAAA AAGTTACCTG TTTTCAACAC CTTCAATGAG
ACGGAACGAG AGGGCGCCGC CTTTGCCAAT CAAGGGATGA ACAAGGATGT TTCGCTGATC
ATGGCTGCTG GTGACTCAGC GAAATCATCA GCCAAGCTTT ATGACGATCT TGAGGTGGAG
GAGGCGATTG CCGCGGCGTT TAAAGAATTT GCAAATCCAA AGCTAAACTC AGCATCTATT
TCGCCTATCG CGGCTCAATC GTCAAATAAC CCGTTCAATA TCCATGAAGC ACCTGCTCCG
TGGTCGGGGT CGACTAAATA TGCCATTGAA AATCAATCTG CGGAAGCAAA CGGAATCAGC
TCGAAAGTTG AACCACAAAG ACCGATGGAG AGAGAGCCAA TTCGTGATCG GGACAAATCG
CACATCAAGC AACAAAATTG GTTTTGGTTT GCCAAACATC TGTTTGAAGA CGTTGCAGGA
GATGTCGTCA AAGCTTCTAT TACTTCAACG CCAGGTTTCA AAATAGAAGC AAAAAGTTTA
GAGGTCGCGG ATGAGCCAGT GGTGGTCGAA GAAAGGCGCG CCGAAGTCAA GAAGAGTATT
GCGCATCTAC CTGCTGTGAC TTTGTCGATC AAAACCAAAG CAGGAAGCGA AGGGCTAGGA
GACTCCGGTG GCCCCAATCA GTGCAACACT CGTCTGCCTT TTCTTGAGAT GCTTACAGGA
CTTGGTAGTT GTATGTCCGA CATTTTGGGG GCCGTTCAAG AAACTCCGGA ACTCGCTCAT
GTCAAATGCC AAGACGACGG TAGATTCCCA TTGCATACTC TCTGCAATAG AGACATTATT
GATCGTTCAT CGGTTGCTGA ATCACCACTG GCAGGTGTTT TGCTGGCCGA CATCTTGGAG
TACAAAGAGG TCCTTAGGAA CCTGTTGGAA GCCTACCCTG CGGCAGCGAC TGTTATGGAC
AAGACCGGCG ACCTACCAGT GCATTTGCTT GCCAGGACGC TCATGAAATG GGAAGCCTAC
TGGTATACAA CTGTTTACGC TCAAGCTGCG AAAGTCTTCA ATCCCGATGG AAAAGATGCT
AACGCAATCT CGTCGTTGTA TCATACGATG TCGACATGTG TTGAACTCGT TCTAGAACCC
CTGGTTACAA AAGAGGATTT GTGTCGCCAA CCAGGTAGTG TTGGTAGAAT GTTTCCATTG
CATATCGCAT CTATATTTAC ATGTTCAGTC GAATCACTCC AATCACTCCT CGAAGCATAC
CCGAATGCAG CAAAAAAAAG GTGTGATCTC AACACCCTCA ATACCTTTAC CCCCGACGAT
TCTTTTCCTT TAATTTTGCA TGATTCGCTT TCAACAGACT TTCCAAAATG GGAGGTTGAA
ATCTTCAAGC AAAATGAATC TTATGTGGCA GGCCTTGGTT GCCACAAGTA CGGTGATGAT
ACCGAGAGGT ACCTCCGCCG ATCGGACCTG CTCTTCGCCT ACAACCCAAC CATTGAGCCA
TATAGGCTTG ATGGCGAACG CATAAGACGT CTCGAAAGAA AAATAATTTT TGAAGCAATT
CAAGTCGTCG AAGGGAAAGA AGACTGCCTT AGCAAAGCCA TGCAGCGAAT TTGGGTATGG
CTCTGTACAT TCAAAGAACA TGGAAGCAAG CGACCCACTT ACAGTCAAAG CGTGAAGAGA
ATCGCAAGAA TGTTAACGCT TCGTGCTGTT CAATATCTAG CCTCAATCTC AACTCCACAT
GGAAAATTAG TCTTGGACGA GGCTACTCCT GAATGTTCAA GAGTCATTCA ACTCCGGCTT
GCAGAAAGCC AACCCATTGA CCTATGGGGA AAAAGGGTTC CACAATCTCG TCGTTTGACG
AAGCAAAATG TGAGGCAATT TGTAAAAAAG ATCTTCAATG TTGAGCAGCA GCGCTTCCCT
ACGAGCTTCA TTGTACTACC GTATCGTCTG CAGGTGAATA AAGACGGTTT TAACAGCATT
GGAACACCAG AAGCGATCGA GGTTGCGGAC TTATTCACCA GTTATATGCT ACGTCTGACA
GATCCAAGAG CTCTTCTTCA CTATCTTAAC GTAAAATCAC AGAAACACTA TGGTGAACCA
CTTATCCAGG GAAGCAAGAC GGATGAGGCC CGACAAAATT TTCTCGATTC AATCAAGAGA
GTAGAAGACA ATATGCTTTC TCTATACAAA CCAGGATCGT CGTTTTTATA TCTTCTAGAT
GAAGGAACTG GCTTTCCAGT AGTTCCAGGG AGCACTGACG AGTACCCAAT CATCCTCGCT
GAACCTATGA GCATCATTCC CAAAATATTC CCGCTGATGA TGCCTGGATT GGTCATGATG
CGAGGCGAAC AGTATCTAAT CACATTGTCG AAGGTCCTCC TCGACAGCGA CGTAACGATG
GTACCACGAA ACTGGTTGAC GACTACAGAT AATATCGAAA GAAGGCTATT ATCCGCTGAG
GTTGCGGATT TGACTGACAG TGAAGCGTCG ACATTAGGTG ATCGTATATC TCAGCTTTCG
TCTAGCAACA AGACTCGATG GGAGGGGATT ATGTCACCCA AGAATGGAAA TACTGAATGG
GCTGCGGAAA TGTCTATTCT CAACACTCTG ATGGAAGTAA ACGACCCATC GATGAAGTTT
GCTGGTCTCA AAGTTCAACG TGACGATGAT TCGTCTTTGT TTTGGTCTGT GGACTCAAAC
AATGATTGCT GTAATTCGAC TCTTGCGCTA GGAGCATTGC ACATTGATGA CCTCTCCAAT
AAACAAAAAG AATTGGCCAA AAAGCTGGAA CGCATGAGAG TCGACAGGAA AGATTCGGGT
CCAGATTTTC GATACCCTAG AAAGAACTTC GTTGAGCCAC ATAACGAGAT GCTTAACAAA
CCATTTTGGC AAAGGCTAGC TAACGCTTGC GACTACGTCA AATATCCTGA CAAAGCAGTA
GACTCAGAAA GAAAAAAGCT TCCCTTGAAA ATTGCTATTG ACTGTAAATC AAAATTGGAT
TCTGAAATTT CCTCAGATGA TATGAAAGGG GAGATCCGAA AGAAGTACAG CTTTCTTTTA
TCCGACTTGG CAGTTAAGGA TCGCGATAAT AAGGGACAAG AAGCCTTCTG CTGTCGACAA
CGAGAACGAG GTTCGAGTAG TGAAAGAATT TGGGAGGATG TGACTTCGGA ATTAGACCGT
GCTGGAATGT TTTACGAGGA GAGTGCAATC TTGCACCTGA AAGTCGGCAT TGCTGAAGAA
GCAAAACGAA GCGGGCTTCT CGCCAAGCGA ATCGCTTCGC TCCAGGAAGG CGGGGCTCAT
GTATGTTTTA AGGCGGAAGC ACTAGACACG GAGCTGCCAT ATCACGAAAT TCAAAACGAA
GTGACTGTCC TTCCAGGACT TTCTGATTCC AGGAAATTGT TGATTAGGCT GTTCGACTTG
GAGGACCGTT TGTTGTGTGA CGAGATTGAT ATTCAGCATC TCGCAATTGA GTCCCTTTCT
ATGTTTTACC AGATTGAAGA GGTCGACCCC GACAATCTGT TTGAAGAAAC GAAAGAAGAA
TTCAATAGCC GCGAGAGCGT TTTACATCGT GCACATCCTC GTGTACACTC TGCTTCTTTT
AGAGCAGATG CGAGAGGATT TTGTGGTGAG AAAACGCCTA GTCTAGCCAC GTCGTTGTCT
AATGACCCTG GAAGTTTCGT CAACCCTCTT GACCTGACGA ATCTGAATGA TAATGTAATG
GAAGACATCG CAGTGCCGTG GGTTGTGTAT CACGAATCGA CTGGCCAAAT TGAGTTCTGA
 
Protein sequence
MTKKPSFSVD ICATCITSKA PPLPLREELP SLGFEIVYNA GGLGAKASKH ETKIALDQYR 
KQCQTKELKK RRFFPKSSRR IRETLLERKK IEHQLKIPHA LSYAAYEHRL ARLEEKSWTS
TSTSFTPTTH ASSHKSISLY KYKDGKKPFT KRCKGRWGIS LRRRIGSNRK KDTKSLPSET
SEFLQTSSIR DTIGKAAFGD FLVPVNSSSH EKENGVDCGM YTKRNPLKWK RAFDSESFDS
HSNDSSISSN GDCSTNDIKL QFVASDEMEK KLPVFNTFNE TEREGAAFAN QGMNKDVSLI
MAAGDSAKSS AKLYDDLEVE EAIAAAFKEF ANPKLNSASI SPIAAQSSNN PFNIHEAPAP
WSGSTKYAIE NQSAEANGIS SKVEPQRPME REPIRDRDKS HIKQQNWFWF AKHLFEDVAG
DVVKASITST PGFKIEAKSL EVADEPVVVE ERRAEVKKSI AHLPAVTLSI KTKAGSEGLG
DSGGPNQCNT RLPFLEMLTG LGSCMSDILG AVQETPELAH VKCQDDGRFP LHTLCNRDII
DRSSVAESPL AGVLLADILE YKEVLRNLLE AYPAAATVMD KTGDLPVHLL ARTLMKWEAY
WYTTVYAQAA KVFNPDGKDA NAISSLYHTM STCVELVLEP LVTKEDLCRQ PGSVGRMFPL
HIASIFTCSV ESLQSLLEAY PNAAKKRCDL NTLNTFTPDD SFPLILHDSL STDFPKWEVE
IFKQNESYVA GLGCHKYGDD TERYLRRSDL LFAYNPTIEP YRLDGERIRR LERKIIFEAI
QVVEGKEDCL SKAMQRIWVW LCTFKEHGSK RPTYSQSVKR IARMLTLRAV QYLASISTPH
GKLVLDEATP ECSRVIQLRL AESQPIDLWG KRVPQSRRLT KQNVRQFVKK IFNVEQQRFP
TSFIVLPYRL QVNKDGFNSI GTPEAIEVAD LFTSYMLRLT DPRALLHYLN VKSQKHYGEP
LIQGSKTDEA RQNFLDSIKR VEDNMLSLYK PGSSFLYLLD EGTGFPVVPG STDEYPIILA
EPMSIIPKIF PLMMPGLVMM RGEQYLITLS KVLLDSDVTM VPRNWLTTTD NIERRLLSAE
VADLTDSEAS TLGDRISQLS SSNKTRWEGI MSPKNGNTEW AAEMSILNTL MEVNDPSMKF
AGLKVQRDDD SSLFWSVDSN NDCCNSTLAL GALHIDDLSN KQKELAKKLE RMRVDRKDSG
PDFRYPRKNF VEPHNEMLNK PFWQRLANAC DYVKYPDKAV DSERKKLPLK IAIDCKSKLD
SEISSDDMKG EIRKKYSFLL SDLAVKDRDN KGQEAFCCRQ RERGSSSERI WEDVTSELDR
AGMFYEESAI LHLKVGIAEE AKRSGLLAKR IASLQEGGAH VCFKAEALDT ELPYHEIQNE
VTVLPGLSDS RKLLIRLFDL EDRLLCDEID IQHLAIESLS MFYQIEEVDP DNLFEETKEE
FNSRESVLHR AHPRVHSASF RADARGFCGE KTPSLATSLS NDPGSFVNPL DLTNLNDNVM
EDIAVPWVVY HESTGQIEF