Gene PHATRDRAFT_39357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39357 
Symbol 
ID7195067 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp275296 
End bp278558 
Gene Length3263 bp 
Protein Length1049 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183456 
Protein GI219126421 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.807834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGG TGCATGCACT CCTTGTCAAG TCGTCGTCCT TGTTCACACA AACCAAGTCT 
CTACGGCTCC CACAGACTGT TCGCGACCAC CAACTCTCGA CATTGGTATC CACTCCACTT
TAATCCTTTT TCAGTAATGT CGACTCCAGT CAAGTCACCG ATGAAGAGCG GCCACAAAGC
AGCGACTCTG AATCATATTT CGCTGTTGAC GCCGTCGTCT ACGGAATGGT CCGTCGATGC
CGTAGGAATT TTGCCACAAT CGCTCATGCC ATTATCTACG TCGTCGTCTT CGAATGCCGA
GTCGGCTGTT TTGGTAGCGT CTCGCTACGG CGCGTGGTCC CTGATTGGTA AACGAGAACG
GGACAGTAGC CAACACGTCA AGATGGTCTT GTTCGTCTGG TACTCGTCGC CGACGGCGTC
CGTGAAAGAG CGAGCGTTGC AAAAGCAAGT ACTGAAACTT TCTCATCCGC ATTTGTCGAC
TTTTGACAGC AGCAGCGGTA GTAGTTGTAG TAGTGCGGAT CCTCCACCAC TCGTACAACT
CGCGACGAGT CCGACCAAAC CGGAAGAAGT CTACGTATAC GTTGCTAACG TGGTCTCGGG
ACGCATCTTG ACCTGGAAAC TATCGCGCCC CGATCTCTCT CGGGTATTAG CACCGCCTCC
TGCAGCCGTT ACATGGCTGG ATGAATTGCA ATCGGTAGAG GAAGGGGCAG AAACCTTGAC
GTCCCTGTCG GCGACCTGGA ACTCCGGCAT GACAACCCCC TTCCTCTTGG TCGGTACCTC
GGCGGGCCAC GTCTACAGCC TGCAGCAAAC ACACGTACCG CTGGCAATAC AGGTGCAACG
TGTCGCAACC TCGTCGCTAT TCGGATCCAT CAACAGCAAC AGTAACGGTC TATGGGGCAG
TATTCTTCAT TCGATGATCC CCGCAAGTCG GAGTGCAGCC ACTGGAGATG CCGTGGTCAC
TACGCTGCTT CACCCTCCGG ATGATGCCAG CAGTAGCAGT AACTATACCC AAGGCATGTT
TCATGTGTTG ACCAAGTCTG GTCGGTTACA AGGGTGGCGT ACGCACAAAC CTACCGAAAG
TCCACATTGG ATCTTTGAGG CTGGTCCGAA GCCAATTGAC CTAGTCGCGC TATTACACGA
CAAAATTAAG GCACCCGCAG TACGCAGTCT TCGCGTCTTG CAAGCAGCGA TTTCACCACA
AGGACACCAT CTTCATTTAC TGGTGCTAAC GGAACACGAC GATGATGAAT ATCGTTTGTA
CTGGAGCGTC TTGACCATGG GTAGTAAAGG CGGACACGGT GATTTTGCCA AGGAAGAACC
CACCCTCACA TTGACCACCG CACACTGGTT GGATCGATTT TACGATCCTC GCAGTGTGAC
GGTGGTCGGA TTGATCGTGG CTGCCAATAA TACTGCCTAC GCTGCCTTTC AAAGCCCAGG
CAGTGCACCC ATAGCCATGG CTCTGTGCGG ACAAGATAAC ACATTCGATG CTATCCACGA
ATGCGACCTC CCTGTCGGTA ATATTCCTGC ATTGATAGTC GATACCATGG TTCCCGATAA
AGTGGTATAC GGCTGTGCCG TTTGGTCTAT TAGTGGTGCC AATGTACGTA TACAGTACCG
TCCAACAACG GCATCACCGG CTCCGTCGAC TGTTCCACAC GGCTCTACCT CGCGAAGCCG
TTCCGCGATT GCTACGTTGA CCACTCATCT ACAATCCGCC TTTTGGAACT ATTATCAACA
TCCGGATCGG TCGGTTCGCC TCCCCCCTTC CCTGATTTCG GCCAACATTG CAGATTTGCA
AGAAGCGGTA GTCGGCGTTG GTTGTGCCTT AGCATCTCGT CAACAAGCAC TGTCGAACGT
CTTGGAGTGG CATTTGGCAT TTCTGAATTT GCTGCAACTA AGTGGCCTGT ACCGCAGCTT
GCCCGAGGTC ACTCAGTGGC AGCTTTTGGC CATTGGACAA GAGTTGACCG TTGCAGACCG
ACTAGGTTCC TTGCCCTCGG ACATGTCAAC CAGAATCCCA TCCTGGCAAT TGGAAGCGTT
GGAGTCAAGA CCCTGGCAAG GTCTGGGGCC TTGGTTCGGA GATCTTGTGG CAAAACACTT
TGGAGCTGGG CAAGATCGCC AGCAAGCACT TGTGGAATGG CTAGTAGCCA TGTTGGAGAC
AGCCGAAGGC TACCGTGAAG AACACGGTCA AAAGACTTAT TACTTGACGT CAGGTAGCCG
AATACCCAAA ATGTCGAGGA TTGAAGAAGT GCCAATTTGG ACGAGTCAGA TTGCTTTGCA
AAGAACTCTA CTAGGGTTGT TGGAAAGTTG GCATGATGGT GGCTTCGTCG GGCACGGTGG
GCAGGCTGTG GTCTTAGCAA CCGGTATTCT GCAGTCATTT GGCGACACGT ACGCCTCTGT
GCCGACCGAG GAAACCAAAT CGACGTACGC AAATGCACAG CGAATGACGA TCGGACTGCT
ACGTCGCCAA ATAAACCCAC CAAACGACGT CGTAGCTTTC CGTCTATCCG TCAAACACCA
TTCCTTTAAA GGCGTGTGCC AGATTGCCTT TGATCACGAA AAGAAAGAGG ACGCGGAAGA
ATTCTCCGTA GTTCCATTGT TTACCGAGTT AGCCCACGCG AAGGACATTT CTACCAGTAT
GTTGTTTCCG GCATTTTTCC TGTATTGGCA TTCGCAACGC CAACATTTTG GCCATGTCCT
CGACTATGGT CAATACTGTC CTGACACGTT TAGGGCATTT TTGGAAAGCA GCGAAGAATT
GCGTCCGTAT CGGTGGATTC AGGCGGCGCG GGCTGGGGAT TTCGAAGGCG TAACGAACTC
ATTACTTCGC AATGCGGAGA AACCGGAAAT TTTGTTGCAC GATGCCCACC TGTCCCTTAG
TCTGGCGAAA CTTGCAAACT CTGTAGTAGA GTCTGAGTCC ATGGATAAGG AACTCGCGGC
GAAGCGCGCT CGACACATTG ATCAAAAGCG AGAATTGGTT AATGCTCAAA ACGAACTCTT
TGATGAGACC GCACCTAGCT CTTGCCTCTG GTCGGCCGAA CGTCTACTCA ACTACGCCTA
TGACAGGGCG AATCAGGCCA AAGATGCGGA AGACAAATGT CGAATCTATT TCACGGCCCT
AGCAATTTGT GCAACGATGG AAGAAATCGA TCAAGTAGAA AAGAACGCTT CTCACGTGTG
GTTCCGTGTT TTGCAAACAG AACTGGACTG GTGGAACAAT CTGATTCAGA GTGCTACTGA
TTTGACAGAT ACAGATATAC TGA
 
Protein sequence
MATVHALLVK SSSLFTQTKL FATTNSRHWY PLHFNPFSVM STPVKSPMKS GHKAATLNHI 
SLLTPSSTEW SVDAVGILPQ SLMPLSTSSS SNAESAVLVA SRYGAWSLIG KRERDSSQHV
KMVLFVWYSS PTASVKERAL QKQVLKLSHP HLSTFDSSSG SSCSSADPPP LVQLATSPTK
PEEVYVYVAN VVSGRILTWK LSRPDLSRVL APPPAAVTWL DELQSVEEGA ETLTSLSATW
NSGMTTPFLL VGTSAGHVYS LQQTHVPLAI QVQRVATSSL FGSINSNSNG LWGSILHSMI
PASRSAATGD AVVTTLLHPP DDASSSSNYT QGMFHVLTKS GRLQGWRTHK PTESPHWIFE
AGPKPIDLVA LLHDKIKAPA VRSLRVLQAA ISPQGHHLHL LVLTEHDDDE YRLYWSVLTM
GSKGGHGDFA KEEPTLTLTT AHWLDRFYDP RSVTVVGLIV AANNTAYAAF QSPGSAPIAM
ALCGQDNTFD AIHECDLPVG NIPALIVDTM VPDKVVYGCA VWSISGANVR IQYRPTTASP
APSTVPHGST SRSRSAIATL TTHLQSAFWN YYQHPDRSVR LPPSLISANI ADLQEAVVGV
GCALASRQQA LSNVLEWHLA FLNLLQLSGL YRSLPEVTQW QLLAIGQELT VADRLGSLPS
DMSTRIPSWQ LEALESRPWQ GLGPWFGDLV AKHFGAGQDR QQALVEWLVA MLETAEGYRE
EHGQKTYYLT SGSRIPKMSR IEEVPIWTSQ IALQRTLLGL LESWHDGGFV GHGGQAVVLA
TGILQSFGDT YASVPTEETK STYANAQRMT IGLLRRQINP PNDVVAFRLS VKHHSFKGVC
QIAFDHEKKE DAEEFSVVPL FTELAHAKDI STSMLFPAFF LYWHSQRQHF GHVLDYGQYC
PDTFRAFLES SEELRPYRWI QAARAGDFEG VTNSLLRNAE KPEILLHDAH LSLSLAKLAN
SVVESESMDK ELAAKRARHI DQKRELVNAQ NELFDETAPS SCLWSAERLL NYAYDRANQA
KDAEDKCRIY FTALAICATM EEIDQIQIY