Gene PHATRDRAFT_45854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45854 
Symbol 
ID7200960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp472830 
End bp478008 
Gene Length5179 bp 
Protein Length1689 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180245 
Protein GI219118957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0900508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGAC AACAGCCCAA CACTCCGTTC GGCGGAGGAG GTTTTGGACA ACAGAACCCA 
ACATCCGGTT TTGGAGCGCC AGCTCCAGCC ACCGGGGGTC TGTTCGGTAG TCCAGCGGGG
GCTCCAGCGT TTGGACAAGC TCCTGCTCCA GCGTTCGGAC AGGCTCCTGC CCCAGCATTT
GGTGCCCCTT CGGCTCCAGC GTTTGGACAA GCTCCTGCTC CCGCCTTTGG TTCCCCTGCA
CCCGCATTTG GAGGTGGAGG GTTTGGACAA ACTCCTGCCG CCCCCACTGG AGGACTCTTT
GGACAACCGT CTCCCGCACC AGCCTTTGGG GGCGGTTTTG GACAACCCGC TCCCGCACCG
GCACCCACCT ATGGTGGTAT GTTTGGACAA CCCGCTCCCA CTCCCGCGCC TGGACTCTTC
GGAGCTCCAG CTCCACAAAC CAATACTTCA CCTTTTGGCG GAAATCCGGG TGGTTCCGCT
TTCGGGGCTC CGGCGCCTTC GGGCTTTGGT GCACCCACAT CCTCGCCCTT TGGTTCCACG
GGGACTACTG GTGCCTTTGG AGCAAACACT ACCAGTTCGT TTGGCGCACC CGCACCTGCC
GCTGGTGGCT TATTTGGACA ACCCGCACCG GCTCCAGCCT TTGGCGGTGG TACGTTTGGA
TCGCCGGCTC CCGCTCCCGG CGCCTTTGGC GCACCACCAG CCTCGGGTGG GCTGTTTGGA
CAACCCGCAC CGGCACCCGC AGCCGGCGGC TTGTTCGGAT CCCCCAGTCC CAGCGCTCCC
GGAGCAGAGG GAACCCGGGC GGTCCCTTAT CAAGCAACGA ATCGACAAGA TGGCACCGCC
ACCATTACTC TACGTTCTAT TACGGCCATG CCACAGTACG AGAACAAGTC GTTTGATGAA
TTACGCATGG AAGACTTTTC GCAAGGCAAT CGCGGATCTA CTGTGACTCC ATCCACGAGT
AACGCTTTTA GTGGAGGGTT CGGAGCACCG GCTCCCGCAC CTAGTGGCGG TTTGTTTGGC
GCCCCCGCAC CAGCTCCCTT TGGCGCTCCC GCACCAGCTC CCTTTGGCGC TCCCGCACCA
GCTCCCTTTG GCGCTCCCGC GCCGGCGGGA GGATTGTTTG GCAGTCCTTC TCCAGCACCG
GCTCCGTTCG GGGCGCAGTC GAGTAGCTTA TTTGGCAGCA ACCCTGCACC AGCGCCCTTT
GGTGCACCGG CTCCGTCGAG TGGTCTATTC GGCTCCAATC CAGCACCTGC CCCGTTTGGA
GCCCCCGCTC CTTCCGGGGG TCTCTTCGGG TCTTCCCCCG CACCGGCACC TTTTGGTGCT
CCAGCCGGCG GCGGGCTCTT TGGTAGCAAT CCAACGCCCG CGCCCTTTGG AGCTCCGGCA
CCCAGTGGCG GTCTGTTTGG TGCCACCCCG GCGCCCTTCG GAGCTCCGGC TGGCGGTAGT
CTGTTTGGAT CCAATACAGC TCCGGCACCG GGAGGCTTTG GCTTTGGATC ACCAGCCCCA
GCGCCCGGTG GTAGTCTCTT TGGAGCACCA GCTCCCGCAC CGGGAGGCTT CGGGTATGGA
GCGCCGGCTC CGTTTGGTGC TCCCACACCG GGTTTGTTTG GAGCCCCCGC GCAAGCTCCG
CCGCCTGCCG CTTTGCCGCA AAACGCTGCT ATTATACCAC CTGTTGTCAA TGAAGTCATG
GAACAGCAAT TGCGGGCAAT TGAAAATAAG CAGGCCGAAC TTCAGAAGAG TGAAGCCTGG
AAGGGTAGCG CAACGAAAGA TCCAGTCACG ACACCTACGA GTTTGTCTGA AGCGGACGGA
CTCTTTGCGT CGCGCTACTC GGCTTCGCCT TATGTTACGA CGACCCCTCG ATCAGCCGTC
AAGATTCGAC CCCGTGGATT TCCCAGAAGT GAACCGAGCA AGTCGACAGC CCTTTCGCTC
AGCGCTGTTG GGCGCGACAA TAGCGGTCTC TTGTCACCCG AGTCTCACTT GCGTTCATCC
GTTATGAGTT TGCATATCAA GCCAGAGAGT ATGAATCGCA AATCCAGTTT CCGTTTACAG
ATCAACAAGC CCTCAGCCTC TTCACCGGTG CCGACGCCAT CTGATCCAAA ACCGCAACAA
CCTTCGTTCC TTTCTCCGAC CTTTGCTGAG ACTCTCACGT CGCCTCCACC TGATGTCTCT
CCAACTACCG GGGCCTCTCT GGTGCATGAG TCGCCCCAGC CTTTTACAAC TCCAAATGCA
TCCGCAACCC CCAAGAGCCC TGCCTATGAG TTGTACCAAC AAGTTATCGG CAGTGGAGAG
GCGTCCAGCA AGCCTCAAAG TCAGGTGAAG AAGCCTACCC GTACCTCGGT ACCCACACTA
ACGCGAAAGG GATATATCAT ATCGCCGACT TTGGAGGAAT TGGAAAAGAC TGAGGACGCC
GATCTGGCTG CCGTTAGTGA CTTTAGTGTC AAGCGCCCGG GATTTGGAAT GGTGGAATGG
GAAGGCGACG TGGATGTACG GGGGGCAGAC CTGGATCGGA TAATCACCAT TGATCAAGCA
GATGTATCGG TATATCATGC CGATGAAGCC GAAGGTAGCA AGCCCAAGGT GGGCTCCAAA
TTGAACCGTC CCGCTATTAT TACTTTCTAC AATATTTTTC CGAAAAACGG TGGGGCCAAT
GCATCCAAAG AAGAAAAAGA AAAGCACGCG AAGAAAGTAC AGCGCAGTAC TGCCAAGATA
GGCGCCGAGT TTATGTCCTA TGATCGCAAC AATGGTGTCT GGAAAATCCG AGTTCTCCAT
TTCAGTCGCT ACGGTTTGGA TGACGACTCA GACACCGAAA ACGAAGTGCC CCTGCCGGAG
CAAAACAGCG TGCAGTTTCA ACAACAGACA CCTCCAAACG CTCAATCTCT CCTGCGTCGA
AATCCCACAC CATACAAACC GAGTCGAATT CAATTCGACG AAATGGAAGT TTCCGAAAGC
GCTGATGACA GTGATGTAGT TTGCGTCCAG GACATTCAAA TGACGGACTC GGAAAAAATC
GCCTTGGTTC AAAAACGCGC TGATGAAGCG GCTAAGGAAG TCTTTCACAT TGTACCACAA
CAAGATCCTG ACTATGAGGT GCAACATCCA CTACGGCCCG CGAAGATCAC AACGTTTGAG
AATGTCGGCG ATTGCGATTC CGAGGAGGAA TCTGACTATG TGGTACCACC CGATGGCGAA
GACTGGCATG CAGCTCGATT GGCATCAAGC TTCTGCAGAG GTATTGCAAT TGAATCAGGC
ATGCATTCTT CCTCTACTGA TATGGGCCTG CGTATGGGGC GAGTCTTTCG ACCTTGTTGG
CTTCCTAACG GTTCGCTTCT GAAGTTAAAG CCAAGCAGTT TCAATCGGTC GCCTACACTT
ACCTCTTTGC GTCCTGTCCT TTCTGACTCA TATATTTCTC AACTCACTAG TGAACACCTT
CTGGAAATTC ACCGTTCAGA GTCAGTAGCA CTCGAATCGC AAGACGGCTG TCCCTTGTTC
AGCCTCCCAC GAGCTCTTCA GAACAAAGGC TCTTTGATGT CTCATAAAGC GTTGTATGAA
ACCGTCACCA AATTCCGCTC TGTCCGCAAC GAAAATAATG AAGTCCAGTC AGCTTTTGAC
CTGATAGCGC GTTTGATGGA CAGTGAATCT TTTCCTCCCA CAGAATCAGT AGATGGTGTT
CGCTATATTG CAAACTCAAT ATCCTTCGAC TCTCGAAAGA ACACTGCCGT GCTCGCTTGG
CTTGTTGACG TTTGCGCGCC ATCTGTTGAC TCTGAGATTG CTGAAGCAAA GCTTCGAAAT
TTCAATATTC TTGCTATATT TGCGGCTTTG GCTGGTGGAG ATGTCGACAA AGCCTGCATT
GTAGCAATCA GCTCTGGTCT CAACAATCTC GCTGCGATTC TCGCAAGTGG TTCGGAGGGC
AGGAAGGATG TACTTTTGAG TGTGAGTAAG CTTGCCGAGA GCAATCATGC GTCATCAGTG
CCAGCGGAAT TGATTCGCTT AATGAAAGAA TCTGGGGGAG ATGTTCACTC AGAGTGCACT
TTGTACAAAC AAGGCTCTAG TTCTCTCGAC TGGAAACGCC GTCTTGCTCT TCGCCTCTTG
CAAGATAGCG ATAAAAGCCT CGTGCAACTG TTGTGTCAAT ACGAAAACGA TATTTCGTCT
AACCACGCAC CACCACCAAA TCTTCCCCAT TCACAACAAA CAGACATGAG AAGCCTCACT
TTCCAGCTTC TTAAGTCATA TTCGGTTCCA CAGTCCATGG AAGTTTCTGA TGTAGTACAT
CCTCTTGGGT TCTCTTCAAT GAGCCACGAT TTTTCACTGG TCTTTCATCT GACAGCCTTG
ATTTGTGCGA CGGGCGCTAC GAAAAACGAT TCATTCAAAA CTGAATATAT CCTAAACAGT
TTTGAAGCCC AGCTGATACA AGCTGGGCGC TGGGATCTGG CTGTGATCGT TTGCTTGTCG
GCGATAGGTG AAATGTCTGA AACCTTGCAT CATTGGAAAG CTCACCGCGC AAAAAGTCTG
ATTTGTCGAT TTTCATCGGA CAATGATGGT AAGCGTCTTT TTCTCGAGGA GACTGGAATC
CCTCGCCGGT GGTTTGAAGA AGCCCTCGCC TACCGAGCTC TATACCGAGA GGACGCCTTC
GGCTTTGTTG TGCACGGATT GGAATGCGAT CTCAAATCTG CGCAAGATGT GCTTGAAGGT
TGTTGGTTGC CCAACCTATT TTTCTTGAGC TTGAAGGACA TCCGTACGTT AATGGAGCGA
ATCGGTATGG CTTTTTCTCC CAATTCCCTT TCGGCTGCAA TGCACCGCTT CTTTGACTTG
AATGACGGGG TAAATCTACT CATTGGGAAA AGTCAGGACG AAATAGAGAG CATTGTGCCG
TCTTTGATCG AGTCGTGCCA GGGTATTGAA CGTACGCTTG TCTCTACAAA ACAGCATGGT
CTAGAGTCTA ATCGTACCAC AAGCTTGCTA ATGCGAGAAA ATTCGATACC GCTTCAGTCG
ATGATTTCTG AAGCTTTGGA GCATCTCAGT TTTCTACGTC TCCAGTTGAG AGCTATCGGG
GAAGCCAGGC TGACCTCGAA GTCGACGTAA ATCCGAGGCT GGAGCCAAAG AAAGATGCGC
AATGCGCAAG GAATAGCACA TAGTTACAAT CAGTCAGACA TCGCATTATG CGGAAATTCT
CTTAAAGACC TATCTCATC
 
Protein sequence
MFGQQPNTPF GGGGFGQQNP TSGFGAPAPA TGGLFGSPAG APAFGQAPAP AFGQAPAPAF 
GAPSAPAFGQ APAPAFGSPA PAFGGGGFGQ TPAAPTGGLF GQPSPAPAFG GGFGQPAPAP
APTYGGMFGQ PAPTPAPGLF GAPAPQTNTS PFGGNPGGSA FGAPAPSGFG APTSSPFGST
GTTGAFGANT TSSFGAPAPA AGGLFGQPAP APAFGGGTFG SPAPAPGAFG APPASGGLFG
QPAPAPAAGG LFGSPSPSAP GAEGTRAVPY QATNRQDGTA TITLRSITAM PQYENKSFDE
LRMEDFSQGN RGSTVTPSTS NAFSGGFGAP APAPSGGLFG APAPAPFGAP APAPFGAPAP
APFGAPAPAG GLFGSPSPAP APFGAQSSSL FGSNPAPAPF GAPAPSSGLF GSNPAPAPFG
APAPSGGLFG SSPAPAPFGA PAGGGLFGSN PTPAPFGAPA PSGGLFGATP APFGAPAGGS
LFGSNTAPAP GGFGFGSPAP APGGSLFGAP APAPGGFGYG APAPFGAPTP GLFGAPAQAP
PPAALPQNAA IIPPVVNEVM EQQLRAIENK QAELQKSEAW KGSATKDPVT TPTSLSEADG
LFASRYSASP YVTTTPRSAV KIRPRGFPRS EPSKSTALSL SAVGRDNSGL LSPESHLRSS
VMSLHIKPES MNRKSSFRLQ INKPSASSPV PTPSDPKPQQ PSFLSPTFAE TLTSPPPDVS
PTTGASLVHE SPQPFTTPNA SATPKSPAYE LYQQVIGSGE ASSKPQSQVK KPTRTSVPTL
TRKGYIISPT LEELEKTEDA DLAAVSDFSV KRPGFGMVEW EGDVDVRGAD LDRIITIDQA
DVSVYHADEA EGSKPKVGSK LNRPAIITFY NIFPKNGGAN ASKEEKEKHA KKVQRSTAKI
GAEFMSYDRN NGVWKIRVLH FSRYGLDDDS DTENEVPLPE QNSVQFQQQT PPNAQSLLRR
NPTPYKPSRI QFDEMEVSES ADDSDVVCVQ DIQMTDSEKI ALVQKRADEA AKEVFHIVPQ
QDPDYEVQHP LRPAKITTFE NVGDCDSEEE SDYVVPPDGE DWHAARLASS FCRGIAIESG
MHSSSTDMGL RMGRVFRPCW LPNGSLLKLK PSSFNRSPTL TSLRPVLSDS YISQLTSEHL
LEIHRSESVA LESQDGCPLF SLPRALQNKG SLMSHKALYE TVTKFRSVRN ENNEVQSAFD
LIARLMDSES FPPTESVDGV RYIANSISFD SRKNTAVLAW LVDVCAPSVD SEIAEAKLRN
FNILAIFAAL AGGDVDKACI VAISSGLNNL AAILASGSEG RKDVLLSVSK LAESNHASSV
PAELIRLMKE SGGDVHSECT LYKQGSSSLD WKRRLALRLL QDSDKSLVQL LCQYENDISS
NHAPPPNLPH SQQTDMRSLT FQLLKSYSVP QSMEVSDVVH PLGFSSMSHD FSLVFHLTAL
ICATGATKND SFKTEYILNS FEAQLIQAGR WDLAVIVCLS AIGEMSETLH HWKAHRAKSL
ICRFSSDNDG KRLFLEETGI PRRWFEEALA YRALYREDAF GFVVHGLECD LKSAQDVLEG
CWLPNLFFLS LKDIRTLMER IGMAFSPNSL SAAMHRFFDL NDGVNLLIGK SQDEIESIVP
SLIESCQGIE RTLVSTKQHG LESNRTTSLL MRENSIPLQS MISEALEHLS FLRLQLRAIG
EARLTSKST