Gene PHATRDRAFT_49712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49712 
Symbol 
ID7198397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp27544 
End bp32841 
Gene Length5298 bp 
Protein Length1765 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184475 
Protein GI219128555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCGT TATGCGCTGG GGACAGCGGC ATTCTAGGTG AAAGCAGCTC GAGTTTTGCC 
AGTGTCAGAG ACGACACCGT TCGAGACGTC AGTGCCAGCA CGAGTGATGG GGAAGAGGAG
GTTGATGCGG TAGAATTTGC GTTGAACGGT GGAATGCAAG GCACGGAGCA GTATATCAAA
TCTGCAGCCG GTAACGTTAC AAGCATGGCA TTTGACTGTG AGGACTCCCG GCAGGACAAT
CGCTCGCGCG ATACCGCCTG TGGAAGACCT GACGAGCCAG TACCAAACCG TAGCCGCAAG
AGAAACAGCG CCAAAGACAG GCATCCGCGT ACTACAACGC CATCAAAAGT CCAGTCGAAA
AATTCGATAT CACATCCGTT GTCTCCTTTT TCTCCTTTGC TTCTGCAAAA ACAACAGCGC
AAACGACGCA CTCGGGCGAG ACCCGTCGAT CAACATCATC AACCAAACAC GACTTCGCCC
GGCGTTCCGT GGCAAGACGA TGACAACGAC GACCAAGACA CTGTTATTGG TACTTTATCC
TTGAATGCGA CGCGGAAGGT GTCGGTTATT GTACACGTGC GCAATCCAAT CGCGTCCGAA
AAAGAAAAGG ACAAAATTTG TCTCTTTCCC CTTCTGCGCA AACATGACGA AGCCACCGCA
ATACTTTCGA CATCCGAGGC GGCACTTTCT CCCACATCTG CCACACTACA ATCCGCTGGA
CATACCCGCG AACTAATAGT CGTCAATCCG ACTGCTTTTG GCAAGCTTAT TCCGTCCAAA
GTTACCATGG AAACGGCTCG ATTGGTAGCA CAAATGGCTA ACGTCGAAGA CTGGGCGCGA
TCCTTTCGCT TCCAGCAAGT TATGTGGCCC GGAAACGAGT TGCAAGCGCT GCAGAATCTG
ACTCAAGCAA TGGTTAATGA TGTCATGGCA CCAGGGTCGA TCGCCTCCCG AACGATTCTG
GCGATTGGTA GCGATGCCGC CAGCAAAAGC ACTCTTTTGT TTGGTTCCGT CGGAACACAG
AGTATCGCCC GCGTATTGGC TACTACGGCA CACGATCTCA CACCCAGTGA TATTTTGTCG
CGCTACGGAC TCATCGGAAT AGCAGTATCG CAATTGCTGG ATCAAATGCC ATCGCACGCC
ATCGTGACCT TGTCTGTTTT GGAAATCGCC CCAGACGATG TTCTTCATGA TTTGTTGGCC
GCTCGGCCGT TTAATTCTAG CTCACGGCTT CAATTTCGGT ACTCCAACAG CACTAATGCG
ATGTCAGAAA GCGGCGGGGG AGCGGTGGTT CACAATCTTA CTAATGTGCC CCTGGATTCG
CTCAAGTCGT TGGGGCATTT GTTGCGGCGT GCCTTTACTG CCGCTCGGAA TCCCAAACGA
GCCCATTCTC CGGCCCATAT TATGGCAACG CTTAAAATAT GGCAATCTAA TGGCACACGG
AACGACTCTA CAATGGAGGC TGCGCATACT GTTGTACAAT TTGGGGATAT TGTCAACGCA
TGTACCAATT CAAGGAGAGA CGCAGCCTTG AAGCAAAGTG TTACGACGTT GGGTGGCGTC
TTACGTGGTG TACTGCTGCG GGAAGCAGGC AACGACTCCC CATGGAAATT CCGTGAATCT
TCTTTGACCA AAGTGTTGCA ACGATACGTC GATCACCTGG ATGGCAAAGT TGTCATGCTG
GCGTGCACCT CGCCTTTAAG CGACGTTTAC GACGAAACCT TGGCAACTCT CAGGTTCGTT
TCGAACTTGC TCTATAGTCC GAATCGCGCT GCGTCAAGTC CATTCCATCG CAGAGAAGAT
GATACAGCTC CCACGTCACC GTTGTCCGAC ACTTCCTCGG CAATGAGCCT CATGGCAGCA
GAGTTTTCGG GATCGGATCG ACAAACGTTG CTGACGAATT TGTTGTCGGA TCCTCGGCAG
CGTCTCGCCA AAGTGTGGAA ACAAAAAGCG CGCGAAAATC CAGCAAATGC AGAGCTGCAA
GAAAACAAGA ACGAGTACAC GCCTACTCAG TACGAGCTCA TTGAGATAGC GACGTTGGAC
AGAGCTGAAA CGCGACTACC GAGAGCCGAA GAGCAGTCTT CGACTTCTCA AATTGTTCAA
CCAGAGATAT TAATTCCGAC ACCTATTGCC GCCAACCGAA ATGCCATGAG GAATAAATGT
CGCTCGTGGC GTCCACCCGA CACAGATGGT CCCTTTGGTA AGGTCCTGGA AGATTGCGAT
TATGCTGCTG TAGAAACAAG CTTTCCCTCG GACTTATCGG AACGCGACGC TGATGTTGGG
CATAATTGTA TGGACGAACC GAGTCCATTG ACGATTGATA CAAAGGATCC ACCTTTTAAT
CTTGACACTT CCTATACGTT GGACGAGATG GCAGAGCGTG TGTTTAAAAA CAGCAAAATG
AAGGATACGG AATCAAAGAC GCGTGCTCCG TCAGGATATC TTGAAGGGCC TGGGAAGGAA
CCATCGAACG TCCACATGGA CGAGAGGCTA GCCGTTGGTG ATACCGTGGT TGGCGAGAAA
CATGCTTTGT ACAATTCATT GCACAAGCAA ACGGGTAATT TGCCCTGGGC GTCCCGCGAA
AAACCGAGTA GCCATGGATT GCATGAAGTG ACAGGAATGC AATCAAATAC GGGTACATTA
GCACATTTGA GCAAGAAAGA GAGGCAAGAG GAAAAGGAGT CCTTCCCAGA CAAGAAGGAC
AATGCAAAAG TGGACCCGCC ACAAGATCTA CTGTTGGACA ATAGAGAAGA GCGCGACAGT
ACTTTGATGC ACGATCCGCG ACAAGTCTTC AGCACTAAGG AAGGAGAGGA TAGATATGTT
TCATTTTCTT CGGCAAACCA TCAGGCTTCA AATGTTCCTC TCCGTCACAA TTCGGATTTC
GACCGGAAAT TCCCTAGCCT TATGTTTGAA GATAAAGATC CGAAAGTTGA GCGATCGGAT
GCAGAAGGGG CCTCGACGAC GGATCCTTGC GCGGGTCGGC ATAGCGCCTT GTTTGGCAAT
TCCACTGAAC CCAACAATCG GAAAGATTCG GGCAACAAAA GCAGCTTGGC TCACACTCCT
TTTGAAAGTG ACTCGTACAA CGCAGAAATC GACCCGAGCT TGGAGACTTC GCATCATATT
TTGCTTGTAA ACGTGCCTGA GGATGTGACC GTTGGTCACA GCGCGCATCC GGACGTGACG
GTAGGACGGG CTTCAGGAAA TGAGATCTTC GATCGCTCAT CGGTCGATGG GGCTAGTCGT
GTTTCAGCCG AGGCACAGCT ATTGCGAGAA AGTCTGTCAA AAGCGAAAGC GGAGCGGGAC
AGTGCAATTC AAAAGTATCA AAGACTTGAG GAGGAAAAAA CAGCGCGAGA TGAAGAAGAA
GACTTTTTGT TGAAAAAGTT GCGCGCTTTG ACCGACGAGC GAGATGGCGC TTTTCGAAAG
TTGGAAGACC TTGAGCGAAA GCTCACCTCG GAAGCAAAAT TGAAGTCTGA TACTCGAGAA
GCTGAATGGA ATAAATTGAC CACCGAAAGA GCGAAGGCAA TTAACCAAAT CGAAGAACTT
GAAGCTTCAT GCCTCGAAGC GCAAGAATCC CGAGACATCC TCCGCGAAGC ACTGGAAGAA
CGCGAAGAAG CTTTGCGACT GATACAACTT TCCGAGAAGG ATCGTATCCG TGACGCGGAG
GAACATGATA AGGCTCTGAA GGAAGCCACT TTACAGATAG AAGTTCTGCA AGATCAGACT
GTCAAAATGT GTGCCGACCG GGGAGAGCTA GTCAAGATTG CTGAGGAAGC CATTGGGACC
AACGCTCAGC TCGAACAGAA GATCGTAGAG CTCGAGAAGG AAGTTTCTAC AAACGTTTTA
TCGTCGGTCT CAAAGGATGA GGTTGAAATT CTGGAAAAAG AAAATTTGAA GCTATCAGAG
GAGAGCCAAG AACTGCGCAA GCAGTTATCC CAACACAAAT TACAACTCAA GGAGAACGGT
CTCAGTCTCT CCGAGCTCAC CTTTACGGTC AGTAGTCTGG AAGACGAAAG GGCTCAACTT
CTCAACAACA AAAAAGCAAA AGACGACGAA ATTCGTCGTT TAAAGCGCGA ATTGACTTCC
AAAGATATTG TAGAAGACAG AGCTTCGGCG CTTCAACGCC AACTGGAGCA CCAATTGACG
TTAGGCGTAG AATGGCAGCA GCGTGAAACG GATTTCAATC GCACTATACA GAACAATATC
ACCGAGCTGG AACAACTTGA GAGCGCCGTA AAAAACCTAC AAGCGAAGCT GGATGAAAGG
AACGAAATGC ACTCACAAGA GGTCGAATCG GACAAGAGCC AAATTGCACT CTTGCTAAAC
AAAGTGTCAT ACTACGAGAG AGAGCGGTTG TCTGCGGCTG CAAGCGTCGA AGAACTTGAG
CTCAAACTGG GATCTGAAAT CCGCACTCGT CACAAACTAT CATCTGATTT GAAGCAAGCA
CAGATTGATC TCCACAGTCG CCTCGCGGAT GTACAAGAAT TGTCGTCCAA TTTAAAGCAA
CTCCTAGTCG AGAAAGACGA AGACGAGCGA AAGGTTGCGC GCATGCAACT TGCTTTACAA
AAATTTCAAA GCGAAACGCG AACCAGGGTA AAAATGGTAG TCATGCATCG TGACGAAGCG
GCCAATCTCT TGGACGAAAC TTTGACCGAA AATCGAGCAT TGACCGAAAA GCTTCAAGAG
CTACGGCAGG CTTTGGAAGA AACGCAACAT GCTCGCGTTG ACTGGCGTTC AAATAGTCTC
ACGGCAAGTC GCTTTCAGGC AGATCTCGAT GACATGGTGT CCAAAAACAA AGCTTTAAGT
CAGCGCAATC AATTACTGGA GACTCAAGTT GAAGAACTCC TACAACAACA ACCACAAAAG
CAGTCTTTTC CCCGCAATCG CGAAAACGAT AATCTCCGAG ATCACGATAA ATACAGAGGC
ATTGACAGTG AACACCAATT TGGAATTACC AATTCGTTAA AACGGCGAAA CGATTCTGGT
AGCGGAAAAC ACACCATTCA GCGTAGAGAC GAAGATGGCT TTGATCACGG AGCTTTGCAT
CTTGAGCCCT TCTCAGCACT GCCTGGGCAT GCTTTGGCGA ATTCTGAGTC GAGTCTTCAC
GCACGGGCAG AAGAAGTAGC TGCGTACCTC GCTTTGTCCG CAAAGATGAC CGTGGAGAAG
AGTCAGACGG AGGTGATCCG AATGCAGCAT CGATTGCATG CGGTGGAAGA CACCAAAGAT
ACTGAGATTG ATGCTCTAAA GCGACACGTC CGCAGGCTAG AACGGCATCT CGAGAATTCG
AATTGGCCTG AGGGGTGA
 
Protein sequence
MESLCAGDSG ILGESSSSFA SVRDDTVRDV SASTSDGEEE VDAVEFALNG GMQGTEQYIK 
SAAGNVTSMA FDCEDSRQDN RSRDTACGRP DEPVPNRSRK RNSAKDRHPR TTTPSKVQSK
NSISHPLSPF SPLLLQKQQR KRRTRARPVD QHHQPNTTSP GVPWQDDDND DQDTVIGTLS
LNATRKVSVI VHVRNPIASE KEKDKICLFP LLRKHDEATA ILSTSEAALS PTSATLQSAG
HTRELIVVNP TAFGKLIPSK VTMETARLVA QMANVEDWAR SFRFQQVMWP GNELQALQNL
TQAMVNDVMA PGSIASRTIL AIGSDAASKS TLLFGSVGTQ SIARVLATTA HDLTPSDILS
RYGLIGIAVS QLLDQMPSHA IVTLSVLEIA PDDVLHDLLA ARPFNSSSRL QFRYSNSTNA
MSESGGGAVV HNLTNVPLDS LKSLGHLLRR AFTAARNPKR AHSPAHIMAT LKIWQSNGTR
NDSTMEAAHT VVQFGDIVNA CTNSRRDAAL KQSVTTLGGV LRGVLLREAG NDSPWKFRES
SLTKVLQRYV DHLDGKVVML ACTSPLSDVY DETLATLRFV SNLLYSPNRA ASSPFHRRED
DTAPTSPLSD TSSAMSLMAA EFSGSDRQTL LTNLLSDPRQ RLAKVWKQKA RENPANAELQ
ENKNEYTPTQ YELIEIATLD RAETRLPRAE EQSSTSQIVQ PEILIPTPIA ANRNAMRNKC
RSWRPPDTDG PFGKVLEDCD YAAVETSFPS DLSERDADVG HNCMDEPSPL TIDTKDPPFN
LDTSYTLDEM AERVFKNSKM KDTESKTRAP SGYLEGPGKE PSNVHMDERL AVGDTVVGEK
HALYNSLHKQ TGNLPWASRE KPSSHGLHEV TGMQSNTGTL AHLSKKERQE EKESFPDKKD
NAKVDPPQDL LLDNREERDS TLMHDPRQVF STKEGEDRYV SFSSANHQAS NVPLRHNSDF
DRKFPSLMFE DKDPKVERSD AEGASTTDPC AGRHSALFGN STEPNNRKDS GNKSSLAHTP
FESDSYNAEI DPSLETSHHI LLVNVPEDVT VGHSAHPDVT VGRASGNEIF DRSSVDGASR
VSAEAQLLRE SLSKAKAERD SAIQKYQRLE EEKTARDEEE DFLLKKLRAL TDERDGAFRK
LEDLERKLTS EAKLKSDTRE AEWNKLTTER AKAINQIEEL EASCLEAQES RDILREALEE
REEALRLIQL SEKDRIRDAE EHDKALKEAT LQIEVLQDQT VKMCADRGEL VKIAEEAIGT
NAQLEQKIVE LEKEVSTNVL SSVSKDEVEI LEKENLKLSE ESQELRKQLS QHKLQLKENG
LSLSELTFTV SSLEDERAQL LNNKKAKDDE IRRLKRELTS KDIVEDRASA LQRQLEHQLT
LGVEWQQRET DFNRTIQNNI TELEQLESAV KNLQAKLDER NEMHSQEVES DKSQIALLLN
KVSYYERERL SAAASVEELE LKLGSEIRTR HKLSSDLKQA QIDLHSRLAD VQELSSNLKQ
LLVEKDEDER KVARMQLALQ KFQSETRTRV KMVVMHRDEA ANLLDETLTE NRALTEKLQE
LRQALEETQH ARVDWRSNSL TASRFQADLD DMVSKNKALS QRNQLLETQV EELLQQQPQK
QSFPRNREND NLRDHDKYRG IDSEHQFGIT NSLKRRNDSG SGKHTIQRRD EDGFDHGALH
LEPFSALPGH ALANSESSLH ARAEEVAAYL ALSAKMTVEK SQTEVIRMQH RLHAVEDTKD
TEIDALKRHV RRLERHLENS NWPEG