Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49712 |
Symbol | |
ID | 7198397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 27544 |
End bp | 32841 |
Gene Length | 5298 bp |
Protein Length | 1765 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184475 |
Protein GI | 219128555 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCGT TATGCGCTGG GGACAGCGGC ATTCTAGGTG AAAGCAGCTC GAGTTTTGCC AGTGTCAGAG ACGACACCGT TCGAGACGTC AGTGCCAGCA CGAGTGATGG GGAAGAGGAG GTTGATGCGG TAGAATTTGC GTTGAACGGT GGAATGCAAG GCACGGAGCA GTATATCAAA TCTGCAGCCG GTAACGTTAC AAGCATGGCA TTTGACTGTG AGGACTCCCG GCAGGACAAT CGCTCGCGCG ATACCGCCTG TGGAAGACCT GACGAGCCAG TACCAAACCG TAGCCGCAAG AGAAACAGCG CCAAAGACAG GCATCCGCGT ACTACAACGC CATCAAAAGT CCAGTCGAAA AATTCGATAT CACATCCGTT GTCTCCTTTT TCTCCTTTGC TTCTGCAAAA ACAACAGCGC AAACGACGCA CTCGGGCGAG ACCCGTCGAT CAACATCATC AACCAAACAC GACTTCGCCC GGCGTTCCGT GGCAAGACGA TGACAACGAC GACCAAGACA CTGTTATTGG TACTTTATCC TTGAATGCGA CGCGGAAGGT GTCGGTTATT GTACACGTGC GCAATCCAAT CGCGTCCGAA AAAGAAAAGG ACAAAATTTG TCTCTTTCCC CTTCTGCGCA AACATGACGA AGCCACCGCA ATACTTTCGA CATCCGAGGC GGCACTTTCT CCCACATCTG CCACACTACA ATCCGCTGGA CATACCCGCG AACTAATAGT CGTCAATCCG ACTGCTTTTG GCAAGCTTAT TCCGTCCAAA GTTACCATGG AAACGGCTCG ATTGGTAGCA CAAATGGCTA ACGTCGAAGA CTGGGCGCGA TCCTTTCGCT TCCAGCAAGT TATGTGGCCC GGAAACGAGT TGCAAGCGCT GCAGAATCTG ACTCAAGCAA TGGTTAATGA TGTCATGGCA CCAGGGTCGA TCGCCTCCCG AACGATTCTG GCGATTGGTA GCGATGCCGC CAGCAAAAGC ACTCTTTTGT TTGGTTCCGT CGGAACACAG AGTATCGCCC GCGTATTGGC TACTACGGCA CACGATCTCA CACCCAGTGA TATTTTGTCG CGCTACGGAC TCATCGGAAT AGCAGTATCG CAATTGCTGG ATCAAATGCC ATCGCACGCC ATCGTGACCT TGTCTGTTTT GGAAATCGCC CCAGACGATG TTCTTCATGA TTTGTTGGCC GCTCGGCCGT TTAATTCTAG CTCACGGCTT CAATTTCGGT ACTCCAACAG CACTAATGCG ATGTCAGAAA GCGGCGGGGG AGCGGTGGTT CACAATCTTA CTAATGTGCC CCTGGATTCG CTCAAGTCGT TGGGGCATTT GTTGCGGCGT GCCTTTACTG CCGCTCGGAA TCCCAAACGA GCCCATTCTC CGGCCCATAT TATGGCAACG CTTAAAATAT GGCAATCTAA TGGCACACGG AACGACTCTA CAATGGAGGC TGCGCATACT GTTGTACAAT TTGGGGATAT TGTCAACGCA TGTACCAATT CAAGGAGAGA CGCAGCCTTG AAGCAAAGTG TTACGACGTT GGGTGGCGTC TTACGTGGTG TACTGCTGCG GGAAGCAGGC AACGACTCCC CATGGAAATT CCGTGAATCT TCTTTGACCA AAGTGTTGCA ACGATACGTC GATCACCTGG ATGGCAAAGT TGTCATGCTG GCGTGCACCT CGCCTTTAAG CGACGTTTAC GACGAAACCT TGGCAACTCT CAGGTTCGTT TCGAACTTGC TCTATAGTCC GAATCGCGCT GCGTCAAGTC CATTCCATCG CAGAGAAGAT GATACAGCTC CCACGTCACC GTTGTCCGAC ACTTCCTCGG CAATGAGCCT CATGGCAGCA GAGTTTTCGG GATCGGATCG ACAAACGTTG CTGACGAATT TGTTGTCGGA TCCTCGGCAG CGTCTCGCCA AAGTGTGGAA ACAAAAAGCG CGCGAAAATC CAGCAAATGC AGAGCTGCAA GAAAACAAGA ACGAGTACAC GCCTACTCAG TACGAGCTCA TTGAGATAGC GACGTTGGAC AGAGCTGAAA CGCGACTACC GAGAGCCGAA GAGCAGTCTT CGACTTCTCA AATTGTTCAA CCAGAGATAT TAATTCCGAC ACCTATTGCC GCCAACCGAA ATGCCATGAG GAATAAATGT CGCTCGTGGC GTCCACCCGA CACAGATGGT CCCTTTGGTA AGGTCCTGGA AGATTGCGAT TATGCTGCTG TAGAAACAAG CTTTCCCTCG GACTTATCGG AACGCGACGC TGATGTTGGG CATAATTGTA TGGACGAACC GAGTCCATTG ACGATTGATA CAAAGGATCC ACCTTTTAAT CTTGACACTT CCTATACGTT GGACGAGATG GCAGAGCGTG TGTTTAAAAA CAGCAAAATG AAGGATACGG AATCAAAGAC GCGTGCTCCG TCAGGATATC TTGAAGGGCC TGGGAAGGAA CCATCGAACG TCCACATGGA CGAGAGGCTA GCCGTTGGTG ATACCGTGGT TGGCGAGAAA CATGCTTTGT ACAATTCATT GCACAAGCAA ACGGGTAATT TGCCCTGGGC GTCCCGCGAA AAACCGAGTA GCCATGGATT GCATGAAGTG ACAGGAATGC AATCAAATAC GGGTACATTA GCACATTTGA GCAAGAAAGA GAGGCAAGAG GAAAAGGAGT CCTTCCCAGA CAAGAAGGAC AATGCAAAAG TGGACCCGCC ACAAGATCTA CTGTTGGACA ATAGAGAAGA GCGCGACAGT ACTTTGATGC ACGATCCGCG ACAAGTCTTC AGCACTAAGG AAGGAGAGGA TAGATATGTT TCATTTTCTT CGGCAAACCA TCAGGCTTCA AATGTTCCTC TCCGTCACAA TTCGGATTTC GACCGGAAAT TCCCTAGCCT TATGTTTGAA GATAAAGATC CGAAAGTTGA GCGATCGGAT GCAGAAGGGG CCTCGACGAC GGATCCTTGC GCGGGTCGGC ATAGCGCCTT GTTTGGCAAT TCCACTGAAC CCAACAATCG GAAAGATTCG GGCAACAAAA GCAGCTTGGC TCACACTCCT TTTGAAAGTG ACTCGTACAA CGCAGAAATC GACCCGAGCT TGGAGACTTC GCATCATATT TTGCTTGTAA ACGTGCCTGA GGATGTGACC GTTGGTCACA GCGCGCATCC GGACGTGACG GTAGGACGGG CTTCAGGAAA TGAGATCTTC GATCGCTCAT CGGTCGATGG GGCTAGTCGT GTTTCAGCCG AGGCACAGCT ATTGCGAGAA AGTCTGTCAA AAGCGAAAGC GGAGCGGGAC AGTGCAATTC AAAAGTATCA AAGACTTGAG GAGGAAAAAA CAGCGCGAGA TGAAGAAGAA GACTTTTTGT TGAAAAAGTT GCGCGCTTTG ACCGACGAGC GAGATGGCGC TTTTCGAAAG TTGGAAGACC TTGAGCGAAA GCTCACCTCG GAAGCAAAAT TGAAGTCTGA TACTCGAGAA GCTGAATGGA ATAAATTGAC CACCGAAAGA GCGAAGGCAA TTAACCAAAT CGAAGAACTT GAAGCTTCAT GCCTCGAAGC GCAAGAATCC CGAGACATCC TCCGCGAAGC ACTGGAAGAA CGCGAAGAAG CTTTGCGACT GATACAACTT TCCGAGAAGG ATCGTATCCG TGACGCGGAG GAACATGATA AGGCTCTGAA GGAAGCCACT TTACAGATAG AAGTTCTGCA AGATCAGACT GTCAAAATGT GTGCCGACCG GGGAGAGCTA GTCAAGATTG CTGAGGAAGC CATTGGGACC AACGCTCAGC TCGAACAGAA GATCGTAGAG CTCGAGAAGG AAGTTTCTAC AAACGTTTTA TCGTCGGTCT CAAAGGATGA GGTTGAAATT CTGGAAAAAG AAAATTTGAA GCTATCAGAG GAGAGCCAAG AACTGCGCAA GCAGTTATCC CAACACAAAT TACAACTCAA GGAGAACGGT CTCAGTCTCT CCGAGCTCAC CTTTACGGTC AGTAGTCTGG AAGACGAAAG GGCTCAACTT CTCAACAACA AAAAAGCAAA AGACGACGAA ATTCGTCGTT TAAAGCGCGA ATTGACTTCC AAAGATATTG TAGAAGACAG AGCTTCGGCG CTTCAACGCC AACTGGAGCA CCAATTGACG TTAGGCGTAG AATGGCAGCA GCGTGAAACG GATTTCAATC GCACTATACA GAACAATATC ACCGAGCTGG AACAACTTGA GAGCGCCGTA AAAAACCTAC AAGCGAAGCT GGATGAAAGG AACGAAATGC ACTCACAAGA GGTCGAATCG GACAAGAGCC AAATTGCACT CTTGCTAAAC AAAGTGTCAT ACTACGAGAG AGAGCGGTTG TCTGCGGCTG CAAGCGTCGA AGAACTTGAG CTCAAACTGG GATCTGAAAT CCGCACTCGT CACAAACTAT CATCTGATTT GAAGCAAGCA CAGATTGATC TCCACAGTCG CCTCGCGGAT GTACAAGAAT TGTCGTCCAA TTTAAAGCAA CTCCTAGTCG AGAAAGACGA AGACGAGCGA AAGGTTGCGC GCATGCAACT TGCTTTACAA AAATTTCAAA GCGAAACGCG AACCAGGGTA AAAATGGTAG TCATGCATCG TGACGAAGCG GCCAATCTCT TGGACGAAAC TTTGACCGAA AATCGAGCAT TGACCGAAAA GCTTCAAGAG CTACGGCAGG CTTTGGAAGA AACGCAACAT GCTCGCGTTG ACTGGCGTTC AAATAGTCTC ACGGCAAGTC GCTTTCAGGC AGATCTCGAT GACATGGTGT CCAAAAACAA AGCTTTAAGT CAGCGCAATC AATTACTGGA GACTCAAGTT GAAGAACTCC TACAACAACA ACCACAAAAG CAGTCTTTTC CCCGCAATCG CGAAAACGAT AATCTCCGAG ATCACGATAA ATACAGAGGC ATTGACAGTG AACACCAATT TGGAATTACC AATTCGTTAA AACGGCGAAA CGATTCTGGT AGCGGAAAAC ACACCATTCA GCGTAGAGAC GAAGATGGCT TTGATCACGG AGCTTTGCAT CTTGAGCCCT TCTCAGCACT GCCTGGGCAT GCTTTGGCGA ATTCTGAGTC GAGTCTTCAC GCACGGGCAG AAGAAGTAGC TGCGTACCTC GCTTTGTCCG CAAAGATGAC CGTGGAGAAG AGTCAGACGG AGGTGATCCG AATGCAGCAT CGATTGCATG CGGTGGAAGA CACCAAAGAT ACTGAGATTG ATGCTCTAAA GCGACACGTC CGCAGGCTAG AACGGCATCT CGAGAATTCG AATTGGCCTG AGGGGTGA
|
Protein sequence | MESLCAGDSG ILGESSSSFA SVRDDTVRDV SASTSDGEEE VDAVEFALNG GMQGTEQYIK SAAGNVTSMA FDCEDSRQDN RSRDTACGRP DEPVPNRSRK RNSAKDRHPR TTTPSKVQSK NSISHPLSPF SPLLLQKQQR KRRTRARPVD QHHQPNTTSP GVPWQDDDND DQDTVIGTLS LNATRKVSVI VHVRNPIASE KEKDKICLFP LLRKHDEATA ILSTSEAALS PTSATLQSAG HTRELIVVNP TAFGKLIPSK VTMETARLVA QMANVEDWAR SFRFQQVMWP GNELQALQNL TQAMVNDVMA PGSIASRTIL AIGSDAASKS TLLFGSVGTQ SIARVLATTA HDLTPSDILS RYGLIGIAVS QLLDQMPSHA IVTLSVLEIA PDDVLHDLLA ARPFNSSSRL QFRYSNSTNA MSESGGGAVV HNLTNVPLDS LKSLGHLLRR AFTAARNPKR AHSPAHIMAT LKIWQSNGTR NDSTMEAAHT VVQFGDIVNA CTNSRRDAAL KQSVTTLGGV LRGVLLREAG NDSPWKFRES SLTKVLQRYV DHLDGKVVML ACTSPLSDVY DETLATLRFV SNLLYSPNRA ASSPFHRRED DTAPTSPLSD TSSAMSLMAA EFSGSDRQTL LTNLLSDPRQ RLAKVWKQKA RENPANAELQ ENKNEYTPTQ YELIEIATLD RAETRLPRAE EQSSTSQIVQ PEILIPTPIA ANRNAMRNKC RSWRPPDTDG PFGKVLEDCD YAAVETSFPS DLSERDADVG HNCMDEPSPL TIDTKDPPFN LDTSYTLDEM AERVFKNSKM KDTESKTRAP SGYLEGPGKE PSNVHMDERL AVGDTVVGEK HALYNSLHKQ TGNLPWASRE KPSSHGLHEV TGMQSNTGTL AHLSKKERQE EKESFPDKKD NAKVDPPQDL LLDNREERDS TLMHDPRQVF STKEGEDRYV SFSSANHQAS NVPLRHNSDF DRKFPSLMFE DKDPKVERSD AEGASTTDPC AGRHSALFGN STEPNNRKDS GNKSSLAHTP FESDSYNAEI DPSLETSHHI LLVNVPEDVT VGHSAHPDVT VGRASGNEIF DRSSVDGASR VSAEAQLLRE SLSKAKAERD SAIQKYQRLE EEKTARDEEE DFLLKKLRAL TDERDGAFRK LEDLERKLTS EAKLKSDTRE AEWNKLTTER AKAINQIEEL EASCLEAQES RDILREALEE REEALRLIQL SEKDRIRDAE EHDKALKEAT LQIEVLQDQT VKMCADRGEL VKIAEEAIGT NAQLEQKIVE LEKEVSTNVL SSVSKDEVEI LEKENLKLSE ESQELRKQLS QHKLQLKENG LSLSELTFTV SSLEDERAQL LNNKKAKDDE IRRLKRELTS KDIVEDRASA LQRQLEHQLT LGVEWQQRET DFNRTIQNNI TELEQLESAV KNLQAKLDER NEMHSQEVES DKSQIALLLN KVSYYERERL SAAASVEELE LKLGSEIRTR HKLSSDLKQA QIDLHSRLAD VQELSSNLKQ LLVEKDEDER KVARMQLALQ KFQSETRTRV KMVVMHRDEA ANLLDETLTE NRALTEKLQE LRQALEETQH ARVDWRSNSL TASRFQADLD DMVSKNKALS QRNQLLETQV EELLQQQPQK QSFPRNREND NLRDHDKYRG IDSEHQFGIT NSLKRRNDSG SGKHTIQRRD EDGFDHGALH LEPFSALPGH ALANSESSLH ARAEEVAAYL ALSAKMTVEK SQTEVIRMQH RLHAVEDTKD TEIDALKRHV RRLERHLENS NWPEG
|
| |