Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40325 |
Symbol | |
ID | 7198243 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 112921 |
End bp | 121096 |
Gene Length | 8176 bp |
Protein Length | 2361 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184307 |
Protein GI | 219128202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTAA ATCTCGAATT TGGGTTGCGA TCCGTCTGCG CTGCCGTCAG GTACCGTGCC CGAGGCGTTC GAACGGTGGC AGTGGCGGCC AATCCGCAGG TATGGTCCGT CGTCCAGACC ATTCCGTTGT GTTGACTGCT ACAATTATGT CAAGTTCTCA TTTTTGACCG GATATACAGA CAGACGGGGG CGGCAATTCT GGAACACTGA AGTACCGAGC CGTAAACGGG ACGGGACCGC ACGGCACCAT GGGATCGCAA CCCTGTGGTT TCCATTCCGT TGACATTCTC ATCTTTTATA TTTCATATTC TCGACTGGGA CGACGACGAC GACTGCTAGT TGTACGTCGA CAATATTCCC AAGTAGAGAG AGAGAGAAAG TCGGCCATTG GGGTGTGCCT TCACGTCTCT TTTGCGGTAG TATGAACGAG AAAGAGGCCG TCCCGTCTGG TGCGTCCCTT CCGGCACACG ACGCGGCCTG GGTGGGATTG GATTCCACGC CATTACCAAC ATCGTCGTCG CCGTTGTCGT CCACCACAAC AGTCTTGCCT TTGTTGCCAC TCGCGACCTC CGGATCTTCT CCAACCAGTC AGGCCCCGTT GTCCTCCGGT CCCACTACCC CGATGGATAC GAGCCACAAC GGTAGCAACA CTGCGCGCAA CAACAACAAC AACAGTGTGA GGAGCAGTAC CACTATACAC AACACTTCCA CTACCAGCAT CGGTTCCAGC GTCAGCAACA ATAGCAACGC CAAGAATCCG CACGTTCCGT TCCGTATGAA ACTCAATCTC GCTTCCACGA GAGGCCGGAG TATACATCGC CGGAAACGCA AACGAGCCTT GACGAGTCCC AGTTCAGGAT CCAGTTCCAG TTCGGGTAGT TCCGAAGAAG ACGAAGATGA TGATTCCTCT ACCGGGGCTC TCGTATCCGT GGCCGTCCCG TCCAACCAGG GACGTGTCGA TGGCACCGGG ACGACCTCGT CACCCCAAAC CGGCGAACCC ACCGCCTCCA GCACTGGCCA CCACCACCAG CACCACCCAC GACAATAACA GCAGCAGTGT CAACAATCCT CAGCCTGAAG GATGGCGGGT CAAACTTTAC CGACTCAATG CCGACGGCTC TTGGGATGAT TGTGGCACCG GACGCATCCT TTGCTTGTAC CGTACCCCGG GCGTACAGAA ACCAACATCA CCCGTCCAGG ACGGCCACCA AGCCAACACG AATACCTCCG TCGCCTCGAC CTTGCCATCC GAAGCAACCT ACTCTGCCGA AACTTCCACA ACAACATCAA CCTTGACAAA AGCGGCAACA TCTCCGGCCT CGCCCACTTC CAACAAAGTA AAGACCAACG TGGATCAATG GTTGCATACG GAAACGGGCG AAGCTACGCT CTGCGTTCAC GCAGAAGCGT CCAAGCAAAA TTCCTCCCGA CGTGTACTGC TACGCACGCG GGTCTTACTG CGCGATGCCT ACCAACGACA AGGTGACAAT ATCATTACCT GGTGTGAACC CTACTACGGA AATCGGGCCA CCTCCCCCGA TGGCAACGCC ACTAGCACTA GCGGAAATAG CAACACCAGT AGTACATCGG GAGTGGACTT GGCCTTGTCC TTTCAGGACA ATGCTGGTTG CTTGGATATC TGGCGGCAAA TTACGCAGGT ACAAGGCCAG GCCGCCGAAC TGCTTCAGGA AACCCTGGCG GCGTCTTCCT CCGTGGAAGA CATGGCCGCG CACGTGGCGG CTCAGCACCA TGCCGATCTG CAAGCACGAC AAGCACATAG TGATGCCGAG ATGTGGAACT TTTCCAACAC CAACAACAAT AACAACAATG ATGAGGAAGT ATACGCCTTG GAATCATCTC CCGCCATCCC GATGCCTCCT TTGCCGACTC CACCAAGCCT CCAAAACATT GCTTCCATTG CCGACACGAT TGCCGCTCTG CAACACACAC AACAACGGGA TTCTTTAGCC ATGCGCATTG CCACGGACGA TTGTGCATAC CTAAAATCGT TACTTGCCTT GTTTCCCGCC GCCGAAACCC GCGGAGATTA CGGCAAACTG GCCATGCTCG CCGCGTGCGT CAAAACGATT CTTCTACTCA ACGATCCCTC TATTTTGGAA TGGATCATTT CGGTGGCGCG CGTCTTTGAA GATATCTGTG CCTGTCTGGA ATACGATCCT GACTTGCGCG AAAAAGCTAA TCATCGGTGG TTTTTACGGG ATCGCGCTAA GTTTCGAACC GTGGTGCCCA TGGAAGATCC CGAACTCGTC TCGGCAATTC ATCGGAGCTT TCGTGTGCAA TATTTGCGCG ATACTCTCTT GCGACCAACC ATGGACGAGT CGGCCCTGTC TTCGCTAGGC TCCCTACAGA CCTTCACGCA TGCTGACGTG GTAAAAGGGG TTACCATGTC GAGTAATGGA GATGTTAGTC TGAAGGATAG CTATTTGATA CGGGTGATTC GATTGTTGGG GGTTGAAACA GACGCCGTGG GAAGGCTGGA ATGGTCGGAA TTGGAAGCCG ACCCCGACGC TGGTGGGATT GAGACGGCAT CATTGACAAC CACGCCGATG CTAGCTTTGG CGGAACTTCC CGCCGATGAA ATGGTGCCGG ATGGGTCCAC CGTGGTCGGC AGACATGGTC TGGATGGCAC CGCCACTTGG AAACAGTATT TGGCCCCGCA GGATTCTTCC TTGGTGTCTC GAAAGAATCG TCGACGTGGG TGCGTTTCGT TTTTGCGAGA ACTCTTCAAT ATGGTGCGAA CAAGTCTGCA ACAGTCGGAC AAGGACGATT TTTTTGCGTT TATTTGTTCC TTGGAGATTG ATGTGAACGA TGGTATTGAA ATACCCGACA ACGTTTCACA AACCTCCCAA CAGGTGGAGG TCGGCAGTGT AGCGAGTACT ATCAAATCGG AACGGACTGA TGAAAAGACC GAGAGTATGG TAAACACTGC ATTGTCTTCA CACCTGGAAT GGTCGCCACC CTCGTCGCCC GCCAATATTT TATCCTTGCT AGCCAACATT CTTGCTGATC CACACATTGA CGTCACGGAA AAATGTTTAG TTCTGGAAAT TGTTGCTGGC GTTGCTATGC ACGATCCCGG CCTTATACGG AGGCATTGCT TGGAGTATCA TACAGTCTGG AATAATCACG AAAAGGCCGT CCCCAACGTA ACCATTGGAC GCCCGGATGC CAACGAACGC CGGCAAGTGT TGTTTCTGTG CCCTCCGAAT GACCTGTTGG GGTCGTTATT GTTTCTTCTG GACGTGGAAC CGGACGCTGG TCTTTTGCTG CAAGTAACTG AAATCATGAG GATTGTCTTG GATACCGACA TGATGGGTGA TCACGGTCCG ATGAGCGCCT TTGCGGACGA AGCCGAAGGT ATTCCTCCTG GAGTTCCAAG TCAGCCGCAT CAGGCGTCTG GTCCAATCGG CACCACCAGT GGTACGGATC AAAAGCAGTT TCTTTCCATA TTTTTTGAAA ACTATGTCGA GTGGCTCGTG GCTCCTTTCC AATTTTCCAT TCTTCATGTG ATTCGGCGGG TTCCGGATGA CGTGTTGAGG TGTCAATCAA AATCAACTTT GTTGCAACGA ATTGTACGAT TATTTCAGCA AGGGGTTACG TCGAAAAATG CATTGCTCAA GATTGTTCCA TCGTGCGCAA TCCGCAGCTC GTTTGCAGTC GAGATGTTAA GTTTCTGTGT TCGAGCTCAT CTCTATCGCA TGAAATTCTT TTTGCTCAAA TCTAGAGTGT TGGGGAATGT CCTGAAAGTG TTAAGTCCTT CTTCTATAGT ACGAAGTCAT TCCGGCGACC GTTGCTTGAA GTTGGCAGCT CTGCGATTCC TACGCGCTGT TCTCTCTGTG AACGATGAGT TCTACCATCG GCACATTATT CAACACAAGT TGTTTGCGCC TGTATTTGAG GCTTTCCGGG CAAATCCAGT CGGAGATAAT CTGGTTTCTT CTGCGATTGT TGAAATGTGC GACTACATTC ACAATGAGAA CATCAAATCG CTCATTGAGT ACATAGTTTC AGACTACATG TCATCGGCAC AGACGGGAGA TGATGTACCG AGTCTGGAAG ATGTGTCGAG CCCCTACGTC AGTACGCTCA CCGTGCTCCG GAAGGCCTAT GAGACCAACA TTCACGCCAT CAGGCAGTCT CATAATTGTG AGGAGGGGGC GTCCTCACCT GGTGGGTCAC GTTACTTTCC CGGCGGAGCA AATCATTACC CGCATGGCCC TAGAGTACTT AGTGGCAAAG CTTTGGAAGA TCAACGGAAG TTTCTGGAAG TGGACGAGGA GGAATCATAT TTTGAGTCGG ATGACGAAAG CGCCTGCAGT ACAATGATTC TTCCAGCTAC GGAGGCGGAA GTTGCACAGC AGCAGGTGGA GAGCAATTTG GAGAGGATCC CTCGAATGTA TTCACTATCA CAAGCGCCAC CTTTGAGTGA AGTGGAAGAT GCTAAGGAAA TTCGACACGC CGTATACGCG AGTGATGTTG CAGCGGAAGG TTCAGAGTTG ACTGAGGCAG GGTGACCACT TAGCATGACA TTTTCTAAGT TCGCACTTGT TTTCAAGACC AAGACGGCCG TGTATTTTCA AATAGAGAGT CTGTTCCTGT GCGTTGCTAG GATTTATCTC AAATACAGCC GCACCAGACG CCGTACGCGA CTGCGGCGCA TATTCCAGTA CTGCCGCTCG TTTGATTGTT GTTTCCCAAA GCTTCACCCA CAAAGCATGT TGGCATGCTG GAAGAGTGGT TTTGGGCACG CATGTGTTCC GCGCGGGTGC CATATTGTCA ATTACATTAC GAGAAGTCCT TATTTCAATG AAGCGGCGGA TTCATGCATC GGCTTTCAAG ATGGTAAAGG TTGACAGTTA GTCGGTGTGG TATCCTTTGA CGGAAACGCG TATAATGAAT CTTAAATTGT CATTTCTGAT TCGGACAAAG ACACTTTCAT TCGGTTTGTC CCCGTTGTGG TAGTAGGAGA TTAGAGATTT TACTGCATAA TATGCCATAG GTGCAAATGA CAGAACCCGA CATTAGCAAA AACACGTGTT TTGCGGACCG AGAATGACAT CTTTATGGTT CACAATGTCC TTCATAATGA GCCCTATACC AGATAGTCAG CATTCGTCAG CGAGAGGTGT TGTGACTGGT GAGAACACAA CCAAGAAACG GTCATTGAGA GGTACAATTC TCCCGTAAAA AGTGGCAACT GTGAATATTG GATATTACTG GTTCCTTGTT TATGACTAAC TGTAAAATAA CTTCATTCCG GGTTGATAAA GTGTCTCTAG TTTCACGTGG TCATAATGAT GACTATCGAC TGAGGCTTAA GTGTAGTTTC CATGGTACAG CACTTTTTCA GGATGTCTGC GGTCGCAGCC AAACACTGAC CACAAAGTCC GCCCAACTGA CAAATCGGCC GAAACGAGAG AAGACAATAG TCACCGACTT TTTTTCGCAT GAAATGCTTT CGGTAGCCAT GAGAAGCTAT ACAGAAGTCA AGTGGAGTCC TCCTGGAGAG ACTGACGGAA ACGAAGGCCC GAGCAAGAAT GCTATCTTGG CATTGCAATG TGTTCAAGCA GCAGCGAATC AATTTGACTC GAATTTCAAG CTGCGCAATC CAGAACGCGA GGCACTTATT CCTCTAGTTG GTACCGACGA GATTGTAGCT GGGATCGACA AAATTCGATC TTCTGGATTC TATACGGAGT ACGAGCTAAA AGGAATTCGC ATTATTATGG ACCTCACTCC TGGTGATTTC GGCTCTACGA CCGAAGAAGC TCGAAGGAAG CTATCGGAGC GCTGCCAAGA GGGTCGATAC TACGCAATTA AATATCTTCA GAGCGGTTTG ATCGATTCTG ATCACGGCCC GACTGCGGCA TGCGATATGA TCATCGAAAC CAAGATCCTG ATGAATCTTG CACCTCATCC CAACATTGCC CAGGTGTATG GAGTAAATAT TGAAGGAATT GATAGTTTCC TGGAATCTGG AAGGAAATCT TTCTTCTTCA TTACTGACCT CATCACAGAA ACCCTTTCTC AAAGACTGGA GAGCTGGAAA CAAGATAAAA GTTATGTAGG CGAGGACTTG GACGACAAGA AAAAAGAAGG ACAACGACTT GAGATTGCTT TAGATGTAGC CATGGCGTTG GTTTACCTTC ACGACCGGTT TCTGGTTTTT AATATTCGAC CCGATAAAGT GGGCTTCGAC GGTCGTTATG GAAGGATAAA ACTCTGCAAT TTTAGTCAAG CCCGGCAGGA TGGAAGGACC GAGCACGCCA CAAGTATTAC AAAGTCGGAT GACATCAAAA CGCTCGCGTA CACGGCCCCT GAAATGCTCT GCAGAGCTCC GGCGACGGTC AGCTCTGATG TTTATGCCTT TGGGATTATG CTTTGGGAGA TTCTCAGTCT CACCAAGCCT TTCGAAGGCT ATGACCGATC GACGCACTTT GAAGATGTTG TAAAATGCAG TCAGCGTCCA ACAATTATTC CGAATTGGCC TAAAGCAATT TGCGATTTGA TTCAAGAGTG CTGGCATCCA CATCGGAGAC CGACGATGAA GGTAGTTCAT GAAACTTTAG AAACAACTCT ACTTTTTGCG GACGATCCCG ATTGTGTTCA TCCCACACTA ATATGTCGCA CTGCTTCTAG CCAATCTGCG GAAGAACCGC GTCAGCGGCC TTCGTCCAGT GGAATGACGG TTGAAGCATT GCGCAGGGCG CTTTCACAAA GAGATCTTCA CGACTCAAAT ACAAAACAAA ACATTCGAAG CTCATGCCGA CAGCAGCGAA GTCAGTCAAC AGGGCGACCG AGATCGAAAT CCAAGTCTCC AGGTCGAAGG TCGACTACTC CAAGTGGACT GGAAAGGCCT CGATCAACGA CGACTGGCTA CGATAATACA CGGTCCAAGT CTCCCAGTGT TCGGGGTATG CAAAGGTCAA ACTCGTCTCT TGGACATGCG AGGACTAGAT CTAAATCTCC GGCCCCACAG GCTGCTCAAG TCCGTATAAG AGTCCGGGAA GGTAGAGAAA GTGAAAGCAG GAAGCTTCAA AGCCAGATCA GTTCTCACAA TTCGGAGGCT GCCACGGATC AAAACGTTCA AATGCAACCC GTGTCCCGAA CGTATCGCCC TACAATGGAT CGGACTTCGC CGGCTTCCGA GTCGGCCCTG CGCCAACTGA AACGAAGCAA ATCGTTTACA ATGGAACATG GTCGGGGCGC ACTTGGTAGA GTTGATTCCA AGCGTAGCCT AAGAAAAGGA GTGGGCGACG AAAAGTCAAT AAAATCTGAA GGCACAACAC CGACTCACAC AGACGTGAGT TCTAGCATCG AATCCTCGCC TATCCCTCCA GGATATCCTA AGCGTACTCG TCGAACATCT CGTCACCGAA GCGCATCCTC CGAAGAATGC GAGATCAGTC GAAACGCTCT TTTGGCATCG ACGGAATCGA AATCATCACC AATTGCAAAC CTCAAGAAGG CATCGACCAC CGAAGTTGAG GGGTGGGCTT CCTTTGAAGA TGTCACACGA AACATTGTTC GACGCAACAA TATCACAAGA CCCATCTTGA CGCGGAGAAA GAGCTCCTCC ACTCCAGAAC GCCAGCTGAG TCCCTTGCGG CGAACACCAT CGAGATCCAA ATTGCATTCG GGACACGCTC CCTCGACGCC TACGACTCGT TCGCCATCGT CGGTAAAAAC TCCTTCGCCA ACGACGTCGT TTAACAGCTC CTTGAGCGGT AATCAATTCG GGTCGCCACG TAGGCCTTCG CGGCGCCAAT CAAGAAACAC CATGGATATT CTCAAGGCAA ACGGGGCGTC GACCGGAACA CCGGACGGGG ACAGTAGTCC CAAAATGGAT CTGGCGGAAG CATTAAAAGC TTTTTCAGCC AGTTTTACGG AGCAGGGGTT CGATAACATT CCTGCCCCAT CTCCAACCGT ACCGGTGAAG ACTCGTACGA TGCAAAAGAG CGCTTCGCTG CGGCAGGTTG CTACCGTTTC GCCCCTCGTG TCTCCGGATG ATCGCCGCTC GACCCGACTG GAACGCACGA GTAGCTTTCT GGAAGGCATG CGGCGCCTCC GTTCTCCCAA GGGAGGCGTT TCCCATCCTC CCGTAACTCG TGGTCCCGGA TGGTCCCCAC GTAGCCGTAG CATGAGCTTT CAGTAA
|
Protein sequence | MTVNLEFGLR SVCAAVRYRA RGVRTVAVAA NPQTDGGGNS GTLKYRAVNG TGPHGTMGSQ PCGFHSVDIL IFYISYSRLG RRRRLLVYER ERGRPVWCVP SGTRRGLGGI GFHAITNIVV AVVVHHNSLA FVATRDLRIF SNQSGPVVLR SHYPDGYEPQ RDVSMAPGRP RHPKPANPPP PALATTTSTT HDNNSSSVNN PQPEGWRVKL YRLNADGSWD DCGTGRILCL YRTPGVQKPT SPVQDGHQAN TNTSVASTLP SEATYSAETS TTTSTLTKAA TSPASPTSNK VKTNVDQWLH TETGEATLCV HAEASKQNSS RRVLLRTRVL LRDAYQRQGD NIITWCEPYY GNRATSPDGN ATSTSGNSNT SSTSGVDLAL SFQDNAGCLD IWRQITQVQG QAAELLQETL AASSSVEDMA AHVAAQHHAD LQARQAHSDA EMWNFSNTNN NNNNDEEVYA LESSPAIPMP PLPTPPSLQN IASIADTIAA LQHTQQRDSL AMRIATDDCA YLKSLLALFP AAETRGDYGK LAMLAACVKT ILLLNDPSIL EWIISVARVF EDICACLEYD PDLREKANHR WFLRDRAKFR TVVPMEDPEL VSAIHRSFRV QYLRDTLLRP TMDESALSSL GSLQTFTHAD VVKGVTMSSN GDVSLKDSYL IRVIRLLGVE TDAVGRLEWS ELEADPDAGG IETASLTTTP MLALAELPAD EMVPDGSTVV GRHGLDGTAT WKQYLAPQDS SLVSRKNRRR GCVSFLRELF NMVRTSLQQS DKDDFFAFIC SLEIDVNDGI EIPDNVSQTS QQVEVGSVAS TIKSERTDEK TESMVNTALS SHLEWSPPSS PANILSLLAN ILADPHIDVT EKCLVLEIVA GVAMHDPGLI RRHCLEYHTV WNNHEKAVPN VTIGRPDANE RRQVLFLCPP NDLLGSLLFL LDVEPDAGLL LQVTEIMRIV LDTDMMGDHG PMSAFADEAE GIPPGVPSQP HQASGPIGTT SGTDQKQFLS IFFENYVEWL VAPFQFSILH VIRRVPDDVL RCQSKSTLLQ RIVRLFQQGV TSKNALLKIV PSCAIRSSFA VEMLSFCVRA HLYRMKFFLL KSRVLGNVLK VLSPSSIVRS HSGDRCLKLA ALRFLRAVLS VNDEFYHRHI IQHKLFAPVF EAFRANPVGD NLVSSAIVEM CDYIHNENIK SLIEYIVSDY MSSAQTGDDV PSLEDVSSPY VSTLTVLRKA YETNIHAIRQ SHNCEEGASS PGGSRYFPGG ANHYPHGPRV LSGKALEDQR KFLEVDEEES YFESDDESAC STMILPATEA EVAQQQVESN LERIPRMYSL SQAPPLSEVE DAKEIRHAVY ASDVAAEAAP DAVRDCGAYS STAARLIVVS QSFTHKACWH AGRVVLGTHV FRAGAILSIT LREVLISMKR RIHASAFKMV KVDTLFQDVC GRSQTLTTKS AQLTNRPKRE KTIVTDFFSH EMLSVAMRSY TEVKWSPPGE TDGNEGPSKN AILALQCVQA AANQFDSNFK LRNPEREALI PLVGTDEIVA GIDKIRSSGF YTEYELKGIR IIMDLTPGDF GSTTEEARRK LSERCQEGRY YAIKYLQSGL IDSDHGPTAA CDMIIETKIL MNLAPHPNIA QVYGVNIEGI DSFLESGRKS FFFITDLITE TLSQRLESWK QDKSYVGEDL DDKKKEGQRL EIALDVAMAL VYLHDRFLVF NIRPDKVGFD GRYGRIKLCN FSQARQDGRT EHATSITKSD DIKTLAYTAP EMLCRAPATV SSDVYAFGIM LWEILSLTKP FEGYDRSTHF EDVVKCSQRP TIIPNWPKAI CDLIQECWHP HRRPTMKVVH ETLETTLLFA DDPDCVHPTL ICRTASSQSA EEPRQRPSSS GMTVEALRRA LSQRDLHDSN TKQNIRSSCR QQRSQSTGRP RSKSKSPGRR STTPSGLERP RSTTTGYDNT RSKSPSVRGM QRSNSSLGHA RTRSKSPAPQ AAQVRIRVRE GRESESRKLQ SQISSHNSEA ATDQNVQMQP VSRTYRPTMD RTSPASESAL RQLKRSKSFT MEHGRGALGR VDSKRSLRKG VGDEKSIKSE GTTPTHTDVS SSIESSPIPP GYPKRTRRTS RHRSASSEEC EISRNALLAS TESKSSPIAN LKKASTTEVE GWASFEDVTR NIVRRNNITR PILTRRKSSS TPERQLSPLR RTPSRSKLHS GHAPSTPTTR SPSSVKTPSP TTSFNSSLSG NQFGSPRRPS RRQSRNTMDI LKANGASTGT PDGDSSPKMD LAEALKAFSA SFTEQGFDNI PAPSPTVPVK TRTMQKSASL RQVATVSPLV SPDDRRSTRL ERTSSFLEGM RRLRSPKGGV SHPPVTRGPG WSPRSRSMSF Q
|
| |