Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42442 |
Symbol | |
ID | 7196645 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 67387 |
End bp | 70840 |
Gene Length | 3454 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177010 |
Protein GI | 219110517 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.285973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGCAATC GCGGCATCCC CCGGCATTTG TGTCGTTCTT GTCGCCTCAT TGCCGCCTTG CATTCCCTTG CTTGAGTTTC TTCTTTCCGT ACTTTTTCCG TATTGAAAAC TGCCTTTCAC CATGGCTTCT CCCTCTTCCG GCGGTAGCCT TTCGTACGTT CCAACGCCGG TAGAACGGGT ACGTGTGCAG TGTGTGTATG TGTATGTGTT TGTGTGTGTC TCTATCGTGT ATGGGATGGA CTGAAGGGTG CGTTAGTCCT GTCGGAAGCG AGCGCTACGA CACCATTGCA ACAACAAGCA ACGAGTTCAA GCTTGTCACA GTGTGACTGT CACTTGGACG GGGAACTTTG CCTTTTGAAT TGGACTGACT GTGCCTGTGA CTGTGTGCAT ACAGCCCTAC TTTGAAGGAC TCTTTGCGGC AGCCGATACC CAAGGCGGAG GGCAGATTGG TGGAGCCCAA GCTGTCCCGT TCTTTCAGCG TTCGCAGCTT CCCACCGAAG CTCTCCGCAA CATTTGGACG ATAGCCGATC AACCCCCCAC CAATGCTTTG GATCACCGTA AGTTTGCCGT CGCGATTCGG CTCATTCAGC TTTTGCAGAA CGGAAAGCAA GGCGAAGGAC CCACTCTACA GGCTCCACAG GGTGTGGATT TGCGTCCCGT GTACTTTGAA GGAATCAGTG GTGTTTCCGT CCCCCTCCCG TCGATGGAAC AACAGCAACA TCCACACCAA CCGCAGCCAC AGATTCCTCC CGTGCAGCAA CAGCCACATC CACAGCAACA ACAACAGTAT CCTCAACAGC AGCACGCCCA CCACACTCCG CCCCGCCCTC CGTCGTCAGC ATCGCAGTAC GCTCCACCCG TGCAGCAACA ACAGCAGCCG CCGCGCCCTC CTCCCTCCAC GAGTATGGCC TTGACGCCAC AAGATCCGTA CACGCTCCCA CCCAATGAAC AAGCCCGCTA CGAGTCCATC TTTGCCGAAT ACACGCAACC GGACGGATTT GTTCACGGCA AGGAAGCCGT CGCGCTATTT TCCAAGTCGG GCCTTCCCCA AACACAGTTG GCAAGCATCT GGAACATGGT CGATACACCC GTGGATAATA AACTCGACAA GGTGGAATTC GCGATCGCCA TGCATTTAAT TGTTTGCATT TCCAAAAAGA ACCTACCAAT GCCACCCTCG TTGCCGCTTT CGCTCAAACA GCTTAAATCG CAGGCCCCGC CTCCGACCTC GGTGCAAACC CAGCTTCCCA CAGTTGGTGC AACTTCCTCC CAAGGATTGC CCTATCAACA AGAGCACCAA CAACACCAGC CGCAGCAGCA GCAGATTCAT CGTACCATGA CCAACGATGC CGGTTCGGTG GCTTCCGTTC CACCGCCTCC GGTTATCTCC GGGGGTCCCC CACGGTCGAT TCAGTTGCAG CCCCAATCCC AGCAGCCTCC GCCCAGTGGG TTGTCGGTTT CCGCGTCTCT CCAGGGTCCA CCGCCACTGC CAGCTCGTGG TGAGGGTGCG CTGAGTATCT CGGACGCGTT TGAAGGATTA TCCGTGGACG GAGCAGCGGG TTCGTCGTTT TTGCCCCAAA CACTCGCCCC CGCCTCCTTT GGTGCACCGA ACAACCTCGG CACGACGGCA TCCTTCGACA ACGCCAGTCA CGCCAGCAAT ACTGGTGCAG TCAGTGACGT CGGTGGGGGC ATTCCCAGCC CGGGACGCAA CGCCGCCTCG TCTTTTGCCA TGGGACCTCC GGCGATTGTG ACGGCCACCA GTCCGGCGCC AGCGCCCAAA ACAACCCAAC AGCTCGCGTC TAGCTACAGC ATGGGTGACT CAACGCAAGA ACTGGAAAAG CTCAAGGACG TTTTGCAAAA GTTGCAGGCG GAGAATATTG CTCTCAAAGC ACAGCTGGGC ACGATGACGG GCGACGAGAA GGATGTCCTA AAGCAATTGG GTGCGACGGT CGCCGAAATA TCCACTCTCT CCAATGAATT GACTACCGTA CGTGCACAAG TGCTAGCATC CAAGTCCCGC TTGGTGGAAG CAACTGCGGA ATTGCAGGCA GCCAAGGAAA AGAAAAGGTA AGGAGATTGT CTAAATGGGG GAAACACAAT GAAGGAAGGG TAATCATCGA CTTACCAACA CATTTTGTTT TCTCTTTAGT GTCGTCAAGG ATCTGATTTC GGAAGCCAGC GAAACGAAGA GTGCTATTCA GCAGGCTCAT ACAGGTGTAG AAGAAGCAAT TGAAATGGCG AAGGCCCCAC CTCCGGCAGC AAACGGATTT GACGGCGACT TGTTTGATTT CGGCGGAGCA GCTCCCGCCC CATCGGGTCC TGTCGCCCAG GACAGTTCCT ACGCTTCGAA TGCGGAGTCC ATGCACCCGA ACCCAATCAC GGAGCCGCCG GCGTATCAAA ACAACCAGGT ACTAAAAACA GTTGCGTCAA ACGACTCCGA GTATGGACAA TTGAAGGAAG CGGTCCTGTC AACAGACACG TTCAATTCAA GCTACGGAGA AGCATCGAAA GCAGGGTTGT CGAACTACGC CTCTCACTAC GGTCAGCTGG AAACAGTGAC ATCGTACGAA TCCAACCAGT CGGATGGAGG TCCGGGTCAC AACCGCACCG CTTCCGCAGC TTCGTTGGGT TTCGACAGTA GCATGGTAAT GGGCGGCGCA CCGCTGGACT ACTCCACGGG CTCTACTTTA GCCGGACCGC CCCCGCCAAC CGATCGTTAT CAAGGCAAAA ACGCAGATGA CCACAATTCG ACGCCGTCGA TTGGAGATGT CAACGAATTG AGGCGCAGAG CCAAAGAGGC AGAGGACGTC GCACGAGATG CGGAAGAGTC GCGTCAGCAA GTGGCAGCTC AAGTCGAAGA GCTGCGTCGT GTGGCCGATG AGGCGGAAGC GGAAGCTCGC AAACATTTGG CCGGTGGAGA CGGCAAGAAG AAGAAAGTCG GTATGCTGGG TCGGGGTAAA AAGCGAGATG CGGTACGTAA GCTTCCTGGG TCCATTGTAC CGGCCAATGA TTCTTAACGA ATCTTGGTTG AAAAAGACTT ACCTTTATTG CTTTTCGTAC ACAGAAAGAA GGAGAGCGGC TCGCACTGGA GGCAAAAACC AAGAAGGATA CGTTTCTCCA GGCACAGTCG CAAGCCAATG ATGCCCAGGC TTTGGCACTG GACACAAAGC GCGAAGCCGA TCGTTTGCGG CAGCAAGCCG AGGAGGCCGA AATCAACGCT GCGTCGGCCG CTTCTATGCA GCATAGCCAG CCGGTCGCTC CTTCTCAGCA GCCATCCAAT GGATACCCAG CACCAGCTGC TACTCCAGCG TACGGAACAG GAATGGGACA GCAACCCCAA TACGGGGGTA CATCTTTTGG CGGACAATAC AATCCCAACG TCATGGGTAG TGGTGGCGTT GAGATTCCCA CACCGAGCGG AGGCGAAGAT CCATACTCAA ACCCTTTCGG CTAA
|
Protein sequence | MASPSSGGSL SYVPTPVERP YFEGLFAAAD TQGGGQIGGA QAVPFFQRSQ LPTEALRNIW TIADQPPTNA LDHRKFAVAI RLIQLLQNGK QGEGPTLQAP QGVDLRPVYF EGISGVSVPL PSMEQQQHPH QPQPQIPPVQ QQPHPQQQQQ YPQQQHAHHT PPRPPSSASQ YAPPVQQQQQ PPRPPPSTSM ALTPQDPYTL PPNEQARYES IFAEYTQPDG FVHGKEAVAL FSKSGLPQTQ LASIWNMVDT PVDNKLDKVE FAIAMHLIVC ISKKNLPMPP SLPLSLKQLK SQAPPPTSVQ TQLPTVGATS SQGLPYQQEH QQHQPQQQQI HRTMTNDAGS VASVPPPPVI SGGPPRSIQL QPQSQQPPPS GLSVSASLQG PPPLPARGEG ALSISDAFEG LSVDGAAGSS FLPQTLAPAS FGAPNNLGTT ASFDNASHAS NTGAVSDVGG GIPSPGRNAA SSFAMGPPAI VTATSPAPAP KTTQQLASSY SMGDSTQELE KLKDVLQKLQ AENIALKAQL GTMTGDEKDV LKQLGATVAE ISTLSNELTT VRAQVLASKS RLVEATAELQ AAKEKKSVVK DLISEASETK SAIQQAHTGV EEAIEMAKAP PPAANGFDGD LFDFGGAAPA PSGPVAQDSS YASNAESMHP NPITEPPAYQ NNQVLKTVAS NDSEYGQLKE AVLSTDTFNS SYGEASKAGL SNYASHYGQL ETVTSYESNQ SDGGPGHNRT ASAASLGFDS SMVMGGAPLD YSTGSTLAGP PPPTDRYQGK NADDHNSTPS IGDVNELRRR AKEAEDVARD AEESRQQVAA QVEELRRVAD EAEAEARKHL AGGDGKKKKV GMLGRGKKRD AKEGERLALE AKTKKDTFLQ AQSQANDAQA LALDTKREAD RLRQQAEEAE INAASAASMQ HSQPVAPSQQ PSNGYPAPAA TPAYGTGMGQ QPQYGGTSFG GQYNPNVMGS GGVEIPTPSG GEDPYSNPFG
|
| |