Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54789 |
Symbol | |
ID | 7202729 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 697009 |
End bp | 701055 |
Gene Length | 4047 bp |
Protein Length | 1205 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182117 |
Protein GI | 219123613 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGGT TACTCGCCTC CGTGAGTAAA GGACGAGATG TTAGTGACTT TTATCCGCAC GTCGTCAAGC TGGTCGGAGC TTACAGTCTG GAGGTACGCA AAATGGTCTA TATGTACCTC GAACAGTACG CGGATCACGA CCCAACAACA CGCGAACTCT CTCTGTTGTC CATCAACGCT TTCCAACGTG GTTTAGCCGA TACGGAACAA TGGATTCGAG CTCTGGCTTT GCGTGTCTTG ACCTCGATTC GACTCGCCGA TATTTTGCAA ATTCAAATAT TGGGCGTCCA AAAATGCTCT CAGGATTCGT CACCCTACGT GCGTAAGTGT GCCGCGAACG CCTTGTCCAA GCTGCATCCG CGGTGTGCAC CAGATCCGTC CCAGCAGACC CTCTTATTGG AGATTTTACA GTCCATGCTG GATCGAGACA AGGCTACCAT GGTGCTAACG TCCGCCTTGA TTGCGTTTCA AGAACTGTGT CCGGAACGGC TGGAACTCTT GCACGGTTCT TTTCGAAAAA CGTGTCATCT CTTGACCGAC ATGGACGAGT GGGGGCAAGT CGTGACTATT GAGATTCTGG CACGATACTG TCGACGTTTT TTTAAAGAAC CCCTGGGATG GCGGAACGGG TCTGCGGAGC AGATTGATCG CGAACGTCGA GTACGGAGGA CCGTTGCTAC CACACGTCCC GTAACAACCT ACAATGCCAA CTCTCAGGCT ACCAGCGCTA CGTCGGCATC TCCGCTACCG GAGCCTCTAT CCACTAGAGC TGCACAAACA GGGGTGTCCC TACCAACCCA TTTTAGAGAT CATGTGGACG ACAAGACTTC TTCTACCGCT CATCCGCCTC GCAAAGTCAA ACGTCGCGTT GTGAAAGAAG GTTTCTATTC CGACGAAGAG GATGCAAGCA CCGAGGAAGA AGTGTACGTG GATGAACTTA ACAGCCCTTC ATTGCCATTG GCGGCAGCTA TGCGGCAACG CAACATTTTG GGTCTTGCAG GTCCCGATGG TACGAAAACC GTTCGACAGT CTTCCAACGT TTTGTTTTCG ACTCAGGAGG ACACCGAGCT GGCCGAAGAT CATCAACGTC TCCTACATGC CGCTATGCCT TTGCTCAAAA GTCGCAACGC TGGCGTCGTA CTCGCCACCT GCTCTCTGCA ATATTACTGT GGTATCTCCA GTATTCAAGT ACGTGCCGCT ATGGGAAGGG CACTTGTCAG GATCCATAGA GATTGCCGCG AAATTCAATA CGTGGTATTG ACCGCCATTC GCGATTTGGT GAAGCATTGC CCATCAGCGT TTGCCCCATT TTTGCACGAT TTTTTCGTCA AGGCTCTAGA TCCGCCCTTT ACTCGTCTGA TCAAGCTCGA TATTCTGACT TCGCTGGCGC TGGAGCCTGC TGCCATTAAA GCCGTGCTGC AAGAAATGCG CTCCTACGTG CGAGACGGAC ACGTCGAATT CGTGCGGCAT GCAATTCGAG CAGTTGGACG TACCGTCGAA TTAGCTCGCA TCGTGTATGA TCGACACGGT CAAAAATCTG GCAAAACCAG CGTTCTGGCT AAAGAACGTG CCGAAACGAA TAGTATCGCA TTGGATTGCT TGCATGGACT ATTGACGTTG ACGCAAACAT CAGATCACGT TGTCATTGTT GGAGAATGTG TTTGTGTGAT GCAGCGCATT TTGCAGCTGT TGCAAGCGCC TGAGCCCTAC ACCGGCGAAA TTTCTGTGGT TAAAGATCCT AATAATGTTC AGCAACGAGC CGTGCAGCGC ATTTTGATAC TGCTGGTGTA TACCCTATCT TCACGCGTCG AGAACGCACC AGAGGATGAT GAGGACGCTT CTGAACCGAC TGTGTTGGCA AAAATCGCCG TTTCGCTTTC ATCCGATGCA ACAGCATCCG CCTTATGGGT TGTCGGAAGT TTGTGTTTTG CGCCTCTAAC GGAATCACCG CTTAGTGAAT CGGTGGGCGT TGGCCTGGTT AAGGGTTCTG CTCGTTTAGA AGTGGCTCGT CTAATAGCAC GGGCGTTTCT GGAAATGGAA GCGGTCGAGA AGGAGCAAGC AATTCATTTC GCATCTCGTA TTATGGTCTC TAAGGCCACC TCTTTGAACG GATCGTCAAC TGAAGAGTTT GCCCTGTGTG AGGCTATCTT GTCGATGGCT CGTACCGACG TCAACGTCGA TGTTCGAGAT CGTGCCCGAT TCGAGTCCAA CCTTGTTCGA GCCACCGTCG GCCTTCAACA TGACACAGAC GCAATGGAAG ACCTACCAGT ACTAAAACGA CAGCTGACGG TCGGAGATGC AAAACGAATG TTGTTGACAT CCAAACCGGC ATGTTCTTCT CTTCCACTGG AAGATGATTT CAGTACCGTT TCGGGCGAGA ACGGTGGCTT TCGTTTTGGA ACTCTCAGTA GCTTGGTTGG CCATCGTGCC CGTAAAGCAT ACTTGCCATT GCCCCGCTGG GCGGATCAAA ACAGTTCTGA TACGTTACGT GTGCCAATTG AAGACAAAAA GACAGATGCT TTAAAAGATG TTGAAGGTGA GACGAGAACG AAGAACACGA ACGGTGCAAA TGAGTTTTAC GAGTCCTCAG ACGATGACGA GCAGGACAGC TCTTCGGAAA GCTCCTCGCA GGACAGTAGC GATGAAGCCG GATCTTCCTC GGACTCATAC AGCGATGAAT CATCTTCTTC TGACGATGAC GACGAGTCTT CTAGCGATGA TAGTGATGTA GGCATGCAAA GCCTTGGTCA AGATGCCACG TTGATACCGA TGGAAGTTGA ACAGCGGAAA GTCGCGCACG ATTTGAATTC TCAGAAACTA CCTCTTCCAG TGGTAGAGAA TGTTGACGGG TCTTCCTCTT CCGAAGAGGA GGCCAGCAGC ACGTCTAGCG ATGATGAGAC CTCGACCGAT AGTTACAAAC TCAGTCCAAA AGCCAACGGT GGGACACATG ACGGCACTTT CATACCCCTA GACGCTTCCA GCAAAGCTGC GCCTGCTGCA ACTTCCACCA TTGCCTCCTC CTTAGCTAGT GACTTTGAAG GCATGACATT GGCACCCGCT ATCCAAAATC AAAAGCCGCA ACTGGATCCC GACCGCGACA GGGATTCTAG CGTTTGGCAA GTCTGGGTAC GGCCCGAACA CGCGAATGGA TTGTTGGTGA AGATTCGCTA TCTACGAGGA CCAACTCGGT CCAAAGAGGC GCAGGTTTTG GTCGGCACGG GAGCCGAGAA ACCTTCCCTG GTCCTGTTGC AAGTGAGATT TGAAAACAGT AAGGATACAA CAGTTCGGCG ATTGCGCATT CTCCAACGGG CTTCCGCTTC GGGTACGTCT TCATCCATTG CACCCCGCAA AATGCTTCTT CCTCCCGAAA TCGACCAACT GAAAAAAGGA CAAACCGTGG ATCACATCGT GGCCATTGAA TTCGCCAGTG TTTCCGATCG GGAAGGTACA ATGTTGGCAA AACTGGAAGT CAAGTTTAGC ACTGGCGGCA TACCGGTGGA AATAAAGCCG AGTCTTTGCG ATTTATTGTT GCCCTGTTTT CGATCGGTGG CAGACTTTGA TCAAGCCGTA GCCCGACTGC AAGGCTTTCA ACGGGTGGAT ACACGCTTTC CTATGTCCGA CGATTCCCAA GCCCAGCGTG ACACCCTGAT GTCCCGTTTG ATGCGAACGG CGCCCTGGAC ATTGATCCTC GAAGGTGATG CCGAAGCTAC CAGAGATGAA ACATGGCCCG GCCAAAAGTT GCGTTTGGCG GGCACGCTGC CAGCATCGTC CGATCCCGTG TACGTCTTGG TGACAATCAC GGGGTCTGGT ATTACGGGCC CGGGTAGTGC CGGTGGTGGA TGCCAGGCAC TCTTGTCTGT TTGTTCCGAC AACGCATTGG CCGTGAATAG CATTTTGAAC ACGTTGAAAA AGACGGTTCA GAACTTGAGC GATACGGAAA CACAGTAGTC TTCTATAGCC AGATAGGTAG TACACAAAAA AAAACTCTGT CTAGAGCAAC TCTTCGCGAA ACCGTAAACG TCTTAAGATA ACGCAGCCGT AAAGGTA
|
Protein sequence | MKWLLASVSK GRDVSDFYPH VVKLVGAYSL EVRKMVYMYL EQYADHDPTT RELSLLSINA FQRGLADTEQ WIRALALRVL TSIRLADILQ IQILGVQKCS QDSSPYVRKC AANALSKLHP RCAPDPSQQT LLLEILQSML DRDKATMVLT SALIAFQELC PERLELLHGS FRKTCHLLTD MDEWGQVVTI EILARYCRRF FKEPLGWRNG SAEQIDREQG FYSDEEDAST EEESSNVLFS TQEDTELAED HQRLLHAAMP LLKSRNAGVV LATCSLQYYC GISSIQVRAA MGRALVRIHR DCREIQYVVL TAIRDLVKHC PSAFAPFLHD FFVKALDPPF TRLIKLDILT SLALEPAAIK AVLQEMRSYV RDGHVEFVRH AIRAVGRTVE LARIVYDRHG QKSGKTSVLA KERAETNSIA LDCLHGLLTL TQTSDHVVIV GECVCVMQRI LQLLQAPEPY TGEISVVKDP NNVQQRAVQR ILILLVYTLS SRVENAPEDD EDASEPTVLA KIAVSLSSDA TASALWVVGS LCFAPLTESP LSESVGVGLV KGSARLEVAR LIARAFLEME AVEKEQAIHF ASRIMVSKAT SLNGSSTEEF ALCEAILSMA RTDVNVDVRD RARFESNLVR ATVGLQHDTD AMEDLPVLKR QLTVGDAKRM LLTSKPACSS LPLEDDFSTV SGENGGFRFG TLSSLVGHRA RKAYLPLPRW ADQNSSDTLR VPIEDKKTDA LKDVEGETRT KNTNGANEFY ESSDDDEQDS SSESSSQDSS DEAGSSSDSY SDESSSSDDD DESSSDDSDV GMQSLGQDAT LIPMEVEQRK VAHDLNSQKL PLPVVENVDG SSSSEEEASS TSSDDETSTD SYKLSPKANG GTHDGTFIPL DASSKAAPAA TSTIASSLAS DFEGMTLAPA IQNQKPQLDP DRDRDSSVWQ VWVRPEHANG LLVKIRYLRG PTRSKEAQVL VGTGAEKPSL VLLQVRFENS KDTTVRRLRI LQRASASGTS SSIAPRKMLL PPEIDQLKKG QTVDHIVAIE FASVSDREGT MLAKLEVKFS TGGIPVEIKP SLCDLLLPCF RSVADFDQAV ARLQGFQRVD TRFPMSDDSQ AQRDTLMSRL MRTAPWTLIL EGDAEATRDE TWPGQKLRLA GTLPASSDPV YVLVTITGSG ITGPGSAGGG CQALLSVCSD NALAVNSILN TLKKTVQNLS DTETQ
|
| |