Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43724 |
Symbol | |
ID | 7197013 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1291176 |
End bp | 1294201 |
Gene Length | 3026 bp |
Protein Length | 890 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177797 |
Protein GI | 219112091 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCGGGGCA TTCAACATAA CGTGCGCAGC CTCAAGCCAG TCTAAAAGTT GCCAGGCTTA TATCAAAGGT GCAACACGAT GAAGAATGGT AGCCAAAATC CTCAGAAAGA TAAGGAGGAA TTAGCCGAAG GCGAGCATCG ATCGAGGGAT TTCTCCCCGC CGAGCAGCGA CACGGAAGCT GACGCTCGTC CTTTGTCTGA AGATGGCACC AAAAGCTCCT CCAATCTGCC AGCGGATTCT CCTGGAACTT TATCCACTGC CCCTGAAGCG TCTGCGGCAT CTTTGAGATT AAAGCAACTC GAGCAAGATG TCGTCTCGAA GACATATAAC AGGGGTAACG CCAAACCCTC CCAGCCTGGC GCCGTCGCTG AAGGCGGTGC GTCCAATCTT TCGCGACTGG AACAAGAAAT CGCCGCCAAG ACACGAGGCA CCGATCGATC GTCTAGTGGA GGGTCTGTCG GTCTGATTCA ACTGGAACAA GACGTTGCTG CCAAGGCGCG TGGCAGGAAC ACTACCGCTG CCACTAGACC GGGTGCAGCA CCTTCGGGCG CAGCTGGCCT CTTGGAACTG GAGCGTGACG TCATTGCCAA GTCGGTGCGT AATTCGAATA CTTCGCAGCC TGGAGCTGTG AGTGAATTGG ATGTAGTGAA TGGTGTGGGC GTGACATCCG CCTCGACAGC TTTGAATCAG CTAGAGCGGG ACGTCATAGC CAAGTCGAAG TCGTATGATG CACGATCGAG CGCCAGCGTC GGCCTGCACC AGTTTGAGAA CGATATTTTG GCAAAGAACC AGGTACGTCT ACCAAAGGAG GCTAGTGCGA GTGCCGCCCA GTCGTTAGTT CAAATGGAGA ACAATCTAGC GGCTAAGCTG AGTGCTTCGG GAGCGGCTGT CTCGCCTAGT AATTTCACTT ACTCGGGGAA AACGGACACA CGCTCCAGGG CACTGATGGC TCCAAACGCG CTGACGGCGA TGCAGCAGCT GGAGCACCAA ATTGCGCAAA AATCAGAAGC GACTGGCGGT ACCACCCCTC CCCCCGTTGG AGTCCTTAGC TCCGCTGTCC GAGGACCAAG TGTTCCATCT ACAGGAAGCG GTCGTCCACT TCGAGCTGAG TTTACTCCTG ATTTTTCAGC CTACGATATC CCCCCTCCAT TTTCTTATGA ATCACATCCC GTAGGAGAAC ACGCACCACC GGATATCCAC GGCTTTCATG GCACACAGTA TAGCCAAGAT ATTCCTGGTG CAGAATCAGG CGGTATTGAA GCATTCGTTG CTGACAACGT CGTCGAAGCA GTGGGCGTTG CGGTCATCAT GAGCGAAGAA GAGGAGGAGG CGGTTGGCCG CAAGCGTCGG AAAAGGTACC TTTGTTTCGG TGCACTCTTC TTGTTGTTAC TGGTTGTTGC GATTGTGGTA ACAGTGGTTA TCATTACGGG TCGGTCTAGT ACTACTGTCC TAGATTTGCC CCCCACTGCA GCACCGTCGA GCGCTCCGTC TTCGGCACCG TCATTTGCCC CCACAACGGA TGGTGTACAG GCTTTGATCT CGTGTCTGAC GCCTGCAACA AGCATCGAAA CTTTTCAAGA CCGTGCATCG GCGCAATATC GAGCTGTCGA GTGGTTGACA AACACGGATC CGTTTGTGCA GATCAATGGG CTTCAATGTG ACAGCCCCAA ATTTTTGCAG AGATACGCAT TGGCGACCTT TTACTTCGCC CTTTCAGGAG AAGAATGGGA AATCTGTGGC CTTCAAAATC CCGAATGTAC CAGCGACCCA GCTGACTTTG GATGGCTCTC TACTCAAGAC GAATGCAATT GGTACAAGGT TAGATGCAAC ACTCTAGACA TGGTGGAATC CATCAATTTC GGTATGTTAT CCTGCCGAGT CGCGATTTGT TCCCAATAAA ACGGTCTACC TAACATCGCA ATATTCCCAT TTTCAAATAG CGGACAATAC CGCGGTCGAA CGGGCACTGA CGATCCTGAA GGGTGCCATT CCGAAGGAAC TGCAGTACTT GACGGACATG GTTCAATTTG TTGTCGCCGA CATGCAGATT GAAGATTCCA TTCAGGATAG TTTTTCTACT TGGTTGAAAC TAGAGCGTCT GATCTTGAGC AGAAACAATT TTACGTCGAG CATCCCAGAC GATATTGCTA TCACTAATCC GCTCTTGTCT GATTTCCAAG TAAGTGACAA CCAGCTTTCG GGTCGTCTTC CAGATGGATT GGTCAGCTTA TCATTACAGG ATTTGAGACT GGATGGCAAT AGATTCACAG GAAGTCTACC ATCATCTTTT GGGGAGAATT CCGAAAGGTT GAGTAAGTAG CGCAGCAGGT ATTGCCTCGA CGAAAGCTTA CAGAAAATTT TGCAACATCT CATGTCGTCT CATCTATCGC AGACAACCTC GCAGTACAAA GGAATCAATT GGGTGGTCCT CTCCCCAGTT TGCTCTGGAC GCTTCCAAAC CTTAGAACCC TTGATCTCAG TGAAAATGCT TTCAGTGGTG AGGTACCGAC CACGATTGGA TTGATGCAGA ACTTACGCGT ACTTCGTTTG CATAGCACGC AGCTCGGAGG AGAATTGCCA GCTGAGTTCT TTGGTATTCC CAATTTTTCG ACACTAAACA TTGCCAATTG TCGCTTCCGC GGAGCTCTTT CGGAGAACTT TATCAATTTT AACCAAACAC TACAGGAAGT GATCGTAGCG TTCAATAACT TTACCGGTCC TATACCCGTT GAAGCATTCG AAGCCGCTCA ATTTCTTGGT ACGTACGATC TGTATCACAT TCACGCTCTA GATATTGAAC GGATGCGTTT CTAATACAAC TTTCCAATTA TAGAGGAGCT TAACCTACAG GGAAACCAGC TTTCAGGCGT TATATCCGAA GCACTGTGCA ATACAAGGGG AACGGCTTTT GGCCAACTAG CCTTTCTAAT TGTTGACTGC AACATTGATT GCAATTGTTG TGATCCGGTG TCGGATTGTG GCTGATGGTG CGACCTACCG TCGCTTTGAC ACATGTTTAT GATGAT
|
Protein sequence | MKNGSQNPQK DKEELAEGEH RSRDFSPPSS DTEADARPLS EDGTKSSSNL PADSPGTLST APEASAASLR LKQLEQDVVS KTYNRGNAKP SQPGAVAEGG ASNLSRLEQE IAAKTRGTDR SSSGGSVGLI QLEQDVAAKA RGRNTTAATR PGAAPSGAAG LLELERDVIA KSVRNSNTSQ PGAVSELDVV NGVGVTSAST ALNQLERDVI AKSKSYDARS SASVGLHQFE NDILAKNQVR LPKEASASAA QSLVQMENNL AAKLSASGAA VSPSNFTYSG KTDTRSRALM APNALTAMQQ LEHQIAQKSE ATGGTTPPPV GVLSSAVRGP SVPSTGSGRP LRAEFTPDFS AYDIPPPFSY ESHPVGEHAP PDIHGFHGTQ YSQDIPGAES GGIEAFVADN VVEAVGVAVI MSEEEEEAVG RKRRKRYLCF GALFLLLLVV AIVVTVVIIT GRSSTTVLDL PPTAAPSSAP SSAPSFAPTT DGVQALISCL TPATSIETFQ DRASAQYRAV EWLTNTDPFV QINGLQCDSP KFLQRYALAT FYFALSGEEW EICGLQNPEC TSDPADFGWL STQDECNWYK VRCNTLDMVE SINFADNTAV ERALTILKGA IPKELQYLTD MVQFVVADMQ IEDSIQDSFS TWLKLERLIL SRNNFTSSIP DDIAITNPLL SDFQVSDNQL SGRLPDGLVS LSLQDLRLDG NRFTGSLPSS FGENSERLNN LAVQRNQLGG PLPSLLWTLP NLRTLDLSEN AFSGEVPTTI GLMQNLRVLR LHSTQLGGEL PAEFFGIPNF STLNIANCRF RGALSENFIN FNQTLQEVIV AFNNFTGPIP VEAFEAAQFL EELNLQGNQL SGVISEALCN TRGTAFGQLA FLIVDCNIDC NCCDPVSDCG
|
| |