Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43552 |
Symbol | |
ID | 7197582 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 795123 |
End bp | 798302 |
Gene Length | 3180 bp |
Protein Length | 971 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178005 |
Protein GI | 219112507 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.774573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGCTGCTC TTGGTGAAAT ACGCAGCCAG ATTCGTAGAG CTGGATTTGA TTCTAGTCTC TTGAGCGACC TCACTGTACG CAAGCTACGC GCAGGAAGCA GCAGATCCGT TAACGGCCAC AGATAGATAC ATCGTTCGCG CAGAGAAATA CGGAACAACT TGTACTCACA TTCTGGACGA CTGTAACTTG AAATGCTGCA CACCCCGGCG TCTCCATCTG GAAGTGTTGG ATCGTTGAAA CGAGATAAGG AGGAAACTGA TTTTGATTTT GATCACGACG AGGGCTTCAT CATACCGTTA CAGATGTATT CTAACACAGG TTCACGCATG CGGAACACCG CTCTGACTCC ACCGTCCCGG CGCAAGTTTC CCTGGATGTG GACAATGCTT GTGTTCCTAA CATTGATTAG CATAGCCTTT GCTTTCTGGA GCGAGCTCCT GGAAGCCTTC CCGACTATTA CGATACATTC AAATATCGCC AATCACGAAA GACAAAGTGG AGTTTTTCCG ATCCAACCCG TTGCGTCCGC TTCACGACGA GGCTCTGGCG CGCCGGTTGC GGTCGTTGTC GATACGCGGG AATCGGCGGG AGAACAACAA CAAACAATAA TGACGACTGA AGCGGCTGCA TTTCCACCTC CGAAATTGTC GGGAATTCCT TTGCACGCCG CCCAGGCACT AGCTTGTCGC CAGTCCGTAA TTAACTTCGT CATCAACGCG ACGGACGGCA AGGATGAATG CGAAGGATTG AAAAAAGCCT TCGAGCGTAC GTGCAGCAAT GACGGTTACG AAGACTCCGA CTCTAAAGAT AAAGCAGGGA GACGCAAGCA CCGTAACATG AGTGAGAAAC TATGGGACAA ATCAATACCC AAAGTAGATC GATGGCACGA GTCGGTGTTC TACCTTTCGC AAACGCTTCG CCGGATAGGT GACTGGCTTA TGTGGAAAGA ACCTGCCCCC TTCTTTTTGG CCGAAGACGA GGTTGCCACG GACGAAACTT GGCAAACTGC TCGCTTTCTC GTAAAAGAAG ATTTAGACCG CGTTGTATAC CGAGATATTC TGCATTCTTG GGCAACGCTG GATTGTCAGA TCAGGCCTGA GCTTTGCCAA GCAAGAGTAC GCCGAAAACT CGATGAAAGC ATCGAGTCCT CCTTTGCCAA TGATGGAAAC AAGGAGGTAC AACACTCGAA CCACACACAT TCTGGTGGTG GATTGTCTCT AGATCTTCCT TTCGCCAGCG GCCATGTTTC GGAAAAGGTC ATGGGTGAAG CTCTCATGCT TCAACAAGGA GATAAGCTCA TTGAAAAAGC CACCAATCAC ACGTCGACAA ATGCAGCTAA ATCGGAAGCG GCAGCCTCTT CCAAGGCTGT ATCGGATGCG TCTGCTGCAG TATCGGCCGT CCTGAACGAT CCTTCGTCTA TTGAAGCCAG GACGTGTTGT GCGTCCATTC TGAACGTCTA TCATGACAAC TGCAGTACCG ATGTAGACGA TCAAGTCTCA GACAGCCGGC TTTTTTTTGT CGTGTTTGTC ATGGCTTTAT GTGGAATGGT GAAAAGCCTG ATTCGTCATT TCAAAATTCT GTGGTTGCCC GAAGCTGCAG GCTGCATCAT TGTCGGAGGT ATGTTGGCAA GTTTATTTAC GTTTTCCTAT TCATCACGAG CTCACCACAA ATTAACTTCC TTTTGAACAG TATTGAGTGG ATACGGTATG TTGCTGTTGC CACACCACGA CATTAGCTTT GATGGAAACT GGTTTTTGCG CATATTGGTA CCACCAATCA TCTTTGAAGC TGCAATCAGC ATTGACAAGC GAGCTTTCAA CCGCCACATT GTGCCAATTC TGATTTACGC AGTTGCTGGT ACACTGGTGG CGACTGTTTT GACAGCATCA ATTCTTCATC GAGGCACGAC GATGCTGTCA GACTGGTGTT ATCCTATTCC TTACGTTGAG GCCCTTGCTT TTGGTGCGCT GATTTCATCT ATTGATCCAA TTGCTGTCTT GAGTGTGTTA AGTAACATGG GAATGACAGA TACAGATACA ATATATGTCG TGATTTTTGG GGAGTCGTTA TTGAATGATG GCGTCGCAAT TGTTCTCTTT CATACGCTTG TGCATTTTCT TGACGAAACA CTTGTGATTG ATCGGGCAGC CGTGATAGCT GCCGTCATTC ATTTTGTGGT GGTAGCGTTT GGCTCATTTT TAATCGGTGT CGCATCAGGT ATGCTCTGTA CCGTCTACTA CTGGATTTTC CACGGATGTC AGACTCCGTT GGTGGAGGTG CTAATGTTTT TTTGTTGGGC ACTCTTGCCC TATTATGTTT GCGACGGTAT TGGCTGGTCC GGCATTGTCT CTGTAGTAGC TGCCGGGTTC GTGATGGATT TGTACATCGT CGGTGACGAG CATGGCGAGT CTGAGATCGG AGACACGAGA GAACCTTCGC CGAAAGTCGA GTCGGCTCGA AAGCGTGGCC AGATCTTCTC GCCTATGGGA CAACTATCGA ATGAAGCCAA GACACATATT GGCTTTGTCA CGGAAATCAT TTCGACGATG ATGGAGACTG CTATTTTTGC TTATCTGGGC CTTTTCCTTT TCAGCCATCG TTATCACTGG AACATATGGC ACACCTTGAT CTCGATTACG GCGTGTTGTC TTAGTCGCGG CATTATGATT CCGTGTCTGA GCTGGGTTGC CAATTTTATT TTACGTATGC AACAAAATCG GCCGTCTTGT CGAATGCAGC AATCGGCAGG ACGAAAAAGC CCGCAGTCGG CTGGTGTTGT TATAGATAAA AAGATGCAAC TGGTCTTGTG GTTTGCTGGG CTACGTGGGG CAATGTCTTT TGCATTAGTC GAGCACATTC CGTTGTACGA TGAAGTCAGT GGTATCGGAA CACGTCTCAA ACCGGAACTC AAGGCCATGA CTTCTGCGTG CATTATGTTT ACGGTATTTG TTTTGGGGGG TCGTACCTAT CACATGATGG AATACTTGGG TATTGCACCC TCCGCCAGCG CACGAAAACA ACAGCAGAAT CCGTCACCAC TTGAGTTGAC GGCACTTATG ACGTCCAAGA GCTACGAAGA CTCAATGGAA ATTGAGGACG ACTCGAGTCG AACACCGTCA AGGCCCGGTC ATGTCTTTCG AAGACAACGG CACAAAGAGC CGATGCCCGA AGGTGAATGA
|
Protein sequence | MLHTPASPSG SVGSLKRDKE ETDFDFDHDE GFIIPLQMYS NTGSRMRNTA LTPPSRRKFP WMWTMLVFLT LISIAFAFWS ELLEAFPTIT IHSNIANHER QSGVFPIQPV ASASRRGSGA PVAVVVDTRE SAGEQQQTIM TTEAAAFPPP KLSGIPLHAA QALACRQSVI NFVINATDGK DECEGLKKAF ERTCSNDGYE DSDSKDKAGR RKHRNMSEKL WDKSIPKVDR WHESVFYLSQ TLRRIGDWLM WKEPAPFFLA EDEVATDETW QTARFLVKED LDRVVYRDIL HSWATLDCQI RPELCQARVR RKLDESIESS FANDGNKEVQ HSNHTHSGGG LSLDLPFASG HVSEKVMGEA LMLQQGDKLI EKATNHTSTN AAKSEAAASS KAVSDASAAV SAVLNDPSSI EARTCCASIL NVYHDNCSTD VDDQVSDSRL FFVVFVMALC GMVKSLIRHF KILWLPEAAG CIIVGVLSGY GMLLLPHHDI SFDGNWFLRI LVPPIIFEAA ISIDKRAFNR HIVPILIYAV AGTLVATVLT ASILHRGTTM LSDWCYPIPY VEALAFGALI SSIDPIAVLS VLSNMGMTDT DTIYVVIFGE SLLNDGVAIV LFHTLVHFLD ETLVIDRAAV IAAVIHFVVV AFGSFLIGVA SGMLCTVYYW IFHGCQTPLV EVLMFFCWAL LPYYVCDGIG WSGIVSVVAA GFVMDLYIVG DEHGESEIGD TREPSPKVES ARKRGQIFSP MGQLSNEAKT HIGFVTEIIS TMMETAIFAY LGLFLFSHRY HWNIWHTLIS ITACCLSRGI MIPCLSWVAN FILRMQQNRP SCRMQQSAGR KSPQSAGVVI DKKMQLVLWF AGLRGAMSFA LVEHIPLYDE VSGIGTRLKP ELKAMTSACI MFTVFVLGGR TYHMMEYLGI APSASARKQQ QNPSPLELTA LMTSKSYEDS MEIEDDSSRT PSRPGHVFRR QRHKEPMPEG E
|
| |