Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47275 |
Symbol | |
ID | 7202360 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 156891 |
End bp | 160633 |
Gene Length | 3743 bp |
Protein Length | 1055 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181494 |
Protein GI | 219122318 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.328258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAATC CGTTCGATGA CTCGTTTCAA GGAGACAATG GACCGACTGT TGGGCCGCCG AACCCTTTTG ACGCGGACGA CGCGTTGGCC AGCTCCTCGG ACGTTGTCTC CACAAACGAT CTCGACGAAG ATGAATATGC AGCAGTCGAA TCAACTTGGA AGTATCTGAA AGACCTTCCT TATCGACAAA TTCCCATTTA CTCGAACGTG CTTTGGGAGT TAGACCCCGA GGACGAGGAC TGGTTTGCGT ATGGTTTGGA ACGCTACCCT TCTTCGGCTT TGAATCCTTC CTGGCCTCGG AGTGAACGTA TGACCTTGCT CCGCAAAACC ACGACGACGA AAGTTTCTGG CTGTCCCTAC GGAGGACCTT TGGCATCGAT TACGACACCG GTCATGTCTA CGCCTACCTT TGCAAAGACT CAAATTACGA TCTGGACGAA TGCTGGCAAA GTCTTGACAC GGATTCCGTT CCCTCCCCAA TCGCATGCCA ACTACTCGCC CTCACTAATC ACGACCATTG GTTTTACTTC CCGCGCGCAG TTGGTTGTTG TCTTACAGGA TTCCCTGTGT CTGACGTACA ATCTTCGAGG CGAACCAATT TTAGCGCCCT TCTTTATTCT CCAGCAACCA TCACAAGGGA AGGCAATTTC AGTTACCCAA GCGGTCGTCT TTGCTGGAGG TGTCGCCGTG TTGGCACAAA ACCAATCCTG CGCTTTAGTA GAGCTCTTGG ATGAACACGA TAATTTGTCC TACTCAGCCT CGGCACCGCT CTCTGCTCGT AAGGTAACAT TCGACACCAA CCAACACACC GACGTAACCT CTTCTGCCGA CGGTATTTTT GCTCTCGTCA CGCCGCTCGA AACGGCCGAA TTTAGTCGAG CACACGGGCT TTCCTACTTG ACCTTGGCCG TTCTACCTCG GCATTGCACA AGCCATGGGC ATCCTGAAGT GTTTATTTCC ACTATATGCC ATTCCGTGGT GGTGGTGGAA GCTCGCGACG GCTCTATGAC AGACCTAGAT TGCCGAGCCC GTATGGTAGC GCCGTTAACA CACATGGCGT TCGCACCGAA TGGTAGATTT TTGGCCTGTT TCACGACCTC CGGGGTGTTG ACAGTAGTGT CTACGGACTT TGATGTTATG GTATTAGACT TCGACGCTTC ACACAGTCGC GAAACCTCCT CTTCGAGTTC CCAGCCACCT TTGGATATGC AATGGTGTGG AGAAGATAGG TATGGTACAT TTGTGCTCTC GCTCATTGTT GGAATAAGAA TTTACATTTT TTACACTGAC TGAATTGCTC TTGCGCTTCC TTAAAAGTGT TGTGTTGCAC TTTAAAAACT TGGGGGTGCT GATGGTGGGC CCTTACGGAG ACTGGCTACG CTTCCCATAC AACGATACCT CTGATCAAGT ATACCTCGTT CCGGAAGCAG ATGGCTGCCG TGTTGTGACT GAGAGCCGTG TCGAGATGCT GCAGCGTGTC CCCCCGGCGA CGGCATTGAT CCTACGGATG GGATCGATTG AACCAGCCGC GATTTTATTA GACGCCGCAG ATGCGTTCTA TTCCAAGTCC ACGGTCACAG TCCTGAATAG TGACGAGATG GTACAAGGCA TGGTCGAACA AGGGACTTTA AATGCGGCAA TTACCAGCTG TTTTGAGGCT GCCACGAATG AGTTTGATAT TTTTACACAA AAACGTTTGC TGAGAGCGGC ATCTTTCGGT ATGCATATTT CAGATAAGAA ACAGGTCAAC GAAGAACGTA TGATTGTTGG GGGATCGACT ACAATTTCGG AAGCAGAACA AGACGGAGAA GACTGCGAAA ACCAGGATCC TTACAGTTTG CCATCGAGAG TCACACGTCG TTTTGTGGAA AGCTCCCGTA AGCTTCGTGT CTTGAACGCA CTCCGACATC CGCTGGTTGG CGTTGTGATG ACATTTCCAC AGTGGCAGAG CATCGGGGCA ATTGGTGTCG TTGCTCGCTT GGTAGCAATG AATCGCCCGG AGCTGGCCAC TTCAATCTGT GATTACCTAG CTTTACCCAA ATCAATTCAG CTATTTGCGC GAGCGTCCAA GGCGTCTTCG TTTGTGGAGC AAAAGGCACA GGCTGATGAG CACTTATCAG ATTCAGAGAT CGCCCAGGGC GCCATTATGA TAATTACAAA AGAGGTGGTT TCATCGGCTG TATCGCCCGG GGCTTCAGCG AGTATGTTTC GAGGAGCCTA CGCGACAGTG GCTCTTGCCG CCAACAAAGT CAACAAGCCA GGCGTTGCAA ATCTCTTGCT CATGTTGGAG TCCAGTGTTG CCGATAAAGT ACCAGCTCTC ATTGCGGGTG GCTCTTACGC TGACGCTATT GCAGTTGCTA CCACCGCCAG GTACGAGCTA ACTTATGCCT AATTATTTTC TTCGCTTTCT GATGCTCTCA CAACTCACAT CCCGTTTATT CTGAAGGGAT GCGGACTTCA TTTTTTCCAC TCTCATGGAT TTCGAAAGAA ACTGTATGAT AGCTGCGTCG CCGACAGATC TCTCGCAGGC TCAATCAGCA TTCTTGTCCA CCGTTGTCGG TAAATTTACA CTTGAGGCAT TTCATACGCT CCGGCGCTAT CTACGTAGTA CATCAGATAT ACAAAGGGTA CTCAACCTTT TGCTCAGAGG GCAAAAGTTC TCCTGGGCTG GTCGGGAAAT GGCGCAAAAG GCGCTCGTCG AGGTTGATGT TCGAGAGAAG CAAGGAATGT TAGCAGTAAG TATGAACTTT GTACGGTGTC GATGGTAAGC AATAGCTTGA AAGTCAACAA ATTCCCTCAT AGTTTTCTCG TTACAGGAAG CTTCCCGCAT TTTTGGTATA AGCAAAGAAA CTGCTTTCCA AAAATCGTGC ACAGATGATT ATTTGGATTT GAGAAAGGAT CAAGAAGTTT TACGTAATAA GTATGGCTCT GTCGACGTTG CCCCGGAAAG CTCGTCGGTG ACGGCGACAA TTTCGTCTAT TGTAAAGTTT GCTGCAAGTA ACATACGAGA ACAGCACCGT CTACTGGCAG ATGCGGACAA GGTGGCGAAG AAATTTCGAG TTGCTGAGAA GCGTTTGTGG CATATCAAAG TAATTGCTTT TGCGGCCAGT GAGCAATGGA GTAATTTACG TATTCTTGCA GATTCCCGGG CGAAACCACC AATTGGATAC AAACCGTTTG CACGAGCCGT TATTGATGGA AACCAAAATA GCAGCGAGAT TCTTCGGTAT ACTGAAAGAA TTTCTGATCT CGAGGAGCGG TACGACATGC TCTGCTATGG TCAGCTTTGG AGCAATGCAT TGGACGAAGC TTTTAAGATG AAGGACACCC GGCGCATTTT GAATGTGAAG AATCTGTGCA ATTCTGCCGA CATCCAAATT AAGGCAGACC AATTAATGGG CCGTCTTGCC TAATTGAAAA CGGTCAAGAT TCCCTAATCT GTCACAACTG CTTCTTTCCA GTTGAACCAT TCCTTGACCC CCAAAACGCA CAGGCTCATG CTACTGTATA TCGGTTCTGT GTCTCCGAGA ACAAAGTGAT ACTATCTGTA ATGTAAAACA GATTGTACGG ACACCATCGA TTAGTTTACA GTTATCAACA ACTGTGAGCA AAGAACCGTC TCCCTTCCGA CCATTCCCTG ATTGCGGAAC TTACCTTGTA ACGAAACACC TACTTGTAAA TCCCTCTCTG GCTAGTAGTA TTGTTTGCCT TACTGTTAGA GGAGCCATTT TGGAAAACCG AAC
|
Protein sequence | MSNPFDDSFQ GDNGPTVGPP NPFDADDALA SSSDVVSTND LDEDEYAAVE STWKYLKDLP YRQIPIYSNV LWELDPEDED WFAYGLERYP SSALNPSWPR SERMTLLRKT TTTKVSGCPY GGPLASITTP VMSTPTFAKT QITIWTNAGK VLTRIPFPPQ SHANYSPSLI TTIGFTSRAQ LVVVLQDSLC LTYNLRGEPI LAPFFILQQP SQGKAISVTQ AVVFAGGVAV LAQNQSCALV ELLDEHDNLS YSASAPLSAR KVTFDTNQHT DVTSSADGIF ALVTPLETAE FSRAHGLSYL TLAVLPRHCT SHGHPEVFIS TICHSVVVVE ARDGSMTDLD CRARMVAPLT HMAFAPNGRF LACFTTSGVL TVVSTDFDVM VLDFDASHSR ETSSSSSQPP LDMQWCGEDS VVLHFKNLGV LMVGPYGDWL RFPYNDTSDQ VYLVPEADGC RVVTESRVEM LQRVPPATAL ILRMGSIEPA AILLDAADAF YSKSTVTVLN SDEMVQGMVE QGTLNAAITS CFEAATNEFD IFTQKRLLRA ASFGMHISDK KQVNEERMIV GGSTTISEAE QDGEDCENQD PYSLPSRVTR RFVESSRKLR VLNALRHPLV GVVMTFPQWQ SIGAIGVVAR LVAMNRPELA TSICDYLALP KSIQLFARAS KASSFVEQKA QADEHLSDSE IAQGAIMIIT KEVVSSAVSP GASASMFRGA YATVALAANK VNKPGVANLL LMLESSVADK VPALIAGGSY ADAIAVATTA RDADFIFSTL MDFERNCMIA ASPTDLSQAQ SAFLSTVVGK FTLEAFHTLR RYLRSTSDIQ RVLNLLLRGQ KFSWAGREMA QKALVEVDVR EKQGMLAEAS RIFGISKETA FQKSCTDDYL DLRKDQEVLR NKYGSVDVAP ESSSVTATIS SIVKFAASNI REQHRLLADA DKVAKKFRVA EKRLWHIKVI AFAASEQWSN LRILADSRAK PPIGYKPFAR AVIDGNQNSS EILRYTERIS DLEERYDMLC YGQLWSNALD EAFKMKDTRR ILNVKNLCNS ADIQIKADQL MGRLA
|
| |