Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36284 |
Symbol | |
ID | 7201897 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 65431 |
End bp | 68358 |
Gene Length | 2928 bp |
Protein Length | 924 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180746 |
Protein GI | 219119995 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0182118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACGAG GACAACGACA ACTCCCCCCC CAAAAGGCCT GGTTTATCAT TCCTTCCTTG TCTCGTACGT AGTAAGAAGA ACAACTGTAC TCAACAACCA GAGATACAAT TCTTCGTAGA ATGGAGGATC CTTCGCTTCG CACGACTGTC CGTGTCGGGA ACGACTATCG CGTGAACGAC GACGTCGACC TGGACGAAAA AGACGGAGGG GACGCTCACC TCCGCGACGA TGACGACGAC TCCACAAACA GCCCTCGTCG TCGCGGTGTA CCGCCGTTAC TCCCCGCGGG TCGTCGTCGG ATGATTCTGA CCGACGAAGC GGACGAAGAG ACGTCGTCTA CCACTGTATC GAGCCTCACA ACCAAAGTCG GGTCATCCTC GACACGACAC CAGCCCCCCC GACCTTCCGT CCGTTCCGTC CGACACAGCG TACTCCCCAC CCACGCGCCG GGACGTCGCC GACGTCGCTC GTCGGCACGC TTTTTGCGTC TCAGTGGTCA GCATCATTCC CGTCACAGTC GCAACGATGG CACAATGACA ACGACGACAT CTTCGTCCGC GGCACAACTC GGCGAACTCT ACAAACAAGC CATTCGAATG AACGCCGAGA ATAGGATCAA CGCCAGCAAT AGTTGGAATC TAGCACTCAT TGAGAATATT GATCAATTCC TGCTCCTTGA AGAAGAAGAA GAAGAAGAAC AGCACGAGGA CCTCCCTCGC GATCACCGCC GTCCCGAAAA CGACAAGAAT CGACTCACTC TCAACAACGC GCAACCCACG CCAACGCCAC GCCGTCAGCG CGTTAACTTT ACCAAGGCAT CTTGTACACT TGACGCATCG GTCAAGATTT ATTCCTACCG GGTCGACGAC GTACACCTTT CCAGTTACAA GGTGCTCGCC AATCTCAACC GCAACGACCA AAACGCCAAC CACAAGAACG CCGATAGTGA CAAAGACAAA AACACCCATC CGGATCCCGA TCACCACAAT GCTGGCAATC ACAAAAAATC CACTCACTCG TCCCACGCAT CCACCTTGGA AACCAATATG GGTGCGTAGC CAACCTTCCT GTAGATGTAC GCCCTTTGTC CCCCTCTCAC ACCTTGCTAC TTTCCCATTC ATTCGTTCCA TCCACATATC CATTCTTCTT GCGCCTACGG AACAGCCAAC ATTAACCTCA ACAAGCTCGA TGCCGCATTC GATATTGATC CACTCTTTCA CAAAATGTCC AAGACTTTTG ATGAAGGGGG TGCCAAGGGA CTGCTCCTCG CAAATCTGGG CGTCAGCAGT CACGGTTGCA ACGTCGTCTT TGACAGTACC AGTAATGACT CCAATCCAGT CGCCGAAGAA AAAACAGAAG ATGACCGTCA CGTTAACGAG ACAACTTACG CTCCCGTTGA CATTTCCTCA CTTCGCGCCA AACTCGAAGC TCTCGTCAAC ACAAACGACC CCGACGGCAA CACCGGTGGT ACCTTTTTGG AAGATTTGGC GCTCGTCCCG CAGCTTACAT CCCTCCGTGC CGAACACGAT CGTCTCGCCG CGGAAGGATT CGTTCTCGAC GACAACCCGA CGATGACGAG CACCAAAAAT CGTGGTTCGC AACGCTACGC CCCCACAGCG GACGAAGAAA CGCAGGCCGA CCAAAGCATT CACCAAGAAG CCTTGGAACG AAGTCGCCGG ACCAACAAGT CGTTTCTATC CGAAACGGAC CAAGAATATG AAGAAATCAT CGGTTCGAGT ACATCACAGC AGCCTCCGCA ATCATTATCG ATTGGATACG ACGACGCCGA CGACTTTGGT GGTGGTTTTG ACGATGGGGA CGACGATGAT GCGGGTTTTG ACGATTTTCT CCAACGCGAC CAGCAGGGGG CTCGGTATTC CTCCATATCC TTCTCCGGAT CGGTGCAAAA CTTTCAAGCC CAGGAAGGCG CATCCGACAC CGACGTACCG ACATCCACCG CATTGTTGGA GGCGTTGCTG GGGTCGCAAG CGTTGACGGA CCAGGATCAG TACCGTTATT TCGACGCCGA GTTGTTGTCG TCCGCCGTTC ACGCGAATAA TGCTTGGGCC GGAGCCACGC ACTGGAAACG CACGCCCAAA GTTGCAACCA CCACGGGACC TTCCGTCGCC AAGACCAAGT CTCAACGCAA GAAACCCCGT GCTTTGGTTG ATTTGACTGC CACGGCCTGT CTGGACGATG TACTTCGTTC ACCACCGAAG ACGTCGAGTT TGTCCTGGAG TCAAGCCATT GTGCAAAAGT ACACGAATGC GGAGCATTCC AACCTGTTGC CTCCCGACGC CGAAATGGAC GTGGAGACTT TGTCGACCCT CTTTTTGCGG CCTCAGAGTG TCTGTCGTGG TCTATCGGTC GCCGGGGGAG GGGACAAGGT ATCTACGCCC AAGGCGGTCG GCTTTAATAT GGGTGGCGTC GAAACCTTTG GTTGGGATGA TGGTCACGAT GACGACGGTG AAGGTGGCGG CTACGACTTT GGTGGTGACG ACGATGACGA TATGAGTTTC GTTGCACCGC TCGAAGACAT CCGCAAGGTG GACAAGGTTC ACGTGGGCTA CGCCACGGTC GCCAAAAAGG TGGACGTCAA GCGACTCAAG AAAGACTTGT GGATCGAGCT GGAAGCAAAA CTGGCCGAGC CAGCCAAGCT CGGCGAACAC AAGGATCATG ACGCGGACGA CAGCTCCATG TCATTGAGCG ATGCTGTGAC CCCTTCTAAA CCATCACTGC CACTATCCTT TCAAAAGGCC GTGCAAGACC TGGAAGCCAC CAAAACGCAA GCAGACGTGA CCTTGCCGTT TTACTTTATT TGTATTTTGC ACTTGGCCAA CGAAAAGGGA CTGCGACTGG ATAGTCACGG TTTGGAAGAT TTCGGTATTG TCTATGATGC AGCCGGGGTA CCGTTGGCGG GGGTGTAG
|
Protein sequence | MGRGQRQLPP QKAWFIIPSL SHTILRRMED PSLRTTVRVG NDYRVNDDVD LDEKDGGDAH LRDDDDDSTN SPRRRGVPPL LPAGRRRMIL TDEADEETSS TTVSSLTTKV GSSSTRHQPP RPSVRSVRHS VLPTHAPGRR RRRSSARFLR LSGQHHSRHS RNDGTMTTTT SSSAAQLGEL YKQAIRMNAE NRINASNSWN LALIENIDQF LLLEEEEEEE QHEDLPRDHR RPENDKNRLT LNNAQPTPTP RRQRVNFTKA SCTLDASVKI YSYRVDDVHL SSYKVLANLN RNDQNANHKN ADSDKDKNTH PDPDHHNAGN HKKSTHSSHA STLETNMANI NLNKLDAAFD IDPLFHKMSK TFDEGGAKGL LLANLGVSSH GCNVVFDSTS NDSNPVAEEK TEDDRHVNET TYAPVDISSL RAKLEALVNT NDPDGNTGGT FLEDLALVPQ LTSLRAEHDR LAAEGFVLDD NPTMTSTKNR GSQRYAPTAD EETQADQSIH QEALERSRRT NKSFLSETDQ EYEEIIGSST SQQPPQSLSI GYDDADDFGG GFDDGDDDDA GFDDFLQRDQ QGARYSSISF SGSVQNFQAQ EGASDTDVPT STALLEALLG SQALTDQDQY RYFDAELLSS AVHANNAWAG ATHWKRTPKV ATTTGPSVAK TKSQRKKPRA LVDLTATACL DDVLRSPPKT SSLSWSQAIV QKYTNAEHSN LLPPDAEMDV ETLSTLFLRP QSVCRGLSVA GGGDKVSTPK AVGFNMGGVE TFGWDDGHDD DGEGGGYDFG GDDDDDMSFV APLEDIRKVD KVHVGYATVA KKVDVKRLKK DLWIELEAKL AEPAKLGEHK DHDADDSSMS LSDAVTPSKP SLPLSFQKAV QDLEATKTQA DVTLPFYFIC ILHLANEKGL RLDSHGLEDF GIVYDAAGVP LAGV
|
| |