Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37572 |
Symbol | |
ID | 7202424 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 451561 |
End bp | 455246 |
Gene Length | 3686 bp |
Protein Length | 977 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181729 |
Protein GI | 219122805 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.421794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGC TGATTGCGAA AAATGGAGCG CTCCTCTTTG CGCTGATAGG ACTTCCATAT TGCCAGGCAT TCATCACTAT ATGTCCAACC TCCACATGCA AGCAGACACT CCTACGCGGA ACGCTTCCAA GTGAAGAAAT CCTTGCAGAT AAGCCCGAAG CTGACAATGA ATATTCGTCG CAAAAGAGTC GTCGGGAAAT GTTGGCCTCC GCAGCATCTT CCTTGGCTTA CTTTCCAGTA TATTCATCCC TGGAAGCTAA TGCTATAGAA AAGAAGGAAG CTGTCTTCTT CAATTCCTTG TCGGATCTTC CTCCTTTAGC TGACGACAAT GTACGTATTT TTCTTTGTCG ACATGGCCAA ACGGAAAACA ATCGCCTCAA GCTAATTCAA GGATCACGAC TTGATCCTTC CATAAACGAA ACAGGGCAAG AACAGGCTCG GAGACTGGGG AAAGCACTAT CTTTTGCGCT TCCGGTTGTC CCTACGATCG TTTTTCATTC GCCCTTGATT CGCGCCCGCC AAACGGCCCA AATCGCGGCT TTGCAGTTTT CCAGCAATCC ATCCGTCTCG TCGCCGACCC TCCGTCAATT AGACAGCTTG AACGAAATCG ATTTTGGAAG CGCTGCTGAA GGCGAATCAG TCGAGCCTTA CCGGGCGAAG ATGATGGCTA CCTATGCTGG CTGGTCCGTA GGAGAGCTAG ACCTCAGCAT GGGGGAAGGC GGCGAAACGG GAGGGGAGGT ACTGGCACGA ATCGAAAAAT CGTTACAGGA CCTCGCCAAG AGTGCGTCCA ACGCTTCAAA TCGATGTGTT GCAGCCATAG CGCATTCAAC CTTCCTCAAA ATACTGTTGG CCACGGCCCA AAATATTCCT TTGGCGCAAG TAGCGATGTT GGAACAAAAA AACTGTTGCG TCAATGTGCT CGATTTAAGC ACCAAGCAAG CCATAAACCT GGCATCTAGG AGCGAGTTAC TAGGAGGCCC CCTCTCGCTA GCACCATTGG AGTTTACGTT GTCGATCCCC AAGACAGCCG TAATCCGGAT GAACGAAAAG CGACACCTTG GTGACCTTGC TATATGACGC ATAGTTTATT GGGACAAGAC GCTAATCTAA TATATTGTCA CGTGCGTATT ACAATCTATC GTCAGATTGC CTCAATCAAA TTTGGACAAC GATTGCCGTA GCTTGTTGAG CTAGGTCTCG CCAATCAAGA AAGGCCATTG GCAGTTTGGT CGGCAAATCA AGCTGACAGT GAAAACTGAC GTCATACCTC AGCCATTGGA TCTTTTGATT TCCAGTTAGA TCAGTCAAAT CCGATGTTAC GATACTGCAG CGAAATTTAC CCGTGATTGA AGTCTTTAGA CGGTTCATTT TCGAGAAAGT GAGGAAGTAA AGATCGTATT AAACACGTAC GCAATTGAGG GTAATGATAC GATCACTTTT CTGTCTGTTG TACTCAGCTG TACCGTCTTT AGGCGTCGCC GCTTTCCAAA CGACAGTCTC CAATTCTGCG ACTCGACGAT TCCATTCGGC CGGATTACGG ACGACCCATG AGGAGAGCGA CATAGATTCA CGTAGGGACT TCCTGCTACA GACGGTTTCA CTACTATCGG GTGGCGTTGC GTTGTCAGGT TCTCCAGACA GTGCCACAGC TGTCGTTGGC GCGTTACCTG AGTTTGCCGA TTCCAATGCA ATCTTGCAGG GTGTTACTAT AAAAGTAGCC GATCAGTCGC AGCAGGAGGC CATGGTTTTA TTTTTGAAAG ACTCTTTTGA TTTCGAAGTT TTACGACGGC GTGTTCAAGG ATCAATCGAA GAAACATGGT TAGGCTACGG TCCTGAACAG CTGCGAATAC CAGACGACTT CACCCTTCCA GTGTCGTCCT TCAACACTTA CGGAGGCCAT GCTTCCGTTC GTCTCGTATA TGACGCCAAG GCGACGGTTC CCTTGTATCG GACGGGAGAG AAAGCGCCAG GCGAAAATTT TGCTTTTCTG CAAGTTGCTG TTCCAGGCTA CCGAATTTCG CAAATGGTTA AGCACGGTGG CAACATCATT GACGCTTACG GTTTTGTCAA CGTGGTTTCA CCATCGGGAC TACCAATGCG AGGGATTGTA GGGATCACCC CGGACCCAAT AATGTTTGTG GCAATCAATT GCATAGACGT TAAGGCCAGT CAGGCCTTCT ACGAAAAGCT GGGATTCCAA AAGCAAGAGT ATCCGTATGC ACGACCCTCG AAGGGAACGG GACCGTTTGA GCCAGCACAA CCATCAAAAT CGGTTTACAT GGCGCCTTCC GCTAACTGTA TGGGATTACT CCTGCTCCCG TCGAAGAAAA AACGTCTTCA AGCCAACCCT GTTGTGCAGT CTTTGAATCT CGTATACACG CCATCGGAAG AATCCGATTC TGCTGATACC ATGCCGACAC TATTTGATCC TTCGGGTATT GCCGTTTCTT TCCAATCTGT TCCTCAGTTT GAGCTAGAAG AAAAAGAAAC TAGGTAGGTT AAAGGAGAAA ATTGAAAGCT CCTATACCGA GTCTGAAGCA CTGCTAATCG TCAAGAATCG ACTAGATTCG CATCAACTGC AAACGACCAT TTCCTATCTG TAGCAACTTT TCCTTAGGAA CATAATATGA CCTCGTCGGT ACCTTCTTGA TTTGTGATTC CAACCATTGT TCCAGTTCGG TCGCTGTTGA ACCAAAGACA GCTCATAGTT AGATGTGGCG CCCTTGATTT TCGAAGGAAA ATCTACTGTG ATATATCAGA TTTGCACCGG GCAACACAAA TCTGATTTCG GAAGTCTTTT TACGTTGCAC AGTACATTCA CGATGGTTCG TAGAATAGTG AATAGAATTC TCATCTGTGT CGCCGCCTTA TCAAACGTTG CTTCCTTTCT TCCGGCTTCG CTCCCGAGGA GTCAAACAAA AAGCAAAGTA AAGCAACATT TGGTACCAGT GGATGCTTTC AGCTCTTCGC TGGTAACCTC TGTGGTGGAA TATTTTGATG GTAGTACAAT TGTAGACCCT ACCATCGTTT CCGATGTGTA TTGGCAATCA CTAGGTAGCA GAATTGTTTC CGTGGTCATT GCACAGGTTC TAGTTATCGC CACTTTTGCC GTTGTTTCTT GGGTTGCTTC GAAGCAAATT GGTAACACTA TAAATTTCAT TGCCCTCAAG ATTTTTGGAC AAGACAGACG AACGGAGGCA AAGAATATCG ATGGAGCTTC CGGCCAGCGG CTCAAAGTAC CACCAAACTT ATCGAATGTG CCACCTCGTA GTCCCGACTT TGGCAAGCTT TTTGTTTGCG TGATAATTGA CATTGTTGGG TCGTCTTCGG AGCTGCTTCC CATCATCGGA GAATTGTCGG ATGTTGTGTA CGCACCGATT GCGGCACTCT TGTTACGCAA ATTATACAAC AGTAACGTTA TCTTTGCGTT GGAATTTGTG GAGGAAATTT TGCCCTTTAC TGATATTTTG CCCCTCGCCA CGATTTGTTG GACTGTCGAT ACCTTTGCAC CCGATTCGGA TGTTGCCAAG TTTTTGAATG TTGGCAATTA CGGGAATTCT CAGGCTCGTG CTAGCACTTC CTTTGATGAT GGGATGGACG CAATTGACGT AAATGGAGAA GTGAAATCTC CGGCAAAAAA AGCTTCTACA ACGATAGCGA GTAGAGACGA CGGGCAACTA AGGTGA
|
Protein sequence | MPSLIAKNGA LLFALIGLPY CQAFITICPT STCKQTLLRG TLPSEEILAD KPEADNEYSS QKSRREMLAS AASSLAYFPV YSSLEANAIE KKEAVFFNSL SDLPPLADDN VRIFLCRHGQ TENNRLKLIQ GSRLDPSINE TGQEQARRLG KALSFALPVV PTIVFHSPLI RARQTAQIAA LQFSSNPSVS SPTLRQLDSL NEIDFGSAAE GESVEPYRAK MMATYAGWSV GELDLSMGEG GETGGEVLAR IEKSLQDLAK SASNASNRCV AAIAHSTFLK ILLATAQNIP LAQVAMLEQK NCCVNVLDLS TKQAINLASR SELLGGPLSL APLEFTLSIP KTAVIRMNEK RHLGVAAFQT TVSNSATRRF HSAGLRTTHE ESDIDSRRDF LLQTVSLLSG GVALSGSPDS ATAVVGALPE FADSNAILQG VTIKVADQSQ QEAMVLFLKD SFDFEVLRRR VQGSIEETWL GYGPEQLRIP DDFTLPVSSF NTYGGHASVR LVYDAKATVP LYRTGEKAPG ENFAFLQVAV PGYRISQMVK HGGNIIDAYG FVNVVSPSGL PMRGIVGITP DPIMFVAINC IDVKASQAFY EKLGFQKQEY PYARPSKGTG PFEPAQPSKS VYMAPSANCM GLLLLPSKKK RLQANPVVQS LNLVYTPSEE SDSADTMPTL FDPSGIAVSF QSVPQFELEE KETRIVNRIL ICVAALSNVA SFLPASLPRS QTKSKVKQHL VPVDAFSSSL VTSVVEYFDG STIVDPTIVS DVYWQSLGSR IVSVVIAQVL VIATFAVVSW VASKQIGNTI NFIALKIFGQ DRRTEAKNID GASGQRLKVP PNLSNVPPRS PDFGKLFVCV IIDIVGSSSE LLPIIGELSD VVYAPIAALL LRKLYNSNVI FALEFVEEIL PFTDILPLAT ICWTVDTFAP DSDVAKFLNV GNYGNSQARA STSFDDGMDA IDVNGEVKSP AKKASTTIAS RDDGQLR
|
| |