Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44239 |
Symbol | |
ID | 7204070 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1406812 |
End bp | 1410580 |
Gene Length | 3769 bp |
Protein Length | 1180 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186246 |
Protein GI | 219113325 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.153589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGAAG AACCGCCTGC CGATTGGAGG GAGAACTCCA CCGGGGGATT GGCGGATCCC AACGTCCCCG TCGCTCCGTA TCCACCGATG ATTTCCGCAC GTTTTCCTTC CCATCATCTT CGCTCTTCCC TTTCCCATTC TCCACACTAC ACGTCTCCCA CTCCTAGTTT TATCGGCAGT CGCGCCGAAT CCAATGTGAG TGGGACTAGC AGTCTTTTCC CCGGATACGC CACGAACGTC GGAGGAGCCT CCACCGCGAC GAGTTTGGAA GACGGCGGCA CGGCGGATGA AGCTTCTCGG CAGACCAACG CATTCGCAGT AACCGACGAT TCCGCGGCAA TCGCAAACAT TGAATTTTGT CGACCGGATC TCCTAGCTTG TATTAATTGT GTTTCCTCCT CTAGCAATAC CCACACCACG TCCAACACTC CAACGTCACT CAACAGCGCC AAATTCGCAG ATTCTACCAC CCCGGCGCGC AGCAATCGCA AAATGGGACC TGTTCTCTTG GAGCTCCGAA AACTTCAGCT TAACGAAGGA ACTCACGGAA ATCATGATTT CTTTACGACA TCACAAATTT CCGTATCGAG AGGTAACCCC TCACTGGGCA TGAGCATGTC GAGCACCTGC TTGCATGTAT CGCCACAGGC AATGCGGACC GGCGAAAACG CACTGCACAG ACCACCACCG CCAATTGCGA CGGGACTAAC GACCGGAGCC TTATGTATTC ACACTTTTGT CATAAAGTCG GATGGCGACA ATGAAAACAA TCAGGATTGG ACGCCCAACG TGGAGTACTA CCATACACCA CGACATCATC GTGCCTCCAC TGCAGTACAG TGGTGTCCAA CGACGGTGCG TCCTCAGCAT GTAGCAATTG GTTTACTTTC TGCTTCTTCC AGTGGTAGTC ACCCTACCAC AAACAGTGTC GTACCCGGAC GGCGAGGCGT CTCGGGTGGT GGTGTGGCCG CTAGTGTGGG ACTAGGCGCC CGATCGGCTA GTACTGGCGA CAAGGATTAT TGTTGTTTCG TATGGGATGT TGAGCACCAA AGCGCTTCTC GACGGACGAA GACATCGCCG ATTTACAAAC TAAGTCATCA GTCCGGCGTT GCTTCATTAG GTTGGCTCAT GGGCGGGGAA ACCTTGGCCA TAGGCGGACA ATTGCGTCAT GTTCAATTGT ACGACTTGCG CGAAGCCACG ACGTCGGCAC CCATGACTGT TATGGCCCAC AATTTTGCCG TACACGGAAT TGTCCCCGAC CCTCACAAGT CCTGGCAATT TGCTACTTAC AGTCGGGTAT CGAACGAGCC CGTAAAAATC TGGGATTGTC GAAGAATGGA CACCAACTTG ACGGAAATCA AAATTCCTTC CCAGTCAATA TCTCCTTCGT CAGTATCGGG TGTGACACCT CCCGTCTCGC AAGTGCACTG GTCACCACTA GAAGCAGGCT TCTTGTCAGT AGCGGTTGGA GACGCCATCT ACGAATTCGA TACAACGACG CCGGCCTCAC GACCGATTCA TGTCAATACG ATGTATGCTC GGGGATCGGT TCTCGACGTG GCATTGTACC CGTTTGTGGC GGAGATGGGG ACCGCCAAGG AAGCGAGTGT GCATAAACTC AAGGCTGAAC AACGTATTCC AACTTTATTG ATCCAAGAAG ATGCGATGGA AGCAAAGCAC CTGGAGGAGA TGAAGCTCAA TCATTTTCTG GAAAAACGTT CGGTACGTAG CAACCAACGC ATTCTTGGAG AACTCTACCC CAATCGTATG ATGGTAGTCT ATACAGACAG ATCTCTACAC GATTTTCCAC GCCATACCAT TGCTCCGTTG GCAGTGTCCA GCAGGGATGG CAGGTTGGTG CACTCCATCG GACGTACGCT ATGGGTAGGG TCCAGCAGGC AAGGACCGGC TGCCATTGAA CGTCTTACCG CAGCGCAAGA TGAGGATGTA TCCGCCGTAA TGTTGCGCCG GGCGCGCTGC ATACAGTCCA TCAATTATTC TATGGATCCA TCAGCCAATA TTCAAATCCT TGCGCATGAT GGGAGCGGAG TCGACTCGCT GTTACGGCTA TGGAGTTGGA TTGAACGAGT GGAAGTCTTG TGCTCCAGTA CAGAAACAGA CGATGGATGG GATGATGGCA TGTCATGGCC GGCGAAGACT TTGATGGACG CGGGTGCTTG GCGCCTGTTG CATATTGCCG GATGTGGCGA GGGGGAAATA CGGGGCTTTT CTGAACATTC CTGCTGCTCA ATTTATGATA GTCCAGGCCG CCGGTAAGGA AAAGTGAATG AAAAGATCCT TTACATTGTG TGGCCATATC ATTTCTAAAC TACATGAGAC TTTTCTATGC AGTGCGGCGT TAACTTCATG CGGGTGGGCG GGAAGGTTTG ATCTCTCGAC GGTTATGGGG GAATGTGAGG AGCTCGGTGA GTACGAAAGG TCGGCGGCTC TGGCCGTATG GCACGACGAC ATCGGGGCCG CAGTTGACTG CCTGCAGCGT GGAGCCTCGG TGATCAGGCA ACAAATGAAG AGGGGTGGAG AAAGTGTTAA TATGTATTGC TCCTCCGAGC ATGCCGAAAC GCTGGATCTC GTTTCTTTCT GTGTAGCTGG TCATCGGGGT GACAGCATGG ATTCACCGGC TTCCGGAATC TGGAGAAGAA CGTGCGCGAC TTTGATGAAA CGAAGCAGCT TTTCTGGCCA ATCCCGATGT TTTGCCTACG TTCGTGGAAT GCTCAAATTT CTCATGACCT CGGGGTCGGA CCAAGGGCAT GACGAGGTTC TCTTGTGCGA TGATTTGAGT CTTTGCGATC GCGTTGGCTT TGCTTGTCGC TTTCTCTCCT GGAACGAGCT TCTGCAATAC TTGGAAACGT GCATCGTCAA TTGTCAAAGA TCAGGTGACA TCGAGGGTAT GATAATTACA GGGCTCGAAA AGGAAGGTAT TAAAATCTTG CAATCCTTTG TGGATCGAAC TGCTGATGTG CAAAGCGCTG CTTTAATAAC GAGTCGAGTC ATTTTTCCCG TTGGTTGGAA TGGTGAACGT CGAGCCAGTA TAGAGTGGTT GGAATCTTAC CGATCACTGT TAAATACTTG GCAAATGTGG CAGTCTCGCG CCTTGTTTGA TGTCGATCGT GCGGACCTTT TACGCAAGGT AAAGTCGCGT CAATTTGATG CGTCCGGCAA ATTTGGCAGC GTTCCCATTA GTCGTCGGCA AGTGTCTGCT GGTGGTAAAC CAGGGCTGCG CCAACCCGAT CCGGACATTC AAGCCACCAT TCCGGCACAG CTTGACGCCC GCTGTAACTA CTGCTCCGCT CCATTGAGCT TGAAGCTAAA AGACACGCAC GCCAATCAAT GGCTGTCCAA AATGAAACCG GTGCTACCAT GCTGTGCACA ATGTCGCAAG CCGCTTCCGC ATTGCGCTAT TTGCATGTTA TCAATGGGTA CCTTAAATCC ATACATGGAA TTGACGAAAG ACCGATCAGG GCGGTCGTCC CGTAGTGGCC TTTCGTCGCT GCAGACCGCG GATGACATGT CGTCTTTGGG GAATTTGCCC TTTGCAGAAT GGTTCACTTG GTGTCTACGA TGCAAGCATG GCGGCCACGC CCACCATTTG GTGGGATGGT TTGCGAAACA TGAAGTATGC CCCGTGAGCG GGTGTGACTG TCATTGTCAA TTCGACGGAA TTCATGAGTT GAATCGATAT AAGCAATCTT CAGAGAGAGT AACAAACGAA AACGAGCAGG ACACGACAAG CAACACCGAG GCCGACTAA
|
Protein sequence | MSEEPPADWR ENSTGGLADP NVPVAPYPPM ISARFPSHHL RSSLSHSPHY TSPTPSFIGS RAESNVSGTS SLFPGYATNV GGASTATSLE DGGTADEASR QTNAFAVTDD SAAIANIEFC RPDLLACINC VSSSSNTHTT SNTPTSLNSA KFADSTTPAR SNRKMGPVLL ELRKLQLNEG THGNHDFFTT SQISVSRGNP SLGMSMSSTC LHVSPQAMRT GENALHRPPP PIATGLTTGA LCIHTFVIKS DGDNENNQDW TPNVEYYHTP RHHRASTAVQ WCPTTVRPQH VAIGLLSASS SGSHPTTNSV VPGRRGVSGG GVAASVGLGA RSASTGDKDY CCFVWDVEHQ SASRRTKTSP IYKLSHQSGV ASLGWLMGGE TLAIGGQLRH VQLYDLREAT TSAPMTVMAH NFAVHGIVPD PHKSWQFATY SRVSNEPVKI WDCRRMDTNL TEIKIPSQSI SPSSVSGVTP PVSQVHWSPL EAGFLSVAVG DAIYEFDTTT PASRPIHVNT MYARGSVLDV ALYPFVAEMG TAKEASQPTH SWRTLPQSYD GSLYRQISTR FSTPYHCSVG SVQQGWQVGA LHRTQGPAAI ERLTAAQDED VSAVMLRRAR CIQSINYSMD PSANIQILAH DGSGVDSLLR LWSWIERVEV LCSSTETDDG WDDGMSWPAK TLMDAGAWRL LHIAGCGEGE IRGFSEHSCC SIYDSPGRRA ALTSCGWAGR FDLSTVMGEC EELGEYERSA ALAVWHDDIG AAVDCLQRGA SVIRQQMKRG GESVNMYCSS EHAETLDLVS FCVAGHRGDS MDSPASGIWR RTCATLMKRS SFSGQSRCFA YVRGMLKFLM TSGSDQGHDE VLLCDDLSLC DRVGFACRFL SWNELLQYLE TCIVNCQRSG DIEGMIITGL EKEGIKILQS FVDRTADVQS AALITSRVIF PVGWNGERRA SIEWLESYRS LLNTWQMWQS RALFDVDRAD LLRKVKSRQF DASGKFGSVP ISRRQVSAGG KPGLRQPDPD IQATIPAQLD ARCNYCSAPL SLKLKDTHAN QWLSKMKPVL PCCAQCRKPL PHCAICMLSM GTLNPYMELT KDRSGRSSRS GLSSLQTADD MSSLGNLPFA EWFTWCLRCK HGGHAHHLVG WFAKHEVCPV SGCDCHCQFD GIHELNRYKQ SSERVTNENE QDTTSNTEAD
|
| |