Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35237 |
Symbol | |
ID | 7200595 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 410727 |
End bp | 414474 |
Gene Length | 3748 bp |
Protein Length | 1052 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179843 |
Protein GI | 219118123 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATA TTGCTCCCAA CAACCCTACC ATTGACGACG CGTTCAACGC TGTTGTCGTT GACGGAACTG TTCAAGCCAA TGTCGCCCCT GTTATCGAAC ATCCTAACGT TGCGAACGAA AACCCTGTCT TTCCCATTGT TGTTAACCAT TATGACAATC TTCTTGCCAT GACCGGAATG AACATTGGTA CCCGCAAGGC TCTCGTGTCG GAAGGGTTTG GCTCTATGTC CTCTTTGGTC CGCTTGACCA CTAAGGACCT TGACTCCCTT GTCGATATGA TGAACAAGCA ACACCGTGGA AAATCCTTCA AGAGCACTTA TCCCCCTGGA ACGGCTGATA ACGAGAAGGA GATCCATATT GGCTTTAACT CTAAGTCAAC TCTTTCGGTT ATCATCCACT GGGCTGAACG TTTGAAACGT CTTGGAAAAG AGGTTCGTGC TGAGGACTAC ACGAGTGAAG TTGATCAGCT TGCCCGCGAC CGCATGGAAG AGGAAAAGGA AATCCGCGAG GTGGCCAAGA CCCAGACACC GGTCAAGCCT ACTCCCCTCA AGGACATGAC CAAATGGCGT TCTTTCTTCG AGAATTGGAA CCAGTACATG AGTTGCTGTC GCGGTCCCTC GCTGACCCCC CTCACGTATG TTTACCGTAC CCAGGAGGAG CCTGAAACTG CCCCCCTTGA AGATTATGAC GACATGGATG CTTACTTGGT GGCCCAGACG ATTTTATCGG GTCCCAAGTT TGTAATTGAC AACAAGCGAG TTTTTGATGA ATTCAAAGAG GCAATTACCA CGACCGGCCC AGGATGGGCT TTCATTAAGG AGTTCAACAA GGGCAAGAAC GGTCGCGCTG CTATTTTGAA GTTGAAGGCA CAGGCAGAAG GAACCCTCAA CGAATCTATT CGTCGGGATG ATGCAATCAA AATCTTGTCA ACAACAACGT ACAGCGGTCC TAGTCGCAAT TGGAACATTG ACACCTTGTT GCAGAAATTC CAATATGCGG TCTCTGAGCT GGTCGAAATT GACGGGGCCT CTTTACCGGA TGGCCAGCTT GTCACCTACT TGGTGCAGGC ACTCAAAGAT CCCAGTTTGA GTTACGTTCG CGACACTATT CGCACTAATG CTACCTACCG GAACAGTTTC CACGAGGCTC AGCTTTTTGT GAAAACGTTC GTTTCTTCAT CTTCGATCAA GACCGAAACA ACGCCCCGAC AGATAAATGA TGTTCAGACG TCCAATGGTG GATCTGGTGG TGGAAAATCA GGTGGCGGTG GAAAAGGAGG AAGCAAGTCG GGCGCTATTA AGGGCCCAGT TTTGGCTCGT AGTTATTCTC CAGGCGAATG GAAGAGTCTT ACCAAAGAGC AGCAAGAGAA AGTGAAGTCT ATGCGTTCTA AGAAGAAGCA AGTGAGCAAA CCAAGCGAAG AAAAGGAACG CAGTGTTGAC AGTGTCTCGC GGGAGGATAC CGCAGGCATT AAAGAAGATC ATGCCAATGA TAAGTCTCAA CCAATAGACG ATGCTGCCGG TCTTCAATTT GGACGTGGTG CACACAAAAA ATCGGTTGGA TTTGTCATGG ATAAAGCTTC TCCCTCTGGA AACGGAGCGA AGAAGCAGAA ACTATTGTGA AATGCGGCCG CGGTGGCTTG CGTTTTGGGG ACTGAAATAC GCAAGGCACC GGACAGAGAA ATAGTTGCCC TCACCTCTAC ACGCAGTATT GGGGATGTCA ATTCAGGCTC GAATCACCAC ATTGGAGCTA GATGCGAATT GGATTCCCAT GCGGACACAT GCGTTGCTGG GGCTAACACT ATCCTCATTA GTGACTCGCA AAAGTCCGTT ACCGTTCGAC CTTTTTCGGG GGAATACTTA GCAATGACAA ATATTCCTAT TGGAACGGTC GCTACGGCCT ACACGGTACC AGATGATGGA CGAGTGGTGA TTCTAATCAT TAATCAGGCT CTCTACTTTG GGGACCGATT GAAGAACACA CTACTCACCC CAAACCAGAT GCGAGATTAC GGTATTGAAG TCGACGATGC CCCTCGGCAG TATGTTGCCG GCTCTCAACA CTCCTTATAT GTTCCTGATT CTAACTTACG GATTCCGCTG CAGCTGCGCG GTATATTCTT GTTTTTGGAA TCGCGGAAAC CCACGCAACA GGAAATGGAC GAGTGTGAGC ATATTGTACT CACCTCCGAT GCGCCGTGGG AGCCATGCTC GGTTGAATTT GCCAATCGAG AGCAAGAGGC CGTCAGAAGT GACCGTCGCG TATCTTTGGT GGACTCTGGG GGAAATTCCA CTGGCCAGGT GCATTACATA GCATACCCGA GTGACACTCG GACTGTTGCA GCTGCCCAAC GAGTACTCGA AACATTTCGG TCCCTAACCG AAATTGAATT TTGTGAGACC AAATTGGTGG ACCGCTTAAT TGCTTGCGTT AATGTCGCTT CGGACGACCA CTGTGGAGAC GGGTTGGACG GACGTGCTGA TCCGGACGTA TACCCAGCTT CAGACGACTT CATAAGAGTT GTCTCCGGTA TGACGTCAAG CGAACGAAAG TCAGCGTTGA CTCCTGAGGT TTTGTCGCGA CGATGGAATA TTGGCTTAGA TTCGGCCAAA CGGACCCTGC AAGTAACAAC ACAGAAAGGC GTGAGAACTG TGTTGCACCC TTTGACTCGA CGGTACCGGA CCCGGCAGTC TCATCTGCGA TTCCCTACCA TTCGGACAAA GGTTTACACA GACACTATGT TTTCCTCCGT TACTTCCATT CGTCAGTACA AGTGTGCCCA GGTTTTTACC ACAAACACTG CCTATTCGCG CGTTTACCCT CTGCAATCCA AGCAGAATGC CCCGGATGCC CTCATGAAAT GGATCCAGGA CGTCGGGGTA ATGAGTGACC TTGTTTATGA TGGGTCTAAA GAGCAGGGAG GTGGCAAGCA TTGGCGTGAG ATTGAACAAC GCCATCACAT ACAACGGCAC GTAACGGAAC CACACAGTCA GTGGCAGAAT CGAGCTGAGG GGGAAATCCG AGAGATTAAG AAATCTGTTC GCCACCGTTT ACAAGCATCC CGAGCACCAA AACGACTTTG GTGCTTCTGT ACGGAATGGG TGTCGGCCGT CCGACGGTTG ACAGCCCTTA GCCTGCCTGC TTTGAACGGC CGCGTCGCAA CGGAACTTCT GGAGGGGGAG ACCCCAGACA TTTCAGAATA CGCACAATTT GACTGGTATG AACCAGTATG GTTCATTGAT CCGACCTCTT CTTTCCCGGA ACCTAAGCGA AAGCTTGGTC GTTGGATTGG GGTTGCATCG GATGTGGGGC AGGCCATGAC CTTTTGGATT TTACCAAAGT CCTGCTCCCC AATTGCGCGT TCCTTGGTAG CTCGAGTTGA CCCTGATGTG TCCTGCACGG ACGAGTTTAA AGCTGATCTT GCCATGTTAG ACTTGTCCAT TGACAATAAA ATTGGCAATA ACAAAACTGC AGAACAGAAC AAAGAAATTG ACAGCTCGCT CGGCAACCTG GTGTCGGGTC CAGCTGACGA TTTATTTGAA AAGGTAGCCA ACAAAGAGTT TTATCCGCTC GAAGAGGCTG CTGAAAAAGC CGAAGCTGAT GATTTTACGC CCGAATCCAT GGACGAGTAC CTGACAGCTG AAGTTTTACT TCCATTTGGT GGTGAATTGT TACGTGGGGT GGTTAAAGCT CGGAAGCGCG ATGCCGACGG AAACCCTCTG GGGACTCGGA ACAGCAACCC AATTCTAG
|
Protein sequence | MSDIAPNNPT IDDAFNAVVV DGTVQANVAP VIEHPNVANE NPVFPIVVNH YDNLLAMTGM NIGTRKALVS EGFGSMSSLV RLTTKDLDSL VDMMNKQHRG KSFKSTYPPG TADNEKEIHI GFNSKSTLSV IIHWAERLKR LGKEVRAEDY TSEVDQLARD RMEEEKEIRE VAKTQTPVKP TPLKDMTKWR SFFENWNQYM SCCRGPSLTP LTYVYRTQEE PETAPLEDYD DMDAYLVAQT ILSGPKFVID NKRVFDEFKE AITTTGPGWA FIKEFNKGKN GRAAILKLKA QAEGTLNESI RRDDAIKILS TTTYSGPSRN WNIDTLLQKF QYAVSELVEI DGASLPDGQL VTYLVQALKD PSLSYVRDTI RTNATYRNSF HEAQLFVKTF VSSSSIKTET TPRQINDVQT SNGGSGGGKS GGGGKGGSKS GAIKGPVLAR SYSPGEWKSL TKEQQEKVKS MRSKKKQVSK PSEEKERSVD SVSREDTAGI KEDHANDKSQ PIDDAAGLQF GRGAHKKSVG FVMDKASPSG NGAKKQKLFI GDVNSGSNHH IGARCELDSH ADTCVAGANT ILISDSQKSV TVRPFSGEYL AMTNIPIGTV ATAYTVPDDG RVVILIINQA LYFGDRLKNT LLTPNQMRDY GIEVDDAPRQ YVAGSQHSLY VPDSNLRIPL QLRGIFLFLE SRKPTQQEMD ECEHIVLTSD APWEPCSVEF ANREQEAVRS DRRVSLVDSG GNSTGQVHYI AYPSDTRTVA AAQRVLETFR SLTEIEFCET KLVDRLIACV NVASDDHCGD GLDGRADPDV YPASDDFIRV VSGMTQWQNR AEGEIREIKK SVRHRLQASR APKRLWCFCT EWVSAVRRLT ALSLPALNGR VATELLEGET PDISEYAQFD WYEPVWFIDP TSSFPEPKRK LGRWIGVASD VGQAMTFWIL PKSCSPIARS LVARVDPDVS CTDEFKADLA MLDLSIDNKI GNNKTAEQNK EIDSSLGNLV SGPADDLFEK VANKEFYPLE EAAEKAEADD FTPESMDEYL TAELGSAMPT ETLWGLGTAT QF
|
| |