Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21667 |
Symbol | |
ID | 7202588 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 733423 |
End bp | 736847 |
Gene Length | 3425 bp |
Protein Length | 976 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181619 |
Protein GI | 219122578 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0681414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACAAGAA ATACTTTGCG ACCATGGCGA CCCAGCAGCT TCAACAAATT ATTTCCGAGA CGCTGTCTCC TTACGCGGAA ACGCGAAAGA CTGGTACGAT TTGAGGGTGT CGAAAAGATC GACGTAGTTG GTGTCGCGAA AGAATTAACC GACGGACCGG ACCGGGCCGT CACTGCTCAC AAGCACTGTT TGGTTCCTTT TTCTCACCTC TTCTTTTGTC TCCCTGCACC AGCCGAAGAT CATCTGAAAG CTGCCAAATC TAGTCCTAGT CATCCGCTGC AAGTCCTGGA AATTGTCGCC AAGGCGGACG GTAACGACGC GGCGGTGCGG CAAGCGGCTG CGGTGCACTT TAAAAATGTT GTCAAAAAGG GCTGGGACGT TCAACGGGAG GAGGGTAACG AGGGGATCGT CATCAACGAT CAAGACCGTA TCACCATCAA GTCACATTTG GTTCAACTCA TGTGTACAAC GCCGCCACAG ATTCAAGTAC AGCTCAGCGA AGCCATCTCC TTGATTGCGG CCGTCGACTA CCCAAAAGCC TGGGACAATC TACTGCCCGA ACTCGTAAAG CAATTTCAGT CTCCCGATCA GACGGTGGTT AACGGTGTAC TGAAAACTGC CAACGGAATT TTCAAGTCGT TCCGATTCGT CCAACGATCC GACGATTTGT ACGGGATTAT CCTTTACTCT CTCAATATTG TGCAAGGACC ACTTTTGGCT CTTTTCAAGT CCACCGGCCA AAAGGTGGAC GCCGTCGCCA ACAATACGGC TCAGCTCAAA CCACTCATGC AGTCGCTACG CCTCATGTGT CGCATTTTTT ACTCGCTCAA CTACCAGGAC TTGCCTGAAT TCTTTGAAGA TCACATGACG GACTGGATGT CCGAATTTGC CAAGTACCTC ACGTACCAAA ATCCGGCCTT GGTGGATACC GACGAAGAAC TCGAACCCAG TCCGATTGAT ACTTTGCAAG CGGCTATTAT TGAAAATTTG GCCCTTTACG CGGACAAGGA TGAAGAGCCA TTTATGGAAT ACCTACCCAA CTTTACTCGA CTCGTTTGGA ACCTGCTCAT GACGATCAGT GCCTTCCCGA AACACGACAG TCTCGCTACC ACCAGCATTC GTTTCCTGTC CAGTCTCGTC CAAAAACGAA TGCACCACCA CCTTTTTCAG GAAGAAGCCA CCCTCCGCGA AATTGTTTTA AAAATCGTCA TTCCCAATTT GCTTTTTCGC GAGTCCGACG AAGAACGATT CGAAGACGAT CCGAGGGAAT TCATTGTCAC CGAAGTCGAA GGTTCTGACA GTGAATCTCG CCGTCGATGC AGTCAAGACT TGCTCAGGGC CATGTGTCGC CAGTTCGAAA CGCAAACCAC CACAATCTGC TCCGAACACG TTGCCAGTAT GCTGCTCGAG TTCACCAATA ACCCTAATGG TAAATGGGCA TCCAAAGATG CCGCGGTACG TACCCATTAT TGGGTTCATG TGCGAACGTA CGTAATTTGA TCCGCATTGG CTTACCCTTG CGCACTCATT TTTTGTCCTT GCACAGATTC ATCTCATGAT GGGCATTGCC ATTCGACGAG AGAGTTCATT GGGGGTTTCT GAGCTCAACG ATGCGGTCAA CTTGATGGAC TTTTTCCAAT CGCAAATCTT GCCAGAACTA CAGGATCCGA ACCATTCGAA TCGACCAGTG GTCAAAGCGA CTGCAATCAA GTTCGTCAGT GTATTTCGCC AACAGTTTAC GAGGGAGCAC TTGACTCAGA TCATGCCCAT GCTGATTGCG CAACTCGGCT CACCAGCGGT TGTAGTCCAC ACCTTTGCCG CGTATGCGAT TGAACGCATT TTGTATACGA AAGAGACCAT CAACGGAAAA AAGCATCCCA AGTTTGGCGC GGCCGATCTC CAACCCTTTT TGGAACCCCT CTTCAATGGA CTGTTTGCGA TTGTAGACAA CGTGGAGCAC AACGAAAATG ACTACGTCAT GAAGTGCATC ATGCGATCTT TGGCGACGCA AGGCGAGGGT ATCATTCCCG TGACACAGAT TGTTCTCACC AAACTGACTG CGGCATTGGG TCGCGTCGCC AAGAATCCTC GCAACCCACA GTTCAACCAC TTCTTGTTTG AGTCCATTGC CGTCTTGGTT CAATCGGTTT GCTCCGTAGA CCGCAATGCC ACTGCACTAT TCGAACCGCT TTTGTTCGAA CCATTCAATA TTGTGTTGCA AATGGATATT GCGGAATTTA CACCTTATGT CTTCCAAATC TTGGCGCAGC TACTAGAGTA TCGCCCGACT GGCTCGGGTT TGGGGACGGC CTACCAAGCA CTCTTTTCCC CGTTGCTGAC CCCGGGTCTT TGGGACAAGC GTGGAAATGT TCCAGCGTTG TCACGTTTGA TGCAAGCCTA CATTCGTAAG GCGGCACCGG AACTGGTGGG ACAACTTAAC CAGATACTGG GTGTTTTCCA AAAGCTGCTT TCATCGAGAG CTACAGAGGC CAATGCGTTT GACTTGCTGT CGTCAGCAAT TCTTCACTTT CCACAAGAAG AAATGGAAAC GCGCATTGCT ACAATTTTTC AGCTTGTGTT GACACGGTTG CAAGCGGGCA AAACGCCCAA ATATGTCCGG CTTTGCACGC ATTTCTTTGC CCTCTTCATT GGCAAGTATA GCGCGAATGT GTTTATGGAT CGTATGAACG CAATCCAGAA TGGCTTGTCA TTGAATTTGT TGGAGCATGT ATGGATCCCA CGCGTGACGA CGGATCCCCC GGTCCAGCGT ACGGAAGCCA AAGTGCAGGT TGTTGGGCTC ACCAAGCTGC TTTGTGAATA CCCCACCCTG TTGAACGATG CCCATGGGCA AGCCATTTGG TCAAAAGCAG TCGTTGCCAC AATCACTATC CTTACATCAT CATCATTTAA AGCCACGGAA GAAACAGGTT TAGATGAGGA AGAGATCGAA ATCGGGTATG ATGCCCAATT TTCACAGCTC AAATTTGCGA GAAAGGCCGC AGAAGATCCC TTCCCAGAAG TTGCGGACCC TACACTTGGT TTTGCCCAGG CTCTTCATCA AGTTTCGAGT GCACATCCGG GACGTATATT GCCCTTAATC CAGCAGGGGC TGAACGGGGC GGACCCAAAG TTGTCGGTTG GTCTGGAATC CATGCTACAA GCCGCCAACG TGCAACTATC GTAAAATCTC ATGTGTGTAT CGGATACGAC TCCATGAATG CTTCGACTTT TATTTGATAA AGCTCTCCAC TTTAGAACAC ACACATAGTT GTGCTTCTAT AATTTCTGAA TAGGCAAGCT GACCGAGAGA TCAACATCCT TTTTACGTCA TCCAAGGCCA AAATGGGCTG CTCATGACTG TGGCTTACAA AACAAGTAAC GGATTTGAAA AGCCGGTCTA AAGAAGTATG TTTATAAATG TTTTA
|
Protein sequence | MATQQLQQII SETLSPYAET RKTAEDHLKA AKSSPSHPLQ VLEIVAKADG NDAAVRQAAA VHFKNVVKKG WDVQREEGNE GIVINDQDRI TIKSHLVQLM CTTPPQIQVQ LSEAISLIAA VDYPKAWDNL LPELVKQFQS PDQTVVNGVL KTANGIFKSF RFVQRSDDLY GIILYSLNIV QGPLLALFKS TGQKVDAVAN NTAQLKPLMQ SLRLMCRIFY SLNYQDLPEF FEDHMTDWMS EFAKYLTYQN PALVDTDEEL EPSPIDTLQA AIIENLALYA DKDEEPFMEY LPNFTRLVWN LLMTISAFPK HDSLATTSIR FLSSLVQKRM HHHLFQEEAT LREIVLKIVI PNLLFRESDE ERFEDDPREF IVTEVEGSDS ESRRRCSQDL LRAMCRQFET QTTTICSEHV ASMLLEFTNN PNGKWASKDA AIHLMMGIAI RRESSLGVSE LNDAVNLMDF FQSQILPELQ DPNHSNRPVV KATAIKFVSV FRQQFTREHL TQIMPMLIAQ LGSPAVVVHT FAAYAIERIL YTKETINGKK HPKFGAADLQ PFLEPLFNGL FAIVDNVEHN ENDYVMKCIM RSLATQGEGI IPVTQIVLTK LTAALGRVAK NPRNPQFNHF LFESIAVLVQ SVCSVDRNAT ALFEPLLFEP FNIVLQMDIA EFTPYVFQIL AQLLEYRPTG SGLGTAYQAL FSPLLTPGLW DKRGNVPALS RLMQAYIRKA APELVGQLNQ ILGVFQKLLS SRATEANAFD LLSSAILHFP QEEMETRIAT IFQLVLTRLQ AGKTPKYVRL CTHFFALFIG KYSANVFMDR MNAIQNGLSL NLLEHVWIPR VTTDPPVQRT EAKVQVVGLT KLLCEYPTLL NDAHGQAIWS KAVVATITIL TSSSFKATEE TGLDEEEIEI GYDAQFSQLK FARKAAEDPF PEVADPTLGF AQALHQVSSA HPGRILPLIQ QGLNGADPKL SVGLESMLQA ANVQLS
|
| |