Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46863 |
Symbol | |
ID | 7204422 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 649792 |
End bp | 656793 |
Gene Length | 7002 bp |
Protein Length | 1597 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185758 |
Protein GI | 219121053 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAAA AGACAAAGAA GTCCGCCTAT TTGCAGAAAA ACAGCTGGCC ACGAGAGTTC GAGTTTCTAC CTCCTTCTGG CGACGCCCTT CCAGCGGGTC CCTTCTTTCC GCCAGCTATC TTCACCGATG TCATCTACAC GAACGGTGTT CGTGTCGATG CTCCTTTGAT CATGGAGGGC CATAAAATTC GGGCTTCGCT CCCATCCCAC TCGTACGCAT CGCACGGTCG TCTTCCTGTC CTCACGCCAA CTGATTCAGC TATATGGAAG CAAAATATCG CAGCACAGTC AGCCACTACG GATCAATTTC CTGTGAATGT TGGTAGATAC TTGGGGACCA TCACACGCAC TATAACCGAC CGGGGCGATA CGGGGGGTCC TTCTTGCGTC CCTTACCTGA TTGCAGAAGC CGGACACCAG TTGCGTAAAT GGAAGAAACA ACCTACCAAT GCTTCGGCCA AAAAGGAGGC ACTCCTCGAA CGTGGGCTTG GAACGTTTAA GGAAGCTGAA ATTGACAATC GCGTCAATGA GATTTTGGAA GAATGGAAAC GCATATCGAC GGAACCCAGG AAGAGCGCGA CTTGGGATGC GTTTGGGAAT CTCCCACCAA AAAGCGTAAA TGACAGCTTT TACGTCCCCT TATCTTCACT TTTGTCTTGG ATTAAGGCTG AAGGCGACAA CCTGACCGAT TCCCATGCTG GTGTTGTTCG ACAAATTTGT CATGAGCGGA TGTGTATTCC GGTCGACTCT GTGAGTGTGT TAGAGGCCTG TTTGAACCTT GCCGATCCCT CCCATCTTTG GCACATTCCA TTGATATTTG TCTATCCCGT GGTAATGCAA GTCGATGCAT CTAGCAAACC CCAAACTACT GCATCGCGTC CGTCAAAACG CTCAAAGCCT ACGTGTACGA CTTGGAGGTT GAAACTGGGC ATCTACGCTC ATCGCCTCTT GCCAGAAGCC ACCACGACCG TCCTCAAAAC AGTAATGGCC GCTCTCGATG AAGGTTCGTA CAGGTGTACA CAAGGCCTGA GCCTCCCCAC TCATCCGAAC GAGCCTTCTT TTGATCCGAG TCCGTATCCA GTGGTCTTCG TCGACGATGA TGATTTGCAA AAAACTTATT CAGCGTCATC GAAGACGGAA AGTTCGACTT TTATAGATTC AACTCGTGAA GAATCGTGTA TTTCCGCTTA CACAACGAAG GGCTTGCTCA AACTACTGGA GAATCGCGGT TGTGACATCT CAAATTGGGA CGAAATTGCC CCCACTTTGG GAGCCTTGCA ATTGGATCTG ATGCTGCACC AACAGCATGC TATTTGCTGG ATGCATGAAA TGGAACACTT ACCAGGGTTC GGCATAAACA GTATTTTTTG GGAGGAGCGC GAGTTTGGTG ACCGAGGCAA ATACTACTAT GCCCCAGCAT TGGGTCAGTT GCGTTTACAT CCGCCTCCGA CTATGAAAGG TGGTTTGCTT TGTGACGAAA TGGGTTTGGG AAAGACGATC GAGATCATCG GGCTTGTCCT ATCGACTCTA GACGAGCTCA AATCCGAAAC TCAAAACGCT TTAGACATGG ACAAAACTCA CGCGACTCTT ATAGTTGTCC CTCCCGTCTT GGTGGAGCAG TGGCGGAACG AGATTGTCAA ATGTGCTGGT CCTTCTTTGC TTGTCGACGT CATGGAAATT CAGAACAACG AAGTTTATTG GCGTGGCGAT GTCCATAATC CTGATATCGT CCTGGTATCC TACAACGTGA TGCAAAAAAT GACATCGGTG AAACATTTGG CCAAAAGGTG GGGCCGGATT GTTTTAGACG AGATGCAAGA GGTTCGCTCG TCCACGACAA AGATTGCTCG AATGTGCGAG AAGCTGAAGT CTGATCGACG CTGGATGATT AGTGGTACGC CACTGTTCGA AGGCATTGAA GATTTGAAGG GTGAACTGAA CTTTTTGCAT TTAGAACCAT TTTCTGCTAA CAGTGAAGAC GGATTTTTCA ACTTTGCCGT TACAAATCCG TGGGAAAGCC ATCAAAAGCA AGTTATTGCA ACACTAGGGG TCTTGGGGAT GATAATGCTA CGACGCTCGA AGAGTATGAC AATTCATAAA ACGCACCAAC CAATTCTGGG CTTGAAGCCG CTGACCGTAG AATTCATCCC TGTAGTACAA TCGCTCTCGG AACGTACTCT CTACTGCTTT CTCGAATGTA TTACGGCGCG CGAGTTTCGG TCCACGGAAC GGACCGAAAT CGAGAGTATA GCAACTAAAG AGGGCAGGTC ACGATGTCTC TACCTTACGC TTTTACGAGA CATTTGTAAC TCAGCGGTAA GCCGAACGAA AGAGTTTTCA GCTAAAAGCA AAAATGAGCT GCGCAAGATA CTTATCTAAT GATCATTCTC AACAGGTTCT TCTAAATGGC GGCATGGGAG TACCATCCCA GCTTGGAAAG CTGAATGCGA TGCTGATTGC TCAACATCGA CGAGCAACTT CGAAGCCATT GTGGGATAAA AGTCTTAATC AGGGCGTCAC AAATCTTGTA ATGACTTGTG ATGAGGCCAT CGTTCATTTA GCGCAAGTGC AAGAAGCAAC CCGTGCTGGA GACGGCGAGG TGGCTTTTTT GCAACTTGGA TTTGGACAGG GACTTGTCCG ACGAGTGCGT GCCTTCGAGT CCGTTGAAGC GCAGCTCGAC GCCACAAGAA GCGCGTTGGA TACCGCTATA CGTAGTAATG CTGCCGCAAA GCGGCAAAAA GCGAAAGTTC TTTGGCATTT GGCGCTAGAA AAAGTTACCA CAGGGTTTTT GCAAGATGCC GACGTTGTTC ATATCCGTAC TCGTGTGAAA AAACTCTGGA AACGGAGGTA TTCAGTCATT TCTCAAGCGT CAAACGTAAA ACAATCAATT TCCCATACTT CCGATTCTCC GTTTAGCGAC ACCGCCAGAC CTCCTTACGA CGAGGAGCCC TACGAACATG CTATGTTTTT AAGGGGATGG AGGCTCTCCA ATATTTGTCT CGTAGACTAT TTCTGGGCAC ACCCAAATAG TTTCCGCATC GAAGGCATTC CTCCAGAATT ATCTATCGAT GATGTCGCAC ACACGTTGGC AGTCAACTTT CGGAGTGGGG ATGGTTCGCC AGCGGCCATG CGCGACTACT GCCTTGCGGA ACATATTACA GACTTGTATG CTCAGGACAG CTATCGCTTT TGCATTCTTC GCGTTCCTTC GTCATCTACG ACATCGACTT GGACAGCTGG CTTGGCGTTC AAATCGCAAA AAGATGCTAT GCTTCTGTCA GGGCAGACAA AGAAGGTCCA TGGAATTCAG ATTGCTTCCA AGCATATTCC ACCACTCATG AAGAGAAATA TTGATGCTTC TATCGCGGAT TTAGAGGAAG CGAGAGCTTT GCAAGCGGTA AGTCTTTAGC TCTATTGCGA ATCTGGGATC CCAATTACTC ATCCTGCCAC GAAACTATTT GCAAAGCGAC ATCCAACCCT CGAGAATCGG AAAAAGCTCA CGCTTGCGAA AAACGAATTT GAAAAAGCTA AACGCGGCCT TTGCATATGT AGCAAAGGCC AGTATAAGAT GCATGCAGGA ACTGATTGTG AGAATCTGGT ATGGAGTTCG GCTCGAGTTG CTTATAGATT TGACGATCTA GAAAGTGAAA AACTTATTTC AAGCTTATCT GCTGTGCGAC GTGATTGCAG TGCTACTATT GGTGAAACAG ATCTCAAGAT ATATGAATAT CGGAATCGTA TGCAAAAGTT GCGTTCCATT GCCGAAAGAA AGTATTCGGA AAATGTTCAG GCTATGTCTT CCTTTGATGC GCTACAGGCT TTAGCAAAAG GAAGATTTAA TGATACGCAG TGCATCGTAT GCTTGGGTCA TTTGGGATCG GGTGGATTGT GCGGGGACGA TGCACCTTGT CGGCTCTGTA ATAGTGATGC AAAGAGCGCT GATATCAGAG GAGCAATAAC TATGACCAAA TGTGGGCATT TATATTGCGT GAAGTGTTTC GAGCAATGGA GTCGAACCCA ATCGACTGTG GCTTGTATTG TATGCCGAAA AAAAATAGAC AAAAACTCAG ATTTCGTCAC GATCGATCCG ACGGATAGGG AAAGCCCAAG TTTTATTGCT GATAGGCGTT CTGAGGCGCG ACTTATAATA AAGAAAGCAT CCGACATGAT ATCAGAAAGC AACGGCGAGC TAGATCCTGA ACTTTGGGAG CAGCTCTATC ATTCCATCGA GATCCCTCAG AATGTTGACT CAAGTCGCCA CTGTAAGGTA TCGGTACTCC CTGGCGATTT TATGGGACAT CTGCGCTCAT GTACAGGTCT TCCAGTGCTT TGCCATCCTT CTCAGACACC GGTCACCTTG TCTGCACAAA GTCAGTGCCT TTCCAGCAAA ATTAAGGCCC TTCTCCGCGA TTTGCCATCA GAAGAGCGAA GCGTTGTCTT CTCTAGCTCG GCCAAAACGC TCCATCACTT AGAGGTCGTT TTTAAGGGAA TCGGTATTGG ATACCAGTCG TTGTTCTCAG GACAAGCAAC TTCCACAGCA GAACGCGCAG TAAGTGAGTG GCACGCTACC AAAGCTAATG CGATTCCTTG TCCAGTTCTG CTCGTCCAAG CTGGAGCAGC CGCTAGTGGT CTAACATTGA CTAAAGCGTC AAAAATGTTT ATCATGGAGC CGTTTGTGCG TCACGAAGAA GAGCAGCAGG CGTATGCTAG GTGCCATCGC TACGGACAGA CAAACACTGT GCACTGCAAG TGCTACTATA CACCAGTTTC CGTGGAATCC CGTCTTCTGG AGTGGCGAAA GCGAGCAGTA AACGCTGGAG TAGAGGCTCT ACCCCTACAG GAAGGCGAAG CTCCGCAAAT CATTTACGCT CCGCTTCTGG ACCTAGAAGA TGAGGCGGAG ATTGAAACCA GTGAAGTGTC ACAGATGAAC TTTCTTCTTG GATTAGGGAC GGAGCAGATA CGATAGGTCA TCGTAAGCTT CTTTAATAAC ACAAATGACA AGATTAACAT ATACATTGTA TTGTCACGTC CCGCAAAAAA TAGCAGCCCT AACGTCTCGA AGAATTTCGT CATCAGATGA CGGTAACGGC TCCTCTTCTC CTCCCGGGGC TTCTAAGCTG ACGTTTGATT CGCGAAGAGT TTGGCTCTGT GTTCTTCCAC GTCGCACATT ACCAAGTCCA TTGGAGACGG TCCATCCAAT CAAGCTAGAA ATTAGAATTG GCCCTGCGTC ATCGACTGAA CCACTTAAAG TGAGTATCGT CACGGCAGTA ACGGACAAGG GAATGGGAGA CGCTGATACA ACACATGCAG TCATGCAACA TGGTAACGTC AAGGCGAATG GCAATCCTTC GACAACTGAA GCGAGCGCGG CCCCAACTAG CAGGCCAACA ACACCAGCTG GAACGAGAAG GCCCCCGACG AGTCCACACG CAAGGCTCAT ACTCAGCCCC ACAATTTTTG CGAGGGCAAC CATGAGCAAT TGATAAACAG CCAGTGTCTC GTCTTCGTTC GAAATCACAT CGCTCCAGAT ACGCGACGCT AAATCAATTC CACTCCCGAC GGCCAACGGG TCAAAAATAC TGATGAGACC GTGAAGGAGC CCGCCGAGTG TTGGAAAGAG TAGTACGGGC AGCCAAGGGG ACGATCCCAT CCGGTTTGTC ATGACCGTCT CTATGGACTG CTTGGAGCTT TGTAGAGCTC GCGTTAGCGC ATGAATCGTC AAACCCACAA TGCCACCAAC CAGTCCGAGT GGAACCGCGG CGGCCCAATG CCAAAGTTCC AAGTTGATAC CTACCGCCAA AAAACCCACG ACCTCGACCA TGCTGGGGCC CGGAAGGAGA CGACCAATGA GCAAGGTAGA AACTGTAGCA CAACTTTGCA AAACGAGACT TTCCATGTAT CCGAATCGTG ATGGTAATGA TTGTCGTTCG AGAGAGTGCG AAGAAGAGCT TTCATCCAAA TCAGTTAGCT CGACTTGGGT GCCGTCTATG TTGGCAAGCG GTGATCCCTC CATAGACGAT TCACTGTCAA GCTCCAACTC CAAATTGCTC CCAAAAAAAA GTTCCGTAAC GAGTAATGGT CCCACTAAAG GAAACGCCGA GAAAATGCTT CCTACAGCTG CCGCCATTCC CGATTGCACT AGCAACACTT GATGACGCGA CGAGATTCCC GTTGATTGCA GCCTCGGTAC GACCCTGATC AAGAAACCAG CGGCTAACGC TCCAATGCTT AGAACAAGAA TCTCTGGTCC CAAAGGGGCC CCCGTCGACA AAGCCACGAT ACTTGATAGA ACGGTCAGGG TCCATTGCTG CCAACAGTCG GCTGCGTCGT ATATACACGG TATCATGTTT TGTGGTAAGC TCGGAGTTCT GCAGTTTCGG ATCGAACGCG GTTGGGACGA GGGCAACAAA CAAGCCTGCA GACAACCGCA AAGAAATCCT CCGGTGACGG TAGTGTAGAC CCATGACAAT CGACTCTTCC ATGGTAAAGC GTCATCGTCT GGTTCCATCC AACACTGTAC ACTGCAATTG AACCATTGTG ACAGGACCAG CTCGATCGCT TTGAGGGAAC TGTAAATTCC TATCCGAGTC ATTGCACCAA TCAAGAATAC CAACGGCAGC CTCAAAAGCC AAAAGGAAGG GTCCCGGGCG AAGGATCTCG TCGACGACAC TGCTGATCGC GATATTACTG GAGGATTTGC ACCAGGTTCT TCGTAATACT GGACCGACTG GGGAACCGCA ATTTCTCTCG CGTCGACCCG CACCTCCGTG TTGTAGTCGT CATCGTAGTA TGTCTGTTGA GCCCCGGGTT GTCCTTCTTC GGTTGCGGTA CGATTCCGAA TCTGAAAGGC GTGCCTTGGA GGTGTGCGGG TAGTACCCCT CGCGGCGGGA TTACGAATGG TTGATCGAAC CGGCAACAAC GGCTGTTGTA GACCTCCTGT GGCATCGTTA ACGATTGTAG CATCCTCGTT CGTGTTGATC GTCATAACGG AATAAAATCC AG
|
Protein sequence | MTKKTKKSAY LQKNSWPREF EFLPPSGDAL PAGPFFPPAI FTDVIYTNGV RVDAPLIMEG HKIRASLPSH SYASHGRLPV LTPTDSAIWK QNIAAQSATT DQFPVNVGRY LGTITRTITD RGDTGGPSCV PYLIAEAGHQ LRKWKKQPTN ASAKKEALLE RGLGTFKEAE IDNRVNEILE EWKRISTEPR KSATWDAFGN LPPKSVNDSF YVPLSSLLSW IKAEGDNLTD SHAGVVRQIC HERMCIPACL NLADPSHLWH IPLIFVYPVV MQVDASSKPQ TTASRPSKRS KPTCTTWRLK LGIYAHRLLP EATTTVLKTV MAALDEGSYR CTQGLSLPTH PNEPSFDPSP YPVVFVDDDD LQKTYSASSK TESSTFIDST REESCISAYT TKGLLKLLEN RGCDISNWDE IAPTLGALQL DLMLHQQHAI CWMHEMEHLP GFGINSIFWE EREFGDRGKY YYAPALGQLR LHPPPTMKGG LLCDEMGLGK TIEIIGLVLS TLDELKSETQ NALDMDKTHA TLIVVPPVLV EQWRNEIVKC AGPSLLVDVM EIQNNEVYWR GDVHNPDIVL VSYNVMQKMT SVKHLAKRWG RIVLDEMQEV RSSTTKIARM CEKLKSDRRW MISGTPLFEG IEDLKGELNF LHLEPFSANS EDGFFNFAVT NPWESHQKQV IATLGVLGMI MLRRSKSMTI HKTHQPILGL KPLTVEFIPV VQSLSERTLY CFLECITARE FRSTERTEIE SIATKEGRSR CLYLTLLRDI CNSAVLLNGG MGVPSQLGKL NAMLIAQHRR ATSKPLWDKS LNQGVTNLVM TCDEAIVHLA QVQEATRAGD GEVAFLQLGF GQGLVRRVRA FESVEAQLDA TRSALDTAIR SNAAAKRQKA KVLWHLALEK VTTGFLQDAD VVHIRTRVKK LWKRRYSVIS QASNVKQSIS HTSDSPFSDT ARPPYDEEPY EHAMFLRGWR LSNICLVDYF WAHPNSFRIE GIPPELSIDD VAHTLAVNFR SGDGSPAAMR DYCLAEHITD LYAQDSYRFC ILRVPSSSTT STWTAGLAFK SQKDAMLLSG QTKKVHGIQI ASKHIPPLMK RNIDASIADL EEARALQARH PTLENRKKLT LAKNEFEKAK RGLCICSKGQ YKMHAGTDCE NLVWSSARVA YRFDDLESEK LISSLSAVRR DCSATIGETD LKIYEYRNRM QKLRSIAERK YSENVQAMSS FDALQALAKG RFNDTQCIVC LGHLGSGGLC GDDAPCRLCN SDAKSADIRG AITMTKCGHL YCVKCFEQWS RTQSTVACIV CRKKIDKNSD FVTIDPTDRE SPSFIADRRS EARLIIKKAS DMISESNGEL DPELWEQLYH SIEIPQNVDS SRHCKVSVLP GDFMGHLRSC TGLPVLCHPS QTPVTLSAQS QCLSSKIKAL LRDLPSEERS VVFSSSAKTL HHLEVVFKGI GIGYQSLFSG QATSTAERAV SEWHATKANA IPCPVLLVQA GAAASGLTLT KASKMFIMEP FVRHEEEQQA YARCHRYGQT NTVHCKCYYT PVSVESRLLE WRKRAVNAGV EALPLQEGEA PQIIYAPLLD LEDEAEIETS EVSQMNFLLG LGTEQIR
|
| |