Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47693 |
Symbol | |
ID | 7202701 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 537867 |
End bp | 541959 |
Gene Length | 4093 bp |
Protein Length | 1297 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181930 |
Protein GI | 219123227 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAACCTAC CGATTAAAAC AATCGGAGCC TCTACGAGGC TTTCCCCCCA ACTTCTACTC TTTCGTCGTA TTCTACAAAT ACACTCGTAA GCAGGTAGAA CTTATCAGTT TACTCAGACC CAACCCAGCT CTAATTTTTT GAGTTTTTGA AACCTCGACA ACGAATGCGT CGTCCGAGGA AGGTCACTCC CGCTGTTCCG GCCCCTGCCG CAGCGACGGA CTCACCGGCT GATGCCGCGT CCGCATCCAA AGAGGATGAG GAGTTCGGAG GATTCGACTC CTCCGACGGT GAGGAGCCTT CGGGCACCGC ACCGCCATCG CCAGCATCTT CGGATGATGA AGGTGATGGC AAAAAGACTG CAAAGCCTTT GGCTCGCAGC AAGAACACGT CTGACGAAGT CAGTGTGATC GAGAAAAGCG TCATCGACGC AGAGCCTCAC TTGTCTAAAG ACAGTGACGG TCTCGACTCC GTTCCTCGGC AAGACCGTGT TGAACGAAAG GCCTTGATGG TCGTCCTCCG TGACGTCATC TGTGTTCCAT TGTCAGTTGC GGCTGCAATG TTGAACAACG GCATTAAATC ATCTGATGAT TTCCGTCTTC TCACGAAGGA GGACATCAAT GATCTCTGCA TGCGGCTCAA AATGGGCTCC ATGCATACCA AGCGAATACT CGTCTTCGCC AAATGGATGC ATCACGCACC CAACTCAGTC GATGTCGCCA AAGAGTTCAC GGCTTCCGTG CTACGCTTTG AGATGATGAC TAGAGCCGCG GCGTCGTATG ATAATGTGAC TACGACGGCT GCAAAGGCTG AAAAATCGGC TACTAGCCTC TTGCCTGAAC CGTTTGATGG TTCGCAGAAA AAGTGGCTCA CTTTTCGTTA CGGTTTCGAA GCGTGGGCAG GCGCAAGTGG GTCCACTTTT ACCGCGTGCA TCGCGCACCA TTCGGATCGG TATTCGAAAG CCGACCCAAC CGGACCCCAT ACGTCGCCCC GTGACGTTTC AGATTTGTTT GCACTCTCCC CAGTTGTCAA CATCACCAGG AACGCAACAA TCTTCTATAC TCTCATGTCG CTAACCAGCG CTGGGGACGC CTGGGGACTT GTTGAGCCCC ACGAGCACAC TAAGGACGGA CGCAGTGCCT GGATTTCTCT ATGTGCCTTC TATGAAGGAA CGAGCCAAGT GGGTCTCACT ACCGAGCAGG CTCGCGCGAC AGTTATGGAG TCGGTGTATA CAGGACTGTC CAAACAGTTT TCCTTCACCA AATATGTCGC TCGGCATATT TCTGCCAACA ATGCCCTTTT GCGTAACAAG GAGGGCTATT CGGACGCTCA GAAAACGAAT TTCTTTCTTA AAGGGATTAC TGATCCGGCA CTCCTTCCTT ATAAGGCAAC TGCCGAAGCG CGACTCGATG ACTGGAATTT TAATCGGGTC GTCAACTACA TGCGTACGTC CGCGACGAAA CTCAGTTCCA AGGACAGAAG CGACTCACGG AACGTACGTC AGACAAAGAC CACTGGCAGA GCCACCGGCA ACCAACGTGG TAACGACAAC AAACGGCGTG GCTCGTCCAA CCGTCCGTCG AACAAGGGGG CTGAGAAACC TTCCCGCCCT CATAAACACG TCTTACCTCC TGAGCTGTGG GAAGCCCTAA CCCCAGCTAT CAGGGAGAGT ATCTTGAGCG CAAAACGCAG TATTGCACCC CCTGGCCGTG AGGCCAAAAG GGCTAAATCC TCAGATACAG ATAACTCTAG TTCAACCGTT GAATCTTATT CACAACTGCC TAGTAGTAAA AAACCTATTC GTAAACATAC ATGCGAAGAT CACGTCCAAG TAGATTCCAG TACCCCTGAA ACCCTACTTC GTGACGCACC CACAGACATT TCACCCCACG TCACCACCAA AAAAGTGACA TTTGGTGCAG GTGTCCTCTT TGGTCGGTAC GCTAATCGCG TATCGTTGAA TCGTATGGTC CGCTCCGGCA GTCATTTCGA TCAAGCCCCT TGGCGCAAGT CGGATTTCCG ACTTAACGAT GCGACACTAG TTCGTATTCG TCAGAACCGC TCACGCGGAA CAAAAACTCC CACCAATTAT GGTGAAGCGG TAATTGACAC TGGTGCAGAC ACCGTCTGCG TCGGTGCCGG GTACTCTGTA TTGTCATACA CGGGTCGATC AGTCAGCCTT CGCGGTTTTC ATGATGACGG TGAAACGTTT GAACGGATTC CGGTTGTCAC GGCGGCAACC GCCTATGATT ATGACGACGG AACAACCGTG ATTCTCATCT TTCATGAGGC ACTGAACCTC GGGCCCACAC AGACCACCTC GCTCATTAAT TTGAATCAAA TCCGACATGC CGGACATCAA ACCGATGACA TTCCAAAATT TTTGTCGCAA GGCAAATCCC TTCACGGCAT CGAAACTCTC GACGGTGATT ATATCCCGTT TGAGCTCAAA GGTCGTGCAT CCCTGTTGTA TTCTCGCGTA CCTACTCAAC ATGAGCTTGA CAACTGTCAG CACATTGATC TCACTTGCGA TCAACCATGG GACCCCAACA GTAAAGATTG GGAAGAAAAT GAAGCAAAGT ACACGCGACA CGATCGTTCT CGTCGTGCCT GCTACACCGA CAGCGTACCG GTTGACATTC TCCCGGATTG GCCTCCACTA CCCGTTTCCC CTGGATCCGT TGTACCGGAT TTCCATAACC GTGTCATGAA CCCTCGCGAC ATCGTTCGCG AAATCAAATA CGCCACTATC GGTGCGTCCA TATCCAGCCC TCGGGTGTTG GACGTCGACC GCGATAAATT ACGGCGAATT CTCGGACATG TGCCGATGGA AGTAGTTGAG CGTACTCTTA CCGCCACTAC ACAACTAGCG GAGCGCACGG GCGAAATGCC TTTGCATCGT CGTTATAAAA CCAAGTTTGA ACAACTTCGG TATCGACGTT TGAAATGCAC ACTTTATAGT GATACTTTTA AATCCTCTAT AAAATCCTCG CGTGGACATA CCCATACTCA GGGTTTTGTC TGTGGTGACT CTTACTTTAT CTATCATTAC CTAATGAAAG CAGAGTCCGC GGCAGACCAG GGTCTCGCCG AATTCATCCA CAACATCGGT ATTCCTGCAC AATTGCACAC CGATAACGCG AAAGTGGAAA CACTTAGCAA ATGGAAAAAA TTAACTTCCA GTCACTGGAT AAAGACGACG GTCACTGAAC CCTACTCTCC GTGGCAAAAT CGTTGCGAAC ACGAATTTGG TGCTGCGCGC ATTCACACGC GCCTCGTTCT CGAAACCACC AAGTGTCCCG AACAATTATG GGACTACGCC CTTGCCTACG TCATTTTCGT ACGTAACCAC ACGGCACGAA AAGCGCTGGC CTGGATCACG CCTATTACTG CGATGACTGG CGACACCCAT GATATTTCTG AAATCCTGGT TTTCGAATTT TTCGAACCAG TTCAGTATTT TGACAATCCT GATGTCAAAT TTCCACAGAA TAAAGCCAAA GTCGGCCGTT GGTTAGGTAT TGCCACCAAT GTGGGCCAAG CCTTGTGCTA CCATATTTTG ACGGACAAGG GTACTGTAAT CACTCGATCT ACTGTTACAC CTCTCCAAAA CCTCGATTCG TCTGCTCTGC AGACTGCCCT CGCTACTTTT GACGCCACCA TAAGGGAGAT TTATCAGCCT TCTGATTTCG CCCTCGGTAA CAAAATCAAA GCGCCGGCTT TCCGCCGTGA CGAAGCGATG AAAGTCGCTC GGCGATCCGA CGATCCCGGC GATGGCAACA CCCGTAACAG ACACGTGTTA TACGATCTGA ACGAAGGGGA TGACCATATT CAACTGGATC CCGGGCTCAC GGTTGACGAT TTCTTCGAGA ACGACTCACC GGATCAGGAC CCCACCTCCT TAATTATTGG TACTGACGTT CTACTCACTT CGGGTGCGGT TCAGCGCCAG GGCCGAGTTA CCAAGCGCGA TCGCGACGGT ACTCCAGTCC CTAACGACGA CCCTGGAAAT TTCGTCGTCG AATTCGACGA CGGTACCGAG GAAGTCCACG GTTACCAAGC TCTCCTTGAT GCTGTTTATA AGCAGGTCGA TGA
|
Protein sequence | MRRPRKVTPA VPAPAAATDS PADAASASKE DEEFGGFDSS DGEEPSGTAP PSPASSDDEG DGKKTAKPLA RSKNTSDEVS VIEKSVIDAE PHLSKDSDGL DSVPRQDRVE RKALMVVLRD VICVPLSVAA AMLNNGIKSS DDFRLLTKED INDLCMRLKM GSMHTKRILV FAKWMHHAPN SVDVAKEFTA SVLRFEMMTR AAASYDNVTT TAAKAEKSAT SLLPEPFDGS QKKWLTFRYG FEAWAGASGS TFTACIAHHS DRYSKADPTG PHTSPRDVSD LFALSPVVNI TRNATIFYTL MSLTSAGDAW GLVEPHEHTK DGRSAWISLC AFYEGTSQVG LTTEQARATV MESVYTGLSK QFSFTKYVAR HISANNALLR NKEGYSDAQK TNFFLKGITD PALLPYKATA EARLDDWNFN RVVNYMRTSA TKLSSKDRSD SRNVRQTKTT GRATGNQRGN DNKRRGSSNR PSNKGAEKPS RPHKHVLPPE LWEALTPAIR ESILSAKRSI APPGREAKRA KSSDTDNSSS TVESYSQLPS SKKPIRKHTC EDHVQVDSST PETLLRDAPT DISPHVTTKK VTFGAGVLFG RYANRVSLNR MVRSGSHFDQ APWRKSDFRL NDATLVRIRQ NRSRGTKTPT NYGEAVIDTG ADTVCVGAGY SVLSYTGRSV SLRGFHDDGE TFERIPVVTA ATAYDYDDGT TVILIFHEAL NLGPTQTTSL INLNQIRHAG HQTDDIPKFL SQGKSLHGIE TLDGDYIPFE LKGRASLLYS RVPTQHELDN CQHIDLTCDQ PWDPNSKDWE ENEAKYTRHD RSRRACYTDS VPVDILPDWP PLPVSPGSVV PDFHNRVMNP RDIVREIKYA TIGASISSPR VLDVDRDKLR RILGHVPMEV VERTLTATTQ LAERTGEMPL HRRYKTKFEQ LRYRRLKCTL YSDTFKSSIK SSRGHTHTQG FVCGDSYFIY HYLMKAESAA DQGLAEFIHN IGIPAQLHTD NAKVETLSKW KKLTSSHWIK TTVTEPYSPW QNRCEHEFGA ARIHTRLVLE TTKCPEQLWD YALAYVIFVR NHTARKALAW ITPITAMTGD THDISEILVF EFFEPVQYFD NPDVKFPQNK AKVGRWLGIA TNVGQALCYH ILTDKGTVIT RSTVTPLQNL DSSALQTALA TFDATIREIY QPSDFALGNK IKAPAFRRDE AMKVARRSDD PGDGNTRNRH VLYDLNEGDD HIQLDPGLTV DDFFENDSPD QDPTSLIIGT DVLLTSGAVQ RQGRVTKRDR DGTPVPNDDP GNFVVEFDDG TEEVHGR
|
| |