Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41581 |
Symbol | |
ID | 7199410 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 247658 |
End bp | 251176 |
Gene Length | 3519 bp |
Protein Length | 1079 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185503 |
Protein GI | 219130713 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAG TTTGCAAGGC AACCGGTCCT ACCCGGAAGG GAGCGACCGA AACGGTGCCG GAGGAGCGAG TGGAAGAAGA AACGCCCTTT GAGGCCGTTG AGTCGCCGTC CAAGGACAGT GACAATGAGA CGCAACCATC GTCCATGGGC GATGACAATG ACTCACAGTC TGAGATCGAG TCGTACAAGA TTGATACCGA CATTGATTTC AAGTACAACC CAAACTTTTT TGAGGACAAG AAAGCCCTTG AAAGTGTTCT AAGGAATACT ATGGGATTTG GAGATATCCA TGTGAAGTCA CTCCAAAACG AAGGTTTGAA GACCGCAAAT GATTTCTTGC TTATTTCTAT GAGTGACATC AATGATCTTT GCGACAAGCT TTTGTTTGCA ACAGTTTACA GGGCTCGCCT ACGGGCATCT GCTACATGGT TACGTAGTCA ACCCGACAAC GTAAATATTA CCCAAGAATG GACAATTCCA GTTATGCAAT TGGAAATGCA GATGAAGGCG CAAGCGTCTC CATTTGGAAT CTCCGAGACC AACAAAACAG ACAAGTCAGT CTCCAGTCTG GTGCCTGATC CCTTTGATGG TACACAGAAG AAGTGGCTCG CCTTTCAATA CAGTTTTGAG GCATGGGCCG GAGCAAGTGG GCAATCTTTT GATGCCTGCA TCTCACATGA CTCGGAGCGA TATTCCCGTT CAGAACCAAC AGCGACCTAC AATGACATCA ATGACGAACC TGATTCATTT AAATATGACT GGAACGTTAA GTCAGTTCGC AATTCAAACA TCTTTTTTTA TGCTCAAGTC GCTCACAAGC GGCGGAGATG CATGGGGCCT TATCGAACCT TACGAGGTTT CAAAAAATGG CCGTCATGCC TGGATCGCCT TGTGTGCATT CTATGAAGGG GCCAGTCAGG TGGGCTTAAC CACAGAAGAA GCTCGCACTA CAATTCTGAC ATTGAAGTAT ACCGGACAAT CCCGGAACTT CACTTTTACC AAGTATGTTC AAAAGCATCA TACTGGTAAC AACATATTGG CTCGCAACAA AGAGGCCTAC ACGGACTCAC AGAAAACAAA CTTTTTCCTA CGGGGAATTG TTGATCCTGA ACTTATGGCA TTCAAGGCAG CTGCTGAAGC TAACCTAAAT GAATGGAAGT TCGAATGCGT TATCACGTAC ATGCGTACTC AAGCCGCCAA GCTCACGAGC AAGGACGGTA AGGATTCCCG AAACATTCGT CAGGCTACAG GCTTGTCGAA AAACAGGAAC AACAAAAACA ACCGGCGCAA GCGCTCGGAA TACCAAAGCC AAGGCAAAGG TAACAAAGAG TCGGGCAAAG GAAACAATGC TCCTAGTACT CAACTCCGCA AGGACATCTG GGATGAATTG TCTCCCGGGA TAAAGGATGC CATCAAAGCG GCAAAGCGTA GAGCGTCTAC GGACCCGCGC ACGGCTAAAA GAGCCAAGAC TAGTAGTATG GATAACTCAA ACGCAAGCGT TGAGTCCTAC TCGCCTGATT TCAGGTCAAT GTCTACTGAA AACTCATCAA CTTGAATCAG ATTCGGCACG CCGGACATCA GACTGATGAC ATTCCGAAGT TTTTATCCCA AGGGAAATCT CTTCACGGAA TTGAAACAAT TGATGGCAAC TACATTCTTT TTGAATTGAA GGGACGCACA TCATTGTTGT ACTCACGAGT ACCTACTCGC CATGAGCTTG AGAACTGCCT GCACATTGAT CTTACATCTG ATCAACCCTG GGATCCAAAC AGCAAAGACT GGGAGGATAA TGAGCAGCGC TACACGCGTC ATGACCGACA ACGGAATGCA CGCTATACCG CAACTGATAA TGCGGATGAG GAGAACTTTT ACCATGGGTA TTTCTCTCTC CCTGACTCTA AGGAGTTCCC GGTTCTACCG GCAAACAATA ATGTTATGAA CCCACATGAT TTCGTACGCG AGATCAAATA TGCTACTGCA CGGGTTTCAA AATCTAGCCC ACGGGATCTA GATGTCGATC GAGACAAACT TCGCCGCATC CTGGGACATG TTCCTATGGA AGTAGTTGAC CAAACACTGG AAGCTACAAC ACAACTTGCG GAACGCTCTG GCAAAATGCC ACTGCATCGA CGTTTTAAAA CGAAGTTTGA ACAATTGCGA TACCGCCGGT TGAAGTGTAC GTTATATAGC GACACTTTCA AAACTACTGT TAAATCCTCC CGAGGACACA CGCATACCCA AGGGTTTGTA TGTGGTGATT CTTACTTTGT ATACCACTTT CTTATGAAAG CGGAATCCGA AGCAGACCAA GGTCTTGCGT CAATTATACA AGATATAGGA ATTCCGGCAC AAATTCACAC CGACAACGCA AAAGTGGAAA CCTTAAGCAA ATGGAAGAAA ATCACTTCCG GTCACTGGAT AAAAGTCACA GTCACGGAAC CATACTCACC GTGGCAAAAC CGTTGCGAAC ACGAATTCGG TGCGGTTCGG ATCCAGACAC GACTTGTTAT GGAAACGACA CAATGTCCAG AACAGCTTTG GGACTACGCG ATTACCTACG TGGTAATTGT GCGTAATAAT ACCGCTCGCA AAGCCTTAAA TTGGCAAACG CCCTTAACGG TTATGACAGG TGACACGAGC GATATTTCAG AATTGTTGGA TTTCGAGTTC TACGAACCGG TACAATATTT TGACAATCCT GAAATTAAAT ACCAACAAGC TAAGGCTAAA GTTGGTCGGT GGCTTGGTAT TGCAACAAAT GTTGGACAAG CTATGTGCTA CTATGTCCTA ACAGACAAAG GAACCGTGAT AACGCGTTCC ACAGTCACAC CACTTCACAA AGTTGATTTG ACTGCTTTGC AAACCTCTCT TACAGCTTTT GATGCTATGA TAAGGGAGAT TTATCAGCCT ACTGATTTTG CTCACAGCAC TAAAAAGCAA GCTGCCTCGT TACGACGAGA TGAAGCAATG AAGGTTGCCA GAAAAACTGG TGAACCTGAA GATCCAGGAG TCCGTAACAG ACATGTTCTG TATGACTTAA ATGAGGGAGC CGACCATGAC CAAGTGGAAC CAGGACTATC AGTTGATGAT TACTACGGTA ACGACGACGA AAAAGAGTCT GGTTCGTCGG ATCTCCTTGT CGGCAGCGAA GTACTCCTTA CTAAGGGAGG TATACAACAT CTAGGCAAAG TCACCAAGCG TGATAAAAAT GGCCAGCCCA AGGGCTCAAA CGAAACAACC AATTATGTTG TCGAGTTCAA TGATGGTACT GAAGAGATTC ATGGATACAA TGCTCTGCTT GACGCTGTGT ATAAGCAAGT TGATGATGAT GGTAATGAAT GGTATACTTT TGAAGATATT GTTGACCATC AAAGGCGCCC ACGTGGCGGC CGAGGACGAA CGAAAGGTTG GTTCCTCCGT GTTAAATGGG CCAATGGTGA ATACACCTGG GAGCCTCTTA CCTCTTTAAA GGAAAGCAAT CCTTACCCAG TTGCAAAATA TGCAGCGTCA ATGAATTAA
|
Protein sequence | MARVCKATGP TRKGATETVP EERVEEETPF EAVESPSKDS DNETQPSSMG DDNDSQSEIE SYKIDTDIDF KYNPNFFEDK KALESVLRNT MGFGDIHVKS LQNEGLKTAN DFLLISMSDI NDLCDKLLFA TVYRARLRAS ATWLRSQPDN VNITQEWTIP VMQLEMQMKA QASPFGISET NKTDNQFAIQ TSFFMLKSLT SGGDAWGLIE PYEVSKNGRH AWIALCAFYE GASQVGLTTE EARTTILTLK YTGQSRNFTF TKYVQKHHTG NNILARNKEA YTDSQKTNFF LRGIVDPELM AFKAAAEANL NEWKFECVIT YMRTQAAKLT SKDGKDSRNI RQATGLSKNR NNKNNRRKRS EYQSQGKGNK ESGKGNNAPS TQLRKDIWDE LSPGIKDAIK AAKRRASTDP RTAKRAKTSS MDNSNASIRH AGHQTDDIPK FLSQGKSLHG IETIDGNYIL FELKGRTSLL YSRVPTRHEL ENCLHIDLTS DQPWDPNSKD WEDNEQRYTR HDRQRNARYT ATDNADEENF YHGYFSLPDS KEFPVLPANN NVMNPHDFVR EIKYATARVS KSSPRDLDVD RDKLRRILGH VPMEVVDQTL EATTQLAERS GKMPLHRRFK TKFEQLRYRR LKCTLYSDTF KTTVKSSRGH THTQGFVCGD SYFVYHFLMK AESEADQGLA SIIQDIGIPA QIHTDNAKVE TLSKWKKITS GHWIKVTVTE PYSPWQNRCE HEFGAVRIQT RLVMETTQCP EQLWDYAITY VVIVRNNTAR KALNWQTPLT VMTGDTSDIS ELLDFEFYEP VQYFDNPEIK YQQAKAKVGR WLGIATNVGQ AMCYYVLTDK GTVITRSTVT PLHKVDLTAL QTSLTAFDAM IREIYQPTDF AHSTKKQAAS LRRDEAMKVA RKTGEPEDPG VRNRHVLYDL NEGADHDQVE PGLSVDDYYG NDDEKESGSS DLLVGSEVLL TKGGIQHLGK VTKRDKNGQP KGSNETTNYV VEFNDGTEEI HGYNALLDAV YKQVDDDGNE WYTFEDIVDH QRRPRGGRGR TKGWFLRVKW ANGEYTWEPL TSLKESNPYP VAKYAASMN
|
| |