Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46460 |
Symbol | |
ID | 7201556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 401549 |
End bp | 405917 |
Gene Length | 4369 bp |
Protein Length | 1448 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180821 |
Protein GI | 219120153 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0697935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATA TGAACAGAAA TTTCGAAGAA AGCGTGGGTA CTTTTTACGA TCCAGGCTCC ATGTATGCCG CGGCTCCAAT CGACTTTACA TTATACTCCG GTTCCCAAGG GGTGGATCTG GGATACCGGC ACTACCAGTC TTCTGCTTGT GATTCGAATT TTGGGGTAGA ACCATTACCG TATGATGAAT GGAACGGAAG CGACGCAGCA GCTGACACCG AAGCTGGCTG TTATTCTCCT CTGGGAGCGT TTAAACCGCA GATATACGGA GCCCCCGTGA GCGCGATTGC ATATGACGGC ACGTTCGAAG CAATGTACGT TGCCTCCGTA ACTCAGTCGA TGTCTAGTCG TTTCAACGGC CATCGAGCGT CGGTTCTAGC AATGCATCAA ACGACAGATG GCTCTGTCTA TTCGTCAGTC GCTGGGCATC CCGAAGGACC AGGCCACATC TTGAACAATA TTTATGATTC GGTATACGGA ATGACGCCAT CAATGTCGAC GGCGGGCTTG CCAGGATACC GCCATGTTCC AAAGCATGCG TTCAAGCCGC CATATGGGAC AAACGACCCT ATGTTACCAG GTGTTGGAAG CCGAGGGAAC CATCACATCG GTATCAATTC TCTCGTACCC ACAATGAATG GATATGTGGC GTCTGTGTCT CCTGCAGGTG TTCGTGTACA TTCCCATGGC GGTTTGCAAG TAGCCGATTA TCATGTGGAA GGAATGATTT GCGGCACTCA ACATCCTAGC CACGATCCCA GTCTCGTTAC ACATATGACA GTTGGAGGGC TGGCGATTCC TCGTGAGAAA GGTAGCCATA CAAGCAATTA TCAGATGCTC TGTCTCGATT TATGGCAAGG CTTGCGTGTC GTTTCATCTT ATACCATCGC CCGGAATACG AAAAACGTGA GTTGTACTGC TATTGCTTGT AACAACAGCA AAGGATCGAT CTTGGCAGGT TGTTCAGACG GACACGTTCG TATGCTCGAC GGACAACTAC GAGAGGAGGC CCGTCTGAAA AGCCACTTTG GAGGAGTGGC AGACATTACG GTGTCGTCTG ATGGAAACCT TCTGGCGACA GTTGGTTTTG GGTCTCGTGT CGCGAAGAGC ACGAAAGGAC AGCTTTACGG TTTCCCGGAT CCAGCAGTTT TCATCTACGA TATCCGATAT TTGGGGCGAG GAGGAATCCC GCACCCATTT GCAGGCCTGA ATGGAGGTCC TAGGTATGTC ACGTTTCTGC CAGAAGTGCA AGGCCTTCCG TCGAATCGAT TGTTGGTTGC AAGCGGGCAA GTCGGTGGTG GTTTGCAGAT CATTGTTCCG TTTGAAGAGC CAAGCAACGC CGGTAGCAGT TTCATCCTAC CACATTTGAA TATGAACGAA TCAATCTCGT CTATGACTGT GGCAGACGAT AAATTAGCGT TAGGCACTTC CCAAGGAAGT ATTTTGCAGT ATCGTATGGC AGGTTTTCAT CATTCTATTG ACTCCCATGG TTTAAGAAGG GGTGCATTCG TGCCTTCCAC TTTACACGCG AAAAGTATAA AGGAGCCGAA TGACCTATTG CCTGAAGAAA CGAGGAAAAC GAAGCAAGTC CTGAGTCTAC CAAGTCTTGT GCCACCGCCG CCCGCAATTT CAGTGGATAC ATCTGTCTTA CTGAAAGGTA GTCCAAACGC TCGATCCGGT GAGTCAGAAC AGCTGAAAGC TTTATTTAGC ATTTACACAA TGGTTGCCGA TCCGATTTTG TCTATGATTG GTGATTCCCG GGATGAAGCC GCAACTTCGT TTGGCCGACT GGCTTCGACC GCGACGATAG AGCAAAGCAA GTTGAAAGTT TCTTCAATTT TATTGGATAG AGCAAGCCAC AACGTGGACT TCCTCGAGAC TATCCCTACG TCTGATTTAA AGCTGAATCT TTTTCACGAT CATCGCTCCG ACAACGTCAA AATTTCCCCG ACAAGGAAAG CACTATTGTC CAATCCAAAC AAATTTTTAT ATTCGAGCGT CCTCTTTACC ACCGCCTACG AGGAGAGTTT GAACCGGAGT AAGATTACAG GGCGCAAACA TCGCAAAGAA GAAAGTAGCA GGGACGAGAA AGAAAGTCTG ATGCGAATTC CTCCCCGATA TCAGTTGACG CTTCGCTCGT CTGGCAAGTT TGCGGCGTCA TTCAATTATG CGGAATTCAA CCAGTCCGGA ATTCTCCCTG GCTGGGACTA TCCGCCAACT ATGCCTAATG CATTCGTCCC GCCAGTATTG ATGGTTTTAT ATTTTATCCC AGAAGTGCGC TCATCGGTGG CTCAGTTGCG ATTGGAAAAG TTCACGTCTA AGCAACTGAG TCTATCCCCT GAGCTTGCCT TCTTGTTTCA TCGCATCGAA CGTATTTCAG CCTTTGCTAT GATATATCCC ACGTATTCCG ATACGACTTT ACCAACACGC CTCGGAATTT GGGCACCACT GAACTTCTTA TCGATCCTCC AGGAAATGCC GGAAGCTGAA CAACTGCAGA TTCTTGACGG GTCACCTGCT GCTGTCGACA CTTTTCGTCG TCCTGAAGCC TTCTATCGCT TCTTGCTCTA TCAGCTTGAT AAGGAAATGT GCCGGAAAAC AGAGCCCAAG CTTCTTGATC CTTTAGTAGG AATCAGCTTT ACGAGTGTCA ATGAGTTTTT ATCAGGAACC GTTTCTCCCA CGGCTTCATC GACTCGGTCA CTGACGGTTG ACCTATGTTA TGATCCTTTT GTGGGTATGA ACAAGAACGG AACTCGAAGT AGCGGGCCTG TCCGGTTCTC TACCGTCCTG CAGCATACTC TTTACCGCGA AACGAGACTG CGTGCGTGGA ACCAAATATC TAAATCTTAC GAAACTATTG TGCAAAGAAA GATGGCAACG TCGCTTCCGG CCATTCTATC GCTTTCTTCC GCATGTGCAG GGAGGGACGA GACCGAAGGC TTAGCTCTTT GGCGTGGCTC CAATAGCAGC AACGGGCACT GGCTCCCCGA AATGATCGAA ATTGAGCTTG AAGAGAGCGG CAACGTTGTC GTCTGGGAAC TCCTTGAAGA CAAAAGCACG GGAAAAGAAG AGTGGATCGA ATGCTCAGGT AGAAGTTCGA GCACACCTGC GCGGGCAAGT ACCATCTTGG AGCGGAGAAA GACTCGAGGC ACAAAGCGTC GCTATCGATT AGATGCAGTT CTCGCAATTG TTCGCGATGA TTTGGATAGT AGCTGCTCTG ATGAAGTTGC GGGCTTGATA AGTAATGGTC AATATGGCCA TCACGTCGTA TTCGCTAGGC TTGGAAAGGA TCACCACAGA AGTCTCGCCT CACAGCAGCT TGGAACACTT CAACAAATGG ATCACGATTG TGAAAAGAAC GACATTTTCA GAGACACACT CGCCGCGTCG AGGTTCGACA AAAGAATTTA CGAAAAGCGA GTAGAGCTTG CTGAAAAGAT GGTAGAAACC CTGTCCATTG ACTCATCATC TGAAGATGTT TGGCTTTTGT TTAACGGGTA TGTTGTATCA AAGACTAACA TCAAAGATGC CAAGGCTTTT GATGTCAAGT ATAAGGAGCC ATCTCTTATT GTATTTCGCG CCTTAGATTC CCCAATTTCA TCAATAAATG TCACCGACAT TTCCTTTGTG TCGCCTGATG TCCTTAGAAC ACATTCTATA ACAAACGGAT CGCTTTCAAG GCATTCGCTT TCCCAAAATG TGGACATGTT GCCTGGAAAA GGAGAATTGA TAGCGTTCGA TGCCGAGTTT GTGTCCGTTC AGGACGAGGA CTACTCGTTG ACAGAAAGCG GCTCCAAAGT CACTCTTCGT GAAACTCGGC ATGCTCTTGC CAGAATCTCC GCTCTGTGTC GGGATGAAGC TGTGCTTTTC GATGATTATG TGCTTCCCAT AGAACCGGTG GTGGATTACC TGTCGCGCTT CAGTGGCATT GTGGAAAACG ATCTGCATCC GAAGCATAGC CCACATCATT TGATAGGAAC GCGGGCGGCC TACCTGAAAT TGCGATGTCT GGTTGAGCGA GGATGCATAT TCGTGGGCCA CGGTTTGAGC GGGGATTTTT GGACAGCCAA TCTTGCTGTG CCTGAGAATC AAATCATTGA CACAGTCGAG ATATATCACA AGCCGGCGCA CCGTTACGTC TCTTTGAGAT TCTTGACCAA TTTCGTTCTG AAACGCGACA TGCAACAAGA TGTGCATGAC TCACTAGAAG ATGCGAGAGC AGCAATGGAG CTATACCAAA CGTCAATTGC GTTGAAGCAA CAGGGAAAAT TTGATATGCT GCTCGACGAT TTGTACGAGT ACGGTCAAAA GGCCGACTGG AAGCTTGGAA TAGAGCGCGG CAGGTAAAAA TGACATGTAT GGTTCAACC
|
Protein sequence | MNDMNRNFEE SVGTFYDPGS MYAAAPIDFT LYSGSQGVDL GYRHYQSSAC DSNFGVEPLP YDEWNGSDAA ADTEAGCYSP LGAFKPQIYG APVSAIAYDG TFEAMYVASV TQSMSSRFNG HRASVLAMHQ TTDGSVYSSV AGHPEGPGHI LNNIYDSVYG MTPSMSTAGL PGYRHVPKHA FKPPYGTNDP MLPGVGSRGN HHIGINSLVP TMNGYVASVS PAGVRVHSHG GLQVADYHVE GMICGTQHPS HDPSLVTHMT VGGLAIPREK GSHTSNYQML CLDLWQGLRV VSSYTIARNT KNVSCTAIAC NNSKGSILAG CSDGHVRMLD GQLREEARLK SHFGGVADIT VSSDGNLLAT VGFGSRVAKS TKGQLYGFPD PAVFIYDIRY LGRGGIPHPF AGLNGGPRYV TFLPEVQGLP SNRLLVASGQ VGGGLQIIVP FEEPSNAGSS FILPHLNMNE SISSMTVADD KLALGTSQGS ILQYRMAGFH HSIDSHGLRR GAFVPSTLHA KSIKEPNDLL PEETRKTKQV LSLPSLVPPP PAISVDTSVL LKGSPNARSG ESEQLKALFS IYTMVADPIL SMIGDSRDEA ATSFGRLAST ATIEQSKLKV SSILLDRASH NVDFLETIPT SDLKLNLFHD HRSDNVKISP TRKALLSNPN KFLYSSVLFT TAYEESLNRS KITGRKHRKE ESSRDEKESL MRIPPRYQLT LRSSGKFAAS FNYAEFNQSG ILPGWDYPPT MPNAFVPPVL MVLYFIPEVR SSVAQLRLEK FTSKQLSLSP ELAFLFHRIE RISAFAMIYP TYSDTTLPTR LGIWAPLNFL SILQEMPEAE QLQILDGSPA AVDTFRRPEA FYRFLLYQLD KEMCRKTEPK LLDPLVGISF TSVNEFLSGT VSPTASSTRS LTVDLCYDPF VGMNKNGTRS SGPVRFSTVL QHTLYRETRL RAWNQISKSY ETIVQRKMAT SLPAILSLSS ACAGRDETEG LALWRGSNSS NGHWLPEMIE IELEESGNVV VWELLEDKST GKEEWIECSG RSSSTPARAS TILERRKTRG TKRRYRLDAV LAIVRDDLDS SCSDEVAGLI SNGQYGHHVV FARLGKDHHR SLASQQLGTL QQMDHDCEKN DIFRDTLAAS RFDKRIYEKR VELAEKMVET LSIDSSSEDV WLLFNGYVVS KTNIKDAKAF DVKYKEPSLI VFRALDSPIS SINVTDISFV SPDVLRTHSI TNGSLSRHSL SQNVDMLPGK GELIAFDAEF VSVQDEDYSL TESGSKVTLR ETRHALARIS ALCRDEAVLF DDYVLPIEPV VDYLSRFSGI VENDLHPKHS PHHLIGTRAA YLKLRCLVER GCIFVGHGLS GDFWTANLAV PENQIIDTVE IYHKPAHRYV SLRFLTNFVL KRDMQQDVHD SLEDARAAME LYQTSIALKQ QGKFDMLLDD LYEYGQKADW KLGIERGR
|
| |