Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48253 |
Symbol | |
ID | 7203388 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 646016 |
End bp | 651252 |
Gene Length | 5237 bp |
Protein Length | 1461 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182729 |
Protein GI | 219124895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0799297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATGG AAACTCCCGC TTGTCTCGCC ATGATCCCGA CCTATCTCGC TCTAAACCCC TGTGACACGG CTCGCATTAC GGCATGGGAT GAATACCAGA ATGGCAACGC AGCAGCGAGA GAATTGGGAC TGCAAAAGCT CGATTACTAC GCGTATTCCG TTTGTGAACA GAGTACGTAT TCATGTGTGT GTGTGTGTGT GTGTGTGTGT TTAGTCACTG ACGTACCAAA TCACAAAACT CCTCCGAATG ATCTCACACG TACCAATATT CCTATATCTC TCCACATCGT ACCAACCAAT ATCTACTAAC TGTTAGGTTG CGATTGTATT CCGCAAGTGA ACGCTGTCCC TGACCAACGC ATTGTGCACA CGGGACGGGG AAACTGTCCA GCACACACCT TCTACGACAT TTGCAGAGTA CTCCCCAAAG TCAAGCTCAT CATTGGAGAA GGCACCACGA ACGATGATGT GACCTCGTAT CCCGAAGCCT GCGTCGGAGT GGGCAATTGG TTCCGTAGTC CGGCTTCCAA CAATTGGGGC CAAAACCCTA ACACGCCTTT GGAGCCAGCC ATTACATACT TCTTGGATCG CCTGGTGGAA TCGACGGAAC TCACCAAACC AGCTTCGACG CTTTGGGATG AGTGCTTTCA TCTGGAAACC AGTCAACGTC GCATTACCCA AGCCGAACCG GATTGGAATA CAATCGCCAC CAATGGAAAT CCCATTTCCC GGCACGAAGC CTGCTTTGTT TCCGTTGGTC AAAAAGGATA TCTACTGGGA GGGCGAAGTG CCGTGGGCGT GAGCGTATAC GACTTTCGGA CCAGATCGTG GTCGAATTTA CCCCGGACGT TGGACGTTGT GTTGCATCAT GCACAGTGTG TTGTATTGGA CGGAAAAATA TGGATGGTGG GGGCCTGGAC GGGGAGATTT CCGAGGGAAC CCAACGCGGA TAAGTTCTAC GTCTACGACC CCGCAACCAA CTCGTGGGAT GACACTCGTG CAATCATGTC GGACCAACGT AATCGGGGTA GTGCCGCTGT TGCCCTGTAC AATGGCGAGA TTTACGTGTC GCACGGAAAC CGAGGTACGC GTGTACAAAG AAGATTGGCG ACCGAATGAT GCTGCGTGTG GCCACGTACA CGTTGCGCGA GGACAGTCAC TTCGTGACTG ACGACCATAC GCTAATGTCT TCTCACCTTC AAAACACACT ATTTTCTCCT TTATGCAGGT GGTCACAACA CCAAAGCTCT CGTACTCGGT TTATTTGACG TGTACAATCC GGCGACCAAC ACGTGGAGGG CACTACCGGA CGCGCCTAAC CCACGCGATC ACACCGGCGG TGGTATCTTG ACGGACTCCT CCGGTGATAC ACTCTTTTGC GTGTCTGGTG GACGAGATTC TAGCCATCCT GTTTTCTTTA ACGAGTTCGT CCTACCGACC GACTGCTACA ATTTTCAATC GGAAGAATGG GAGGTACGCG CAAACATTCC GACACCCCGT GCAGGTTCAG CGTACGGCAT TACATGTGAC GGGCGGCTCA TGGTAGCAGG TGGGGAAGGG AATGATGTCG CGTGGGATAC AGTGGAGGTC TTCAACGGAG AAGTATGGGA AACATTTCCC AGTCTGAAAA CAGCAAGGCA CGGTACTGGG CTCGCTATGG ACTGTGGATG TTCCCAAATT CACATAGCCA GTGGCTCAGG TAAACAAGGT GCTTTCGATT TGGCCTCGAC GGAAACGTTT TTTCTCCAAG GTGTTGACGA ATCATGCAGT TCCTCACCAA TTCCGGTACC TACCAATTCC CCTCCACCTC CGACGCCGGC AGTACCAACG ATGCCGCCGA CAATATCGCA GGTTGTCAAC AACGAAACGA GTACAGTGCC CTCACCTATA GCGATAGCTC CAACCGCTGC CCCTCTTACT CAACCCACCA GTAACCCCGT CGAAGTATCG GATTTTTCAC TTTTTATCAA TGCGGGTCAG AACAATCAGG ACGTTACTGA CGGCAGCGGA AGTACCTGGG TTTCGGATCG TTACGCCAAC CGAGGAAACC TTTTTAAAAC GAAACTCAAC AATCCGATCC TCAGCATGCC CAACCCTGGC ATGGAAGAAG TGTACCGGAG CTCGCGCTGG TTTAGTGGTA CAGATTTTAC TTACACTATT CCGAGTCTCG AACCAGCTCT GTACCAAGTA CGTCTCCACT TTGCGGAAGT CTACCAGCCT GCACAATCCG TTGGATCCCG TCGATTCAAC GTCGAGATTC AAGGCAGTCC CGTGCTGACG GACTTTGATA TTTTTCTAGA GGCTGGTAGT TTTACAGCTC TGGTGCGAGA ATTTACTGTT TTGGTAGATT CGGGTCAGCT TGCAATCACC TTTGTCAAGG GACTTGCTGA GAACCCCATG ATTACAGCAA TTGAAATACG AGTAGCTCCG ACAACGCCAA ATCCAACAGA TGTTCCGTCC TGGTTTCCAA GCAATGCACC GAGTGAGACT CCCAGTATCT CATTCGAGCC AACGTCCCTG TCTAGCGCTC CAACCAGCGT CCCCTCGCTC GCTCCTAGTA TTCGGGGCGC ACCAATTTCG GTGTCGTCGA GTGCGTCACC GGTTGTGCCA TCGCAATTCC CCATTGAGTC CGGACCTGAG CCCTACCAAC TGTTTATAAA CGTTGGTGGA ACGGTTGACA CACTGGATGG ACTAGGAAGA CAATGGGTTC GAGACCAGTA CGGCACTCCA GCGCAATCTT TCACTATGCC AAATAGCGCG ATTTCAAATA TGGCAGCCTC GGGAATGGAG ACAATCTATA GACAACAGAA ATGGTTAAGT GGTGCGTTCA CATACACGAT CCCCGATCTA ATCAGCGGCT CATACTTAGT CAAGCTTCAC TTTGTGGAGG GCTTTTCTGG GGCTTTCAAA GTTGGTGCCC GCAAGTTCAA CGTCATTTTA CAAGGTACAA CAGTTTTGGC TGATTTTGAC GTCTTCAAAG AAGCAGGTGG AAGCTATGCT GCAGTGAGTC GACAGTTTGT TGCTGTCGTC AATGCCGGCA CAATTGCAAT CACTTTCGCC AAGGGGGGCA GCAATAACCC CACCGTCTCT GCAATTGAGA TTCAGTCACT ATCCACGACG CAGGCCCCCA TGGGCCTACC GACATCTATC CCATCTGGAG CGCCAACGCG TATTGATTCC GCTGTACCTA CAGCTGATAA CTCGAATAGG CCGAGCCAGC AACCTTCGAA ATCTACGTCC GTGTCGCCAT CGGTCAATCC ATCGATCTCA TCTGGGCCAA GTACCAACAG CGTCTCTCCT TCAGCTGTCG GATCAATATT GACACCGAAT ACTGTGTCCG CGGCACCAAC TACGACACCG AGCATCCCAA GGTTTCCTTC ACTGCAACCA AGCAATATCC CGATTTCTTC ACCCGAGTTT CAGATGCGAG TAAGTGTTGG CAGCAATATT GATGTTGTTG ACGGAAATGG AAATACATGG TTAGCGGACT CCCCCTATTG TACCGGCAAG GTGTATCGTG TTGGCGATGA AATTGCTGGC ACGCAACTCG GTCAACAAGA AGTTTACCGA AGCCAACGCT GGGAAAATGG TGCCATGTCC TGCGTCGTGC CGAATCTGAG CGCCGGAGCC TATGAAGTAC GACTACATTT CTCTGAGAAC TATGGCCCAA ACATGAACGC AGGCCGCCGT TTGTTTGATG TCTCGCTACA GGGACGGCTC GTACTGGGCA ATTTTGATAT TTTTACTGAG GCCGGGCTTG GGCTAACCGC TGTCATTCGT CAGTTTGACG TATCGGTTGA TCAAACTGGG ACTTTGACGG CCATATTTAG CAAAGGCATG ATTGAGAATC CGACGGTGTC TGCGATTGAA ATCCTTTCAA TTCGTGAGCC GGCCATCCAG ATGCCAAGTC TTTCGCCAAT TACCGCCGAT TCTTTCCCAA GCCAGGCTCC TTCTGAAGAA TTTTCTGCTT TGCCCTCTTC CCGTCCTTCT CTTTCCCCAA GTGCGGCTAT GAGTTTGTCC CCGACTGGAA GACCGTCGAA TGTCCCGGGT AACGGGGATT CTTCGATTGT TGATTCATCA GAGATTCCCA GTGAATCCCC CCCTACGAGT GCATCGGCAT CTACGACCTC TCCCTTGTCA AACAGCTCCA GCGCTTTACC ATCTGAAACA GAATCTCATT TTCCAAGCGT TGCACCTTCC AAAAGCCGTG GCGCATCTCC AGCAGTGAGC GAATCTTACA TTCCAAGTGT GGTACCTTCC GAAAACCCCA GCGTATCTCC AATAGTGAGG GAATCTTACA TTCCAAGTGT TGGACCTTCC GAGAGCCCTA ACGGATCGCC ATTTCGAATG GAATCTCACT TTCCAAGTGT GGTACCGTCC GAAGCCCCAA GCGTATCTCC AGCAGTGAGT GAATCTTTTA TTCCAAGTGT TGGACCTTCC AATAGTCCTA GCGGATCGCC GTCCGGAATG GAATCTCCGG TTCCAAGTGT TGGACCTTCC GATATTCCTA GCGGATCGCC ATCCGTAATG GACTCTCAGG TTCCAAGTGT TGCGCCTTCA AAAAACCCCA GTGTATCTCC ATCAGTGAGC GAATCTCACT TTCCAAGTGT CGCACCTTCC GATATTCCTA GCGGATCGCC GTCCGGAGCG GATTCTCTTT TACCGAGTGT TATGCCTTCC AGCAGCCCCA GCGAATCTCC ATCAGTGAGC ACAATTCCAA GTGTCGGACC TTCCGACAGC CCTACCGAAT CTTGATCAAA TGTAAAAGAA TAGACTTGAA GAGAAGTTTA AAAAAGTCGC CGGTTGGCTT TTGACCATCC AGTGTCATTG CGTATTGACA GTCGATTCAT CGAAGGGCAC CGTTTGGAAC AGATGACACT ATTTGTATCT TAAAGACTAC ATAAAAGCTG ACAGTCAATA GTTTGAGGAA TACCGACCCG TTGTTCTTCC CATTGTCTAC AAAATGATAT CTTCTTCAGT TCATTCGTAT TTGTCCCCAC TGTGACACTT TCTCTTCAGA TACAGTTGAC ATTATGTAAA GCCTGCAGAA CAGATTTGGT CGACTGGTCG ACTTTTTAGC ACAACTTGCC GTCATCTCTA TCAAACGTGT TTTCTGATTG AAAAGAATCA GCTGGCGTGC AGGGCGTGCG TAAACAGCGC AATTGCCATT TTCACCTCTT CCGGCCAACA AACTAGCGCA CTATTTGATA TGCCTATCTA CCACGCCTTT TTCGCTG
|
Protein sequence | MAMETPACLA MIPTYLALNP CDTARITAWD EYQNGNAAAR ELGLQKLDYY AYSVCEQSCD CIPQVNAVPD QRIVHTGRGN CPAHTFYDIC RVLPKVKLII GEGTTNDDVT SYPEACVGVG NWFRSPASNN WGQNPNTPLE PAITYFLDRL VESTELTKPA STLWDECFHL ETSQRRITQA EPDWNTIATN GNPISRHEAC FVSVGQKGYL LGGRSAVGVS VYDFRTRSWS NLPRTLDVVL HHAQCVVLDG KIWMVGAWTG RFPREPNADK FYVYDPATNS WDDTRAIMSD QRNRGSAAVA LYNGEIYVSH GNRGGHNTKA LVLGLFDVYN PATNTWRALP DAPNPRDHTG GGILTDSSGD TLFCVSGGRD SSHPVFFNEF VLPTDCYNFQ SEEWEVRANI PTPRAGSAYG ITCDGRLMVA GGEGNDVAWD TVEVFNGEVW ETFPSLKTAR HGTGLAMDCG CSQIHIASGS GKQGAFDLAS TETFFLQGVD ESCSSSPIPV PTNSPPPPTP AVPTMPPTIS QVVNNETSTV PSPIAIAPTA APLTQPTSNP VEVSDFSLFI NAGQNNQDVT DGSGSTWVSD RYANRGNLFK TKLNNPILSM PNPGMEEVYR SSRWFSGTDF TYTIPSLEPA LYQVRLHFAE VYQPAQSVGS RRFNVEIQGS PVLTDFDIFL EAGSFTALVR EFTVLVDSGQ LAITFVKGLA ENPMITAIEI RVAPTTPNPT DVPSWFPSNA PSETPSISFE PTSLSSAPTS VPSLAPSIRG APISVSSSAS PVVPSQFPIE SGPEPYQLFI NVGGTVDTLD GLGRQWVRDQ YGTPAQSFTM PNSAISNMAA SGMETIYRQQ KWLSGAFTYT IPDLISGSYL VKLHFVEGFS GAFKVGARKF NVILQGTTVL ADFDVFKEAG GSYAAVSRQF VAVVNAGTIA ITFAKGGSNN PTVSAIEIQS LSTTQAPMGL PTSIPSGAPT RIDSAAEPAT FEIYVRVAIG QSIDLIWAKY QQRLSFSCRI NIDTEYCVRG TNYDTEHPKV SFTATKQYPD FFTRVSDATD SPYCTGKVYR VGDEIAGTQL GQQEVYRSQR WENGAMSCVV PNLSAGAYEV RLHFSENYGP NMNAGRRLFD VSLQGRLVLG NFDIFTEAGL GLTAVIRQFD VSVDQTGTLT AIFSKGMIEN PTVSAIEILS IREPAIQMPS LSPITADSFP SQAPSEEFSA LPSSRPSLSP SAAMSLSPTG RPSNVPGNGD SSIVDSSEIP SESPPTSASA STTSPLSNSS SALPSETESH FPSVAPSKSR GASPAVSESY IPSVVPSENP SVSPIVRESY IPSVGPSESP NGSPFRMESH FPSVVPSEAP SVSPAVSESF IPSVGPSNSP SGSPSGMESP VPSVGPSDIP SGSPSVMDSQ VPSVAPSKNP SVSPSVSESH FPSVAPSDIP SGSPSGADSL LPSVMPSSSP SESPSVSTIP SVGPSDSPTE S
|
| |