Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39253 |
Symbol | |
ID | 7195189 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 34555 |
End bp | 39299 |
Gene Length | 4745 bp |
Protein Length | 1538 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183412 |
Protein GI | 219126329 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGACG TAGAAGAAGT GGCAGAGGCA CTTCCGACGA TTCCCGAGGA CGTTCCGACC CTGACGACCG AGGAAGCGAC AACGCTACCG GTGCCTGTTG CAAACTCTGA CCTTGACGAA GTCACGGACG CTCCCGAATC CGTAGAGACG GAACAATCAC CGGAGATTGT CGAGGAAGAC ATTGACACCT CCGAACCAGC CTCGGTCACG AATAGTACCA TGGACGACTA CGATTCTGTT GCTAGTGAGC AATCCAACAA CGTGTCCAGG GACGAGGCCG ATTCTCACGA CTCGCATGAA GAGGTGCAGG GGTTGACGGT GGCTTCTGCA GTAGGTCCCG TCTCTTCCCC GTCGCATGTC AGTGGTCAGT TTGATGAAAT TGAAGTCGAC GAAAGCATGC TTCTACCAAC CGATGACGAA GAGGAACTGA TTGAAGAAGA AACGGAACAA GTTGACTTCA GGGAAGATTC CTCCGGAGGC TTTGAAGAAG TCGAGGTGCC TTCGGACAAC GAAGGCGACA TACTTATTGC GGATGACGAA CCTCTTGAAG ATGATGATAC GCATGCCTCC ATCGATCAGG TCAGTGAATC GGACGATAGT GCTCATAGCG GAGAGATTTC GCCAGAAGAT GAGGTACCGA ATGGCGACAC CTCGGACGAA TCGATCTATA GCGTGAGTGA TGCGCCCCGC GAAGAATCTG TTCCTAGTCT CGACTCGGCG GTCAGCTTGA GTACTGAAGA TAGGAGTTCG AATTCAGTGA GCTTCATGTT AGCCAACAAC GAAGTATTCT CTCCCTCAGA TTCGGCTGTG GTTAACCCCA ACTTGTCCAT GGAGCCGCTC ACGAAAACGG CTTCCGTCGC GCACACCGAG CCGCTCGAGT TTGAGCTAGA CGAAATGGGA CCAGTCACAG GCATTCAAAT CGAGGACTCC AGCAGCCGTA CGTCCGTCAA TAGTTCTTCC CACAGCGATC CATTCTCTCG TCAAACTGAC GACAGTAACG AGGTTGCCCT TATTCCATTT GTAGACTCGA ACGAAAGCAA CGCCTCACAA TGGACACCAC TTTCAGCTTC TACTCCGGTC ATGACTCCTT CAGGTAGGGT TTTTGCCCCT GCACCGCCAA CCATACGGAA TTCCACCATG GGCTTGTACG AGGATGAACC GACGGACGAA ATGGGTGGCA TGGATCGTCA CAATGAAAAC CCGACCGTGG TACCAAAACA CGAGGACTTA AATCGCAACA GCGGAGACAC GGCGAAGATT GGTAAAGTAA CCGCTGCGGG GACAGTGGCA ACAGCTCTGG CAGCATCGGA GCGATCGCCA CTGGATGCCA GCATGCCGAT GGAAGACGCC ATGTCAACGC ACCCAATAGA AACAAGCGGC TCGTACCACG AAGCGGGTCT TGAGGACAAA TCATCTTCGC ACAATCTGGG GGAAGAGTTT CACGACGAAT CAGCTTCCGG GGATGATAGC GTGTTCTCTG GTGCCGAAGA AGGTAGTGAT GGCGATATGG AAGGCGTTGC TATTGAGGAG ACGGACGACG AAGAGCCCGA TGGCGTCAGA GAAGAGTCCT TAGAAGAACT GGATCGCGAA GAATTCGAGT TCGATAGCGA CGATGGTGAT GACGCTCGTG TCAAGGGGCA GTCTTCACTA GTACTGACAG CATACGCTGG TCCAGAAAGC GATGAAGATT CCGTAAGCCT GGTACGCACG AGTGATGACG GAGAGGATAT CGAAATTGGT CTGGAAGGGG CCTCGGAGAA GGATTCCGCT TCCTACCGAG ATGAATACGG AAAAAGCATG GACGAGGAGG ACAGCCATTA TCGCGGGTGC TGTGCGGTTT TCTTGTCAAA TCCCTGTACG CAACTTTTGT GCTGCGTCAT ATCCGTCTTG ATCATTCTAG GGGTCATATT GGTGGTGGTG CTAGCGGGGG GCGATAGTCC GACGGATGGT GGTAGACCAT TCCCTACAGG TGCCCCTAAT GGAGGGAGTG TAACGTCGGC CCCCACACCA GATGGTTTTA CACGAATGGT AAGTTGCGAA TTGTGACACT TGACGAAAAT TTACATTAAA CGCCTCATAT TATATTGATT ACACAGCCGA CCACTTTGCC TACAAACTCT CCAACTCCCT TCAATATCGA AAGTGTTTTA CTGGCGAAGC TCAGCGGCAT TGCCGGATCG AGCTTTGGGT TTTCCGCCGC GATGACGCGA TCGTCTCGCT CTTTCATGGC AATTGGTGCC CCGGAGGGCG GTGCCGGTTC GGTGTCAATG TATAGTAACT CGACTGGTCA ATGGCAATCT TTTCAAGTTC TATCATCTCC ACAACAAGAA GGAAGATTCG GAGCTGCAGT CTCGTTAACC GACGACACGT CCTCTACTGG TTTGATGGTC GGCGCACCAA ATGTCCTCGC TACAGGCACA AACATACCTG TAGGAGCAGC CTACTATTAC GAGTACAACC GTGCCGATGC CAGATGGGCA CAGCTTGGAT CAGTGATACA AGGAGGTGCG TTAGCGGAAA ATGCCCAGGA GCACTTTGGT GCTGCAGTTG CAACCAGTGA TGTTTACAGG GTTGCCATTG GTGCTCCAAT GAACAACGCC GCCGCTGCTC GTGCAGGACG TGTCTACACC TACGAATATA GACAAGATGG AGCCATCTTT GACTGGGTCC CCATGTCTGA TTCGTCAGTG ACAGGCACAA CAACAGATGA ACGATTCGGT TCCTCGCTTG ACATGTCCAC TGATGGAAAC GTCATGGTAG TCGGTTCGCC CTTTCGAAGC CTTTTTTATA TTTACCAGTG GAACGGTAGC ACATGGTCCT TCATATTCTC GGGGTCGACC CCTGGTGGAA ATCTAGATGA GGAGTTTGGA TCTTCGGTAG CTGTCCTGTC CAATGACTTT ATCGCTGCTG GTGGTCCAGG TGCTAGTAAC GAGGCCGGGG TCGTTCGAGT CTATGAGCTC CAAAATAACG GTTTGTTCCG ACAGTTTGGA CCAGATATCG TTGGCAGAAA TGGCGAACGT ATTGGGGAGC TCAACAGTTT AACGGGAGCA ATCACTGAAC GCGGCCCGGC TGTCTCGGTG GGTACTGTTG ACGGTCTTAT TAGAAGATTT GACTACAATG CCAATCTCGG TGAATGGCAA GAACGCTTTG AAGCTGTCAG CACTGGATTT TCTGGACCAG TGTCTTCACT ATCCATGGTC ACCAATGGAC TAGCAAGTTC GATTGTTGCA GGGCACGCTG CAGGAAATGA AGCTGCCGTC TATGAAGCCT TCTTTGGACT CACGGCCAAC CCAGCTGGAG TCCCAGACGT TACATTGCCG CCGGTAGCCT CTGATTCTCC TTCACCGACT GCTGTTCCGA CCCCTCCGCC CGGGCCCGAG ACAATGACAC CTTCGCCTAC TGCTGTCCCG ATTCCTCAGC CCAGTGCCGA GAGCTTGGTA CCGAGTCCTG GTGTTTCATC CTCTGCACCT TCTCTTTTGC TGACGCTTTC ATCAGCTCCG TCGCCTGCTA CCGTCAATAT GACTGTTTTA CCGACTTTGG CCCCTTTATC TGGGACCCCT TCGTCGGCTC CCAGTGTAGC TTTGTTGGCC ATTTCGCTTG TTCGCCTTCT CCCAGGTGGA AGTCCGAACA CCGACACTGG CTCTTCCGTA GCTTTGACAG ATTCTTCCTT GGCCTTCGGT GCACCGTTGT TCCAGAATGG GGGAGGCATT GTTCAGCCTT ACCGAGGATC TGGAGGGGGC TATACTGCGC TGGATCCTAT CTTTGCCGTG AATGGTGGGC GTTTTGGAGA AGCTGTCGAT GCTACGGACG GTGGTGGTCG TAATGCACTT TTGGTCGGAG CCCCCACAGC TTTTGACGAG CAAGGCTTGA ATGTTCGCTT CGGCGCTGCC TACTACTATG AGCTTGTCGG TGATGTGTAC TCTATGATTG GATCGACAAT AGAACCACCG CTGACGCCTC AAGCTTCTGG CGGTCGATTT GGTACGTCTG TGGCCGTCGC CAGGAATATT AGACGAGTTG CAATTGGTGC TCCCTTCACG AGTACGTCGG CCACTGTCCA GCGTACCGGT CGTGTTTACA CATACGACTA TGACGGCACC AACTGGACAG CTTTGGAAAC GACTCCCGTA GAAGGCTTCG TGACGGACGA TTTCTTGGGT TTCGATGTGG ATCTTTCCGA TGACGGTTCT CGTCTTCTTG TAGGAGCACC AGGGAGCACT AGTGGTGTGG TGTTGTACTA TCAGCTTTCT GGACTTTTCT GGCAACCTAT TTTTTCGCTA CCTGGCTTTG AATCAGACGA GCTCTTTGGG ACCACGGTCA AGATTCTGAC TGCCGATGGC AATACCATAG CCATGGGAGG ACCAGGCTTT GGAGCCAGCA ACAATGGCGT GATTCGGGTT TACAGACGAA GTAGCGATGG AACTTTCGTC CAGCAAGGCC AAGATATTGT TGGCGCCGCC AACGAGCGAT TGGGAAACCG CAACACGCTC ACTGGTTCGA ATGGCAAGTT GCTTGTCGGG ACGGCCAGTG GTACAGTCAA GCGACTAGAC TATATGGTTT CCACGGATGA GTGGACTCAA GTTTCGGAGC AGGAGAACGT CAGCAGAATT TCGGCCCTCG ATACTACTGG ATCGGTAGAC ACCTTCTTGT TGGGCGACGT AAGTCAATCT CAAGTTTCGT TGTACACAAT CGGATAAAAC GTGCATAGTA ATTGTCAGTT CTTAATTGAA GCTTGCCCGG AATAG
|
Protein sequence | MEDVEEVAEA LPTIPEDVPT LTTEEATTLP VPVANSDLDE VTDAPESVET EQSPEIVEED IDTSEPASVT NSTMDDYDSV ASEQSNNVSR DEADSHDSHE EVQGLTVASA VGPVSSPSHV SGQFDEIEVD ESMLLPTDDE EELIEEETEQ VDFREDSSGG FEEVEVPSDN EGDILIADDE PLEDDDTHAS IDQVSESDDS AHSGEISPED EVPNGDTSDE SIYSVSDAPR EESVPSLDSA VSLSTEDRSS NSVSFMLANN EVFSPSDSAV VNPNLSMEPL TKTASVAHTE PLEFELDEMG PVTGIQIEDS SSRTSVNSSS HSDPFSRQTD DSNEVALIPF VDSNESNASQ WTPLSASTPV MTPSGRVFAP APPTIRNSTM GLYEDEPTDE MGGMDRHNEN PTVVPKHEDL NRNSGDTAKI GKVTAAGTVA TALAASERSP LDASMPMEDA MSTHPIETSG SYHEAGLEDK SSSHNLGEEF HDESASGDDS VFSGAEEGSD GDMEGVAIEE TDDEEPDGVR EESLEELDRE EFEFDSDDGD DARVKGQSSL VLTAYAGPES DEDSVSLVRT SDDGEDIEIG LEGASEKDSA SYRDEYGKSM DEEDSHYRGC CAVFLSNPCT QLLCCVISVL IILGVILVVV LAGGDSPTDG GRPFPTGAPN GGSVTSAPTP DGFTRMPTTL PTNSPTPFNI ESVLLAKLSG IAGSSFGFSA AMTRSSRSFM AIGAPEGGAG SVSMYSNSTG QWQSFQVLSS PQQEGRFGAA VSLTDDTSST GLMVGAPNVL ATGTNIPVGA AYYYEYNRAD ARWAQLGSVI QGGALAENAQ EHFGAAVATS DVYRVAIGAP MNNAAAARAG RVYTYEYRQD GAIFDWVPMS DSSVTGTTTD ERFGSSLDMS TDGNVMVVGS PFRSLFYIYQ WNGSTWSFIF SGSTPGGNLD EEFGSSVAVL SNDFIAAGGP GASNEAGVVR VYELQNNGLF RQFGPDIVGR NGERIGELNS LTGAITERGP AVSVGTVDGL IRRFDYNANL GEWQERFEAV STGFSGPVSS LSMVTNGLAS SIVAGHAAGN EAAVYEAFFG LTANPAGVPD VTLPPVASDS PSPTAVPTPP PGPETMTPSP TAVPIPQPSA ESLVPSPGVS SSAPSLLLTL SSAPSPATVN MTVLPTLAPL SGTPSSAPSV ALLAISLVRL LPGGSPNTDT GSSVALTDSS LAFGAPLFQN GGGIVQPYRG SGGGYTALDP IFAVNGGRFG EAVDATDGGG RNALLVGAPT AFDEQGLNVR FGAAYYYELV GDVYSMIGST IEPPLTPQAS GGRFGTSVAV ARNIRRVAIG APFTSTSATV QRTGRVYTYD YDGTNWTALE TTPVEGFVTD DFLGFDVDLS DDGSRLLVGA PGSTSGVVLY YQLSGLFWQP IFSLPGFESD ELFGTTVKIL TADGNTIAMG GPGFGASNNG VIRVYRRSSD GTFVQQGQDI VGAANERLGN RNTLTGSNGK LLVGTASGTV KRLDYMVSTD EWTQVSEQEN VSRISALDTT GSVDTFLLGD FLIEACPE
|
| |