Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18826 |
Symbol | |
ID | 7204139 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1401276 |
End bp | 1404967 |
Gene Length | 3692 bp |
Protein Length | 1135 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186524 |
Protein GI | 219114688 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0391599 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACACT GTGCCGTTTT TGTCCGCACT TCTGTCAGTA CTATGTTGTA TCGCAAAGCG CTACGAGTAT CCGCTTCCGG TCGGGCCAAA ACCTCCACCG GACAAGTAGT CAACATGATG AGCAACGACA CGGCACAGTT GCAGCGATTC TTGCAGTTCG TCGGTATGAC TTTGGTCGCC CCCTTGCAGA TCATTATCGC CTTGGTTCTG ATTTTTCAAC AAGTACGTTT CGCTCTGTGT TACAGTTAAT TGTCCTCGAA AATCTTGTCC ATGGATGTTC TTTCTCACTT TGAAATTCTG TTCTTTTCAT TGTCTTTCTC GGACAGGTCG GAAACGCCAC TTGGGTTGGT GTTGGATTCA TGTTTGCCTT GGCACCCATT AATACCGTTG TCTTCTCCAT AGTATCCAAG CAACGTCGCA AGGTACTCAA ATATTCCGAT TTGCGCGTCA AGATGATGAA CGAAATTCTT GCCGGTATTC GAATTATCAA ATTCTACGCC TGGGAGCGGC CGTTCGGAAA AGAGGTCGGG CGTATTCGTG GCTCCGAACT AAAGGCCTTG ACTAAACTTG CTTATACCTC GGCCATCGGA TTTTCGCTCA TCCTCATGTC AGCGCCTCTG ATACAACCCA TTCTAGTGTT CTTAACCTAC GTGTCTATCC AAAACGAACC GCTCGACGCG GCTACAGCCT TTACTACAGT CGCACTTTTC AACATTATGC GCTTTCCGTT CGCTTTTATG CCCATGGGAC TCTTGCAATA TATTCAGAGC AAGATTTCTT TAAAACGTCT GGAGCGTTAC CTGGCCTTGC CGGAATTGGA CGAATACGTC GAGCATACGC CGCCTCCGTC TGCGAGTATG GACGCAGCCG AGGCGCAATA CGGTAGCGTC ACAATACTGA ATGGTAGCTT TGCGTGGATG GATCCGGAAG GTAAAGAAAT CCGCCCGATT CAAGACGAAG AGCCGAAAAA GGAACGCCGG AAGTCGAAGC GTACCAAAGG CGACGAGACC TCGGATGTTG ACATGATGGC GAGTAACCAC AGTTCAGTCG CGGGTAGCAG CGTCTTGACT GAGAGTACCC AGAAAACTCC TCCCATCACG CTGCAAGAAT TGACATGCAC AATTCAAACC GGTAAGCTAG TCGCAATTGT CGGTGCCGTT GGTTCGGGCA AATCTAGCTT TCTGTCCGCT ATCTTAGGTG AAATGGAACC CGTCAAGGGA TGCAAAGTTT ACATGCCGCG TCCTGTAGAT GCGCCAACAG GCTTTGTGTC TTACTGTACG CAGACCCCGT GGGTCGTCAA CGACACTTTG AGAGGAAATG TTCTGTTTGG ACGTGATTTC AATCAAGAAC GATATGAACG CGTTCTGGAG GCGTGTGCGT TAGTGGACGA TCTCGCTATT TTACCTGCCG GTGATTTGAC GGAAATTGGC GAACGAGGCA TCAATTTGTC GGGTGGTCAA AAGGCCCGTG TTGCGTTAGC CCGGGCTTTG TACTCCGACG AGACTCGTCT GATGCTCATG GACGATCCCT TGTCGGCGGT GGATGCACAT GTTGGAGAGC ACATATTTTC TAACGCTATA GCCGGAGACA TGGCCAAAGG AATAACTCGA TTGTTAGTCA CACATCACGT TCACTTGCTG TCACGATGTG ATGACGTCAT TGTTATGGAG CACGGCCGTA TCAAGCACCA GGGCCGGTAC AGAGATTTGG TGGCCGCCGG TGTTGACTTT GCGGGCGCGG TGGACGTCTC AAAAATCAAG GCAGCCTCAA AGCAGGAGCC TGAAAAATTT GACGACGAAG TTACAGCCCA AAAGGAGGTC GAGCTGTCGG CTGAGAAGAA AGCAGCTCTA AAGAAGAGCG GGAAAAAGCT TGTGAGGGAC GAAGAGCGCG AAGAAGGAAG TGTTGACGGC TCGGCTTATA TGCATTACGC TAGAGCCGGT GGATTGTTGA CGGCGGCATC CGTTTTTGTC ATTCAGGCGC TCGGTCGAGC CTCAGAAGTG ACTGCTGGGT TCTGGCTGGC CTTGTGGGCA GAGCGCAGTC TTGAAGCATC GTTGAGTGGA GATCCGTTCT CGCAGACTAC AACAAATCGA TATCTTGGTG TGTACGCTTT GTTTGGTCTG GGCGGTGTCA TAGGGCTTAC CGCGCGCGCC ATCATTGTTG CGGTTCACCG CCTTCGGGCT TCAAAGAAGA TGCATGATGA CTTGACCGAG AGTATCTTAC GTGCACCTGT TTCTTTCTTT GACATCACCC CTACTGGACG GATTCTCAAT CGTTTTGCGG CCGACATGGA CAAGGTCGAT CTCGAGCTGA CGCAGAGCCT TTCACAGGGA GTGTCAACTG TCTTCAGTGT TCTCGGAGCA ATTGGGGCAA TTATCGCTGC CACAAACGGC ACATTTCTAG TTCCATTGAT TCCTATCGGC TATTTGTACT ACCTGATCCA GAAGTGGTTT CGAAAGACAT CAACTGAGCT GCAGCGCATC AACAGCATCG CAAATTCTCC TATATTTGCT GATTTTTCTC AAACCCTCTC CGGTACTTCG ACGATTCGAG CCTACGGCGA AGAAAAGCGG TTTTTCATCC AGTGCAAGAA GTCTTTCGAC AACATGAACA CATCATACAT CCTCGTTCAG TTAGTCAATT ACTGGCTTGG ACTTCGTCTT GATGTCTTGG GAGGACTTAT GGGGGCCTTT ATTGGAGGAG TCGCTGTTGC TACTTCGTCG TCTGGTTTCA TTTCAGCAGG GTGGCTTGGT CTCGCCCTGT CGTACAGCAT TGAAATGACA AATTACCTCA AGCATGGAGT TCGTATGATT GCTACAATCG AGGCGCAAAT GAATTCTGTC GAACGTATTC TCTTCTACAC CAACAACATA AAAGCGGAGG CACCTGAGTT TATCCCGGAA TGTGATCCTG AACCTGGTGT ATGGCCGATC AATGGCGAGA TCGAGCTCAG CCACGCCTCC ATGCGGTATC GCGATGGACC ACTAGTTTTG AAGGACTTAT CGCTAAAGGT CAAAGCCGGC GAGCGTGTGG GAGTTTGTGG GCGTACAGGA AGCGGTAAAA GTAGTCTTAT GATATGTTTG TTCCGTATCG CAGAACTGGA AGATGATGGC GGAAAAATCT TGATTGATGG GATTGACGCT TCCGAAATTG GAACTTCAGC TTTGCGTTTA AATCTTTCCA TCATTCCTCA GGACCCTGTG ATATTTTCCA ACACCGTTCG TTATAATCTG GATCCGTTCT CAGCAGCAAC AGACGAAGAA GTGTGGGAAT CTTTGACGAA GGTGCAAATG GCTGACACTA TCGCGGAGTT ACCAAATGGA CTCAGCGAGC AAGTTTCGGA GGGAGGAGAA AATTTTTCGC AGGGACAGCG CCAGCTTTTA TGTATTGCGC GATCGTTGAT TCGTAAGCCC AAGATTTTAG TAATGGACGA AGCGACGGCG AGTATCGACA ACGCTACGGA CTCGGCAATT CAGCGAATGA TTCGTGAAAA TTTTGAGAAT ACAACAGTGT TAACGATCGC TCATCGCTTG AATACGATTA TGGACAGCGA CCGCGTGTTG GTGTTAGATG ACGGAAGGAT AGCCGAGTTT GATACCCCAG AGGCTTTGTT GGCGAAAGAG ACTAGCTTGT TCCGTGCAAT GGTAGATAAA AGTCGGGCTG CCAAGTCAAA AACTCTCATC GAAGGAGAGT AG
|
Protein sequence | MAHCAVFVRT SVSTMLYRKA LRVSASGRAK TSTGQVVNMM SNDTAQLQRF LQFVGMTLVA PLQIIIALVL IFQQVGNATW VGVGFMFALA PINTVVFSIV SKQRRKVLKY SDLRVKMMNE ILAGIRIIKF YAWERPFGKE VGRIRGSELK ALTKLAYTSA IGFSLILMSA PLIQPILVFL TYVSIQNEPL DAATAFTTVA LFNIMRFPFA FMPMGLLQYI QSKISLKRLE RYLALPELDE YTSDVDMMAS NHSSVAGSSV LTESTQKTPP ITLQELTCTI QTGKLVAIVG AVGSGKSSFL SAILGEMEPV KGCKVYMPRP VDAPTGFVSY CTQTPWVVND TLRGNVLFGR DFNQERYERV LEACALVDDL AILPAGDLTE IGERGINLSG GQKARVALAR ALYSDETRLM LMDDPLSAVD AHVGEHIFSN AIAGDMAKGI TRLLVTHHVH LLSRCDDVIV MEHGRIKHQG RYRDLVAAGV DFAGAVDVSK IKAASKQEPE KFDDEVTAQK EVELSAEKKA ALKKSGKKLV RDEEREEGSV DGSAYMHYAR AGGLLTAASV FVIQALGRAS EVTAGFWLAL WAERSLEASL SGDPFSQTTT NRYLGVYALF GLGGVIGLTA RAIIVAVHRL RASKKMHDDL TESILRAPVS FFDITPTGRI LNRFAADMDK VDLELTQSLS QGVSTVFSVL GAIGAIIAAT NGTFLVPLIP IGYLYYLIQK WFRKTSTELQ RINSIANSPI FADFSQTLSG TSTIRAYGEE KRFFIQCKKS FDNMNTSYIL VQLVNYWLGL RLDVLGGLMG AFIGGVAVAT SSSGFISAGW LGLALSYSIE MTNYLKHGVR MIATIEAQMN SVERILFYTN NIKAEAPEFI PECDPEPGVW PINGEIELSH ASMRYRDGPL VLKDLSLKVK AGERVGVCGR TGSGKSSLMI CLFRIAELED DGGKILIDGI DASEIGTSAL RLNLSIIPQD PVIFSNTVRY NLDPFSAATD EEVWESLTKV QMADTIAELP NGLSEQVSEG GENFSQGQRQ LLCIARSLIR KPKILVMDEA TASIDNATDS AIQRMIRENF ENTTVLTIAH RLNTIMDSDR VLVLDDGRIA EFDTPEALLA KETSLFRAMV DKSRAAKSKT LIEGE
|
| |