Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39357 |
Symbol | |
ID | 7195067 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 275296 |
End bp | 278558 |
Gene Length | 3263 bp |
Protein Length | 1049 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183456 |
Protein GI | 219126421 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.807834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGG TGCATGCACT CCTTGTCAAG TCGTCGTCCT TGTTCACACA AACCAAGTCT CTACGGCTCC CACAGACTGT TCGCGACCAC CAACTCTCGA CATTGGTATC CACTCCACTT TAATCCTTTT TCAGTAATGT CGACTCCAGT CAAGTCACCG ATGAAGAGCG GCCACAAAGC AGCGACTCTG AATCATATTT CGCTGTTGAC GCCGTCGTCT ACGGAATGGT CCGTCGATGC CGTAGGAATT TTGCCACAAT CGCTCATGCC ATTATCTACG TCGTCGTCTT CGAATGCCGA GTCGGCTGTT TTGGTAGCGT CTCGCTACGG CGCGTGGTCC CTGATTGGTA AACGAGAACG GGACAGTAGC CAACACGTCA AGATGGTCTT GTTCGTCTGG TACTCGTCGC CGACGGCGTC CGTGAAAGAG CGAGCGTTGC AAAAGCAAGT ACTGAAACTT TCTCATCCGC ATTTGTCGAC TTTTGACAGC AGCAGCGGTA GTAGTTGTAG TAGTGCGGAT CCTCCACCAC TCGTACAACT CGCGACGAGT CCGACCAAAC CGGAAGAAGT CTACGTATAC GTTGCTAACG TGGTCTCGGG ACGCATCTTG ACCTGGAAAC TATCGCGCCC CGATCTCTCT CGGGTATTAG CACCGCCTCC TGCAGCCGTT ACATGGCTGG ATGAATTGCA ATCGGTAGAG GAAGGGGCAG AAACCTTGAC GTCCCTGTCG GCGACCTGGA ACTCCGGCAT GACAACCCCC TTCCTCTTGG TCGGTACCTC GGCGGGCCAC GTCTACAGCC TGCAGCAAAC ACACGTACCG CTGGCAATAC AGGTGCAACG TGTCGCAACC TCGTCGCTAT TCGGATCCAT CAACAGCAAC AGTAACGGTC TATGGGGCAG TATTCTTCAT TCGATGATCC CCGCAAGTCG GAGTGCAGCC ACTGGAGATG CCGTGGTCAC TACGCTGCTT CACCCTCCGG ATGATGCCAG CAGTAGCAGT AACTATACCC AAGGCATGTT TCATGTGTTG ACCAAGTCTG GTCGGTTACA AGGGTGGCGT ACGCACAAAC CTACCGAAAG TCCACATTGG ATCTTTGAGG CTGGTCCGAA GCCAATTGAC CTAGTCGCGC TATTACACGA CAAAATTAAG GCACCCGCAG TACGCAGTCT TCGCGTCTTG CAAGCAGCGA TTTCACCACA AGGACACCAT CTTCATTTAC TGGTGCTAAC GGAACACGAC GATGATGAAT ATCGTTTGTA CTGGAGCGTC TTGACCATGG GTAGTAAAGG CGGACACGGT GATTTTGCCA AGGAAGAACC CACCCTCACA TTGACCACCG CACACTGGTT GGATCGATTT TACGATCCTC GCAGTGTGAC GGTGGTCGGA TTGATCGTGG CTGCCAATAA TACTGCCTAC GCTGCCTTTC AAAGCCCAGG CAGTGCACCC ATAGCCATGG CTCTGTGCGG ACAAGATAAC ACATTCGATG CTATCCACGA ATGCGACCTC CCTGTCGGTA ATATTCCTGC ATTGATAGTC GATACCATGG TTCCCGATAA AGTGGTATAC GGCTGTGCCG TTTGGTCTAT TAGTGGTGCC AATGTACGTA TACAGTACCG TCCAACAACG GCATCACCGG CTCCGTCGAC TGTTCCACAC GGCTCTACCT CGCGAAGCCG TTCCGCGATT GCTACGTTGA CCACTCATCT ACAATCCGCC TTTTGGAACT ATTATCAACA TCCGGATCGG TCGGTTCGCC TCCCCCCTTC CCTGATTTCG GCCAACATTG CAGATTTGCA AGAAGCGGTA GTCGGCGTTG GTTGTGCCTT AGCATCTCGT CAACAAGCAC TGTCGAACGT CTTGGAGTGG CATTTGGCAT TTCTGAATTT GCTGCAACTA AGTGGCCTGT ACCGCAGCTT GCCCGAGGTC ACTCAGTGGC AGCTTTTGGC CATTGGACAA GAGTTGACCG TTGCAGACCG ACTAGGTTCC TTGCCCTCGG ACATGTCAAC CAGAATCCCA TCCTGGCAAT TGGAAGCGTT GGAGTCAAGA CCCTGGCAAG GTCTGGGGCC TTGGTTCGGA GATCTTGTGG CAAAACACTT TGGAGCTGGG CAAGATCGCC AGCAAGCACT TGTGGAATGG CTAGTAGCCA TGTTGGAGAC AGCCGAAGGC TACCGTGAAG AACACGGTCA AAAGACTTAT TACTTGACGT CAGGTAGCCG AATACCCAAA ATGTCGAGGA TTGAAGAAGT GCCAATTTGG ACGAGTCAGA TTGCTTTGCA AAGAACTCTA CTAGGGTTGT TGGAAAGTTG GCATGATGGT GGCTTCGTCG GGCACGGTGG GCAGGCTGTG GTCTTAGCAA CCGGTATTCT GCAGTCATTT GGCGACACGT ACGCCTCTGT GCCGACCGAG GAAACCAAAT CGACGTACGC AAATGCACAG CGAATGACGA TCGGACTGCT ACGTCGCCAA ATAAACCCAC CAAACGACGT CGTAGCTTTC CGTCTATCCG TCAAACACCA TTCCTTTAAA GGCGTGTGCC AGATTGCCTT TGATCACGAA AAGAAAGAGG ACGCGGAAGA ATTCTCCGTA GTTCCATTGT TTACCGAGTT AGCCCACGCG AAGGACATTT CTACCAGTAT GTTGTTTCCG GCATTTTTCC TGTATTGGCA TTCGCAACGC CAACATTTTG GCCATGTCCT CGACTATGGT CAATACTGTC CTGACACGTT TAGGGCATTT TTGGAAAGCA GCGAAGAATT GCGTCCGTAT CGGTGGATTC AGGCGGCGCG GGCTGGGGAT TTCGAAGGCG TAACGAACTC ATTACTTCGC AATGCGGAGA AACCGGAAAT TTTGTTGCAC GATGCCCACC TGTCCCTTAG TCTGGCGAAA CTTGCAAACT CTGTAGTAGA GTCTGAGTCC ATGGATAAGG AACTCGCGGC GAAGCGCGCT CGACACATTG ATCAAAAGCG AGAATTGGTT AATGCTCAAA ACGAACTCTT TGATGAGACC GCACCTAGCT CTTGCCTCTG GTCGGCCGAA CGTCTACTCA ACTACGCCTA TGACAGGGCG AATCAGGCCA AAGATGCGGA AGACAAATGT CGAATCTATT TCACGGCCCT AGCAATTTGT GCAACGATGG AAGAAATCGA TCAAGTAGAA AAGAACGCTT CTCACGTGTG GTTCCGTGTT TTGCAAACAG AACTGGACTG GTGGAACAAT CTGATTCAGA GTGCTACTGA TTTGACAGAT ACAGATATAC TGA
|
Protein sequence | MATVHALLVK SSSLFTQTKL FATTNSRHWY PLHFNPFSVM STPVKSPMKS GHKAATLNHI SLLTPSSTEW SVDAVGILPQ SLMPLSTSSS SNAESAVLVA SRYGAWSLIG KRERDSSQHV KMVLFVWYSS PTASVKERAL QKQVLKLSHP HLSTFDSSSG SSCSSADPPP LVQLATSPTK PEEVYVYVAN VVSGRILTWK LSRPDLSRVL APPPAAVTWL DELQSVEEGA ETLTSLSATW NSGMTTPFLL VGTSAGHVYS LQQTHVPLAI QVQRVATSSL FGSINSNSNG LWGSILHSMI PASRSAATGD AVVTTLLHPP DDASSSSNYT QGMFHVLTKS GRLQGWRTHK PTESPHWIFE AGPKPIDLVA LLHDKIKAPA VRSLRVLQAA ISPQGHHLHL LVLTEHDDDE YRLYWSVLTM GSKGGHGDFA KEEPTLTLTT AHWLDRFYDP RSVTVVGLIV AANNTAYAAF QSPGSAPIAM ALCGQDNTFD AIHECDLPVG NIPALIVDTM VPDKVVYGCA VWSISGANVR IQYRPTTASP APSTVPHGST SRSRSAIATL TTHLQSAFWN YYQHPDRSVR LPPSLISANI ADLQEAVVGV GCALASRQQA LSNVLEWHLA FLNLLQLSGL YRSLPEVTQW QLLAIGQELT VADRLGSLPS DMSTRIPSWQ LEALESRPWQ GLGPWFGDLV AKHFGAGQDR QQALVEWLVA MLETAEGYRE EHGQKTYYLT SGSRIPKMSR IEEVPIWTSQ IALQRTLLGL LESWHDGGFV GHGGQAVVLA TGILQSFGDT YASVPTEETK STYANAQRMT IGLLRRQINP PNDVVAFRLS VKHHSFKGVC QIAFDHEKKE DAEEFSVVPL FTELAHAKDI STSMLFPAFF LYWHSQRQHF GHVLDYGQYC PDTFRAFLES SEELRPYRWI QAARAGDFEG VTNSLLRNAE KPEILLHDAH LSLSLAKLAN SVVESESMDK ELAAKRARHI DQKRELVNAQ NELFDETAPS SCLWSAERLL NYAYDRANQA KDAEDKCRIY FTALAICATM EEIDQIQIY
|
| |