Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41409 |
Symbol | |
ID | 7199306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 94583 |
End bp | 99103 |
Gene Length | 4521 bp |
Protein Length | 1506 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185377 |
Protein GI | 219130448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.974379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGT TCGTGGCGAC GAGTCGTCGT TTGGACGGTT GGACGGGTCC CTCCGCGGAT CCCGACGCGG AGTCCTGTGC CAGTTCCCCG GCATCCGCCT CGCCGCTGCG CACGACCCCG CGCCAGTACT TGTGGAGTCC CCGTGGCGTC ACACGCACCC CGGAACGAGA CCCGACCGCT ACAACCGAAT CGGGACACAA AGACGCAACC GTACACGATG ATGTAGAAAG ACGTTGGACG CCGATCCAGT CCACAACGGC ACGGACTCTG TTCCCTCCGA CCCACTGCGA CGTGGGCACG GTGTGGGACA CTCGTGCCGT CGAAACCCAC GCGCCGGTAG TTCCGGACAC CGTGTGCCGG ACCAAGACGT CCCGTTTCGC CGTGTGGCAC GACACGTCCG AACCCGTGGT GCCGCTTGCG GTCCGTTCGA CGACGTCAAC TTTGGTGGTG GTACAGCCGT CCGTGTCGTT CGCGATCGAG ACGTGTCGGA ATCATTCCGA CACGGTCGTC GCACGGGAAC GCTGCGATCC CGAATGGCTT CGGATCGATC ACTATCGTCG AGCGAACGAC ACGACCGCAG CACCCAAACG TACCACCACC GCGACTAGCA CAACCAGTAC AACCAGTACA ATTAGGACTA CTCCAAAGAC CGCAGCCACA AACACAACGA CCATGTCTCC TACCGTTCCT TCCGTATCCG CCACTTCCCG CCACACCACG CTTGGTCGTG CACTGGTGGG GCCTCGGGGA GTCGATCCCG TTACGGCTCC CACCCCGACT CGGACAACAC TCCCGAGTCT CACCCACACG AAGCCCCTAA CTCCCACCCC GAGCAACGCC CGTATACTCC GTGACGCCTC CGGCAATCAA CCCGGACGAC CGGCAGTTGC TCCGTCCGAA GAACCCACGC ACGACTTCCC TACCAAGATG CCGTCCCCAC CCGTCGCTTC GTCCATTGCC AAAGAAAACG CCCCCACGGT CTCTCCTTCG GTCCGTAGCG ATTCTCACGA CGTTGTTACC ATCACCTACA CCACCACTAC CGAGTCCGTT CCCGAAACGT TGTGGAGCCG GGTCGTCGAG AGTGATGTTT CCTTTTTGGG AGACATTACG GACACTCCCG ATCGCGACAC TAGTCACGTC CACGCCAACC CTTGCGATCC GATGGACCAC GCGATTGTCA CGGACGACGA AGATTCGGTC ACGTCCACCA GCGTGCACGC CGTGGGAAAC ACCAGCACCC GCATGCTGGA CGTTTCGACC ACGGAGGAGT CCCACATGCT GGACCTTTCG ATCGAGACGG GTCCCCAGTG TGCGCCGTCG GCTGAGCACG ATGGACACGG CTTGTTCGGA GCGGATCGTA CGGGTGACAC CGATAACGAC TCGGGGAAAG AGGAAGAAAC GATCGAGACT ATACTTGATG ATTCGGTCGA CACGGGATCC GTTCTACCGT TGCCCGACGC CAACGGCATG GACGAATCGT CGACTGCCGG TGAGGATGAT CCACGGCCGG CCCCACAAAA AAACGCACCC TTGTTGGAGA CGGCCACGGT CCAAACCGAA CAGTCCAACG AATACATGTC GGATCGCAAG TCATCAACGG TGGTTCCGCC AGAAACCGAG CACCCGGAGT CAGCGCCCGT CTTGTCGTGG GCGGCCGTCG CCAAAACAAC ACCTTCCAAA CCCAAGACTT TGAACAATAG AGCGACGTCG CCAAGCACAA TTCCGGTATT CCCGGCGCCC ACGCACAACA AAGTTACCAC CGCCCCCCTC GATACCTCGT CCAGTACCTG GGAGCACGAT GGGCTCCTCG TCTTGTGCTC GCATCAATCA TTGGACAGAT CCGTCACGGT CAACCAAATC AAAGTCTTTA CTATATTGGG AGGTCACAAC ATCTCCTACG AAACCATCGA CGGTGCCGAC CCGGTGCACA AACAAACCCG CAATCACCTG TTTGAAGTCT CGGGCTTGAA GGCGCAATAT CCCCAGTTCT TTTTGGTCCG GAATGGCGAG CTGACCTTCT GGGGTGATTT TCACCAGTTT GTGCAGGCCG ACCAAAACGG TGAATTGCAA GTGGCTTTGG CCGCAGGTGT GGAGCTAGCT CCCGTGATTC GTCCAATCGA AACGGCAATG GTCGACACCG AAGACACTTA TTCTATAGAG GCGGGGAACG TCCCGAAATT ATTGGTGCTC TGTTCGAATC AAACCATGAG CCGAAGAGTA ACGTCGCATC AAGAAAAAGC ATACACTATT CTGAAGGCTC ACCGTATCCC TTTTGAAACT CTGGACGGAG CCGACGCGTG GAACAAATCT CGACGCGAGG AGCTCTTTCA GATTTCGGGA CTGCGCGCGC AGTACCCCCA GTTCTTTCTG ATTGACGAGA CAAGCGGCAC CACGACGTTT TGGGGAGATT GGGAGCGTTT CATGTACGCC AACGAAGACG ACGTGCTCGT GAACGAGCTC GGTTTGACAC CTTCGTATGC TCCCGAACAG CTTCAGAACC ATGTGAACAC GGCCAACGAT ACTCACGCTA GCAAGACAGA ACGGATGGAG TCCTTCGACT CCAACGCTGG AAATTCGGAA GGAACAGACG CTAATGAAAA GCGGTTCCTG GTGCTGTACT CCAGTCAATC GCTTGATCGC CAGGTGACGG TCAATCAGGA GAAGGCATAT TCCGTACTGA CGAACAAAGG TATACCGTTC GAGACACTGG ACGGGATCGA TCCCGACAAT GTCCAGCGCC GCAACACGCT CTTTCGAATC TCGGGATTGC GTGGCACGTA TCCGCAGTTC TTTATCGTTG ACGGTCACAC CACCGAATTC TGGGGCGACT GGAATCGTTT CAACAAGGCG AGGAAAGCTG GAAAATTGAG TATATTATCT ACGCCGATCG AAAAGGAGAA AATTGCGAAG GGAATTGCCC AAGCATCTAC GGGACAATCG GCAGCCGGAA CGGGCAAGGC TCACACCGAG AATCAAAATA CCTCCGCTTC CAGTAGCGCA GCTACCTCTT CTGGAACCGG TGGGCCTTCC TCACACAGCG AATCGAATAG CAACACTCGA AGTAGCCAAG CAGTAGCTAC AGTACAAACA ACGGACCGGA TCTCTACCAA GACCGACATT ACTATTTATG GAGCTACAAG TTTTGTTGCA AAGCACGTCA TTACGTATAT AATGCAAACA AGCATTCATG GTGCCAATAC TCTGAAAGTG ACGCTGGCCG GACGCAGTTC ATCCAAGGTC CAAGCCCTAA CAGACGAATT CTCACAAAAG ATGAAGAATC TCTTTATTGT GAGCGAGAAA CCACAAGGCA AATGTGTCTT TGACTTTTTT ATTGCCGAGA GTTCAAACCC TTCCGCTCTT GGCAAAATGG CATCCCGAAC CAAAGTCGTC CTCAATTGCG CTGGCCCGTT CACTCGACTT GGCTCTAACG TCGTTGCGGC CTGTGCTAAG ACTGGTGCGG ACTACGTCGA CATAACGGGC GAGATAGAAT GGGCATCCGA GATGCGGCAG CTGTACAGTG CCGACGCTGC CAAGTCTGGC TCGCGTATCA TTTCATTCTG CGGATTCGAT TCCATACCTT CTGACCTGGC TGTTTACACT GCGATCAAGG TCATGAAAGA AAAGTTGAAA CAAAACGCGA AACCTATCGA AACAGCGTCG ACATATCACT CCAACTTTGG GCTTGCCAAC GGCGGCACGT TGCAAACGGT CTCGGAAATG TCGCTAAACC TACGGCATTG CCTTTTCCGT TGGGTCCCCT ACCTTCTCAA CGATCCTTTG GCGCTGACCC ATCCCAGCGA TCGCAGTGCG CAGTCGCCGC AAGAAACGCG AAACAGAATG GCCAAGGCAG AATGGATCAA CCAGTTGCCG TTCTTCCATT CAATTTTTCG AATGGGTGTG TCGTCGCCAT TCTTTATGGC TCCAGTCAAC ACCAAAGTTG TCAACGCCTC CGCCGTTGCC AACAACTACG GGCCGAGCTT CACTTACTAC GAACGATATG TGCCGACGGG CTTTCGTTTC ACAGTTCAGC TAGGAACACT GTCGCTGCTT CCCGCAGTCG TCACTCAATT TTTAATATAT CTCGCGGCGT TTCTGATCAA ATTTCCCATT CTTGGGCCGT TGCTGATCCA ATGGTTTATG CCACCAGGCT CTGGTGCATC GGATCTTTTC TGCAAAACGG GATATGCTGA AGTCTATGCG GACGTTGTTA CTTCCCCCAA TGCCGCGGGA AAGGTTGACA AGGCAAACTG CTTCCTGGCC TTTGAGGGAG ATCCCGGGAA TTGGGTGACA GCTCAAACAG TAGCCGAGTC GGCACTCGCC TTGGCACTGA ACAAGAAGGA TCTTCCGCCT CGTAGCCAGG ACGGCTTTGG TACGCCAACC GAAATTCTCG GTGGTGTTCT TCTGAAACGT TTGACCGAGA CCAAGATTCG CCCTGTGGTT GTTGTGACCG ACGTGCGTGC GGCGACATCA AAAATAGAGT GGTCTATGTT TCCCCTACAT GTGACAATGA GCGCCCACTA G
|
Protein sequence | MSLFVATSRR LDGWTGPSAD PDAESCASSP ASASPLRTTP RQYLWSPRGV TRTPERDPTA TTESGHKDAT VHDDVERRWT PIQSTTARTL FPPTHCDVGT VWDTRAVETH APVVPDTVCR TKTSRFAVWH DTSEPVVPLA VRSTTSTLVV VQPSVSFAIE TCRNHSDTVV ARERCDPEWL RIDHYRRAND TTAAPKRTTT ATSTTSTTST IRTTPKTAAT NTTTMSPTVP SVSATSRHTT LGRALVGPRG VDPVTAPTPT RTTLPSLTHT KPLTPTPSNA RILRDASGNQ PGRPAVAPSE EPTHDFPTKM PSPPVASSIA KENAPTVSPS VRSDSHDVVT ITYTTTTESV PETLWSRVVE SDVSFLGDIT DTPDRDTSHV HANPCDPMDH AIVTDDEDSV TSTSVHAVGN TSTRMLDVST TEESHMLDLS IETGPQCAPS AEHDGHGLFG ADRTGDTDND SGKEEETIET ILDDSVDTGS VLPLPDANGM DESSTAGEDD PRPAPQKNAP LLETATVQTE QSNEYMSDRK SSTVVPPETE HPESAPVLSW AAVAKTTPSK PKTLNNRATS PSTIPVFPAP THNKVTTAPL DTSSSTWEHD GLLVLCSHQS LDRSVTVNQI KVFTILGGHN ISYETIDGAD PVHKQTRNHL FEVSGLKAQY PQFFLVRNGE LTFWGDFHQF VQADQNGELQ VALAAGVELA PVIRPIETAM VDTEDTYSIE AGNVPKLLVL CSNQTMSRRV TSHQEKAYTI LKAHRIPFET LDGADAWNKS RREELFQISG LRAQYPQFFL IDETSGTTTF WGDWERFMYA NEDDVLVNEL GLTPSYAPEQ LQNHVNTAND THASKTERME SFDSNAGNSE GTDANEKRFL VLYSSQSLDR QVTVNQEKAY SVLTNKGIPF ETLDGIDPDN VQRRNTLFRI SGLRGTYPQF FIVDGHTTEF WGDWNRFNKA RKAGKLSILS TPIEKEKIAK GIAQASTGQS AAGTGKAHTE NQNTSASSSA ATSSGTGGPS SHSESNSNTR SSQAVATVQT TDRISTKTDI TIYGATSFVA KHVITYIMQT SIHGANTLKV TLAGRSSSKV QALTDEFSQK MKNLFIVSEK PQGKCVFDFF IAESSNPSAL GKMASRTKVV LNCAGPFTRL GSNVVAACAK TGADYVDITG EIEWASEMRQ LYSADAAKSG SRIISFCGFD SIPSDLAVYT AIKVMKEKLK QNAKPIETAS TYHSNFGLAN GGTLQTVSEM SLNLRHCLFR WVPYLLNDPL ALTHPSDRSA QSPQETRNRM AKAEWINQLP FFHSIFRMGV SSPFFMAPVN TKVVNASAVA NNYGPSFTYY ERYVPTGFRF TVQLGTLSLL PAVVTQFLIY LAAFLIKFPI LGPLLIQWFM PPGSGASDLF CKTGYAEVYA DVVTSPNAAG KVDKANCFLA FEGDPGNWVT AQTVAESALA LALNKKDLPP RSQDGFGTPT EILGGVLLKR LTETKIRPVV VVTDVRAATS KIEWSMFPLH VTMSAH
|
| |