Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43299 |
Symbol | |
ID | 7197073 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 53807 |
End bp | 58770 |
Gene Length | 4964 bp |
Protein Length | 1416 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177546 |
Protein GI | 219111589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.545075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTCACAA AAACGTTGAA GCATTTGACC AACGCGACTT GTAAATACAG TCCACAATAC TTCTGCGGAG AGGCTTGCGA CTGCCGGGAC AGAGAACGCT ATCACGGTAG TCCTCCCCAC GAGTTTCGCA TTGACAGGAT GACGCAGACG CAGGGGCATT CTCCAAATGA AAATCAGAGA GAAGCAGAAT CCATGGGGCG ACACGGAGAC CAGTCATTTC AAGCCACAAA AAGTACCGAT GGGCAACAGA AAACACATCC TGACGAGGTC AACAGCCAAA CCATACATTC TTCAAATGTC GCCGACAAAA GCAAAAGTCC ATTTTTGCCT CCCATACACA CAGCCAAACC GTGGCCAGAA GACCGGCTAA CAGGAGATCA CGATCTACAG CGCTGCCTTG GAAGCTTCCC CTCTTCCCTA TGGAAGACGT TCCAGCGGTG GACTTACGCC TACATGCAAC CAATCTTGCT CAAAGGCCAA AGACAGTTTC GAGAGAAAGA TCACCTCACA GTGGAAGATG TGTATGTAAT TCCAGCCGAT ATGCAAGCAA GCGTTTTAGT CGAGCAATTT TGGTGAGTAG AGTTTTGTCA AACAAAGGCT CTCCGACCGC ATGTTCCCCT TTCTTACTAA TATGAGACTC ACCGCTTCCA TATACAGGGA CGACTATCAC AGAACTAAAA AGCTTGTACC GGTTCTGTTC CGATTGATCC AGCCAATCTT TGTCCCTGCC GGATTTTGGG AGCTCTTGGT CGTCTTGGCC AAAGTGTCTT TACCTTTGTC TCTCCGCCAA ATCCTGCTGG TGCTCGAAGC CAACCCGAGT GCCTCGGTCA TTTCCGAAGG CCTGCCTTTT GCTATCACCC TTTCCTTAGC TGGCGTGGTG TTGGCTTTGT CACAGAACCG TGTCGTCTTT TTGTGTACAG CCAGCGGAAT ACGAATTCGT GCGGCTCTTA CAACGGCTCT TTACGAGCAT GCTTTACGAC TTACTGCATC CGGCAAGATC GGGCTGACGA CTGGGCAAGT CACAAACTTG GTTGCGGTCG ATACGCAGAA GCTTTTTGAT GTATGCGTGG AAGGACACAA TTTGTGGAGT TGTCCGTTAC TGATTATTAT AGTGCTTGCG CTATTGGCGA CCCTGATTGG CACCGAATTG ATGATAGGAG TGCTGGTTTT GATTCTCTTC ATTCCGATTG TCCGATGGAT TGTCCAGCAG ATGTTGAAAA TTCGGAAAGC GCGATCGGCC CTCACTGATG TACGAATCAA TACACTGACA GCCATGTTGC ATGGCATTCA CGTCACCAAA CTCAACCACT ACGAGCGAAA CATTGAATCT CAGGTGGAAA CAATCCGTAG GCAAGAAATG GTGTTGCTCC GAAAAGAGCT GTGGATGTGG GGCTGGGTAC TGACGACGGC AGTGTGTTCT CCGCTAGTGG CGGTCATGGT TGCCTTTTCC TTCTATACCT TACTGGACGA AGGCAATCTG CTGACGCCGT CTACCGCATT TTCTACCCTC CTATTGTTTT CCATTTTGCG TTCTCCTATC AACATGGCGG CTCGTCTTGT CGGAAATATG TCACAAGCGG TAGAAAACGT TTCTCGGATA ACTCTTTTTT TAGAGCGCGA GGCACACCTT GCTGTAGATG AGGAAGCCTC GAATGAACCG GTCAAAACCT TGGTCAAGAC TAGCAAACGT GCTTTCCCAA AAAGGCAGCT TGTTCGTTTT TCTGCCTCAA GAGACAATTT GGTGTCTATA GAGGCGGGAT GCTTTTCGAT CAAGCCGCAA AACAGTATTA TTCAAGGTTC CTTTCTCGCT TCGTTTCGCG GTCTCACACC AGAAAGAAGT CTCGGAAAGG GTGACTTTAC AGCAGGGGTG AAAGGATTTT CCGTGAGCAA CTTGTCGTTC GAAGTCAAAC GGTCTGAGGT GATTGCAGTA GTTGGGAAGG TGGGATCGGG GAAGTCGCTT TTGCTACGAG CGTTGCTTGG TGAAGTTCCA ACGTTCTTTG GAGACAGAAT CTTCGTTTCG GGGCGCTCAT CGTATGCAGC ACAGCAAGCT TTTATACTGA ATGCTAGCTT GCGAGAAAAC ATCCTCTTTG GAAAAGACTA CAATGAACAG CTATACAAAA GAGTCCTTAA GGCTTGCTGC TTAACGGCTG ACATTCAATG GCTTGGCCCC GCGGGGGATT TGACGCAAAT TGGCGAGCGT GGTGTGACCT TGTCGGGTGG GCAAAAGCAG CGCGTTGCTT TAGCTCGAGC AGTGTACACC GATCCTGATC TCGCTTTCTT GGATGATTGC TTTTCGGCGT TAGATCCTAG CACAGCAAAT GCTGTTTACG AGGGCCTCTT TGGATTGACG CAGGGAGAAG GCCGCAATGG TATTCTTCGA TCAGCTGGAA CTATTCTTGT AACTCATTCG ATTCAATTTC TTTCCCGAGT GGACAAGATC CTTGTCCTGA GCGATGGAGC TCCTTCATTC TTCGGGACAT GGGCAGAATT GCAGTTATTT CAGGGCTCCA AGGGCAATTT GATTGAGTCT ATTCAACAGA ACTGTCAAGA AGTAGAGAAG AAGAGTAGGA GTGGGGAGCT AGAAGAGGGT CAAACAGCCA ATGAAGGTGG ATTGATAATG ACTGTTGAAG AGCGCAAGTA TGGAGGAGCT AGTTTTTCTG TCTGGACCCG ATGGTTTAGT AGTGCAGGAG GATGGTCATT CTTCTTGTCG CAAATGATTT TGTCAATAGT TGAGAATGGG CTTTTTGTCT CATCAGATTG GTAAGGACGC TGACTCACTA ATGAGTAAAG ATTAATGCAC CAAATGCCTA ATCGGTATAC TTTGTCGATT ATTAACAGGT GGTGTGCTAA ATGGTCTGAT TCAACTTTCA CCGGAACAGA TCTTTTCGGA ATGTCCTTTC CGCCGCAGAC GGAAGGGAGG TCTGTTCAAG TGCAGTATGC CGTTGTGCAT CTTTTAATTG TCGTTCTTTC CGTCTTTGCC ACCTCGATAC AGTTGCAGTT TGCAGGTATG TCCAAGCTTT GTTTTCCGCA ACAAGTTGCT CATGTTACTT ATAAGGTGTG TGGACCTCAC TATCAACTTT GCCAGTTGCT GGAGGCGCTA AATGTGCGGA GCGAATGTTT TTGGACATGA CCACTCGGGT TCTGCGCGCT CCCTCGTCGT ATTTTGAAAC CACACCGTTG GGTAGAGTGC TTAACCGTTT CACATACGAT GTTGAGGTTT TGGATGTTGA GCTATCCATT TCCATGGCTG GCTTGATGAT ATCATCAAGT CTGCTGATCT CTTCCATTGT TGTTATGGTA CGTCCGAGCA TTGCGTATCG AGTAAACACT TTTGTAGACA AAAATCTCAC GCTACGGTTT GACATCGACT GCCATAAAAA AGCTTGCTAT TTTGCCGTGG ATTGCCCTCT ATATTGTGCC TGTCGGAGTC GCATACACTT GCATTCAGCT TTATTATCGC AGGAGTGGCC CAGACCTGCA AAGGATTGAC GCAACATCTC GGAGCCCGAT TCAAGCAAAG CTCGCAGAAG GTATGTTTTT GAAGCGGCTC GCTTCCACCT TGGAAATAAA GAAACGTTGA ACAAGTGTTT GACTCAGCTT GCTCTTGGAA TACATAGGTA TGGATGGTGC GACGACTATT CGAGCTTTTC ACCAAGAGAA ACCGTTCATT GTTGGTTTCC AAAGGAACGT CGACTTCAAC AGCTCTGCCA TGCTAAACTT TGCAGCTGCC CAGCGCTGGT TGGCCTTCAG GATGGAAATT CTTGGTGCCA CTGTTGGTTT TGTCTTCAGC ACAATTGTTA TTTGCACAAA TGATCGCTTG AAAATCGATT CCGGAATGGT TGGTTTGGCC CTGCAATGGG CGACCATATT CTCAGCAGCA CTCAATTTTT TCTTTTTGAG ATTAACGGAG GCCGAGGCGA AAATTACTTC AATTGAACGT GTTCACCAAA CAACACTTCT TCCACAGGAA GCATCCTGGG AAACAGATCC ATTGATGAAT CTTGACAAAA ATTGGCCCAA GACTGGGATT TTGCAGTTTG ATAGTGTGTG TATGCGCTAT CGCTCTGACC TGCCCCTGGC CCTAAAAAAT GTTTCCTTTC AGTTAGCGCA CGGAATGCGA TGCGGCATTG TAGGCCGGAC AGGCTCTGGC AAGACGTCGC TAACGGCAAG TCTCTTTCGA TTGGTCGAAA TCGAGGCAGG TCAAATTGTT CTTGACGGAA TAGACCTTTC GAAGGTGGGC CTGGCGGATG TTCGCGGTCG TCGAAACGGA ATGCAGATCA TTCCCCAGGA TCCGGTTCTG TTTGCGGGAA TTTTACGGGA GTGTCTTGAT CCTTTCTTTC TCGAAAGCGA CGAAAAGGTG CTGCAGGCTC TTCAGGCAGT AAACCACAAG GGAGTCAATG AACGTGGCAA AGCTGTTCTG AATGATCCGG TGGACGAGGG GGGAAGCAAC TACAGTGTTG GTGAGCGGCA GCTGCTGTGC CTGGCTCGCG CTATTGTGCA AGAGCCTCGT GTACTGGTCC TGGACGAAGC GACTGCCAGC GTCGATGCGG CTACCGATGC CTTCATTCAA GACATGCTCC GCACTCGCTT CAAAAATACG ACGTTGTTGA CAATAGCTCA CCGTCTGAAC ACCATTATGG ATTACGACAT GGTAATTGTA CTGGACGACG GTCACTGCGT GGAGACGGGC TCACCGCTTT CCCTCTTGGC TGACCCTGAC GGTTGGTTTA CGGCATTGGT GGATGCCAGC GGACCGAACA TTGCGGCGGA GCTTCGACGG ATTGCGGCAG AAAAGGAGCC ATAGGACTGC CGTTGTCACC GTTGGAGGGT AGGAAGAAAT GCCCTGATTG GCGCACACTT TCCAGGTTGT ACCGATGGTG TGTTTGTTCC CATCTGTTCA CGGGTAACAA CAATAAATAT AGGCGACTGG TGATTAACTC GCTATAGAAG TCATTATCTT ATTC
|
Protein sequence | MTQTQGHSPN ENQREAESMG RHGDQSFQAT KSTDGQQKTH PDEVNSQTIH SSNVADKSKS PFLPPIHTAK PWPEDRLTGD HDLQRCLGSF PSSLWKTFQR WTYAYMQPIL LKGQRQFREK DHLTVEDVYV IPADMQASVL VEQFWDDYHR TKKLVPVLFR LIQPIFVPAG FWELLVVLAK VSLPLSLRQI LLVLEANPSA SVISEGLPFA ITLSLAGVVL ALSQNRVVFL CTASGIRIRA ALTTALYEHA LRLTASGKIG LTTGQVTNLV AVDTQKLFDV CVEGHNLWSC PLLIIIVLAL LATLIGTELM IGVLVLILFI PIVRWIVQQM LKIRKARSAL TDVRINTLTA MLHGIHVTKL NHYERNIESQ VETIRRQEMV LLRKELWMWG WVLTTAVCSP LVAVMVAFSF YTLLDEGNLL TPSTAFSTLL LFSILRSPIN MAARLVGNMS QAVENVSRIT LFLEREAHLA VDEEASNEPV KTLVKTSKRA FPKRQLVRFS ASRDNLVSIE AGCFSIKPQN SIIQGSFLAS FRGLTPERSL GKGDFTAGVK GFSVSNLSFE VKRSEVIAVV GKVGSGKSLL LRALLGEVPT FFGDRIFVSG RSSYAAQQAF ILNASLRENI LFGKDYNEQL YKRVLKACCL TADIQWLGPA GDLTQIGERG VTLSGGQKQR VALARAVYTD PDLAFLDDCF SALDPSTANA VYEGLFGLTQ GEGRNGILRS AGTILVTHSI QFLSRVDKIL VLSDGAPSFF GTWAELQLFQ GSKGNLIESI QQNCQEVEKK SRSGELEEGQ TANEGGLIMT VEERKYGGAS FSVWTRWFSS AGGWSFFLSQ MILSIVENGL FVSSDWWCAK WSDSTFTGTD LFGMSFPPQT EGRSVQVQYA VVHLLIVVLS VFATSIQLQF AVAGGAKCAE RMFLDMTTRV LRAPSSYFET TPLGRVLNRF TYDVEVLDVE LSISMAGLMI SSSLLISSIV VMLAILPWIA LYIVPVGVAY TCIQLYYRRS GPDLQRIDAT SRSPIQAKLA EGMDGATTIR AFHQEKPFIV GFQRNVDFNS SAMLNFAAAQ RWLAFRMEIL GATVGFVFST IVICTNDRLK IDSGMVGLAL QWATIFSAAL NFFFLRLTEA EAKITSIERV HQTTLLPQEA SWETDPLMNL DKNWPKTGIL QFDSVCMRYR SDLPLALKNV SFQLAHGMRC GIVGRTGSGK TSLTASLFRL VEIEAGQIVL DGIDLSKVGL ADVRGRRNGM QIIPQDPVLF AGILRECLDP FFLESDEKVL QALQAVNHKG VNERGKAVLN DPVDEGGSNY SVGERQLLCL ARAIVQEPRV LVLDEATASV DAATDAFIQD MLRTRFKNTT LLTIAHRLNT IMDYDMVIVL DDGHCVETGS PLSLLADPDG WFTALVDASG PNIAAELRRI AAEKEP
|
| |