Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49932 |
Symbol | |
ID | 7198629 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 333529 |
End bp | 337027 |
Gene Length | 3499 bp |
Protein Length | 1102 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184783 |
Protein GI | 219129200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGCTG CGAATTCCGC CAACGAGGCC GTGGTGGAAG TCGACGACGA TGCCGTTCGC TTCGAGGATT TGACCGTCGA CGACGATCTG CCGATTCTAG AACGGGTCGT CCGCTACAGT CGGTCGCAGA TTGCCCTACA GCGCCTAGTG CACGTTAAGA TGCTGGCGGA AACAGCCGAA ATTGTTGGGT ACGTGTGCAA CGAGCATTGT AGAAAGAACG TCAGGAATGG AAAATGCTCT AGAAAAGGGG AGAGAGGGTC TACGTTTGTA CTGTTGAAAG GGACGTGTCT TTCTGTAGAT ATGTATTGTT GCTTGTGTTA CTCACGACGC TCTTCTCTCC CCATTGATCA AACCCTCTAT CCTTTGAACC CGTTACAGGC AACGTTCGAC ACAAGAAGTG TTAATCCCCT TGCTGCGATC GCTGGTCAGC GATCCGGAAA GTATCATTCG GCAACACGTT TCTACACAGC TCGTCCCCGT CTGCATCGTT TGCATGGTCA AGAACGTCAG CAACGTCGCC GAACTCGTAC AAAACCCCGT CTTCTCTAAA GATTACGACG AAAAGGGTTA CACTCTCGTT ACCACAACCG TGCTCGGACA TATAAACACG CTCCTGGAGG ATTTTGATCT GGATGTCCGT AGAGCGGCCG CCGACGCTCT TTCGGGACTC GCTTTGCAAA TTCGACCCGC TGATGTTCCC CAGGCCTTGC TCCAAATCCC TCTCGCGCTC GCCGCAAAGT TACCGAAAAA TCCACACGCC AAGAAAAAGA CCGAAGCGGA TCAGCACGTT GAGGAATTGC GCATCACGGC GGGTAATCTG TTGGCCGAAC TTGGTGGTGC CGCCTCGGAA CACTCCACTA CCTTACTCGC ATCGTCCACG TACGTTTCTG GATTGATTCT CCCAGCCGTC CTGAAATTGT GTGACGACGT TTCCTTCCGT GTCCGACGAA GTGCGGCCCA AGCCTTGCCA CGCATTCTCG GAGCATGCTC GTTGAACGAT GTGGAGGAGA CTATTCTACC GGCGTTCGAT CAATTGAGTC GAGACGACTT GTATCGCGTT CGGAAATCCA CAGGCGAGTG TCTGGTGGAT ATGAGTAGAT CCATGATGTT ACTGGCGGCG AGTAACAAAA AGGCTGAAAG AACGCTTTAC AAATTAAGAC GCGAGACACT AATTCCTATT GCCGATCGCT TGATTCAAGA TTCGCATAAA ATGGTCCGTC AAGGTATGAT GCAGTTTCTG GGACCCTTCA TGGCTAGCTT TTATCCATAC CAGACGTCTG CTCTGCGCGA TTTATTGCCC GGCACTGTCG AATCGGACGG CAGCAATCAC ATGGGGATCG TGGCCCAATT CTTTCCGCAC GCCACATCCA TGGTGTCCCG TCTCAACTCG GCACAAAACA TTAGCATGTC GGCACCTACC CCGGTGAACG TACATTTGGA CGAAATTTTA CACCGTGTTC TATCCGAGAT GGATGTTTTG CACCAGGCAC TGCCGGCATT TTTGCAAGCC TCCCGCATGT CGGCGCTGTC ACTAGCCGCC GTCGCGACCC ACCGGAATCG TAACTTACCG GACTCGGAAG ACGTTGACGT ACTGATAGAC AAACTATTGG ATTACTTTGC CGCACTCGCA ATTGTATCGA CGGGCGACGA AAACACCGAC GCGGAAATGC GTGTATACTG TGCGTATAGC TTTCCCGCTC TTATATTGCT ACTGGGAGCG GACAATTGGG AGGGGGCGAT GCGTACATGC TTTTTTACAC TCATGAACCC GAACTATGCC AAAACTCAAC AACCGGAAGA AAAGCAGTCC GACCCGGCGG AAAATCTGAA CGTCGCCGAG CCACCCCTGC CAGTGAAACG CTGTTTGGCG AGTAGTTTAC ACACGGTTGC CAACATCCTG GGACCCGAGC TGGCCGCCTC CGATATTGTA CCTGTTCTGC AGGACTTCTT TTTGAAAGAT CCAGATGAAT CCGTACGACT GAACGTCATT CGCAACTTTC CAGCATTGTT GCAAGTGCTC TCCCCTTCTG ATCGCAAGGG CCCCTTTCTC ATGTGGAGCG AAATTGTCCG TGGTGAAGAG TTGCTGGGTA TCAAGAAGCG CAGTGCACAC AATCCGGTTG TGCTTAACTG GCGACAGCGT GATTATTTGG CGCGATCTCT GCCGGATCTG ATTGGTTTGG TGGAGCCGTC TCTGGTGCAC GAACATATCT GGCCCATTAT GAAAACGCTC TTAACCGATG CGGTCTCTAT AGTCCGAGAT GACGCCATTT GGTCGATTGC CATGATCCTT AAAGCTTATT GCTTGGAGTC ATTGCAGGCA TGGCCGAATG TGTCCAATTT TCGAGCTTTC GGTGCTCAAT CGTGTGCGGA AGTCATTGAT TGGTTGAAAG AAAGTATTCT CAAGCTGGGA GTACCTAGAG AAGCTCGTAT AAAACCCGTC AACTTTTCGG AGCGCCAGCT ATACTGTCGA ATTTGCGCCA CGCTAGGATT GGCCCTGCGC TTTAGTGAAC AGATTGAAGG GGACAAACAA GACCCAGTCT CGGTGTTGAG TGGCAAGTTT AAGACGTTTT TCTTCCCAAA GTCGAAAAAT CTGCAAGACG ACCTCCCAGG GCCGTATCAA GCCATGACCA AGTCGGAACA GAAACACCTT CGACGCCTTC TGCTGAACGA GCTATTGCCT CCAGCATTGG AAATGAAAGA AGATCGGATA TCTAACGTCC GCGTGTCTTT AATGAAAACT CTGCAACTCA TGCCAGCCGA AATTCGTGCA ACGCCTCTTG TCCAACCAGT TCTTCAGGGT TTGGTCGAGG AAGTGGAGAC ATGGGAAAAT TTTGCAATTT CTGATCAGCC AGTGCCCAAC CCGCTAAAGC AATCCGCATC ACAGGCTGCG CTGTATCAGC CCGCGCAACG CTCACTTGCC AGTGGTGCTG TGTCGCCACG AGCGCAACAG AGTGTCCCGG TGGATTTAGA TGCGGAGATG CCCGACGACC GACGGTCGTC TTCGGACGAC TCGAGTGCAT CAGCAGAGGA CGTTGCCGAA TCGTCAGATT GGAAGACGGT AGTGTTTCAA GCTGGACCGA TCGGCATGCA GCTGGAACCC ACTGCCGATG ATCGTGCTTG TCGGGTTTAT GGTTTTTTGG ATTCGGGTGA TGGAAAGCCG TCACCGGCGC GGCATTCGGG CAAGATTGAG TTGGGCGACG TCATCGTCAA GGTGAACGGT AAAGATGTAC ATTCCTACGA CGACACGATT GCGGTTCTCA AAGCAGGTGG CCGTCGGGAA ATCACTTTTC GACAGGGAAC AGCCGACGAC GATTACGACG ACGACGAAGA AGAAGAGAGT GTAGGAGGAT TTTCGTCTAC CGACGACGAA ACCGATCGAA AAGAACGGGA ACGCAAGGCA AAGAAGGAAG CCAAGAAGGC AAAAAAGGCC AAGAAGGAAA CAAAGAAGAA AAGCCGTGAC AAGAAAAAGG ACAAACGGAA AAAGGAGGAA ACAGGATGA
|
Protein sequence | MEAANSANEA VVEVDDDAVR FEDLTVDDDL PILERVVRYS RSQIALQRLV HVKMLAETAE IVGQRSTQEV LIPLLRSLVS DPESIIRQHV STQLVPVCIV CMVKNVSNVA ELVQNPVFSK DYDEKGYTLV TTTVLGHINT LLEDFDLDVR RAAADALSGL ALQIRPADVP QALLQIPLAL AAKLPKNPHA KKKTEADQHV EELRITAGNL LAELGGAASE HSTTLLASST YVSGLILPAV LKLCDDVSFR VRRSAAQALP RILGACSLND VEETILPAFD QLSRDDLYRV RKSTGECLVD MSRSMMLLAA SNKKAERTLY KLRRETLIPI ADRLIQDSHK MVRQGMMQFL GPFMASFYPY QTSALRDLLP GTVESDGSNH MGIVAQFFPH ATSMVSRLNS AQNISMSAPT PVNVHLDEIL HRVLSEMDVL HQALPAFLQA SRMSALSLAA VATHRNRNLP DSEDVDVLID KLLDYFAALA IVSTGDENTD AEMRVYCAYS FPALILLLGA DNWEGAMRTC FFTLMNPNYA KTQQPEEKQS DPAENLNVAE PPLPVKRCLA SSLHTVANIL GPELAASDIV PVLQDFFLKD PDESVRLNVI RNFPALLQVL SPSDRKGPFL MWSEIVRGEE LLGIKKRSAH NPVVLNWRQR DYLARSLPDL IGLVEPSLVH EHIWPIMKTL LTDAVSIVRD DAIWSIAMIL KAYCLESLQA WPNVSNFRAF GAQSCAEVID WLKESILKLG VPREARIKPV NFSERQLYCR ICATLGLALR FSEQIEGDKQ DPVSVLSGKF KTFFFPKSKN LQDDLPGPYQ AMTKSEQKHL RRLLLNELLP PALEMKEDRI SNVRVSLMKT LQLMPAEIRA TPLVQPVLQG LVEEVETWEN FAISDQPVPN PLKQSASQAA LYQPAQRSLA SGAVSPRAQQ SVPVDLDAEM PDDRRSSSDD SSASAEDVAE SSDWKTVVFQ AGPIGMQLEP TADDRACRVY GFLDSGDGKP SPARHSGKIE LGDVIVKVNG KDVHSYDDTI AVLKAGGRRE ITFRQGTADD DYDDDEEEES VGGFSSTDDE TDRKERERKA KKEAKKAKKA KKETKKKSRD KKKDKRKKEE TG
|
| |