Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43891 |
Symbol | |
ID | 7204346 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 345095 |
End bp | 347823 |
Gene Length | 2729 bp |
Protein Length | 760 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186046 |
Protein GI | 219112925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.237524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCTT CTGAAGTGCA AAACATTGCC CCTTTCTCCG TTCCATCCTC GGACACTTCT TCTGTACCCT CTAGCATACC GTCGTCGGAG CCGTCGAGTG TCGCGTCAAG AATCCCCACC ATCTGACTGG AAATAAAACA GACATCCCAT CATGAGGTTT CTGGGCTACG CTAGCTTTTC AGTTGAGGAG GTTTGCGACA ACGGCAGATC TGTTGTCCGA TCTGGAAGAG GGAGATGTGT CGTCTTATAG ACCTAATCTT CTGTTTTACT GGAACAACAG TGTCAGGGAA CGATCCAGAA CGATACCAAC GTTGGAAGCA AGAATCTCTA CGGCATTCCA CGCCAATCAC GGAGAAATGG GGGTGGATCG GAGCAGAGTC TCTGTTTCTG TTGAGAGAAG CCAGTGGTAG TAGATTGAAT GGTCGTTCTC TATTTACACT TACAGTTCAT TCGACACAAA AAAAGACGCG AAGCGTGGTA AGGTCCACGA CACCAGCTGG TCGCGACTCG CCGCGCGGTT CGCGCGACGA TCCTCCCCTA TCGTGTATCG TATGGGTGTT TGGAGTGTGG TGGATTGGAC AGCTGCTCCT TTTTCCCACG GCAACGAGAC GACGAGTATT GACCAGCGAC AACCCACCCC GCCGCGAGCC GAACTGGGAT CGTACGGGGA ATTCCCCTCG ATCGACCTTG TCCTCCGTCA CGATCTATAG CAGCTGTCCA GCTACTATCT ATTACAACAG TGAGTGAGCT TGTGTCACTC ACTCACTACT TAGCCTCTTA CAATCATGGA AGACGCCCGG GCTAGATTCT TTTCGTCGAT GGAAGGGGAA ATGGAAGTCC ACGAGTCGGA ACGCACCAGC GACGGTTTGG ACATTGACTT TTCCATCACC GAATCTGGTC GTACCGTAGC TCATATGGTG CAGCAGGACC ACAAAACGCA TCACGACGAT ATGGACTTTT CTATTACGAG CTCGGGACGT ACGCTGGATC ACTTCGATGC CATCCAAGGA GAGAACAACA CTGCGCACAC GAACGATTTC AAAAACTTTC ATCTTCCCGA AGGCTTCTCC ACACAACACG GTCGTCACTA TCCAGCAAGT ACTGCATCGA ACCCTACACC ACCCGCACGC TCCATTCCTA TTCCTCAACC ACACCAAACG TACCCACCCG CGTCACTTTT GCAACAGATT CCGTCGTCGC AAGCCAGTTC GCAAGGAAGT TCCGGTTCGG CGTTTTTGAG TTCCATGCTC AATCAAGGCC CCTTGTCGCA TACACCGCCG GTGGCGTCTA GTTACGAAGT CTCCCATTTC GGCAAGCGCG CTCGGTCTGG ATCCGTTTCG GGACGCTTGC GATCCGCGTC CGATTATTTG GAAGAAAAGG GACTGTTGGA TCGACAAACC AAGGGAATTC TGAAGGATTT GATTATTATC GGAGACGAAG AACTCCAAGT CGCGTTGGAC CGGTACGAAG CAGGAGACCC GTCCACACTG GAGCGAATGA TATCGTCCGG TGCACTGGAA GAGCGATTAC CGAAAGACTT GGACATCTTG GGTGATCTGG ATCTGGACTT TTTGACCGTC CATGACGATG GGCTGGATTT GACCGGGGGC GACTCAATCG AACCCCTGTT GCCCAGCGCC GAATCCTATC ATCAACATGC GTCGCAATCC CAGTCCCTAT CAGCGAACGG GGGAAGTTAC CGACATCGTC AAGCTGTTGG GCCTGGCCAA GCACCGAATC TTGTATCACC AGCGTACGAT GATGGTATTG GAGATTTGGA TTTTAGCGGG GAGTTCGTGG AACAGGCTGA ATTTGGCTTC CAACCACAAT CCTACGGGTC GTCCAAGCAA GCGTCCGTTG CGGCGTCACC GACGGACCAT CCCAATAACA TGATGTCGGA ATACGAACGC CGTATGCGAT CTAATTCCCT ATTTTCGGCT TTATTGGACC CACGCGGCGG TGGCAACAGC AGCGTCAACA CTGCGACTGC GGCGGTCGCT TCGAAACACG GCAGTAGCGA AACAGACCGT ACCGACGGTG GCCTCGATTA CGGCCAGTGG ATGGACCGAA CATTGGCGAA CAACGCGAAA GCCACCGGCA TTCAAATTGG GCATCGAAGG TCTTCGGCTC CCGCTTCTTC GGTGCGCAGC GGCATTTCGG CCAGCTTGGA GCAGGCGGAT CGAAAGAAGC AAGATAAAAA GGACCGCAAA GAGCAAAAGG CTCTGGAAAA ACTCGAAAAG AAGCAACGAA AAGAAGAAGA AAAGGAGAAG AAGAAGCAAC AGCAACAACA AGCCGTCGAA GAGGAACAGA TGGAAGAGGA GCACGTTCCC GGTTCGGGTC GTCCCCGGGC ACTCAGCGAT CCTAACCTCC ATACGTCGCT GGATCAGCAT GGCTTGATGA ATGTAACGCG CCCAGATGGC TGGGTTGGTG CATACTCGCC GGAAAGCCGA AAAGTACGCG TCGACCGATT TATGGAAAAA CGAAATCACC GGGTTTGGAC CAAAACTGTA AAATATGACG TTCGAAAAAA CTTTGCCGAC AGCCGACTTC GTGTTAAAGG TCGCTTTGTC AAGAAGGAAG ATGAGCTGCT CATGAGGGAA TTAATGAGCA TGACCTAGGT ACATTTAGCA ACATTTTATA AACAGGATAG GCAGTGTTTG GGGGAGCGGC AAGTACCAGA TGTGATATCA TTAAACGATC AGACGCTTTT GCTCCTTTGG AGCGCCTAA
|
Protein sequence | MVPSEVQNIA PFSVPSSDTS SVPSSIPSSE PSMSGNDPER YQRWKQESLR HSTPITEKWG WIGADSFDTK KDAKRGKVHD TSWSRLAARF ARRSSPIVYR MGVWSVVDWT AAPFSHGNET TSIDQRQPTP PRAELGSYGE FPSIDLPLTI MEDARARFFS SMEGEMEVHE SERTSDGLDI DFSITESGRT VAHMVQQDHK THHDDMDFSI TSSGRTLDHF DAIQGENNTA HTNDFKNFHL PEGFSTQHGR HYPASTASNP TPPARSIPIP QPHQTYPPAS LLQQIPSSQA SSQGSSGSAF LSSMLNQGPL SHTPPVASSY EVSHFGKRAR SGSVSGRLRS ASDYLEEKGL LDRQTKGILK DLIIIGDEEL QVALDRYEAG DPSTLERMIS SGALEERLPK DLDILGDLDL DFLTVHDDGL DLTGGDSIEP LLPSAESYHQ HASQSQSLSA NGGSYRHRQA VGPGQAPNLV SPAYDDGIGD LDFSGEFVEQ AEFGFQPQSY GSSKQASVAA SPTDHPNNMM SEYERRMRSN SLFSALLDPR GGGNSSVNTA TAAVASKHGS SETDRTDGGL DYGQWMDRTL ANNAKATGIQ IGHRRSSAPA SSVRSGISAS LEQADRKKQD KKDRKEQKAL EKLEKKQRKE EEKEKKKQQQ QQAVEEEQME EEHVPGSGRP RALSDPNLHT SLDQHGLMNV TRPDGWVGAY SPESRKVRVD RFMEKRNHRV WTKTVKYDVR KNFADSRLRV KGRFVKKEDE LLMRELMSMT
|
| |