Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28794 |
Symbol | |
ID | 7202589 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 738122 |
End bp | 741288 |
Gene Length | 3167 bp |
Protein Length | 818 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181620 |
Protein GI | 219122580 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAGATCGAC CTGCGAATCG TCGTTGCCGT TTCTTCTTGG TGAACCCAGC AAAAAGCACA AGTCATCATG GCGCGTTGGT TTCGTTCCGA ACCGATGGAG TACATCTCCC TCATTGTGAA CGAGGATGCC GCTCACGACT GTTTAGCGGA CCTCGGAAAG CTTGGGGTGA TCCAGTTCAC GGACGTAAGT TGCACGTAGA TCGTTGTCCA GCCACAGGCG GGAGTTTGAA AACGCGAGTT GGCTTCCCGC ATTCTGTACT ATAAAGATTC TTGAATCGAA GTACTGCTTT GTCGGAAGAA AAAGTCGGCA GATTTACTAC GATCCTCTCG GGTCGTGGTC ATCGTCGAAC TTCCATCCGG CGCCGCGGCT TTTCGAACTC CGTTTCTCTG GTTGAAAACG TCCTTCCGAT GTCGTATGCG GAGTTTTGTC CCCGTCAATT TTGGAATCGT CCTTTCTGCT ACCTGTTTCA CAGTCAGCGC GCAAACATTA TCTGTAGGTT TGTTCTTTAG TTTCTCACCC TCATACTCTA CATTCCTATA GTTGAACCCT GACTTGACTC CATTCCAGCG TCGCTACGTT TCTTACGTTA AGCGATGTGA TGAGTTGGAA CGCAAGCTTC GTTATTTCTC CAACGAGATT GAAAAGTTCG AGATTGACCT TGTTTCGGCT GGAACAGTCG ACAACTTTGT CATGTCTCCC ACGCTTGTGT CCAGTATGGG TAATGGCTCA AAAAAGAGCG GTGCTCAATT GCTCGAGAGT CTCGAAGTTG AACTTGAACA ATACGAGTCG CAGCTCAGGG AACTCAATTC TTACTCCGAA AAGCTTACCA CCGAGTACAA TGAAAAGGTC GAGCTCCAGG AAGTCCTCGA GAAGGCCCGT CGCTTTTTTA TGACCGACGC TCCCCGCCTT GCCGTTTCGG AACTTACCAG CGGGCCCATG GACATGACTG GAAAGGAAGA TGGGCTCCTT GACTCGGACG CTGCCCCTCG TCCCGACTTG GACATGCGAT TCTCTTCGAT TACCGGGGTC GTATCCACAG AAGAGAAGGT CCGCTTTGAA CGCATGATCT TTCGTGCCAC TCGAGGAAAC TGCTACATTC GATTCGCTCC TATTCAGCAG CCCATTACCG ATCCGGAATC TGGAAACTTG GTCGAAAAGT CCGTCTTTAT TATCTTTTAC AAGTCTGAGT CCATTGAAGG CAAGCTCAAG CGCATTTGTG ACGCATTCTC TGCTCACCGA TACTCTCTCC CTGATATGGA CGATGCCGGA TCAGTTGACA AGATGCTGAC GGAGAACGCA CAGGAACTCG TCGACTCTCG CACTGTTTTG CTCAAGAACC AGGATACGCG CTTCCGTCTC TGTCAGCTGC TTGCGAAGCA CACGGAGCGC TGGACGTGGA TCGTCCTCCG CGAAAAGGCT GTTTATCACT CTCTGAATAT GTTCAAGGCT GATGTTCAGG GTATGCTTCG TGGTGAAGGT TGGGTCATTG CTGAGTCCAC CGACGCTGTC CGTCAAGCAG TTGAACGTGC TCACTCCAAT ATGGACATGG CCATGCCTTC CTTGGTGGAC TTGGTTCCCC AACCATGGCC TACTCCTCCC ACGCACTTTA TCACCAACAA GTTTACCTAC GGATACCAGG AATTCGTCAA CACGTACGGT ATTCCACGTT ACCGGGAAGC CAACCCTGCG CTTTTCACAG CCGCCACATT CCCCTTCCTG TTCGGTGTCA TGTACGGAGA CATTGGTCAT GGTCTCTTCT TATTCTGCGC TGGTTGCTAC TTACTTTGGA ATGAGAAGGC TAACGAGAAT GCAAAACTTG GTGAGCTAGG CGACGGTATG CACTCTGGTC GATACATGAT TGTCATGATG GGCTTCTTTG CCGTGTACGC TGGTTTCATG TACAACGACG CATTTTCCCT CGGTCTCAAC CTTTTTGGAA CTCGCTACAA GTTCGAGGGC CAGGATTCTG GTACCGTCGA AGAAGGTGAT GTTGCCTATC AAACGTTCAG TTATGGTTCC GGTGAATCCG TGTATCCGTT CGGACTCGAT CCCATTTGGC ACGTTACCTC CAACGAATTG CTCTTCTTCA ACTCGTTCAA GATGAAACTT TCCGTCATTT TTGGTATCAT CCAGATGTTT TGTGGTACTT GCCTCAAGGG AGCGAATGCC GTCTACTTTG GCGAAAGACT CGACTTTTTG TTTGAGTTCC TTCCCATGGT TGCGTTTGCG TCTTCGATGT TTGTTTACAT GGTTATCCTC ATTGTTCTGA AGTGGTGCAT CAACTGGAAT AGCCGGATGC TTTCCGCCAC TTGCGTTGAT CCTAATGGCG CTGGATGGGG AGCGTCCAAT TACGTTGGAA CATGGAAGCA GTGCGATGGA GCTGTTGATG GCTGGGACGG AACCTGTACA CCATGGGGAA TGTCCTGCAC CGGATACGAT GATACGGCGA CGAAATGTCC TCTCAACTAT GGTGGTTCTG GTGATGGTTG CCAGCCTCCC AATCTTATCA CAACTTTGAT CAATATCGCC CTCAACCCGG GTGTTGTTGA TGAACCTTTG TACGCTGGAC AGGGACCAAT CCAGAACATT TTACTTTTGA TCGCCTTCGT CTCGGTTCCT ATTTTACTTT TGGCCAAGCC TTACTATCTA TCCCAGAAGA CGCATTCCCC CGTTGTGCAC CACTCGGACG ATCTCGAGAA TGGGCATGAC GAGGATGACC ACGAGGATGA TGACCATGGT TTCGGAGAGA TTGTTATCCA CCAGGCCATT GAAACGATCG AGTTCGTTCT CGGTATGGTT TCGAATACGG CGTCGTACCT TCGTCTCTGG GCTCTTTCCT TGGCGCACTC CGAACTTGCT ACTGTCTTTT GGGAGAAGGC CATGCTTTCT ACCTTGAACA TGAACTGGTT CGCCGCCTTT TTTGGATTCG GTATCTTTGC CGGCGTGACA TTCGGAGTGT TGCTCATGAT GGATGTATTG GAATGTTTCT TGCACGCCCT TCGTCTTCAC TGGGTCGAAT TTCAGAACAA GTTTTTTGCC GCTGATGGCG TACGCTTTTC GCCGTACTCG TTTAAGCAGG TGATTAAGGA TACCAGTGCC TAGAGAGAAA CTAATTGGAT AAGTATAGGG ACGAATTAAG CATATAACCA ATGTACTCGT CAAAAGC
|
Protein sequence | MARWFRSEPM EYISLIVNED AAHDCLADLG KLGVIQFTDL NPDLTPFQRR YVSYVKRCDE LERKLRYFSN EIEKFEIDLV SAGTVDNFVM SPTLVSSMGN GSKKSGAQLL ESLEVELEQY ESQLRELNSY SEKLTTEYNE KVELQEVLEK AHGLLDSDAA PRPDLDMRFS SITGVVSTEE KVRFERMIFR ATRGNCYIRF APIQQPITDP ESGNLVEKSV FIIFYKSESI EGKLKRICDA FSAHRYSLPD MDDAGSVDKM LTENAQELVD SRTVLLKNQD TRFRLCQLLA KHTERWTWIV LREKAVYHSL NMFKADVQGM LRGEGWVIAE STDAVRQAVE RAHSNMDMAM PSLVDLVPQP WPTPPTHFIT NKFTYGYQEF VNTYGIPRYR EANPALFTAA TFPFLFGVMY GDIGHGLFLF CAGCYLLWNE KANENAKLGE LGDGMHSGRY MIVMMGFFAV YAGFMYNDAF SLGLNLFGTR YKFEGQDSGT VEEGDVAYQT FSYGSGESVY PFGLDPIWHV TSNELLFFNS FKMKLSVIFG IIQMFCGTCL KGANAVYFGE RLDFLFEFLP MVAFASSMFV YMVILIVLKW CINWNSRMLS ATCVDPNGAG WGASNYPPNL ITTLINIALN PGVVDEPLYA GQGPIQNILL LIAFVSVPIL LLAKPYYLSQ KTHSPVVHHS DDLENGHDED DHEDDDHGFG EIVIHQAIET IEFVLGMVSN TASYLRLWAL SLAHSELATV FWEKAMLSTL NMNWFAAFFG FGIFAGVTFG VLLMMDVLEC FLHALRLHWV EFQNKFFAAD GVRFSPYSFK QVIKDTSA
|
| |