Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45663 |
Symbol | |
ID | 7200447 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 855302 |
End bp | 856660 |
Gene Length | 1359 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179929 |
Protein GI | 219118303 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATGCAATT GAGGGTTGAC GAACGAAATG GACTGGCTGT GAAAATGCAG CCCAGGAATA CTTCGTGAAA TTAATCTTTT GACTGTGAAT CCGTCGCGTG GCGACTATTA CCGTGCGTTA AGCCCGACGT TACCAGCATC CTTACATATC TGATGTCGCG GAAAATAGTG TATATGAACA ATGCTGGTCA GGCGCAACTG GACCAGTCCG TGGTTGCCGC GGGTATCGCT TGCGTTCAGA AGCCAGCGTG GGAAATGGAT GCGGAAGAAG ATGCCAAAAT AGTTCGCACA CTTTTTGCAC AAATGATACA CGCAAGCCCG ACGAACATTG CAATCATGCC GAGTACCGCA TTTGCCATCA CTCTAGCGGC TCACAATCTG AAATTTCAAC TGGAAGGCCA AAAGGGGCGG ATTCTGGTAC TTCAAGATCA AATGTGCTCG GCGGTTTACC CCTGGGAGCA TCTGTGCAGT CGATCAGGAG GATACCTAGA ATTAGATATC GTGTCATATC CAAGTGCAAG TACTACATGG ACCGAAAAAG TGATTACGCA TTTAAAAAAC GGCCAAGACA AAATCTTGGT TGCTTGTTTG CCTCCGCTGC ATTGGGCTGA CGGCGCGCTT CTAGATCTTG TTGAGATCAG TAAAATTTGT AGAGATATTG GTGCTTTTTT GGTTGTGGAT GCCACGCAGG CTGTGGGGGT ATATCCTTGC GATATCAGTT TGTTTAGCCC AGCATTGTTA GCATGCAGCG TTCATAAATG GCTACGTGGA CCTTCGGGAG CTTCACTTGT TTACATCGAC CCTAAACTAC AAAATACATG GGAACCTCTT GACCAACACG GCCGGTCTCG AAAAGTGGCC GGAATGGCAA TTTGGAATGC CGCCAAGGAT GGAATGGGCC CTCAAGGCTA CCCCCAAGAG TTCGTGACAG ATGCTCGCAG ATTTGATAGT GGCGGAAAAC CAAGTCCTTT TCTTTTACCG ATGCTGCGGA AGTCCATGCT AAAAGTAATT GAGGTTGACA TTGAAAAAAT CCAATCACTG CTAAAGGATC GGATGATGCC GTTGCTCGAT TGGGCTGACA AACACGAACT TTGGACACCC AGTGTGCATG CTTACCATCT AATTGGCATT CGGGTGCGTC ACCTATCACC GGAGCAAATG ATTTTGATCA AGAACAAACT CGAAGGCGAA CACGATGTAC ATATTGCTGT CCGATGCGGG GCGTTTCGGA TTTCCCCGTA TCTCGACACC ACCGAGGAAA ACGTACAAAA ACTGATTGAA GCATTAAGCG CAACCGTTCT ACTTTAATTG CGACCCCCAA TTAACGTTTT GCGAAAACTA ACGAAACATG TTGACAAGG
|
Protein sequence | MSRKIVYMNN AGQAQLDQSV VAAGIACVQK PAWEMDAEED AKIVRTLFAQ MIHASPTNIA IMPSTAFAIT LAAHNLKFQL EGQKGRILVL QDQMCSAVYP WEHLCSRSGG YLELDIVSYP SASTTWTEKV ITHLKNGQDK ILVACLPPLH WADGALLDLV EISKICRDIG AFLVVDATQA VGVYPCDISL FSPALLACSV HKWLRGPSGA SLVYIDPKLQ NTWEPLDQHG RSRKVAGMAI WNAAKDGMGP QGYPQEFVTD ARRFDSGGKP SPFLLPMLRK SMLKVIEVDI EKIQSLLKDR MMPLLDWADK HELWTPSVHA YHLIGIRANT MYILLSDAGR FGFPRISTPP RKTYKN
|
| |