Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33458 |
Symbol | |
ID | 7204033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 964109 |
End bp | 965395 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186162 |
Protein GI | 219113157 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0063707 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAG TCGTAGAAAA CGAGGATTTG TTTACCGACA TTTATTCAGA TAACTCGACC TGCGCTTCGT CATATTCTAC TTTGAAGAAC AGCAAAGATG GATGGCGAAT TATAGATTGG AACAAGGAAG CCGTGGGCGT ACTTTCTGAT GAAAGCAAGT GGACTCAACC ATCATCTGTC GATGTTACGG TGATCCAAGA CAGGCTTGTT TACGTGAAAA GAGACGACCA CCTTCGCCTT CAAGGTTCCC AGATTGCCGG CAACAAAGCA CGTAAGATGC TCGCTTTGAA CAATCTGAAG GATTTTCCGT TGTGCGTAGT AAGCTATGGT GGACCACAGA GTAATGCAAT GGTTGCCTTG GCTGCGGTTG TCAATTTCCA GAATATAAAG CAGGGTATAG ACGATCACCA CGATCCGCAT CGGTGTCGGT TCATATATTA CACAAAGAAA CTACCCAAGT TTCTCCGAAA CCAGCCAACC GGAAATCTTT TCAGGGCAAA GATGTTGGGA ATTGAGATGA TAGAGCTACC GCCTGAAGAG TACAGAACTT TATTTGGTGG CAAGTGGGGC GCGAATACGC ACGCTCCGCA GGGTTTAACT CCTCCTGTCC CTGGTGACTC ACTTTGGATC CCACAAGGGG GATCTTCTGG AATGGCTCAT GCCGGCACAA GGTTGCTGGC TCAAGAAATA TGTGAATTTT GGTCTCTGAA GGGAAATGGG CGTCCACTTT CTGTTGCTAT TCCCGGGGGA ACATGTTCAA CCGCTGTTTT GGTTCACACC GCAATTGAGA GCTTACAGTC CAAGCTTTCA AATGACAAAC AAATGGACAT TAAAGTTGTT GTAATCCCAT GTATTGGTGA TGACACCTAT GCTAGAAGGC AAATGATGGC ACTGAATACA CAGCTAGGCA ATGCCTCCAA TGATCTTCCC ACAATATTGA AGCCCTCGCC TTTTGACTTG GCCACCCAAC ATAATCACAA ACATTCTGAC AAATATTTCA CATTTGGTGA ACCAGAAAAG GATATTCTTG AAACATTTGT CTATATAAAG GAGAAGTGTG ATATAACTTT GGACTTGCTG TATGGAGCGC CGGCATGGGC GGTCTTGCTC AGGCACTGGA AAGGAAAACA GACTTCCCCA TCAGTGTTTG ACGCAAATGC GCCATTTGCA GATCGCTCAG TCATGTATGT GCATAGTGGT GGCATTGAAG GCGTTAACAC TCAACTATTA CGCTACCGGT ACAAAGGCCT GCTGAAAACC AAAGATGTTC AACTTCCAAA CCACTGA
|
Protein sequence | MTQVVENEDL FTDIYSDNST CASSYSTLKN SKDGWRIIDW NKEAVGVLSD ESKWTQPSSV DVTVIQDRLV YVKRDDHLRL QGSQIAGNKA RKMLALNNLK DFPLCVVSYG GPQSNAMVAL AAVVNFQNIK QGIDDHHDPH RCRFIYYTKK LPKFLRNQPT GNLFRAKMLG IEMIELPPEE YRTLFGGKWG ANTHAPQGLT PPVPGDSLWI PQGGSSGMAH AGTRLLAQEI CEFWSLKGNG RPLSVAIPGG TCSTAVLVHT AIESLQSKLS NDKQMDIKVV VIPCIGDDTY ARRQMMALNT QLGNASNDLP TILKPSPFDL ATQHNHKHSD KYFTFGEPEK DILETFVYIK EKCDITLDLL YGAPAWAVLL RHWKGKQTSP SVFDANAPFA DRSVMYVHSG GIEGVNTQLL RYRYKGLLKT KDVQLPNH
|
| |