Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14553 |
Symbol | |
ID | 7203114 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 420065 |
End bp | 421225 |
Gene Length | 1161 bp |
Protein Length | 321 aa |
Translation table | |
GC content | 43% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182223 |
Protein GI | 219123837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.65279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACGACGAGA ATTATGACTA TATTGTAACA AGTGGCGAAG TCTTCTTTGG ACGGTACAAC ATCAAAGAAC GTATTGGTAA AGGATCCTTC GGTCAAGTAG TTCGTGCGGA GGACATTGAA ACAAACCAGG AAGTGGCAAT AAAAATTATC AAATCGAAGA AACCCTTTGC ACTACAGGCA AAAACGGAAA TCGAGCTTTT GACACACTTG CTGGATAAGG ATGTTGAAGA TCAGCACAAC GTTGGTATGT TCACAAATTT ACGCAGTGTT TCATTGTTTT GCTTTTTTTT TGCTCCTTCT CACTCTGTCA TGTGCACCCC AGTGCGACTC TTAACCCATT TTGTTTATCG TGGCCACCAA TGTCTTGTTT TTGAGATGCT TTCTCTTAAT CTATATGAAT TGTTGAAGAA TACGCAGTTT AGCGGTGTAT CATTGAATCT AATTCGAAAG TTCGCCAAGC AAGTTTTGAA GGCGCTCTCA TTTCTTGCCC GACCGGACGT GGATGTTATT CATTGTGATT TGAAACCAGA AAATATTTTA CTGCGGCATC CAAAGAAGAG CGGCGTGAAA GTTATTGATT TTGGATCATC ATGTCGGTCA AACAAACGGA TGTATTCTTA TATTCAAAGT CGCTTCTATC GGTCACCTGA GGTCATACTT GGACTGCCTT ACGGCGTGGC GATTGACATG TGGAGTCTTG GATGTATATT GGCAGAGATG CACACTGGTG AACCCGTTTT TTCCGGCTCC GATCAGTTTG ATCAGATGCA AAAGATAGTT AAAATACTTG GCATGATCCC CAACAGCATG CTCAACCGAT CGAGCAGTCA AACCCGGAAT CAGTTCTTTC AGCGAAAACA ATCGACTGTT ACCGGGCGAG AAGAATGGAC GATACGTCAA GTAAAAAAGT CCTCATCCTC TTTGTCGTCA AAAGCAGAGC CTGAACCTCC GGAAGTTGTT GTCGTTCCGA GTGTCAATCC CGTATCGTCT TTGACGGAAG TAATCACAGC AGGAGCAAAC CAGAAAAAGA AATTCCCTCA GAGCGAGGCG TACAACACCC AGCGAAATTA CGAACTTTTC GTCGACCTTG TATACAAGAT GCTAGCCTAC GAGCCTGATC AAAGAATTAC GCCTGCAGAA GCGATGGCGC ATCCTTTCAT T
|
Protein sequence | DDENYDYIVT SGEVFFGRYN IKERIGKGSF GQVVRAEDIE TNQEVAIKII KSKKPFALQA KTEIELLTHL LDKDVEDQHN VVRLLTHFVY RGHQCLVFEM LSLNLYELLK NTQFSGVSLN LIRKFAKQVL KALSFLARPD VDVIHCDLKP ENILLRHPKK SGVKVIDFGS SCRSNKRMYS YIQSRFYRSP EVILGLPYGV AIDMWSLGCI LAEMHTGEPV FSGSDQFDQM QKIVKILGMI PNSMLNRSSS QTRNQFFQRK QSTVTGREEW TIRQVKKSSS SFEAYNTQRN YELFVDLVYK MLAYEPDQRI TPAEAMAHPF I
|
| |