Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45446 |
Symbol | HY2 |
ID | 7200684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 173938 |
End bp | 174966 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179595 |
Protein GI | 219117606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.505627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTGG TTCCTGCGTG GACCATGATG ACACGCAATG CTATATTCTC TGCTCGAAAC CCTTCACATT TCTTGGAGGT CACGGCGCTC TTCTGTATAC TACTAGCGAG CGGAAGGGGC GCCCGAACCA ACACAGCTTT TGTCAATCCA CAGAATCCGC GCAACGGATT GACTTTCACT TCATCGTTCT CCGGTACCAG CCAACATGCC GGCAGAGTGA CGGTTAATGA GGAGAATTTG GCTTTACCCA CTGGTTCCGG ACTATACCGG GAATTTTCCC GACACGCTTG GGAAAAATTA CAATCATCGG GGCTTTTTGT GCAGGAACCG GTGGAGGAGG AATGTGCGTC GAACACGGCA CCTGCCCGTG GCCTTCCGTC CGGCAGTGTC GTGCAAATGG AGGTCCAGGC ATTGCGCGGT TCCCTCCCCC AAGTCGCCTA CGCTAGGTAC GCCTTACTGG AAACTTTGAA AGCCGGAAAT GACGCGAATG TCACGCATCA CGACGGAATC CAAGTCTTGA ATTTGGTGAT CTTTCCCTCA ACCACCACGG ACCTACCAGT GTTCGGCGCC GATTTCGTTT CTTTACCGGG GGGAAAGCAT TTGTTGCTAC TGGATGCCCA GCCAATGACA GAAGATGTAC ACTACGAACA CTATTGGAGT GATTGGTACC AGTCGAACCA CATTTCCGAG ATTTTCCCCT GGGGTGGGGA TATGCCAGAG GCGGTCCAAC GCTACGTGTC GAGCAAGGCA CTGTGGACAC GTATGGCATC CAGCGAAGAC AACGCCGGAG AGAAAAGAAA TCCTGTAATC ATTATTCAAA CTGATTTGAT GGCAGCTTTC CAAGCCCACT TGGAGGTGTA TATACAGCTG CTGCACGATT ACACTGACTT GGAGTCGAAG GACAACTGGA GTGCGGAATA CCTCGACTAT CGACTTACCA ATGATCCCGC TCGACCTATG CTGAAGTCTT TGTACGGCGA GGAATGGACC GAGCGCGTTC TGAAAACCGT GCTATTCCCA TCGCCATAG
|
Protein sequence | MKLVPAWTMM TRNAIFSARN PSHFLEVTAL FCILLASGRG ARTNTAFVNP QNPRNGLTFT SSFSGTSQHA GRVTVNEENL ALPTGSGLYR EFSRHAWEKL QSSGLFVQEP VEEECASNTA PARGLPSGSV VQMEVQALRG SLPQVAYARY ALLETLKAGN DANVTHHDGI QVLNLVIFPS TTTDLPVFGA DFVSLPGGKH LLLLDAQPMT EDVHYEHYWS DWYQSNHISE IFPWGGDMPE AVQRYVSSKA LWTRMASSED NAGEKRNPVI IIQTDLMAAF QAHLEVYIQL LHDYTDLESK DNWSAEYLDY RLTNDPARPM LKSLYGEEWT ERVLKTVLFP SP
|
| |