Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48608 |
Symbol | |
ID | 7194875 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 355108 |
End bp | 356395 |
Gene Length | 1288 bp |
Protein Length | 293 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183079 |
Protein GI | 219125631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCATGGCT ACAAGGACTT GTAGTTGATC CCTCTTCCAT TTCCAATAGC ATTTCCCGGT AAAGCCATGG CGGAAGAGAC GACTACTCGC AGCTTGGGTA CGGTCAAGGC CACCCTAACG GCATACCAGA CGCTGAACAA CATTGGCAAG AATCCACTCA ACGACGACGG AACGGTTGGT GGCGAAGAGT CGGACAGTCT CGTCACCCGA CCGCATCTCG GCATTGTCGG AAGCGTCGCC GTGGCGGGGT TACTCGCAGC GATTGCCAGT TTTGTCCTCG TGACGGCGCA TTTGGTAGAT TTAGCCTCCG TCACGCTCAT GCTCATCGCC CCGCTCGTCG TCGTACAAAA GCACATGCTT CGCAAGCTGG GAGGAATGCG AGGATTACAA AATTCCCTGA GACAGAGCGC TAATCGTTTC ATGCAGCAAA ACTTGACTCT CCACATGTCT GTCAGTAAGC TTTCGCACAA TGTCGAAAGG TAAGTGGGGC TCACTGGCAT AATTCCGAGC ATGCAGGATG CTGCGTTCTA GGATATCACC CTCATTTGCT TATTCTTTGA CAGCCTCTCT AAAGTTGAAG GGAAACTCCA GGGTATTGCC GAAAAGTCTG GCACAAAGGT CGACCATCTG TTTGACACGG TTAAGGAGAA CGGTGAAATT CAGAAACATA TAAAAAAAGC TTTGGAAACG AAGGTTATGC AGCAGATAAT GACGGCTTGT CTTCAAGTCG ACCGCGATCG CAATTTCACT CTCGGGCCGC AAGAAGTTAA GATCCTCGAG ATGCGTTTGA GCAACATCCC TGGTGTCATT TTTGATAAGA CCCGGTTCGA CGCATTTTTG CAATCCGACC AGGGGGAACT GGCATTGTCC GACGTTTGTG GTATCGCCCG TGTATTGCGG GACAACTCTG TCCCGGAAGC GCAACGGATT TTTCGGTTCG CTCCCCAAAA GGTGCTCCAG CAGGGGAAAG CCTCGTCGCC GCGGAGGGGG CTATTTGGTA TGGGCAGTGC CAGACACGAT CTGGTGCCTT AGTCGAGCCC ACCCGTGGGC ATCAGCATCG TGCTTGGACC ATACCATGAC AATTGAGCGG CTCAGCATGC TATAACTATC TATTTAAAGA GGCGATGAAT GATTGTTCTG TCCCTCTCGT ATGGTTGTGG ATGATTTAAA AGCTCATCTC ATTTCGATGG TTCAAGGGAC ACAAACTAGT TAACTCTTGT TCGAGGGACT CGCATTTTCC GATCCTATTG CATAGCATTA GTAAGAATAG AGTGATCTAC TTATCTCT
|
Protein sequence | MAEETTTRSL GTVKATLTAY QTLNNIGKNP LNDDGTVGGE ESDSLVTRPH LGIVGSVAVA GLLAAIASFV LVTAHLVDLA SVTLMLIAPL VVVQKHMLRK LGGMRGLQNS LRQSANRFMQ QNLTLHMSVS KLSHNVESLS KVEGKLQGIA EKSGTKVDHL FDTVKENGEI QKHIKKALET KVMQQIMTAC LQVDRDRNFT LGPQEVKILE MRLSNIPGVI FDKTRFDAFL QSDQGELALS DVCGIARVLR DNSVPEAQRI FRFAPQKVLQ QGKASSPRRG LFGMGSARHD LVP
|
| |