Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49849 |
Symbol | |
ID | 7198671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 87018 |
End bp | 88427 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184643 |
Protein GI | 219128908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.725945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCAC TTTTGGTAGT GAAATTTGTG TCACGTGAAG AGAAGCCCTC GTTGCAGAGT TTTTCTACTG GGTTCGAAAA TAATTGCCGC TCGTCACGAA ACGATACTGC TATGGATGTC AATCAAATCC GAAGGTATCA TGTGCAGCGT AAACGACTTT ATCGAACAGC GACAGCAATG AAAGTTTCCC GACTTTTACT GGCACTTGCT AGCTTCCTTA CAAGTTGGCC ATGCACGGGG GCGTTTCAAC CGAGGCGTCC TCCTTTCGCA AAGTCAGTCC ATGTTTCATC TAACTCTGCA CGCAAAACAA TCCGGGATCG GATAGCGCGG CGTGCCATTG CAAGAAACAT TGCAACGCGT CCAAAAACAT ATACCTCAAG ACCAAAAACT GCACTCCTTG CTGTTCCCGC TATTTCTGTG CCCGCTCCGG GAACGTTCTA CATGATATTG TTGGCCGTGC AATTTGCCAG TCAACCATTG CTCACCAAGC GCTACGCACC TCCCACCATC ATTAGAAGTA CCTACGTACT CGCCCAAGAC CTCTTTCGGA TGTTCACTTG TGTTACGCTA TTGATTATTA CGGGGAGCTG GCATTCCGCG ACCGCTTCAT GGAAGTGGTC GTCGGCAGCT GTAGCGGCGG GTCTCCCCGC CCTGTTGTAC GCCGTCCAAA ACTACTGCAG TTTGGTGGCC TACCAAAATT TGCCCCCAAT CACGTACAAC GTCCTGAATC AAACCAAAAC ACTATCGGCA GCGGTATGTT GCTACTTTTT GTTGCGTCAA AGACAATCTC CGTACCAAAT CGTTGCCCTG GGAGTTTTGC TGGTGGCAGC GCTCGTAATG GAATCCATCC TACCTTTGCC AGGGATCGGC AAGCCCCAAG ATCCTACGTT GGCCGGTACC GCTACCGAAA AGCACAAGGA TCATACCGCC AGCATTGATA CAGACCAGAA AGGAGTGCAC TGGGCATCCG GAGTCTTGCC GGTGTTGGCT GCCAGTGGCA TTTCGGGTTT GGCTGGGGCC TTGGCCCAAA AATCACTGCA AGTACAGGAG CGCAACTCGT TTTTGTTCTC GGGTGAGCTT GCGGCAATTA GCGCTGTCAG TCTGCTGATC AGCTCCTTGC TAGGATCCCC CGACGGGCGG CGGATTCGAA AAGAGGGCTG GACTAAAGGA TGGACGTGGC AAACATGGAT TCCGTTGGCC ACAAACGCGG CGGGGGGTAT CTTAGTCGGA TTAGTGACCA AGCACGCTGG CAGCGTGCGG AAGGGCTTTG CCCTCATTAT TGGCATGTTT CTGAGTGGTG TACTCCAGAA TGTTGTAGGC AGTGAACGTC AAGTCACGAG CCAGCAATGG GCTGGTGGAT CCTTGGCGGC ACTCTCGCTG TGGCTCTATA CAGCGTATCC AATGGTCTAA
|
Protein sequence | MSSLLVVKFV SREEKPSLQS FSTGFENNCR SSRNDTAMDV NQIRRYHVQR KRLYRTATAM KVSRLLLALA SFLTSWPCTG AFQPRRPPFA KSVHVSSNSA RKTIRDRIAR RAIARNIATR PKTYTSRPKT ALLAVPAISV PAPGTFYMIL LAVQFASQPL LTKRYAPPTI IRSTYVLAQD LFRMFTCVTL LIITGSWHSA TASWKWSSAA VAAGLPALLY AVQNYCSLVA YQNLPPITYN VLNQTKTLSA AVCCYFLLRQ RQSPYQIVAL GVLLVAALVM ESILPLPGIG KPQDPTLAGT ATEKHKDHTA SIDTDQKGVH WASGVLPVLA ASGISGLAGA LAQKSLQVQE RNSFLFSGEL AAISAVSLLI SSLLGSPDGR RIRKEGWTKG WTWQTWIPLA TNAAGGILVG LVTKHAGSVR KGFALIIGMF LSGVLQNVVG SERQVTSQQW AGGSLAALSL WLYTAYPMV
|
| |