Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40692 |
Symbol | |
ID | 7198591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 160868 |
End bp | 162088 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184660 |
Protein GI | 219128943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.388438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTTT CTATTTTTGC GTTGCTCGGC GCCATTTCGG CCTCGTCCCT TCTGCCTGAA CTGACGGGGG CTCTTCCAAA CGGCGATACC ACTTACGAAC AGCAAATCGG ACGCCTGGAC AATGCGGCTC GCTCATCCGG CCGTATGTTG ATGAAGAACC TGAGGATGGA TTTAAGCGAC AGGCAGGTTA GTACAAAGTC GCCTGCCCCA CTTGTGACTT CTCCTCCAAC AGCCCAGCCG ACATCATCTA TTAAGCAAGC TTCCTCCAAA GTCCCGTCCG ACGGTTCCAG TAGCCCTAGT TCGAAATCGT CTCCAACCTC ACCGGAGACA GACATGCCAG CATCTAATGC TTCCAAGAAA AGCAGCAAAT CCAGATCTCC GACATCGCCG AATCCGAAGA GCGCGAAAGC TACGGTATCT CCCGATCAAA CTCCGTCTGA CTTCCCCTCC GAAGTTCCTA CTGGTCCCAG TGCGAAGTCC TCTGTGGCAC CGGCAGCCAC CAAGAAGAGC GGCAAGGGAG GGTCCCCGAC ATTGCCAAAT CCGAAGAGCG CGAAAGCTAC GGTATCTCCT GATCAGACGC CGTCCGACTT CCCCTCCGAG GTTCCTAGTA GTCCCAGCGC AAAGTCTTCC GATATATCAA AGGTGAGCGC TTCCAAGAAG AGCGGCAAAG GCGGGTCCAC GACATCGCCG AATCCGAAGA GCGCAAAAGC TACGGTATCT CCGGATCAGG CTCCGTCCGA TGTCCCCTCT CCGGTTCCGA GTAGTCCGAG TGTGGAACCT TCTTCTGACG CACCGGTGGC CGATACTTCC ATCACTTCCA AGAACGGTAG CAAGACCGGG ACTCCGACCG TATCGAAAAA TCCAAATCAA GCTCCGACCG CAGTCCCTAC GAGTCCCGCT ACTATTCCGC CGTTTACCTT CTCGTTACCA CCAGTGTTGA CTTTGCCGCC AGTCAGCGAA TTGCCAAAGA TCCCTACGCA GAAAGCTGAT CAATTACCAC TTCCGAAACT ACCCAGAACA AAAAAGGGAA CCAAGAAAAG CTTGACTAAA AGCACCGCTG ATTTGCCTCC AGTGATGGCT CTACCGCCAG TGATCGATTC GCCGAACATC CCCAGCAAGA ATAGTAACGT GATAACCGTT CCGAAAGTAC CCGAAACGAA AAAAGCCACG ACCAAAGGCT TGACCAAGAC CACCAACTCG TTGGAAATTC CCGCCTTCTA A
|
Protein sequence | MRFSIFALLG AISASSLLPE LTGALPNGDT TYEQQIGRLD NAARSSGRML MKNLRMDLSD RQVSTKSPAP LVTSPPTAQP TSSIKQASSK VPSDGSSSPS SKSSPTSPET DMPASNASKK SSKSRSPTSP NPKSAKATVS PDQTPSDFPS EVPTGPSAKS SVAPAATKKS GKGGSPTLPN PKSAKATVSP DQTPSDFPSE VPSSPSAKSS DISKVSASKK SGKGGSTTSP NPKSAKATVS PDQAPSDVPS PVPSSPSVEP SSDAPVADTS ITSKNGSKTG TPTVSKNPNQ APTAVPTSPA TIPPFTFSLP PVLTLPPVSE LPKIPTQKAD QLPLPKLPRT KKGTKKSLTK STADLPPVMA LPPVIDSPNI PSKNSNVITV PKVPETKKAT TKGLTKTTNS LEIPAF
|
| |