Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46164 |
Symbol | |
ID | 7201372 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 454922 |
End bp | 456667 |
Gene Length | 1746 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180639 |
Protein GI | 219119772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTGGTAGGC GAGTCAAGTA TTCGTCAGCC TTTGGTAGTC TCGAAACTCT TATTCTTTGT GATAACTGTT GATAAGGGAC AGAATAACAG CCAGGCTACC ATGTTCAAAG TCAAACGGCG TTTGTCGAAG CTATCCAAAA ACAAACAACC AGATTTGGCT GTATCACAAC AGTCTCTGCC AGTCCAGCAT CCAGCGATTT TGTCTCCTCC TTTACACAGT GGATCGGACG TTTCTTCCTT GCAATCTTCG AATGCAGCAC TACGCAGTTG CTTGCGTGCT TCGGAAAGTA GCGGTGGCAG TCACGATGCA TCCTCCGCTC GCGATAGTCA AATTCAACAA AAGCAGCCGT TGCAATCCGC TGAAGATCAA TCAGGGCTCA TTTATGGCCG GCCACGCAAT GCGAACGGAA GCAGCCAACA TAGTGGTGAT AGTAGTTCTG GAAGATTGCG AAAAGTTCGA TTTCGTCATG TCCAGGTTCG CGAGTTTGAA CGTATAATTG GAGATAATCC GTCGTGTTCC AGTGGAGCAC CCGTAGCGTA AGTTTCAATC AATTAGTTGA TCGCATCGCG TGTTTTACGC TGGTAGAACA ATGGTTTGGC GACTTTACAG CTCGATTGAA ACTTGTGAAT TCACTTTCGA TGTTTTACTC TTGCCTACAC ATACAGGTTG GGCTGGGCCC ATAGTCGGGA TCGCACGATG CGTTTGGACG ACTACGAATC GGTACGGCCA TCGCGTCGGT CGCAGTTGGA CTTGATTTTA ACCCGGCAAG ATCGGGAGGA GCTCTTACTG GAATGGGGCT CAACCTTTCA GCAGATTATT GACGCCATCC GATCCAACAT TCGGGTAAAG AATCAACGCC GGCGGACGGT CAACAACATA GGCACCTATG ATCGTTGGGA GGAAGCCATG GAGAATGCTG GTCGGAAAAT CAAACGTACG CTACTCCTCA AGAAATCTAC TAAGCAGCGT GTTGAAGAAA TGACTGCGCA ATCGAATACA ATTCGCGTTG TATCGCAGCA CGAAGTGCAA GGTAACCGCA ACGCTAGTGA AGTTAACGCA ATCACGCCGC GACGTCGGCA TTCCGAGGAC GATTCAGAAA AATCCCCTCA ATTGACATCG CAATCTGATA TTTCGAGCAA TGTTGATCCC GCAACGATCC TGGAACCAGA AGACGCTGCG TTTCTGGTTG GAGTTGAAGA GGGGAAATCC TTAGGAGGCT CATCGTTCTT AAGTGTAGAC AGCAACAGAC CAACATCTGT TATTGAAGTC GGCATTGTGG AGCGCACGAC GCTCACAGAA GATTTCTATT CTTACATGGA AGAAATGACA GCTACTTCCG GACTTACGGG CATTTCAGGA GCAACACACG AAGACCAATT TTCAAATGAT GGCTTTGAAA TGCTGGACCG GGACAATTCT TTCTGGGAGG TTGATGAAGA TCGTTCCGAT TTCCCCCGTA TACGCCGCAT GGTTACCCCT ATGGTCATCT CCGAAGATGG AGCAGCGTTT GATGTCCTGA ATCAGTACGA GCAATGGGAC AGCAATGGTG GCTTTTCGGG TGCTCAGCAA CCGCCGCCAT ATTCTAACTC CATCATTAAC AAGTGGGAGT AGTGGACTAC GCATGTTATT CATTGTCAGT CCATGGATGT TTACAACAGA TTTCGAAGTG CAAGGGTTCT CACACACAAC GTGGAAGCAC AGCGTTACGT TGTACAGTCG ATGGCACGCT CATAATGGGA TTGACT
|
Protein sequence | MFKVKRRLSK LSKNKQPDLA VSQQSLPVQH PAILSPPLHS GSDVSSLQSS NAALRSCLRA SESSGGSHDA SSARDSQIQQ KQPLQSAEDQ SGLIYGRPRN ANGSSQHSGD SSSGRLRKVR FRHVQVREFE RIIGDNPSCS SGAPVALGWA HSRDRTMRLD DYESVRPSRR SQLDLILTRQ DREELLLEWG STFQQIIDAI RSNIRVKNQR RRTVNNIGTY DRWEEAMENA GRKIKRTLLL KKSTKQRVEE MTAQSNTIRV VSQHEVQGNR NASEVNAITP RRRHSEDDSE KSPQLTSQSD ISSNVDPATI LEPEDAAFLV GVEEGKSLGG SSFLSVDSNR PTSVIEVGIV ERTTLTEDFY SYMEEMTATS GLTGISGATH EDQFSNDGFE MLDRDNSFWE VDEDRSDFPR IRRMVTPMVI SEDGAAFDVL NQYEQWDSNG GFSGAQQPPP YSNSIINKWE
|
| |