Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50464 |
Symbol | |
ID | 7199314 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 138516 |
End bp | 140247 |
Gene Length | 1732 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185434 |
Protein GI | 219130567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGCCAATC AATCGCTTCA ACCTCAAGGA GAGGACACGC ACAGTTACAG TTCACCCTCC GTCTCTGTGA TCCCTCTCAA ACCATTCCAA ATCACTGTTG CTGCTGCTGT GAGAGTGACT AGTGGGGGAA ATTCATCCAT CACTCCGTCA GCCGTATGCA ATTGCTTGGG ACGAAAAATA CGCTGCTGTT GCTGGCGGCG GCGGCGGCGA CCGAGTCCCC CGTGTGGGCC TTTACGTCAC CCGTTTGTTG GACTCGGAAT CGTCGTATAC GGTTGGTGTC CCCATCCCGA GCGACACTGA CGGTCAGCAG TAGCAGTAGT AGTCGACGCC TCGGCAACGA CCCTAGCGAG TTCGACTACT TACTCCAGGA ACAGTCGTCA TTGTCGCGGG TAGCGTCGAC CACACACGGA CGTCGTCGGG CCGTACAACT CCCGTCCACG CATTCCGGAC AACCCGCTAC CGTCCTCGTG AGTAGCTTTT CCGCCGCTCC CGGAGCCGTC ACCGAATCGA CGGACGAATT CACGACGGAC TCCAATGCGG CAGTGTCCCC CATTGACGGG GACGATCCCT TTGCCTTCGA GTCCGCCGGA CTCTACAAGG TACAAACCAT GACGACGCAA AAACCGTCAC TCGAAGCGCG TCTCAAAAAC ATGGATCTGC AGGACATTAT CGCCACGCTC ATACTACCTT CCGTTGTCGT CTTTGCCGCC GGACGCTGGG GATTCAACAA GGTCTACGGC AAGGTACAGG TCCGCACCGA AGCCCTCCTC GACAGCTTTG CCAAGGAAAT GCTCTACCAC GACGGTGACT ACAACGAAAT GAAACTCTGT ATACAAGATT ACAACCAAAA ACTCATATAC TTGCCCAACC GCCGCGACGT CATGTTGAAA CGCTATCTCG CGGCCTACGC CAAGAAGAAA ACCGTCAGTC CCCTCGCCAT TTCCTCCCTC AGTTTCGTAC TGACGCAGTT CCAAATGAGT GAAGAAGCCG CCGCTGCCTT GCTCGTCAGT TTGTGCCGAC AAATGGGCAC GGATAAGATC GCCTCGGCTG GGAAACTACT CTTTCTCGGT TCCTGCATTC TCAAGTCACC GGAAGGACAG GCCGCCCTCA CCCCCATTAA AGATCTCATC AAAAGTACCT ACCGCGAAGT ATCCGTCGCC GAAGCCATGG TCGAGACCTC TCAACAGTAA GTGGGCACTG AGTGTCCTAT TCCGTGTCAC GCAGAAAATG CTGCACACGC TCACACATTT CTCTCGTCGA CAGAGCCATT GCGGAAGCCT CGTACCGATC GGTAGTTCTG GCTGCGGGCA AACAACAGAA ATCCCTCACG CCCGGTTGGG ACGTCCTGGG TTTGGAGCGG GACGTGGCAC AACGAATTTA CGACGAAGAA GCCAAGGAAG GATTCCGGAC CGAGCGCGAA ACCATGTATG GCGGACAAAC CACCCGCTAC GACAAGAAGG GACGCATCGT GGACCGCGCC GGCAAATTGA AGAATCCTGC CGATGCGGAC GAAGACGATG ACGACGAGCC GCAAGGTGGC GTTAGTAACG TGTACGAATG TGGAGAATGT GGATACACAC TGTTTGTCGC TCAAGGGAGA GAATCCAAAT TCTTTGGCAC CGGCTTTAAA TGTCCCGAAT GTGGTGCCGC TAAGAAACAG TTCAAGGCAC GGGACGATAT GGACGAAGAA TAACTTACCT TACCACCATT AGCTAACCTT ACGGAACCAG GG
|
Protein sequence | MQLLGTKNTL LLLAAAAATE SPVWAFTSPV CWTRNRRIRL VSPSRATLTV SSSSSSRRLG NDPSEFDYLL QEQSSLSRVA STTHGRRRAV QLPSTHSGQP ATVLVSSFSA APGAVTESTD EFTTDSNAAV SPIDGDDPFA FESAGLYKVQ TMTTQKPSLE ARLKNMDLQD IIATLILPSV VVFAAGRWGF NKVYGKVQVR TEALLDSFAK EMLYHDGDYN EMKLCIQDYN QKLIYLPNRR DVMLKRYLAA YAKKKTVSPL AISSLSFVLT QFQMSEEAAA ALLVSLCRQM GTDKIASAGK LLFLGSCILK SPEGQAALTP IKDLIKSTYR EVSVAEAMVE TSQQAIAEAS YRSVVLAAGK QQKSLTPGWD VLGLERDVAQ RIYDEEAKEG FRTERETMYG GQTTRYDKKG RIVDRAGKLK NPADADEDDD DEPQGGVSNV YECGECGYTL FVAQGRESKF FGTGFKCPEC GAAKKQFKAR DDMDEE
|
| |