Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_13511 |
Symbol | |
ID | 7204604 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 199703 |
End bp | 202713 |
Gene Length | 3011 bp |
Protein Length | 939 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185839 |
Protein GI | 219121222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCTGGAACA TGTCGTTGAA ACTGCGAGAT CTTATCCGTA AAGTGCGGCA ATGCAAGACT GCTGCGGAAG AACGGGCGGT CATTGCGAAA GAGTCGGCAA TGATTCGGAC AGCTATCCGT GAGGAGCAAG CGCACTACCG CCACCGGAAC GTGGCGAAGC TGCTTTTTAT GCACATGTTA GGTATGTGAC GCTTTCATGA GCGGATAGTA CGCTGGCACG TAGAAGCACA CCATGCGTTC TGTGAGTTTT CTGATTACTG ACGGCCTGAC TTTGCATTGC TTATTTGTTC GAATCAGGCT ATCCAACGCA TTTCGGTCAG CTGGAATGTA TGAAGCTCAC GGCCTCCCCT CATTTTCCGG AGAAGCGCAT CGGTTATCTC GGAATGATGC TGCTCCTCTC CGAGGATGCT GACGTTCTGA TGTTGTCTAC GAATGCCTTG AAGAACGATC TTACGTCGTC CAATAAGTTC GTTGCGGGTT TGGCACTCTG TGCCATTGGA AATTTGGCCA CGGCTGACAT GTCTCGAGAT TTGGCTCCCG AAGTAGACAA GCATCTCAAG TCTCCCATGC CATACATTCG CAAGAAGGCC TGTTTGGCCA TGTCGCGTTG CCTCTCCAAG TGCCCTGACA TGGTGGAAGA CTTTATCGAT CGCGTCATTA CCTTGCTCAA AGACAAATCT CACGGAGTTT TGATTACCGT GGCGCAGCTC ATGACACAGA TTCTCATGAT CGATTTTCGG AATGCCGAAG AAGAAGGTGA AGATCCTTTT GCAACGCCTT GCCGTCAAGC TTTTTTGCGA TTGGTCCCAA CGCTTGTCAA AATGTTACGC AACTTGTTGA GCACAGGCTA CTCTCCCGAA CACGATATAG GTGGTATATC GGATCCATTC TTGCAAGTTC AGCTTTTGAC TTTGCTTCGC TTGTTGGGTG CCAATAATGA AGAGGCTTCA GAAGAAATGA ATGATGTGCT GGCACAGGTC GCGACCAATA CCGAAACATC TAAGAATGCG GGCAATGCCA TTCTCTACGA GTGCGTACAG ACAATTATGG GGATCGAAAG CGAAGACGGT CTCCGTATTT TGGCGGTCAA CATTCTTGGA CGCTTCCTCC TTAACCGTGA CAACAATATT CGTTACGTTG CACTCAATAC CTTGGCACGA TGCATTATCG AGCAGAAACG CTCCGGAGAC ATGATTGAAA CGGGTGACGG TGAAGAGACC AACAGTGCCA TGTCCGCTTT GCAGCGTCAT CGTACTACCG TCGTGGAATG TCTAAAGGAT CCTGATGTCA GTATTCGCCA GCGCGCGCTA GAGCTCATTT ACCATCTCGT CAACGACGAC AACGTGGAAT CGTTAACAGC TGAACTTCTG AACTACCTTG TTCTTTGTCC ACGTGAACAT CGAGGCGATA TCTGTAGCCG CATCCTTCGT GTCGTCGATC GATACAGCCC TGACGACAGG TGGAGAGTGG ATACGCTCAT CACTACCTTG ACCATTGCAG GACGAGAAGC GGCGAGAGAT GTCCAGTCTT CAGCAGTTGT CTACATTTCT CGAGGAGGTG AAGACATCCA TTCTTTTGCT ACACACAAAT TGATCAAAGC AATCAGAGAC GACGACGGGT CGCAACACGG ACTGTTAGCC GTTGGTATCT GGTGTATTGG TGAGTACGGC GACCTGCTTT TGAAGCCATA CACCTATACA CATCAGGCTT CCGATGTAGC TAATTTTTCT TCCAACGGTG GGCTAATCAC TTTCCATGCA CTCGACTCGT CTTCGGTCAT TGACACGATT GAGCACGTAG CAAAGCGTCA CGCCTGTCCA GAAATGGTAA AGCAACGAGC GTTGACTGCG TATGTAAAAC TGAGCCAGAG ACTAGCCAAC AGTGGTGATC AGGCAGCACT GGACCGACTG CGGCAATTGC TGAAGAACCA AAACATGTCA CACTCGTTGG AGTTGCAATT GAGATCCTGT GAGTATAGTG CTCTGGTTAA TGCCAGCAGA GGCGTGACAG CGTCTGCTCC TGCTCCTGTA ACGGACGATA TATTTGGCAT GACCAACGAT AACGCAGGTG GATCTGTAAG CGACGGTGTT ATTAATGCAG CAAAGGAAGC TCTTGCCCGA ATGCCAGTTA TAGACATGAA GGTTCTTCAA AAGCGGCTTT CTACCTCAGA TTGGGACGAT GACAGCACGC CTCGGATTCC ACGAGGTGCT GCCAAGAAGG ATACAAGCGG TGGTGATTTA TTGGATCTTA ACGATATTTT TGGAGCTGCA CCAACGCCAG AGACTACACA GAATGGAGCA ACCTCTGTTT CGGGAGCTGG AGAAACAGGA AAGAGTGATT TGGATCTGTT GTCTGACATC TTCGCTGTGC AGGCAGCCAC TGGTTCGGCC GCGGCACCAG TGAGCAATGG TACCTTTGAC TTGTTTGCAG CTCCCGTTTC ACAACCTGCC CCGGCTCCCG TCGACCCGAT GGATTTGTTT GGTGCGCCTC CGGCTTCGAC GAATATTCCT GCCTCGACCA ACAATGTGAT GGACCTTTTT GGTTCTTCCG CTCAAGGCCA AAGTATCCCT GCCCCACCCC ATATGTCCCA ACCGATGGAC GACTTGTTTG GATCAGCGCC AGCTCTTGCG GGACCTAGTG GTGTGAAAGT TTCTGGACCT AGCCACGCAG GCTTGTCGAT TGAATTTGAG TGCACGAAAC CAGATACATT TAATCAGCAA AAGTCGGTGT TAACTGCCCA CTTCAAGAAT ACGTCAGGTG ATACAATATA TGGAATGAAT TTACAGTGCG CTGTTCCGAA ATACGTGACA ATGGAAATGG AGCCACCGAC CTCCACTACG ATTCCCGTCT CCGGTGGTTC AAGTGAGGAA GTTACCCAAA AGATTACGGT AACAAATTCG ATGCTAGGCA CAAAGAACTT GATGCTCAAA CTCAAATTGA GTTTCACTGC CACGGGAGAG AGGATAGAGC ACATGGCCAC TGCTTCTGGA TTCCCAGCTG GTCAATACTA G
|
Protein sequence | PWNMSLKLRD LIRKVRQCKT AAEERAVIAK ESAMIRTAIR EEQAHYRHRN VAKLLFMHML GYPTHFGQLE CMKLTASPHF PEKRIGYLGM MLLLSEDADV LMLSTNALKN DLTSSNKFVA GLALCAIGNL ATADMSRDLA PEVDKHLKSP MPYIRKKACL AMSRCLSKCP DMVEDFIDRV ITLLKDKSHG VLITVAQLMT QILMIDFRNA EEEGEDPFAT PCRQAFLRLV PTLVKMLRNL LSTGYSPEHD IGGISDPFLQ VQLLTLLRLL GANNEEASEE MNDVLAQVAT NTETSKNAGN AILYECVQTI MGIESEDGLR ILAVNILGRF LLNRDNNIRY VALNTLARCI IEQKRSGDMI ETGDGEETNS AMSALQRHRT TVVECLKDPD VSIRQRALEL IYHLVNDDNV ESLTAELLNY LVLCPREHRG DICSRILRVV DRYSPDDRWR VDTLITTLTI AGREAARDVQ SSAVVYISRG GEDIHSFATH KLIKAIRDDD GSQHGLLAVG IWCIGEYGDL LLKPYTYTHQ ASDVANFSSN GGLITFHALD SSSVIDTIEH VAKRHACPEM VKQRALTAYV KLSQRLANSG DQAALDRLRQ LLKNQNMSHS LELQLRSCEY SALVNASRGV TASAPAPVTD DIFGMTNDNA GGSVSDGVIN AAKEALARMP VIDMKVLQKR LSTSDWDDDS TPRIPRGAAK KDTSGGDLLD LNDIFGAAPT PETTQNGATS VSGAGETGKS DLDLLSDIFA VQAATGSAAA PVSNGTFDLF AAPVSQPAPA PVDPMDLFGQ SIPAPPHMSQ PMDDLFGSAP ALAGPSGVKV SGPSHAGLSI EFECTKPDTF NQQKSVLTAH FKNTSGDTIY GMNLQCAVPK YVTMEMEPPT STTIPVSGGS SEEVTQKITV TNSMLGTKNL MLKLKLSFTA TGERIEHMAT ASGFPAGQY
|
| |