Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44109 |
Symbol | |
ID | 7203870 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1007276 |
End bp | 1008506 |
Gene Length | 1231 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | annexin |
Protein accession | XP_002186168 |
Protein GI | 219113169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCAGCGTT CTTACAGTCA ACGAGCCAGG CTCTTTGTAT CCAACTATTA TGAGCATCGA TCTGTATCCT GCCATCATTC ACGAAGGAGA CCTTTCTCCT GACTCCTTTG GGCCAGAAGT CGACGAAATT TGCAATGAAA GTAAGTTTGA ATCCGGCCCT CTTTTTGTAG TTTAGGTAAC TTTAATCCGT GTTCTCATGT GGCGTTCTTC TCCGCCGCGA AGTTGATGCT TCTTGCAAGG GTTTTGGAAC CAACGAGAAG CGCCTGCTTA AGGCTATGGG ATCCCAGTCT CCCGAGACGC GCTGCAAGGT GCCACTCCGC TACAAAGAAC TTCGTGGCAA GGAGCTTAAG AACGTAATGA AGTCCGAATG CGGCAAGCGC AATTTCGGGA CCGCGTTGCA ATTCTTGGCC GTTTCGCCCA TTGAAGCAGA CTGCGACATG ATCAAGGCCG CTTGCAAGGG TGTGGGTACG AATGAGCTCT TGCTGTTTAC CATCTTGTGC GGTCGCTCCA ACACGGAGGT CGAACTACTC AAGAAGAAGT ATTTTGAAAT TCACACGGCG GATCTTGGTC GTGTTCTCGA TGGAGAATTG GGCGGTGATT TAGAAAAGAT GATTTTCAAT ACCTTGCAGG CTGCAGAAGA AATATTTGAT TCGGACTACC ATACAGAGGC GCTCATGAAA GAGGACGCTG CGAAATTGTA CGAGATGGGA CAAGGCAAGC GATTCGGAAC GAACGAAGCA GGACTTTTCA AGATCCTCTG CGCCCGCCCG CCTGAGTATT TGAAACAGAT GAACCTCGTC TACGCGGAGA AGTACGGCTT CACTCTTCCC AAGGCATTGG AGACCGAGCT AAACGGGCAT GTCAAGGATG CAGCATTGTA CATGATCGGT ATGAAACTCA AACCCTACGA AACGGTGGCC AACTTGATTC ACCGCGCTTG CAAGGGTTTC GGTACAAATG AACTTCTTTT GACGTCGGCT CTGATTCGCT ACCAACCCAT AATGAAGAAT GTTATGGAAG CATACATCGA ACTCTATGGC GAGACGATTG AGGACCGTAT CAAGTCTGAA TGTGGTGGCG ACTACGAACG TATTTTGTTG GAAGTGTTGG GCGCCGCTGA ATGATTTTAA CGTGTTCTGG CCCGTTTGTT TTTATGTCCG GTGTTTTAGC CCTTGTTAAA CTATTTAATG AAAAAGTGTG TAGGGTGATC TCCAGTCACT TGATGCAGAA G
|
Protein sequence | MSIDLYPAII HEGDLSPDSF GPEVDEICNE IDASCKGFGT NEKRLLKAMG SQSPETRCKV PLRYKELRGK ELKNVMKSEC GKRNFGTALQ FLAVSPIEAD CDMIKAACKG VGTNELLLFT ILCGRSNTEV ELLKKKYFEI HTADLGRVLD GELGGDLEKM IFNTLQAAEE IFDSDYHTEA LMKEDAAKLY EMGQGKRFGT NEAGLFKILC ARPPEYLKQM NLVYAEKYGF TLPKALETEL NGHVKDAALY MIGMKLKPYE TVANLIHRAC KGFGTNELLL TSALIRYQPI MKNVMEAYIE LYGETIEDRI KSECGGDYER ILLEVLGAAE
|
| |