Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46470 |
Symbol | |
ID | 7201564 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 430815 |
End bp | 432840 |
Gene Length | 2026 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181008 |
Protein GI | 219120544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGAGTACA ATACAACAAG ACACTCGGGT CAACCAATCT AGTAGAGCGT GGCACGGCAC TGTTGTACAG ACTGTTTCAA TTGCACTACT TCCATTGACT GTCTGTCTCC TGTATCTATC TACATCTGGA TCAAGCTTGG TGGGAAACAC AGTCGAACTG AACGGATAGA TAATATTTTA TTACTGACGC GAACAGAGAA CTCACTCACG TCCCGTCCCT GCTGCTGCCG CTACTACTGT ATCCCTCCTT TTTCCACATC ATGGTGAAAA CCAAGACCCG TGTGGGTGTG TTGATGCGTG AATCGCTCAA GGGCAAGACG GCGAGCAGTA ATGCTTTCCC CAACCTCGTG ACCCTCTCGG AGCGCTACGA TACTTGGTGC AAGCAGGTCC GGGGGCTCAT TGTCGCACTG CAGCAACACC ACGCCGTCAT GGGACAGATC GAAAAGACCC GCGCCAATGT ACGTACCACC GGAACTACGA CGGAACGAAC TCGGTGTAAC TCGTTCACAC GGTGAACACA AAAAACGAAT GAACCAGAAA AGGACTCACA CATGTTGGTC TTTGTTCTTA CCGCAGCTCT CCAAACACTT TGCGGCCTTG TCCGTCAAGA CGCCGATTCA CGAAGCGACA GGCATGTTGC CTTCCGCGGA TCGTCCTTCC AGTACGGTCA ATTCGTACGC TTCCATTCAC GACACGCTGT CCGCAAAGAC ACAGTCTTAC GTCGCCAAGT ACCAACAGTT CGTGATTGAT TACGCCGTGG AATGGGAAAA AGTCGTCGTC ACGCGGGTTG GCAACGGCTT GAAGGTCGTG CAAGATTTGC GTCGTGATTT GGATCACTAC CAAAAGAAGG TTGAAGCCAT GCGCTTGAGC GTTAACCAGG CCATGAGCAA GGGAAAAAAC GTCAAAGCGG ACACCGCGGA AAGACTCAAG CGCAATGAAG AAAAACTCAT CTCGGCCAAG CAAACTTTCA ACAAGTCTGC AACCGATTTG TGCATCCTTA TGGAAGAAGT TACGGAGCGT TCGTGGCGAG ACTTGCATCC ACTTTTGCTC AAGTGCGCCC AGTTCGATAT GACCCTGGCT TCGGACGAGT CCAGCATTTT GTCTGGATTG AATGCGGTCG TGTCGGCTCT GAAAGAGGTC GCGACCGCCC AAGGATTGTC ACCCCAACCC CGTCTCAAGG ACCTGGCCGG AGTCAAGCCA GAACTCTTGT CGACCCGACC CGGAGGTGTC AGTGGATTGA TGATTGAAGC GGGCGCTGCC ATGGGAGGCG AACTGGGTTC GGGTACGGCA TTCGGTGGAT ACGATAGTAT GGCGCAGCCT CCAGGATCCG TGGCTCTGCA GGGGATGGGT GGCTACCCCG TGTCCATAAC GGATTCCATG TCCCAACCAG GATTTCCTAG TGATCACTCT GCCCCAATGC GCTCCGATTC CATTGGAAGT TTTGCTTCGG CCCCGCCCAT GTCCAATCCC TCCCTTTACA ACGGCAGCAA CGGCTACAAC AACAGCGGCA GTGTCGATCC ACTCTCGACC TTGGGTATGT TGACTCTTTC CGGCTCCGCC GCCCCACCTC CGACGATGGA AGACGTCTAC GCCGCGTCCC GCTCGGCACC CAGCAGCGGA AACCTACCAC CCCTCGGACC GTCCTACAGT AATTACAGTG TCCGTAGTGC CAGCTACAAC GACATGGATT CCATGTCCAA TTACAGTGCT CCGGCGCCAA TGGGATCGCC TCCACCTCCA CCCAACATGC CACCTCCTCC ACCGCCGGGA CCCGCGTACG ACGCCTACCC ACCGCAGCAA GCGGCTTGGT CGGCCGAGCC TCCGCCGGCC TATGCCTACG CCCCGGCTCC CGTGGCGTAC CAACAGTACC CGCCACCGCT GCCCAACTCG TACGGACCGC CTCCTACCGC GGCCAATCCG TTCGGTTAAA CAGGAACAAG CACTGCAATA CTACGGCGTC CTGCCAATCT CACGCGCCGG TCCAGGAATT TTTACTCTCT CAATTTGTAA AGATAGACGG AATTTT
|
Protein sequence | MVKTKTRVGV LMRESLKGKT ASSNAFPNLV TLSERYDTWC KQVRGLIVAL QQHHAVMGQI EKTRANLSKH FAALSVKTPI HEATGMLPSA DRPSSTVNSY ASIHDTLSAK TQSYVAKYQQ FVIDYAVEWE KVVVTRVGNG LKVVQDLRRD LDHYQKKVEA MRLSVNQAMS KGKNVKADTA ERLKRNEEKL ISAKQTFNKS ATDLCILMEE VTERSWRDLH PLLLKCAQFD MTLASDESSI LSGLNAVVSA LKEVATAQGL SPQPRLKDLA GVKPELLSTR PGGVSGLMIE AGAAMGGELG SGTAFGGYDS MAQPPGSVAL QGMGGYPVSI TDSMSQPGFP SDHSAPMRSD SIGSFASAPP MSNPSLYNGS NGYNNSGSVD PLSTLGMLTL SGSAAPPPTM EDVYAASRSA PSSGNLPPLG PSYSNYSVRS ASYNDMDSMS NYSAPAPMGS PPPPPNMPPP PPPGPAYDAY PPQQAAWSAE PPPAYAYAPA PVAYQQYPPP LPNSYGPPPT AANPFG
|
| |