Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49658 |
Symbol | |
ID | 7198302 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 331820 |
End bp | 333209 |
Gene Length | 1390 bp |
Protein Length | 339 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184346 |
Protein GI | 219128283 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACGAAACAC GCTAAAGCTT CACGCCGAAT AATTCTTCCA TGGTCAAGAA AATTACGCCG TTCTTTGTTT CTCCAGCTAT CCGAATTTTT ATTCTGGTTT CGTGTGCCAC TGCGTTTCAA CCGAAGGCTT TTCATGCCAT TGACATCAAC CCGACCAATC ACTTGTTTGG CAGGAGTAAC GGGACGCCGA ATCGCAACGA GTGCGCCATG TCGTTCAACT CCGCAGACCC CAAAAAGGAA TTGTCGGTAG GCTTTATCGG CTGCGGGACG ATTGCCAGTG CCATTGCGAC CGGCTTGGCC CTGCAAGACA AGGTATCGGT CACAAACATC GCCGTTACCA AACGATCGGA AGCCAAGTCT TCTGCCTTGC AAAAGTCCTT CGGTGACCTC GTTTCGATAC ACGAGGATGC GCAAGAGTTG GTGGATCAGT CGGATGTTGT ATTCGTAACT GTTCTACCTG AGCAGGCATC GCAAGTTCTG CAAGAAGTTA CCTTCGACAG CACTCGACAT TCGCTGGTTT CTCTGGTGTC CACCAGTACA CTGGACGACT TGATCAGTGA CTCTGGATTG CCCGCCGAAA ATGTATCCAA GATGATATGT AAGTCAACAT CCACAAAAAT CGCTTCGAAA TTGTCGAACC TGGTCGCCTG ACACGTAAAT GTCTATTTCG ACCGATTCCT ACCGTCAGGT CTCCCCGCCG TGGCTAAACT CAAGGGCGTT TCACTTGTAG TACCGAAACA AAACCACAAT CCCATTCTAC TACAAATGCT GGAAAGCTTA GGGGGCTACG TTGAGTGCGA AACACTACAC CAGATGAACG CAATGATGGT CCCCACCGGC ATGATGGGGA GCTTTTACGG TCTGTTACGG AACAATCGTG ACTGGCTTGT GCAGCAGGGT GTGCAGGCCA GCGATGCTTC CTACTTTGTT GCGAAACAGT ACATGAGCAT GATGGAAGAT GCCGTCGAAT CTTGCGTGGA TCCGTCACGT TTTGACGATT TGGTAGAAGA ACAGACCCCT GGAGGTTTGA ACGAACAGGG TCTGGCGAAT CTATCGCAAC AAGGAGTCTT TCAATCGTAC AATCAAATTA TGGATGCTCT TTTGTCTCGC TTAGAAGGTC GATCGGACGG ATCGTTAACT GAGAAGTAAA GCCTGTAACA CCGGCTGCTC CCTCATATTC GATTCGTATA CCGAGGGCAT TCATCCGGTG AGAATGCTTG AAATATGCAA ACGACCTACG CCTTGCTTGT TTATACTTTT GGGAGACACA TAAACACTGC AGCAGCTGAC TGTGAAATCT CGATTGCCTG ATCTTTGATT GTGAATGAGG TGCAAGGCAA TTGCTTTGAT AGTTAACAGA AAGGATTTAG CTTTTTCTAT GAGCCTAGCT
|
Protein sequence | MVKKITPFFV SPAIRIFILV SCATAFQPKA FHAIDINPTN HLFGRSNGTP NRNECAMSFN SADPKKELSV GFIGCGTIAS AIATGLALQD KVSVTNIAVT KRSEAKSSAL QKSFGDLVSI HEDAQELVDQ SDVVFVTVLP EQASQVLQEV TFDSTRHSLV SLVSTSTLDD LISDSGLPAE NVSKMICLPA VAKLKGVSLV VPKQNHNPIL LQMLESLGGY VECETLHQMN AMMVPTGMMG SFYGLLRNNR DWLVQQGVQA SDASYFVAKQ YMSMMEDAVE SCVDPSRFDD LVEEQTPGGL NEQGLANLSQ QGVFQSYNQI MDALLSRLEG RSDGSLTEK
|
| |