Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50330 |
Symbol | |
ID | 7198988 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 328935 |
End bp | 330178 |
Gene Length | 1244 bp |
Protein Length | 234 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185174 |
Protein GI | 219130022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.162709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAG ATGACGCAGA CGGTGCGTGT GTCATCCGTC GCTACTTTCC TTTGTCTGTG CAACTATGGA AGCCACTCGT TGGTTCAAGT AACATGTAAA GGATAACTAC TCTAAGAAAT ACTGCGGGAG CTCTTTTCCC ACTTCTGGTT GGTAATCACC GATTGGAAAA GAGCATCTCG TAGGGCACTA ACAGGCCAAC GATTGTTCTA GGTATAACAT ACTTTTATTA ATTTGATTCC TTCAATCCGC AGGATGACAA CAATTATCAA GGCAATTTCG GCGCTTTCTT CTGTGCGAGG CTACTATCGT GTCGCCTCGT CCGCTTTTCC CATACGTATG GGTGCAACGG ACTCAGACAT GTTCCGTATT GACTTTGTCG ATGAGGACAA TACCTTGAGC GTGTCGCTGC AAGACTTTCA CCTCGCTTTT CTGACGTCGC CTTTGTTCCA ATTCGAGCTG TGGATTTTAT CGTTCGCGAC GGTGTCCGAC CCGGCCACTA CCACAACGGA ACATTTGGCA GCGGTAACAA GTGGTGACAA GAGCAGTTTC GGACCGTGGA CTGCGTGGGC AGTCGAGGGT AAACGCAGTG CCCCACTTAC AGCCTCTTCT GAACCGGCCT CAGCGTGCCA AATCATGCGG TGCTACATCC AAGGTTCGTT CTTGACTGTT TCTACGCGAC CTTTTTGATA ATTTGTATTG CTGTGTGTTC TAATGCAGTG TTTTACGCTT TTACATTAGG CAAAACCTTT TGTGATACGT GGTGGGCTGT TGAAAAAGTG GCTGACCGAC CTAATCCGGA GCTCGTTTTT GGATCGGCCT TTAAATTTTG CGACGATCAT AAGCCTCTAA TGTTTCGCGT GCTGGACCCT TTGCACCGCT TGTACAGTCG CCTATTGTTA GCGTCAGCCA TGATCAACCT GATCCAACAA AAGCAAAAGT GCGAGTGATT TACTGTTAGC GATAAATTTG TGGTCTCACC TTTCAGATAG ATCAGCAATA TCATACCGGA TGACCATCTT TTATTGATTT TGCATGCCAT TGGTCCCCAG TCGACGGCCG ATTGCCAGCT CGTGTCTCCT TGGAAGATTG ACGGATGTGA CCGGTTGTCT CGGATGGAAC ATACCGATAC AAAATTGTTT GATGTGACAT CCGTGACAGT GAATAATTTC CCTACGTTTG TGCAGCATGA GATTTGTAAC ATAGATTGTA AACAAAAGGA ATACGTAGCT CTTTCTTTGG AAGG
|
Protein sequence | MSPDDADGAC VIRRYFPLSV QLWKPLVGSS NMMTTIIKAI SALSSVRGYY RVASSAFPIR MGATDSDMFR IDFVDEDNTL SVSLQDFHLA FLTSPLFQFE LWILSFATVS DPATTTTEHL AAVTSGDKSS FGPWTAWAVE GKRSAPLTAS SEPASACQIM RCYIQGKTFC DTWWAVEKVA DRPNPELVFG SAFKFCDDHK PLMFRVLDPL HRLYSRLLLA SAMINLIQQK QKCE
|
| |