Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41130 |
Symbol | |
ID | 7199089 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 38555 |
End bp | 39562 |
Gene Length | 1008 bp |
Protein Length | 304 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185108 |
Protein GI | 219129885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.156284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTA CCGAGAATGA TGGACCAAAA GCAGCTCGCG ACTCTCCTCG AGGGCGAACA ACAATAAAAC TTGTTATTGG TCTTTTGTTA CTTGGCTTTG TTGCCTTTGT CATCCTTGAT TCACTAACAA ACAGATACGT GCGTGACGGA ATCGATTCAT TCCTCGACTG GATTGAGGAG AATTCCGTTG AAGGAATCTT CCTTTTTGTG CTCGGTGCGT ATGTATTGCG CGCTATTAGC AAGCTGCCTC TTGAAATATC ACACATGGTC CAGGAAATGA TCTCACTATG TCTTTTTGAC TTTTTAGTTT ACTTCGCTGC AACAATTTTG TTTATTCCCG GCTCAATTTT AACGCTTGGT GCTGGTTTCG TTTTCGCATC CTCCTTTGGC CTTGGACTGG GACTTGTTAT TGGAGTGTTT GCTGTTTTTC TTGGAGCAAG CTTGGGCGCC ACTGCATCTT TCTTTATTGG ACGTTACTTG CTCCGGGACC AAGCAACAAA GCTGACGAAG AAGTACGCAG TTTTCGAGGC CTTGGACGTC GCTCTTCAAG AAAACGGCTT GAAGATTTTG GTCCTGCTTA GATTGTCTCC AATTGTTCCT TTCAATGCTA TCAACTACAT ATGCGGTGTA ACGGCTGTAT CAATTCGCGA TTACATCCTG GCACTGTTTG CTATTTTACC TGGTACAACC TTGTACGTGT TTCTCGGCGC TTCAGCTGGA AGTCTTAGTG ATAGTGCTTC AAGTGGCGAC GACTCTACTG TTACAATCAC TGTAGTAGTC CTCGGAATTG TCTTGGGTTT TATTGCCATT TGGATTTCAG CACGTTACGC GAGAAAGGAA CTGAACAGGG TTCTTGAACA AAGACGTGCA GAATCTGAGC AGTCCGAGGA AACAGCTGAG AGTAATATTG AACAAGGTGT GGTGAATCAT CCTCGAGATA GAGATTTGGA CTGTTCTGAG TGTGAACAAG TTGGGGATGT CGGACCAGTG GAACTGTCGA TCAAATGA
|
Protein sequence | MSTTENDGPK AARDSPRGRT TIKLVIGLLL LGFVAFVILD SLTNRYVRDG IDSFLDWIEE NSVEGIFLFV LVYFAATILF IPGSILTLGA GFVFASSFGL GLGLVIGVFA VFLGASLGAT ASFFIGRYLL RDQATKLTKK YAVFEALDVA LQENGLKILV LLRLSPIVPF NAINYICGVT AVSIRDYILA LFAILPGTTL YVFLGASAGS LSDSASSGDD STVTITVVVL GIVLGFIAIW ISARYARKEL NRVLEQRRAE SEQSEETAES NIEQGVVNHP RDRDLDCSEC EQVGDVGPVE LSIK
|
| |