Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43068 |
Symbol | |
ID | 7196857 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1951702 |
End bp | 1952912 |
Gene Length | 1211 bp |
Protein Length | 349 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176879 |
Protein GI | 219110255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.150717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTTGTAGA GCTTCGGATG GCCTCGTAGA AAGAAGAAAG CGTCAGACGG ACCGCTCGAT GACGGGCCAT CGGTCAGCAT CTGCTCCAGA ACAAGAACAC ACACATTCAC TTTCAGTCCC AATCTCGCTC GGTAGACCTT CTTTACGCCT CAGACTTGAA GCATATTATG CATTGATTTC GCCCGACCAA ATATCTAACA ACGCTTCTTG GAAAGAACGC TTTAACCAAA TTTGGAGAAA ATTCGGAGGA TCTGTGGAAG GCGAACGGAA TCTTGCTACT AAGTTGTCCA AAAAGTATGG CACCGCTATC CGATTACAAA CCGTAGACTT GACACGAACA ACTGCCGCGG ATACGGGGAC GTCGAAGTCG GCGTGGAAAA GGAAGGAAGC CTATTACGAT CCAAAGCAAT CGCAACAAAA GAGTGGAGTT TTGGACTTTG GCTCGCAGAG ATTTGATCCC AATGCAACCC TAGCAGCGCC TTTATCACAT GTTGAGCGCG ATAATGTGTT TGTCGTGTCT TGTCCTTTGC TTGACCGTGT CGAACTTTGT AAAGGGCTCC TTCCTTGCTC AGATCCTCTA TATCAACGAT CAAAAAAGAG AGAACGATCG GGCATCTCTG TGCAAGTAAA AAATGATGAA TCGACGAAGA AGGTTAAACC TCCCTCCTGT TTCGCAGAGA TAGCCCAGGA TCTCCAAGAA GGTCCATTTG CACTGATGTA TAACACCTTT GTCCACAAGC AGCGCGTTCG CGTTGTTCTT CGGTACGTGA ACGGTATACG AGGCATAGTT ACGGGCTATT TGGTGGCGTT TGACAAGCAT TTCAACCTGA TCCTTCGAGA TGTCGAAGAA GTCTATTCGA AGCGAGCTGA ACGAGGATTT GAGCAGTCCA ATGCTGAAAT GGAGTTGCGT CGTCGACGAA CAAATCTTTA CCGAGCAACG GATCATCTCG ATTGGTGTAG TAGGCGTCGC TGTATGCGTC AAATTATGGT TCGGGGGGAT AACGTCGTCA TTGTTTACCG TGCAAAAGAT GAGCGCTCTG CCTTGCACGA AAATTCAAGG TGCCCGCAAG AAAGTCGTTA TAGAAGAAAG AGCATAAAAA GAGTGCCGAC AGAAGAACGC ATTGGGACAC CCGGGTCATT GATATATTGT GTACAACGAC AGCCTTTTCC AAGGTGACCA ACACGTCAGT GCGAATGATG CGCCTCAGTA A
|
Protein sequence | MTGHRSASAP EQEHTHSLSV PISLGRPSLR LRLEAYYALI SPDQISNNAS WKERFNQIWR KFGGSVEGER NLATKLSKKY GTAIRLQTVD LTRTTAADTG TSKSAWKRKE AYYDPKQSQQ KSGVLDFGSQ RFDPNATLAA PLSHVERDNV FVVSCPLLDR VELCKGLLPC SDPLYQRSKK RERSGISVQV KNDESTKKVK PPSCFAEIAQ DLQEGPFALM YNTFVHKQRV RVVLRYVNGI RGIVTGYLVA FDKHFNLILR DVEEVYSKRA ERGFEQSNAE MELRRRRTNL YRATDHLDWC SRRRCMRQIM VRGDNVVIVY RAKDERSALH ENSSLFQGDQ HVSANDAPQ
|
| |