Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16944 |
Symbol | |
ID | 7199259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 99257 |
End bp | 100570 |
Gene Length | 1314 bp |
Protein Length | 438 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185426 |
Protein GI | 219130551 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 0.994947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAATTGCCA AAGGTACCCG GGACTACCTA CCGGAACAAA TGATGATTCG TCAGGAAGCC TTCAACATTA TTCGACGCGT TTTCGAATCG CACGGTGCCG TGGAGATTGA CACACCCGTA TTCGAACTCA AGGATACCTT GACGGGCAAG TACGGCGAAG ACTCCAAACT CATTTACGAT TTGGCCGATC AAGGTGGGGA GCTCTTGGCC CTGCGGTACG ATCTGACCGT ACCCTTTGCC CGGTTCTTGT CCCTTAACAG TGTCGGTAAC ATTAAACGTT TCCATATTGG TAAGGTGTAC CGTCGGGATC AACCCCAGTT GAATCGTGGG CGGTACCGGG AATTCTATCA GTGCGATTTC GATATAGCCG GAACGTACGG ACGCATGGTG CCGGACTCGG AATGTCTCTG TGTGGCCTGT GATATTCTCG ACGCCTTGCC CATTGGAGAC TTTGGGATCA AACTCAATCA CCGAAGACTG CTGGACGCTA TTCTCGATTT GTGTGGCGTA CCAGCCGACA AGTTTCGGAC CATTTGTTCC GCCGTGGACA AGCTTGATAA AGAAGCATGG TCCGAAGTCA AACGGGAAAT GGTGGAGGAC AAGGGTCTGC CAGAAAGCGT AGCGGACAAG ATTGGAACCT TTGTGTTAAA CAAGGGACCA CCTTGGGATA TGTACAAATC CTTGATGGAC GGAAACCGTT TTGGCAACCA CAAGGGTGCC AACGAAGCCA TGGAAGACTT ACGCATTCTG TTTGAATACC TCGAAGCCAT GGACAAACTC AAATTTATTT CCTTCGACCT GAGTCTAGCG CGCGGTCTCG ATTACTACAC TGGGGTCATT TACGAAGCCG TCTGTATGAG CGGTGAAGCG CAAGTCGGCA GTATTGGTGG AGGTGGGCGT TACGACAATT TGGTTTCCAT GTTTCAGGAA GCCGGCAAGC AGACACCGTG CGTTGGAGTA AGTGTAGGGA TCGAGCGCGT GTTTACCCTG ATGGAGGCTC GATTGCGCGA GCAGCAAGGG GGATCTATCA AGCGCGCGAA CGTCAATATT TTGATCGCGG CTGCTGGCGG AACCATGATG AAGGAAAAGA TGCGCATTGC ACGAATTTTG TGGGACAATA AACTCAGTGC AGAATTTAGT CAACAAGAGA ACGCGAAACT GAAAAAGGAA TTACAGAATG CTTTGGATCG TGACATACCC TTTATGGTAA TCGTGGGAGA AGAGGAGCTG GCGGAGAGCA AAGTTACCGT CAAGGATCTG AAGGCCAAGA CGGAGCACAA GGTGCCGATT GACGAGCTCG TTTCGACTTT GCGT
|
Protein sequence | KIAKGTRDYL PEQMMIRQEA FNIIRRVFES HGAVEIDTPV FELKDTLTGK YGEDSKLIYD LADQGGELLA LRYDLTVPFA RFLSLNSVGN IKRFHIGKVY RRDQPQLNRG RYREFYQCDF DIAGTYGRMV PDSECLCVAC DILDALPIGD FGIKLNHRRL LDAILDLCGV PADKFRTICS AVDKLDKEAW SEVKREMVED KGLPESVADK IGTFVLNKGP PWDMYKSLMD GNRFGNHKGA NEAMEDLRIL FEYLEAMDKL KFISFDLSLA RGLDYYTGVI YEAVCMSGEA QVGSIGGGGR YDNLVSMFQE AGKQTPCVGV SVGIERVFTL MEARLREQQG GSIKRANVNI LIAAAGGTMM KEKMRIARIL WDNKLSAEFS QQENAKLKKE LQNALDRDIP FMVIVGEEEL AESKVTVKDL KAKTEHKVPI DELVSTLR
|
| |