Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31068 |
Symbol | |
ID | 7199055 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 252089 |
End bp | 253900 |
Gene Length | 1812 bp |
Protein Length | 353 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185242 |
Protein GI | 219130165 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTACTGTT GGTGTTGATG CTTGGACTGT GCATAAGACC GTCGCGACAA CTCGTTCTTC GTATAACAAG GATTGAAAGA CTAGTAGTGG GCTAGTGGGC GTGTGGCGTG TTTTTCCTGG AAAGCCAGCG TGTGTCGGTT TCGTCAGTCA GTCAGCACCC AAGTTAGATT CCTGAAGCCA GCAACCACGA TGGAATCCGC ATCGACGACT ACCGCTCCCC CGTTCGGAGA GACCCCCAGT GGTCAATTGC CACCGATCGT GCAACGCTTA GAACGTGGCG AAAAGACACC TATATGTGTC ATTATGGTGG GAATGGCGGG GTCGGGAAAG ACAACTCTCT TGACGCAATT GCAACGATCT CTGGAAACGC CGTCCGTACC GCCCACGCCT GATGATTTCG GTAAGGATGT ACACGGCGAC ACCAGCCGCG CACCCGACAG TGACGTCGAA CCACACTCCG ACTTAAAAGT CGCCGCTCAT GTACCAGTCG CGGCCGATAC CGCCGCTGAC GCCAAAATGG CATCGTACGT TGTGAATCTG GATCCAGCTA CGCTATCGGT ACCCTACGAA GTGTCCATCG ATATTCGCGA TACGGTCGAC TACAAGCAAG TCATGCAACA GCACAAACTG GGACCCAATG GAGCAATCAT GACTTCGCTG AATCTCTTTG CCACCAAATT TGATCAAGTC ATGACGTTGT TGGAAAAGAG GGCGTACGAA GACGCTTCCG AACACGACCA GGAACAAGAT GGTACCACGA GTACGCCGCC CGAACCGCTG CCACCACAGT CACAAATAGG GATGGACTAT ATACTGGTGG ATACACCCGG GCAAATTGAA GCGTTCACCT GGTCCGCGTC GGGAGCCATA ATGAGCGAAG CGCTAGCCTC CGCCTTTCCG ACCGTGCTCT GTTTCGTGGT CGACACGGTT CGCTGCGCGT CGTCCCCCAA TACCTTCATG AGTAACATGC TGTACGCCTG CAGTATGATG TACCGTACCA GACTGCCCCT GATCGTCTGC TTCAACAAGA CGGACGTGGT GTCGCACGAG TTTTGCCTGG AATGGATGCG GGATCATGAT GCCTTTCAAG AAGCTCTAGA CGACGTCTCC GAGTCGGCCG GGTTTTACGG ATCTCTGACC AGAAGTTTGG CCCTGGTATT GGACGAGTTC TATTCCAGTT TCGCCAACGC CGTTGGGGTG TCGGCAGTGA CTGGAGACGG AATGGATGAC TTTTGGAAGA CGGTGGAAAA GGCGGGACGT CAAGACTTTG TGTTGGACTA CATAGAAGAT TTGAAGAATC GGATAGAGGA GCAGCAAGCT CGCACTCAAG CCATGGCTCG AGTGAGTTTA TCGAGGTTAC AGCGAGACGT GGATGCGGCG GACTAACTGT AAAGGATGCT CGGAACGTGG ATATCATTGA ACCCTGGTAA TCCCTTTTTG ATTTCGTGTC CACGTAGTCA TCAATACGAT ATTCTCTTGT GTTCGGGACT TATACAAGCT GCCCTGCCCT CCCTGTCCCA TAGCTGACAC TGAAAATGAG CAAACAAACG GATTTCACTG CTACAGCACC CATCCTCTGC AACCCCTTCT ATTGGAGCAT AAGTGGCACA GCTGGTGAAA TTATCTAATA ATAGTGTCCT TTTGATGTCG TATGGACGCC AACTATATTT GGACGACAAC CACATGGACA TTTTGAAGCT CGTCTGGCAG CGTTTTCGGC CACACCTAGC TTGAAAAGTT GACTTCAAAA GAGAAACAAA GCCAGGGCGG CAACCTTTGG TAGCTAGTGC CAAGAGTATC CTTTCTTCCT GC
|
Protein sequence | MESASTTTAP PFGETPSGQL PPIVQRLERG EKTPICVIMV GMAGSGKTTL LTQLQRSLET PSVPPTPDDF VAADTAADAK MASYVVNLDP ATLSVPYEVS IDIRDTVDYK QVMQQHKLGP NGAIMTSLNL FATKFDQVMT LLEKRATPPE PLPPQSQIGM DYILVDTPGQ IEAFTWSASG AIMSEALASA FPTVLCFVVD TVRCASSPNT FMSNMLYACS MMYRTRLPLI VCFNKTDVVS HEFCLEWMRD HDAFQEALDD VSESAGFYGS LTRSLALVLD EFYSSFANAV GVSAVTGDGM DDFWKTVEKA GRQDFVLDYI EDLKNRIEEQ QARTQAMARV SLSRLQRDVD AAD
|
| |