Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43034 |
Symbol | dsCYC4 |
ID | 7196837 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1856612 |
End bp | 1858383 |
Gene Length | 1772 bp |
Protein Length | 493 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177388 |
Protein GI | 219111273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGGC CCTGGGAGAT TAATTTTGGC GAAGACCATA AGTAAATGGG GAGTGTACAC TCTGTGTAGT TCGTTCTTGG CCGCTTCCGT CGATGAAATT TCTGGACGCG CCAAGTTAGG CAAATGTGAG TGAGTCTTGA GAATTGGTGA AGTCTTTTCG CGAGACACGC CAATCATCTA TACCTATTTA CAACGTAGTT TAATATTAGC ATAGAGATTT TTACTTCTGC AGGCCAGGTA AATCCTGCCT GTCTCCCCAG CAAATAAATC GCAGCACCAA AAGAACTCCC ATATCTTTTC TGCCGTGAGT TTTTTGCGTA GAGCAGTGGA GTTTGCGCCA GGCGCACCAT CAAATTTTGT GAGAGCACGG TCACAGACCA ACTCTCCGCT ACCAGAGAGA CTTTTCTTCT CACACAGCCA GCATCCAATT CCGCTCACAA CTCGGCCGCC TGGGGAGTTA TGGCAGATTG TTCCGTACGG CATACAAGTC TCCATCTTTT GCCGTATACG GGGAAATACG ACAACAGGGT CGACCATGAT GCTCAAGCTT ACAATGAACC CGCGTAGAAG ATCCAAGAGG CATCGATCGA TCCCCCACCG ACTGCGTTTG CCCACCAACC CCACAGCAAT GGAGACCGGT GATGATGAGG AAGATCGGGA AAAGGACATG CTGGCAGAAA ATATCGGTGT GATAATTAAA CAAGAGGGTG CAGCCAAATA CAGCTGCGTT GACTATTTGA GTCTAACTGT TTGGCAACAA AGTGTGTATC AGTTGATGAA GAAATCCAAG GCGGATCCCA TAACCCACCA TGCTAACACA ATGATCGATG AATACTGCCG CGAGCAAATT GTTGAGTGGT CATTTCGGGT CGTTGATTAC TTTCGCATAG ACCGTGAAGT GGTAGCGCTT TCAATGTCCT TTTTGGATCG ATTTCTGGCG ACTTGCCGAT GCGATCGGAC CAGTTTCAAA TTGGCGGCAA CAACAACATT GCACTTGGCT GTTAAACTCT TGTATCCATG CAAACTAGCC GATTTGGGTA TCTTGAGTGA TCTCAGCCGA GGCGAGTTCG ACATGCACGA CGTGACCGAA ATGGAAAGCC ATATCCTTCA CGCACTTGAG TGGAACCTGC ATCCGCCTAC ATCCGCGGCA TTCACCTCAC TTTTCCTGGA CTACTTTTTC GCCACCCGCG CAGTTCATGT GTCGAACGCT GATCTTGACG ATATCTACGA CGTATCGTCC TTTTTTTGTG AGTTGGCTAT TTGTGATTAC TTCTTTGTTC CTACCCGAGC GAGCGCGATT TCTCTTTCCG CTATTCTGAA CTCCCTGGAA GGTATGTACG GTCCCGACAA TCGCCTATCT CATGCCATAT TGGAAGCAGC TCTCGAGCTG CAGGTTTGCG GCAGCGGCCT TATCGACTTA TCCGCCGCTC GCAACCGTCT ATGGGAGCTA TACGAGCGTA GCGAAGAATG TGCATTGCAC AACGACAAGC CTGCGCAGGA AGATATTCGG CAACATGGAA GCTGTACATA TATCAAGAAG CTCTCGGCAA CGGCGTCGCC AGTATCAGTC TCGAAACCGT GCCATTCGTC CACCGACTTT TCACGTACGA GTCATAGCTC GGCCCTACGT AACGAAAGCT GGTGAATCTA CGTACAGTGC GTGTCTAAAA CGTACGCTTA GCCCAGAGCT TTGCTACTAC CGCTTGAATA TATTTTCCCG GAATGGGGAC AACTCCGTCC GCATAACCTA ATGTATTCGA TGCTATTTCT TC
|
Protein sequence | MTGPWEINFG EDHNSFLAAS VDEISGRAKL GKSNKSQHQK NSHIFSAVSF LRRAVEFAPG APSNFVRARS QTNSPLPERL FFSHSQHPIP LTTRPPGELW QIVPYGIQVS IFCRIRGNTT TGSTMMLKLT MNPRRRSKRH RSIPHRLRLP TNPTAMETGD DEEDREKDML AENIGVIIKQ EGAAKYSCVD YLSLTVWQQS VYQLMKKSKA DPITHHANTM IDEYCREQIV EWSFRVVDYF RIDREVVALS MSFLDRFLAT CRCDRTSFKL AATTTLHLAV KLLYPCKLAD LGILSDLSRG EFDMHDVTEM ESHILHALEW NLHPPTSAAF TSLFLDYFFA TRAVHVSNAD LDDIYDVSSF FCELAICDYF FVPTRASAIS LSAILNSLEG MYGPDNRLSH AILEAALELQ VCGSGLIDLS AARNRLWELY ERSEECALHN DKPAQEDIRQ HGSCTYIKKL SATASPVSVS KPCHSSTDFS RTSHSSALRN ESW
|
| |