Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34924 |
Symbol | |
ID | 7200134 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 668946 |
End bp | 671546 |
Gene Length | 2601 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179265 |
Protein GI | 219116941 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0285995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAC TGATAGCTTC CGATACAGTG GCGGAAGGGG ACACTTTCGT GTGTCGACGA CGACGGCTTC GTCAGAGTTA CTCCTCCTCT GGCAGCGCAT CGGCCGCAAT AGGACCCATA GTAGCAATAC TGCTGCTTCT GTCTCATCAG GGAGAAGCCT TTTTGCACTC GCAAGAAAGC TGTCTTCGCC GGGTCTCCCC GCGCTACATG GTGAGCTCTT CCCAAAGAAT ACGAAGCAAC AAATCCGTCG TCTTGAGGAA TCGTCTACAG GCGACGTCAA TCGACTACGA GGATGGCAGC CAAAGCCGGC TTATACCAGA AGATGTGTCG GATATGCAAG CACTACGCCA GAAGATATGG GATCTCGAGC AACAAACCCA TATGCTAATA AAGGCGAACG ACTTGCAAGG TGCCTTGGAG GGAATACAAA GACTACTGGA AGTCATCCGC GTTTCTTCTG CCAATGCTGC GACGGCCAGT GATAATACGA TGATGCGCTC TATGAGTCAA TGCCTGGACC GAAGCGTTCA AACTTTCGCC GCCAGGACAT TTAATGTAAA AATGGAATCA CCATCCCGAG CACGAAAGCA TGTCATGATG GGCGTCGAGG CTTTACAACT TCAGCTTTCG TCGCAGTTCC TGTCGGAACC GTACAACTTG CTGCCCAAGA TGACCTTTTT GAATGCCTTA AAGGCACTCA CACAACTAAT TGAGGTGGGA CGAGGTGAAC AGCACGACCC ACTTCTTTCA AACATGTCCG CAGCTGCCTT TAGGATTTTG CAACGTCTGG TAACCGGTGT GGGCATTCGG AACAAATCTT CCCCTTTGGT GGTATACGAA AAGGATTTTT GCATGGTTCT CAACGCCTTC ACCGAGTCAG GAAGGATGGA CATGGCGCAT CGGATTATTG CTTTGCAAGA GCGGACCGAG CATGCGCCGC CACTATCGCC AGTGGCCTTT TCGATTCTAC TTAAAGGATA CGGTAGATTG AAGGATTTGC AGCAGGTAGA GATGGTCCTC CAACATTCCG AAAGAAGCAA AATTACTCCG GATACGGTCA TGTTCAATAG CCTGATTGAT GCGTACGTCA ACTGCAACGC TATCGACAAG GCCCGTGGCG TATTTGATCG AATGCAACGT CCACAGGATA TGCTCAAGGA CGCAATTGCC ACATCCTTTA CTTGTCCACC TCCAAACAAG AGAACTTACA ATACCATGCT CAAAGGCTAT GCTAACTTGG GTATGCTCGG TGCGGCATTA GAACTGTGTG AACAAATGCG GAGGCGGCGC ATGTGTGACG CTGTGACCAC CAACACTTTG GTCCACGCAG CGGTAGTAGC GGGTGACTTT GGTATGGCCG AACGCGTCTT GTCGGAACAG ACTGAACGAC AACCTAAAGA AGCAGGCTCA CAGCATCCAA ATGTGGAAGC TTATACAGAG CTACTGGACG CATATGCGAA GTCTGAGCAA CTAGATAAAG CAGTTTCAAT CCTTCCACTC ATGCAGTCCC GTGGAGTAGA AGCGAATGAG TATACTTACA CGTGCTTGAT TGCAGGCTTT GGACGGGCCA AGCGTATGGA AGAAGCAAAG AAAATGATGG CTTACATGAG AAAGATTGGA ATGCAACCTA GCGTCATCAC GTACAATGCA CTCATTTCAG CTGTGTTGGA GCTGGAAGCC TCTAACGATG ACTTGGATAG ATGGGTTGAT CTTGGGCTGA AAATATTACG CGAGATGATT CACGCACAAG TTCGTCCCAA TGCCGTGACG GTATCTGCGT TGGTGGAAGC TCTTGGTCGC TGTGACGAGC CTCGTGTCAA AGAAGCATGT ACGCTTGTGA GCAAGCTCGA GAAAGAGAGA ATCATTTCGA AAGGAACTCC CCGTGTGGTG ACTGCACTTG TTCAGACTTG CGGTGTGGGC GGAGATATCA AGGCATCTCT GGAGGCATTT AGAACGCTGA GAAAACCAGA CACAATTGCA GTGAATGCGT TTCTTGATGC ATGCTACCGT TGTTGTCAGG ATCGGTTAGC TTTGGAGACG TTCAAATACT ACTTTCACAA ACGAAACGGC CAAGCTAAAT TAAAGCCTGA TGTAGTTTCT TTCTCGACGC TGATATCTGC ACTCCTGAAA AAGAACACAA GCGACAGTCG GGGAAGCGCA CTGCATTTAT ACAATGAAAT GCAATTGAAG GCTTTGATAA AACCTGACAA TGCTCTCGTC GACATAGTCT TGAAAGCTTT GCTGAAAACG GCACAAACAA ACTGGCTTAC TGACAGTGAC GTTCGATTCG TTGCCAATGT CCTTCGAGAC GCCGAAAACT TAGGATGGGC GGATGGCCAG CTTTATCGTC GAAAGCGCGC TGTCCGGGCT GTGCTCGCCG ATCGGTTGCG GGAAACGTTT AATCAGGACG ACGATCTCTA CAGATTAGTT TCTCCGGATG TTGGAGTCGA TGAGTTATTT CAGAAGCACG GGTGGAATCA GGTGGACTCC GGATTTCGAT TATGGGGGAG AAACAATGAC GTGGCCGATG GAGAAGGAGT CGACAAGTTC CTTCAGTCCA AAGGCTGGAA TAACGTCGAT TCAGGATTCC GAATATTCTA A
|
Protein sequence | MTQLIASDTV AEGDTFVCRR RRLRQSYSSS GSASAAIGPI VAILLLLSHQ GEAFLHSQES CLRRVSPRYM VSSSQRIRSN KSVVLRNRLQ ATSIDYEDGS QSRLIPEDVS DMQALRQKIW DLEQQTHMLI KANDLQGALE GIQRLLEVIR VSSANAATAS DNTMMRSMSQ CLDRSVQTFA ARTFNVKMES PSRARKHVMM GVEALQLQLS SQFLSEPYNL LPKMTFLNAL KALTQLIEVG RGEQHDPLLS NMSAAAFRIL QRLVTGVGIR NKSSPLVVYE KDFCMVLNAF TESGRMDMAH RIIALQERTE HAPPLSPVAF SILLKGYGRL KDLQQDRLAL ETFKYYFHKR NGQAKLKPDV VSFSTLISAL LKKNTSDSRG SALHLYNEMQ LKALIKPDNA LVDIVLKALL KTAQTNWLTD SDVRFVANVL RDAENLGWAD GQLYRRKRAV RAVLADRLRE TFNQDDDLYR LVSPDVGVDE LFQKHGWNQV DSGFRLWGRN NDVADGEGVD KFLQSKGWNN VDSGFRIF
|
| |