Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45690 |
Symbol | |
ID | 7200469 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 946578 |
End bp | 949899 |
Gene Length | 3322 bp |
Protein Length | 1015 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179943 |
Protein GI | 219118332 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.136558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCCG ATTTCGACGA GGATGATCTC ATCAATGACT ACATTGAGGA ATCTTACGAG CCGCCGACGG CCGAATACGA TGTAGACTTT TTCGAAGAAA TGATGGCCAG CGGTGGCGTT ACCAAAACGA CGACGGGTAG TAGGACCAAC GATGCGACGA CAATGGGCAA GCAGTCAGTA CTCGTGCCGG TCGAGAATAC GGGTGCGGTA GTCAATCCTG CGCCTGTAGA TCCGCACGTT AGTGTTCGTG ATGCATTCGA ACAACGCGCC GAAAAGCCTA CCGAGAATCT CTTCACCTTT GAGCGGTACG TGACGTCGCA TTACGAAACG AACAATCGGC ATACCGACGA ACGATGATTC CAAAACTCAC GAGCGATAAT GCCAAACGTA CTTTTTCAGG TACAACTACA ATATGGATTG GAGGGCTCCT CGCCAAGCCA ATTCGCCGAG CGGCAATACC ATGCAAGCCA AGGAATGGAA AAAGTCGGAA CCGGGAAGGA GACGGAATCG AGACTTGTTC GGAGCATACG ACGACGACGA CAATCCGGAA ATCACGCTAT CCGTGGGAAT ATCCAGACAA GTGAGCAATA GATTGCCGTC GGCACCGGAT GCACAATTGT TGGAGTTTGG CACCAAATCA GCCCGATCCC AAATGGGTCG GAAACGACCT TGCTACCGAT CGCACTCCAT GCCCACACGA CCACAGGTGG GACGGCACCA GCAAATACCC ATGACGCTGG GTGACGGGAC CCGTGTGCAT CTCAACGTGA AGGTTCCCGC TAGTGACGGC AAGCTAGATG GTGGTGAAGG TATCGGACAC AAAAACGATA CCCACAACTC TTTGGGAATT TCTGTTGCGG AACTCATGGA GCGGGTACAA GCAATTCGCC GTAGGCAAGA ACACGACAAA CAGCAGCACT GTAACACTGA TCATGACGAC CCAGTCCACG GAGATAGTCA CCGTCATTTG GGGACGGAGG ACCATCGTTT GTGGGTAGAT AAGCATGCCC CCACGTCCTT TGCGCATCTT CTTTCCGACG AACGTACCAA TCGCGAAGTA GTGCGAGCTC TGCGCGCCTG GGATCCGTAC GTCTTTCGGC GAGATCCCCC ACCGCGGCCC GATTTTGGGT ACTCCGCCAA ACCATCGGAT TTTCATTCGG ATCGCAAAAA CGAACATGGC AGTGGCAAGA GTAGGGATAG CAGTCGGCAA GATCGCCGCC CGGAAGAGTC GTGTAGAGTC ATTTTGTTAT CGGGACCGCC AGGTGTCGGC AAGACTACTC TGGCGCACAT TGTCGCCCGG CATGCTGGTT ATCGTCCATT GGAAGTCAAC GGATCGGACG AACGTTCGGC TTCTGCTTTA ACGGAACGAA TCGTTCGAGC CATGGAATCT ACAACTCTCC ACACTGCAAA GATGCGAAGA AGTGTACACA ACCACGAGTG CAAAGACGAT TCCCTACCAA AGCCCAACTG TGTCATTCTG GACGAGATTG ATGGTGCGGA TGCCAAAGGT TCCATACAGG CCATTGTGAA CATAATTCGA GCCGATATTC CAGCCAAGTC ACAGGCTTCC AAAGCACAAT ATTTGCGGCG CCCCTTAATT TTGATATGCA ACAACAAATA TGCGCCGACG CTGCGGGCTT TACTGCCTTA CGCAAAAGCC TTTCACGTCA ATCCGCCGTC GCCAGCTCGC TTGGTTGCTC GCCTCCGATC CGTGCTGACG GCGGAAAACC TAACAGCGGG AGGTGGCAGC TCGTTGCTAA ATCAATTAGT ATCGGTCGCA TCCGGTGACA TTCGCTCCTG TTTGCACACC TTGCAATTTG CGTCCTCGCG GTCCAAGGAG CTGGCCACCC ACGCGGAAGA AGCCCCGTCC GTTATCGATT TGTCCGACAG CTTGCGCGGT GCCATGTCGG GGGATGGCCT CAAGGATGAA CGAAACGATA TGGCTGGTAC AATCACGAGC GTATTTCGGA AGAGAAAGGA TCGAACCTTT CTTGATAGCA AGCGTGTCAT GCAAGACAAG CGCCCGAGCT CAACGCGCAT TTTCGAGGCT GTGCAGGTAT GTAGACTGTA GATTGGTGAT AGAATTCACA TTGTACATTT GTCGTTACTG ACACCATTTC GTGGGTGGCT ATAGAATTTT GGGGACAATC TTCGTATCCT GGATGTTTTG TTTCTCAATG TACTCCGCGT TTCGTACATC GACCCAACCT TGGACCGCTG TGCAGCAGCT CACGAATGGT TGTCGAGTTC GGATCTGTGT CCGCGGCAGG TTCCGTCCAC CGCCGGTGCG ATTCATTTAC TGTGTCGCGT CGAACAGCGC CCGGACTTAT CCTTTTCAAC ACGGGAGCTT ATGGACAGTC GCTACCAGTT TGAAGCGAAT CAGTCTCTGG CGCAAAAGTT TGCTGAAGGG CTTTCAATGC AGACACGAAG TCGGTCAACG AGCTTATTGG CGACGGAAAC CATCCCGTAC AGCTTATGGG TACTTTCCGC AGGGGAAGGT AGCAGTGGGG CCCTAGATCG GGCCGCTACG TCTCTGCAAA TTCTGAATAA GGCAGAACTT GGTTCCTTTC ACAGACACGT TATGTCTTTG CGATGTTTGG GCCTTAGTTA TGTCGCGGAG CAGGAGGAAG CCGCACCGGG CGAGTTCAAA GGGATCACTG GCAGCGTTCT CCGTCTCGAG CCACCAATCG ATCGCCTCGC ACACTTTATG GACCTGACGC GAGCCAAGAG TCAAAAACGA ATTGAGATTC CCATAGCGGT ACGTTGTGTC GTGGGTGCCG GAACCGTTTG CGGCGGTGTC TCGTGTATTG GTTCACTCAC TCATCCTTTT GCTCGCCCTC GCTGTCTTGA CTACTTATAG ATGAAAGAGT TGCTGGCACA AAGTGTGCTT CACGAAAATA TGCGCCATCT CGGAGCCCAA GCGCAGTCCA AGGCGATCTC CACGAAAGTT CGGTCCAAGC TGCCGGCGCC CTCCGTGGTG GCCGCCCCGA TGGAAGCAGC GCCTGAATTG TCCGCGATGA ACACTTCCTC TCCCGACAAA CGGGAAGCGG CTAGTAGTTC CGACGCACCA CTCGCCAAGC GCCGCAAAAC ACCATCACCA ACCAAAGCTA CGGCCCACAA TTTTTTGGGA CTACAGGCTC GCAAAGTCAA GCAGCAGCGG TCAGCCCGTA CGGCGGCCCG CGTGGGAGTC GAGCGTTCCC ACAAACATCA AACGTCTCAT ACGGGCAGTG GTGTTCCGTT GACCCAAATT GTCCGCCTCC GGTACATCAA GGGTTTTACA CAGGCCGTTC GGGCACCGTG TCGATTGGAA GACTTGGCGT AA
|
Protein sequence | MEPDFDEDDL INDYIEESYE PPTAEYDVDF FEEMMASGGV TKTTTGSRTN DATTMGKQSV LVPVENTGAV VNPAPVDPHV SVRDAFEQRA EKPTENLFTF ERYNYNMDWR APRQANSPSG NTMQAKEWKK SEPGRRRNRD LFGAYDDDDN PEITLSVGIS RQVSNRLPSA PDAQLLEFGT KSARSQMGRK RPCYRSHSMP TRPQVGRHQQ IPMTLGDGTR VHLNVKVPAS DGKLDGGEGI GHKNDTHNSL GISVAELMER VQAIRRRQEH DKQQHCNTDH DDPVHGDSHR HLGTEDHRLW VDKHAPTSFA HLLSDERTNR EVVRALRAWD PYVFRRDPPP RPDFGYSAKP SDFHSDRKNE HGSGKSRDSS RQDRRPEESC RVILLSGPPG VGKTTLAHIV ARHAGYRPLE VNGSDERSAS ALTERIVRAM ESTTLHTAKM RRSVHNHECK DDSLPKPNCV ILDEIDGADA KGSIQAIVNI IRADIPAKSQ ASKAQYLRRP LILICNNKYA PTLRALLPYA KAFHVNPPSP ARLVARLRSV LTAENLTAGG GSSLLNQLVS VASGDIRSCL HTLQFASSRS KELATHAEEA PSVIDLSDSL RGAMSGDGLK DERNDMAGTI TSVFRKRKDR TFLDSKRVMQ DKRPSSTRIF EAVQNFGDNL RILDVLFLNV LRVSYIDPTL DRCAAAHEWL SSSDLCPRQV PSTAGAIHLL CRVEQRPDLS FSTRELMDSR YQFEANQSLA QKFAEGLSMQ TRSRSTSLLA TETIPYSLWV LSAGEGSSGA LDRAATSLQI LNKAELGSFH RHVMSLRCLG LSYVAEQEEA APGEFKGITG SVLRLEPPID RLAHFMDLTR AKSQKRIEIP IAMKELLAQS VLHENMRHLG AQAQSKAIST KVRSKLPAPS VVAAPMEAAP ELSAMNTSSP DKREAASSSD APLAKRRKTP SPTKATAHNF LGLQARKVKQ QRSARTAARV GVERSHKHQT SHTGSGVPLT QIVRLRYIKG FTQAVRAPCR LEDLA
|
| |