Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41214 |
Symbol | |
ID | 7199048 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 214558 |
End bp | 217391 |
Gene Length | 2834 bp |
Protein Length | 914 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185151 |
Protein GI | 219129974 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACCA CACCAGTCCC CCTGCAGTTG ACGTCCTTGG TGAACGTGTC GGCATCCCAA CAACAAGAAC GATGCGACGG GGAACCAGCG TACCATCCCG AGGATTCCTA CCAAACCCTC CAACGACTCT GTACACGACG CCCGTCGACT ACCAACGACA CCACCATTGC ATCCTCCAGT GACTACGTGC ACCACGGGAG TATCGAAACC CTCTGGCAAC GTCGACGGGA CGGTACCGGA TGGCGTGTCG TGGTACAGAG TGGACTCGAC GACGGTCGCG TCCTCGACCA ACAACACACC AACGCCCGCG CACGTAAGAC CACCAGCACC ACACACGCGT CCTTTGTCAT TCCACAAGAT GCCGCCGCTG TTTCTACATA CGGAGGAGAT TGGCAACCCA ATCCAGCACT TGTATGGACC TCCTTTGCGA CCGATACTAC GCGTGATCCA GCGCCACACA CCGTCTTGTG TGTACTCGTA CATCCAACAA CCCTCGTGCT CTACGACGTC TACCCCGATC AATCCGCTAC CACCATACTG ACGGGAGGAG AAGGATACAC CATATCCTTG CCGTTTGAAT GTGCCGGTCT GTACAGTCTC GATCAAGTGG GGGGACTCTT TCTGCAGCGG ATGCCCGACG CCGAAGATAC TCAAATCGAT GGACCGCACC ACCACGATGA ATATGACCAT AACGACGACA CACACGAACA AACCCTACCA CACACCGTCG ACGATGGTTT TATCCTCCAG GCTCCACCCC GTCGTCACCG GAATCCACCC CGGTTCCCCA ACGACGAACT GGACGATAGT GTGTCACCGC GTATTTCTCG ACAATCCCCC CGTCCCGCAC CGCACACTTC CCCCCACACA CCGTCCTGGC TGCCTCACCT GCCACCCGTC GCCAGTTTGT TTACCCTGCA ACATCCTCTC GCGGACGTGT TGCCCGTCCG GCAAGTATCG ACGAGCAGTG TGTTGCACAA CAATCTGGTC ACGGATGTGC ACGAAACGGT CTTGTGGGTG GGGACGGCCA AGTGGAGTGA CGACAACAAC ATTACTCGGA ATTCATCGTC ACCGCAGGCA CAGAAACAAG TGCTCATGGT CACCTTCCAT CAAGTGCACC AACGACACGC CGTATGGGTC GTGCGCGAGG CGCCGCCGGC GCCTTCCCAA ACCATCCCTC TCCACCAAAT GTCCCACGCA TGGCACCAAA CGAACCATAC CGCACAGGAG TGGGAAGTAG GAGGTGACGA CGTCGACCTC TGGCTTGAGA GTGTGGAAAC TGCGGCAGGA ACCGGTCTTC TGGTCGGAGG CAATGCCGTG CATTTGCCGC CCACCGTATC GCGTGATCAG GCTCTGGCCG AAGCACTCGG CGTCCGGCGC ACGCCCCGTC AAGCCATCGA CGCCGCGGTC AATACTTCCG CGGCCCGCTC CCGACCGCGT ACCAACGACC CTCGATTAGC GCCGAACTCG TCTTTTCTGT CCCCTCATAA TCGTTCCACG CTATCCGTCA CCCAGGAAAC GTCAATCCTC CACGAATCCA ATGGTCGATC GCCGGGGGGC AATGCAGTGG CCAGTAGTCC CTTTGCCTCC ATGGAGCCCA AGATTGCCGT TGAGTGCGTC TATCGGGAAG AGCCCAGCGC GGTGAGAGTT CCCGCCACCA GTGTCTTTTT GGCATCTAAC GTGTCCGGCT CGGGCACGCT GACGCTATGT CTCCTTGCTG CCTCTAGCAC AGACACGCAA GAATTGAGCT TGTTGGCGTT GCAGCCCAAC GCCCAAGATG GTTTTCAAGT CACTTTGATA GAGCGGCGTG CCTGTTCCGG GGCCCAGCCA ATTCAGTCCA CCCCGGTCCC GGCCTGTTTT GCACCACCCA TGGTCAATCG CCGGACGGAA ATGGCCACGG ATATATTGGT GGCGGATCCA GACGGAGCCT TGGTACTGTA CCGCGCTAAT TTACCGATTG CGTCCGTCAC CGCTCCCTAC CGTGGCCCAG TGGTCAATGT ACAGGATGGA TTGGGCGATC GCGTGTCGAT AGTGTTCGGG GACGGGGGCG GGCAACAGAT CGTACGCGCC ACTCTTTCGT TGGCGTTGGA ATCTAGTCCT TTGACCCGGG ATGCCTTGGC GGTTTGGGAC GCGGCCCTTT TTCCACGACC GGAAAACCAA GCGTTCGCGT TCGCGTTGCG GGCAGATACG GTCCGCCTGG CACAGGCGTT GGCAACGGAA GCGACGGGCC GTGTGCGATG CGACATGGCG TGGACGGCGT TTGCTGCCGT TTTCGAACAC ATCGTGGAAC TAGCCTTGTG GGGGACAAGG TCTGAAACAA ATCCGTCCGC CAAGGTACAG TCTACGACTT CGTGGGAAGC TTTGGTGGGT TCCGACTACC ATCGACAGTA CAAGGATCAG GATCAAGGTT TGCTTTTCAA AACAACAACG CCATCTATAA CACTTTCACG AGATGACGCG CCACGAGCGC TACTAGATTC CATCTCGTCG TTGGCAGCCA CAGTACTGCG TTCTCGGGAA AATGGACTCC CCATTGTTCC GCCTTTATTT GATGCCCTCC TGTTTTTATA CGAAGAAAAC AAGCTTTCGG TCGCGCATCG AGCAAAGCGA CTAGTCCCTC TGTCGAAACT GCTTGGGTAC GTCTGTCACG CATTCTCCTG GGCCGAGAAA ATGACCCCCA CAAGTCCATT CTTAGAGTTC TTGGCCACTG ACGTTGGCAC TGAAGTTCTA AGCTGTACAA ATACAATTTT CCCCCAGTCC GCTCCCAACA ACATTTCGTT TACGTCGTTG CCATTGCCAC CATCTATTCT TACGTGGATC GGGCTGAGAA TTGA
|
Protein sequence | MSTTPVPLQL TSLVNVSASQ QQERCDGEPA YHPEDSYQTL QRLCTRRPST TNDTTIASSS DYVHHGSIET LWQRRRDGTG WRVVVQSGLD DGRVLDQQHT NARARKTTST THASFVIPQD AAAVSTYGGD WQPNPALVWT SFATDTTRDP APHTVLCVLV HPTTLVLYDV YPDQSATTIL TGGEGYTISL PFECAGLYSL DQVGGLFLQR MPDAEDTQID GPHHHDEYDH NDDTHEQTLP HTVDDGFILQ APPRRHRNPP RFPNDELDDS VSPRISRQSP RPAPHTSPHT PSWLPHLPPV ASLFTLQHPL ADVLPVRQVS TSSVLHNNLV TDVHETVLWV GTAKWSDDNN ITRNSSSPQA QKQVLMVTFH QVHQRHAVWV VREAPPAPSQ TIPLHQMSHA WHQTNHTAQE WEVGGDDVDL WLESVETAAG TGLLVGGNAV HLPPTVSRDQ ALAEALGVRR TPRQAIDAAV NTSAARSRPR TNDPRLAPNS SFLSPHNRST LSVTQETSIL HESNGRSPGG NAVASSPFAS MEPKIAVECV YREEPSAVRV PATSVFLASN VSGSGTLTLC LLAASSTDTQ ELSLLALQPN AQDGFQVTLI ERRACSGAQP IQSTPVPACF APPMVNRRTE MATDILVADP DGALVLYRAN LPIASVTAPY RGPVVNVQDG LGDRVSIVFG DGGGQQIVRA TLSLALESSP LTRDALAVWD AALFPRPENQ AFAFALRADT VRLAQALATE ATGRVRCDMA WTAFAAVFEH IVELALWGTR SETNPSAKVQ STTSWEALVG SDYHRQYKDQ DQGLLFKTTT PSITLSRDDA PRALLDSISS LAATVLRSRE NGLPIVPPLF DALLFLYEEN KLSVAHRAKR LVPLSKLLGS KLYKYNFPPV RSQQHFVYVV AIATIYSYVD RAEN
|
| |