Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43224 |
Symbol | |
ID | 7196585 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2376385 |
End bp | 2379590 |
Gene Length | 3206 bp |
Protein Length | 835 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177501 |
Protein GI | 219111499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTTGCCACAA CAGCAACACG ACTCTCTATC GTGGGCAACA ACGATTTCAT TCGGTAGACT CGGCGTACTT CGTTGGTGAA GGGTGCATAC TTCTTCGACT ACCGAGAGAC TTGCGCCTTT TGCCATCTTC TCTCGTCTCT TTGTTGGTTT CCTACATCCA GTGAAGATTG CTTGTCGGTG ATAGTCTTTA ACAACAACAA CACCATCCAA CAACATTCAT CAGCACTATG GGGGAAAAAT ACCGATCGTG GTACACGAAC ACTTGGATCG CGTGGAGCTG TGGGGTCGCT CTATGGCTGA GTAGAACGAC GATTGCTGCC GGATCTACCG AAGAATTCTA CCTTTATTAC GGCGACACGG ATGCCTCGAC GTACGTGCAA GAGTACCGGG CGACCAAAGA CGGGATTCCT CTCGACGGAT TGTTCGAACA ATCCGAACAT CCCCGCATTG TCGAGTTCTA CTCACCGTAT TGTGTAAGTG AAGCGAAAGC GCTTAGCTAG CCCTTGTGCA TTGCGGAGAC GACTCTTGCT GTGTACCGTT GTACATTGGT AGGAGGTCTT TAGTCGACTA CCGTGCGATG GACTGCTTTT AAACTCACAA TGTCACTCAC ACCTGGCCAC TCCTACACGC ATTGTTTCGT GCCATGCTGT AATTGACTTA CTTTTTTTGT GCGTTAGCCG CACTGTCGTC ACTTCAAACC CAAATACGTC CGATTGGCGC GAGACGTGGG ACAGAAGTAT CCCGACGTGG AGTTTTACGC GGTTTCCTGT GTCGCTCACA GTGACATGTG TCAGAAATAC AACGGTACGT CTCTGTGTGT GTGTGTGTGT GTGTGTGCGA AGTTGACGTG GACGTGGGGG TATATGGGAA TGCGAGTGGA ATCACTCTGA ACGTTTCGCT TTCGTCTTGC TCCATTGACT ACCTTGCGTT ACTTCGTTCT GCCACTCACG TCTTTTCCGT CTACTTTCAA TTGGTTTTCG ACTGCAATGT GGTGTTAAAC AGTTCGAGGG TTTCCGCAGA TCTTGGCCCT GTCGGCCTCT ACTGCCGAAC CGACAGTGTT GTCCAAGAAC TCTTACACGG TGCGAGGCAT TGCCTCCGCC TTGAAGCTCG GACCGGTCCT TGGCGGTACG TCCACCGGGA CCGTCGGTCG GACGGCCCGA CTTCTGGAAG ACGGTACCGA AGCAGGAGAC TCGGACGAGC CACCCAACGA TGACTCGGAG CAAACGGTGG ACCAACCCGA CGAAAACTCC AACGACCCGA GTCTGGACGG CAAAGTATCT TTCGAACTTC ACGTTCCTAT TCCGGATCTG CGACCCCACC CCACCGGTGA CGACGTGCCC AATTCCAACG ATCCGAGTGC CGACGAAAAA ACAGAAGACG AACCCGCCGA CTCGCAAGAT ACACGGAAGC ACTACTCGGG CGAGGACCCC ACCGTCGGGA ACGGTCTTTT TAACGTGGAT ATTGATGATT CCCCCGATGA AGGCTTTGAT CAATCCGCCG ACGATCGAGA CGACCCTGAC GCGGAAGAAT ACTACAGTGA CAACCGCGAC GACGAAGATC TCGACGAAAG CGAAGACCAG GAGAGTGAAC AAGACCCCTA CGGAATGGAC CTATTCTTCG AGGCTCAGAA ACGTGGCGGT GACGAGTCGG AGTCCTCCGA CCTTCCCGGT CCCGTCCAGC GCAAACCCGG AGCCGTCACC GCGCAGTTTG CCCCTCGAGA TATGGACAAG TACCGCGACG TCCTACGCCA GAAACGGGCC GCCGAACAGA AGAAACGTAA GTTTGTCGGA CTGCGAAAGG ATCGTCAAGT AAAGCCACTG GGTCCTGTAG TCCGGGACGG TGCCTCGAAG GCTATGAGGG CCAATACGCC GGGAACACTG GAATACAAAC AACGCCAGAA AGAGTCCTCG GAACGTTTGG CAAAGTTGCT GGAGAAAAAG TTTGGCAAGA AAAGGGCCGC CAAGTACGTT TCGTCTAAAG GTATCGGAAA CCGTATCGGG AATTACACGT CGCTGCCCTT CAAAAAGGAA GTGGCCAAGC CGCGCTTGGT GGAGCAGCTT CCAATACTCA AAAGAGTAGT ACGCATGGAT GACGAGGAGA TGCTCATCCT CGACTCCTCC CTGTCGTTTC TCCGAGGGCT CCGCTATGGA GTCTTTCAAG ATCCAAAACC ATTGACAGGC AAAAAGAAGC GTGCGCTCAA AGACTGGTTG GATTTACTGA GTGGCTCGCT GCCACAGGAA TGGGGACTCC ACGAGATCAT TGACGATCTT TTGGACAACC TGGATTATAT TGCCCAAGGG AGCAAGAACT TGCACGATAT TCTCGACAAA CACCCGATTC AACGCAGGAA CTGGAGTCGC TCTTGCACCA AAGGTGGTCG AAACGTCAAC GGTTTTACTT GCGGATTCTG GAAGTTGCTG CACGTCATGA CTGTTGGTGT GGCAGAACAC CGAGGTGGCA AGAACCTTGT GGCTACGGGA CTGCGTCGGG ACATCCGTGT CTTCGCGCCC ATGGAAGCTG CGGATACTCT GCGCGAGTAC ATGGCGCATT TCTTTAGCTG TACCGAATGC TCGAAGCACT TTCTGGTCCA GTACGATCAG TGTGACATGA ATCGCCGCTG CGGTCGTCTC GCTACGGATG CGCACGATGC TACCGATTCG GACTGGAAGG AATTGGCGAA ATGGCTCTGG GAATTCCACA ACGACGTGAG TGTCCATGTT TTGAACGAAC GCACGGACAA CAAACGCAAA CAAATGCAGC AACGCACATG GCGTCGTGCG GAGTCCGGTC CCGGAGCAGC AGGACTGTTT GAACAGGTCA GCGTGGTTTG GCCTTCGACC TTGTCATGTA CAGAATGCAT CAAGGCGGAC GGTACGTTCG ACGAGGATGC CGTCTTTACG TACTTGGAAC AAACCTACTG GCCTGGTTTG GAAGATTCCA TTGATCGTGT AATACAGTTT TACGACGAAC ACGAGTCCGG ATCCAAGGTC TTGACACTCA TCCTGTTGTG CATTGGTGCG TACTTGGCTT TCGTGATGCG TAAGAGTCTC GGTCCGAAAA GTCTCCAACA ATCCCTGATT ATGGCACGGA AAATGAGGCC GAAAGGCTCC GTCGGCGTGG ACAAGCGTTC GGTTTGAGAT CTCCGTTTAG CCTTTGTTTT AGATAACAAA CCGTCAATCT ACTTAGTTTC ACAGTCAAGC AGCTCA
|
Protein sequence | MGEKYRSWYT NTWIAWSCGV ALWLSRTTIA AGSTEEFYLY YGDTDASTYV QEYRATKDGI PLDGLFEQSE HPRIVEFYSP YCPHCRHFKP KYVRLARDVG QKYPDVEFYA VSCVAHSDMC QKYNVRGFPQ ILALSASTAE PTVLSKNSYT VRGIASALKL GPVLGGTSTG TVGRTARLLE DGTEAGDSDE PPNDDSEQTV DQPDENSNDP SLDGKVSFEL HVPIPDLRPH PTGDDVPNSN DPSADEKTED EPADSQDTRK HYSGEDPTVG NGLFNVDIDD SPDEGFDQSA DDRDDPDAEE YYSDNRDDED LDESEDQESE QDPYGMDLFF EAQKRGGDES ESSDLPGPVQ RKPGAVTAQF APRDMDKYRD VLRQKRAAEQ KKRKFVGLRK DRQVKPLGPV VRDGASKAMR ANTPGTLEYK QRQKESSERL AKLLEKKFGK KRAAKYVSSK GIGNRIGNYT SLPFKKEVAK PRLVEQLPIL KRVVRMDDEE MLILDSSLSF LRGLRYGVFQ DPKPLTGKKK RALKDWLDLL SGSLPQEWGL HEIIDDLLDN LDYIAQGSKN LHDILDKHPI QRRNWSRSCT KGGRNVNGFT CGFWKLLHVM TVGVAEHRGG KNLVATGLRR DIRVFAPMEA ADTLREYMAH FFSCTECSKH FLVQYDQCDM NRRCGRLATD AHDATDSDWK ELAKWLWEFH NDVSVHVLNE RTDNKRKQMQ QRTWRRAESG PGAAGLFEQV SVVWPSTLSC TECIKADGTF DEDAVFTYLE QTYWPGLEDS IDRVIQFYDE HESGSKVLTL ILLCIGAYLA FVMRKSLGPK SLQQSLIMAR KMRPKGSVGV DKRSV
|
| |