Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19122 |
Symbol | |
ID | 7198098 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1007017 |
End bp | 1008212 |
Gene Length | 1196 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178344 |
Protein GI | 219115097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.398188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTGCAAGA CTCTCCATTT GCGTAAATTA GATCACAAGA CTACATTTTG GATTAACAGC TGCAATTTTT TAGCATGCCG AAGGCATTGA AAAAGCGGGG TCAGACCCAT ATTGCATCTG CTGCGTCGAA ACCGTACCAG AAAGCTTCAC CAGCATCGAC CAACTCGGCT AGCACCAATC TTATTTCACC GAACACTTCT CTGGGGCAGC ATTTTCTCAA GAACCCTGCT GTAATAAGTT CGATTATAGA CAAGGCTGGA TTAAAGGCAA CGGATGTTGT GCTTGAGATC GGACCAGGAA CAGGAAACAT GACTGTTCCG ATGTTGCAAC GGGCCAAGAA GGTTGTCGCC CTGGAGTTCG ATTCTAGAAT GGTTCGCGAG GTTCTGAAAA GAACCGAGGG CACGGATTTG GCACATAAAC TTCAGGTTAT ACAGGGTGAC GCTATGAAGA CAGCTTGGCC ATTTTTTGAT TGTATGATTG CAAACTTGCC CTATCAGATT AGTTCTCAAG TCGTCTTTAA ACTTCTTTCT CATCGGCCCA TGTTTCGTTG TGCCGTTCTC ATGTTTCAAG AGGAGTTCGC TTTGCGTCTC TCTGCTCGCC CCGGGGAGGC ACTGTACTGC CGCCTTTCTG TGAATACGCA GCTACTGGCA AAGGTAGATC AGCTATTGAA AGTTGGGTAA GACTCTTCTG GATTAGTGTT TGTAGATTGG AGCAGTTCTC TAACGTCATG ACCACATGCA CTTTATCAGG AAACAAAATT TTCGCCCGCC ACCGAAGGTA GAGTCCCGTG TTGTGCGTAT TGAGCTAAAA AATCCTCCTC CTCCTGTGAA TTTTACTGAA TGGGACGGAA TGGTACGTTC GAATGTATTC AAGCAGTGGC CAACTGGTCT AGCAGATTGA AATTCCTAAT TGCGTTGGAC TTGCTTCTAC AGATTCGATT GTTATTTAAT CGAAAAAACA AAACGCTTCG CTCGGTTCTA AATACAAAGT CAGTTATGAA ATTGCTGGAG GATAATCGGA GAACAGTTCA GTCACTTCAT CCCGAAAAGA TGGTCGACGG TCGACCCGCT CAAGTTATCG TGGAAGAGAT TTTGGAAAGA GATTCATGGA AAGGACAGCG TGCAGCAAAA CTTGATCTAG ACGACTTCTT ACAGCTGCTT GCTGAGTTCA ACGAGGCTGG AATTCATTTT AATTGA
|
Protein sequence | MPKALKKRGQ THIASAASKP YQKASPASTN SASTNLISPN TSLGQHFLKN PAVISSIIDK AGLKATDVVL EIGPGTGNMT VPMLQRAKKV VALEFDSRMV REVLKRTEGT DLAHKLQVIQ GDAMKTAWPF FDCMIANLPY QISSQVVFKL LSHRPMFRCA VLMFQEEFAL RLSARPGEAL YCRLSVNTQL LAKVDQLLKV GKQNFRPPPK VESRVVRIEL KNPPPPVNFT EWDGMIRLLF NRKNKTLRSV LNTKSVMKLL EDNRRTVQSL HPEKMVDGRP AQVIVEEILE RDSWKGQRAA KLDLDDFLQL LAEFNEAGIH FN
|
| |