Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49502 |
Symbol | |
ID | 7195727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 436554 |
End bp | 438238 |
Gene Length | 1685 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184136 |
Protein GI | 219127842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.304466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGGAGCTGG ACCGGACCCT ATCTCACTAA TCGCCGACTC GCATTCCGTA GTAACATTTG TATTGTGTGC GATTTCGCTC ACAGTACAAA CAGTCATGCC GTTCATGGTA TCTCCTGGCA CTGTGCTTTT GCTGTTGGGT GTTCTGAGTC GAAATGTCTC CCTAGCGTCG TTGGATCCGA GCCACTTTCA ATCCTACTCC TTCCATGTTG CGCAAGAATC TCCAGCAACG GAGACGCCGA CGGACAGCAC CGATTCGGAT GAGACTGTTC GGAAACAGCA GACTCCGTCG CTGGCTAGAC ACCAGATTAA TGATGGAGAG GATGACAGCC GGCAACCAAA TGTAGTGGCA AGCTCTTTAC TCATGTACGA AAAAGGTTGC GTCGTGTCGT CTAGTGAAGT GTACCCAGGG TCTACGCCAT TGGACACTGC TGACGATACG GAAACCAGTT TGCACAAACA GGCCTACACG GAAATTTGGA GTCTGGAGGA TGAAGAATTG TGGAGAGAGT ATGAAGACGA TGAAGATGAA GATGATGACA ACGACGATGG ACATTCTTCG ATGCTTTCCA ACGCGTCAGT CAAACGCACA CGTACACGTG CACGTATATC CACCAATAGA GGGACGAGAG AAGCGCTCGC ATCGTCTACC AACACACATC TGCCGTCTCC TGTGACCTCT GCTACGCTCT CCGCCCTCCG AGGAGGCAGT AAGGGCACAG TCTCCGGCAT CTTGGGATCG GAAGTCGTCA AACGACTTTA CGTCACCGCT CTCGTTACAC TCGTGTTTGA GGCATTGGTG GGGCACATTC TGGAATTTTT CAAAATTGTC ATGCAGACCC GAGACGACGG TTCTTCTTAC GGCACGGTTG TCCGAGAAAT TACGCTCGAA AAGGGTATTC TGGGACTCTG GGACGGCTTT GTCCCCTGGG GCGTCGTACA GGCCGTGGGC AAGGGTGCCG TGTTTGGACT CGCGCACGCC GCCGCCAAAG ATTTACTCGT ACCGCTCGCA CAAGACGGTA TGCTCCCCAT GGCGGCCGCC CTCACATTGG CTGGCGGCAT TGGAGGTGGA TTTCAAGGCT ACGTGCTCTC GCCGACGCTA CTATTGAAGA CCAGGGTCAT GACGAATCCA GTGTTCCGTG AACCCATGAG TCTGTGGCGC ACAGTCTATC TCAGTATGCG CATTGGATTT GACGTCGTCC AAACCGAGGG GTTTTGGACT CTCATGAAAG GGGCCAACGT GTTTGCGACC AAGCGAGTGT TTGACTGGGC GAGTCGCTAC TTCTTTGCCG ATTGGCTAGA GCAGGTTTTT ATTCAACTCA AGAACGGACA GCCTTTGACG ATCGCGGAAA AGAGTGCGGC TTCACTTTTG GGTGGCGTAG CGTCAACGTG TGTCACTCTC CCTTTGGACG TTCTCGTGGC AAAGACACAG GACGCCAAAA AAGCCGGCGT CAAGGTGTCG GCTTGGACAC TTTTTCGAGA CGAACTCAGA GAGAAGGGCC TAAGTGGACT CAAGGACTCG TACATGCGCG GATTCGAGGC CCGGCTGCTC CACGTTTGCT TGACAACACT TGGTACGTAC TCATGGTGGA ATGTTTCTTG GAAGGATGAT TTGTCGCGTC TGTTTGCCAT TGCAATGAAC AAAATCTCAA ACCTGGGTAT ATCAATTGCT TCGTTGGTTG TATAG
|
Protein sequence | MPFMVSPGTV LLLLGVLSRN VSLASLDPSH FQSYSFHVAQ ESPATETPTD STDSDETVRK QQTPSLARHQ INDGEDDSRQ PNVVASSLLM YEKGCVVSSS EVYPGSTPLD TADDTETSLH KQAYTEIWSL EDEELWREYE DDEDEDDDND DGHSSMLSNA SVKRTRTRAR ISTNRGTREA LASSTNTHLP SPVTSATLSA LRGGSKGTVS GILGSEVVKR LYVTALVTLV FEALVGHILE FFKIVMQTRD DGSSYGTVVR EITLEKGILG LWDGFVPWGV VQAVGKGAVF GLAHAAAKDL LVPLAQDGML PMAAALTLAG GIGGGFQGYV LSPTLLLKTR VMTNPVFREP MSLWRTVYLS MRIGFDVVQT EGFWTLMKGA NVFATKRVFD WASRYFFADW LEQVFIQLKN GQPLTIAEKS AASLLGGVAS TCVTLPLDVL VAKTQDAKKA GVKVSAWTLF RDELREKGLS GLKDSYMRGF EARLLHVCLT TLGTYSWWNV SWKDDLSRLF AIAMNKISNL GISIASLVV
|
| |