Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50136 |
Symbol | |
ID | 7198840 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 158252 |
End bp | 159602 |
Gene Length | 1351 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184977 |
Protein GI | 219129610 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCCCCCCTG ACCAAAACGT CGCGTGAGAA AAGGAGCGAT ACCCATCACC ACGAATACAA AGTAGGAACA CCAATCTTCA TCATGGCATC CGAAACTGTT CCATCCAACA CTGTAGTTAG TGACTCGGAA GACATCCTTA CAAAGCCACT CGCGATGGAG GACGAAGTAA AGGCAGAAGA GAAGGACGTC AAGGAACAAG ATGACGAAAG TAAAGCAGCA GTGGCGCTGA CTAGTCTTGT CGATACGAGC TTGCCAATAG AGGCGAGCAC CAGTGGAAAT GATGAAGAAA GAATGGACGA CGAAGAAGAA GAGGAGGAGG AATGCGACGA AAGAGACTTC GAAATTCCTC AGAGGTTCAC ACGCAGCGGA CGTCGCCGTG CTACTCCTTT CCCCCTCAAG GTAAGTCGTG AGAGGAAGGA TGAAGCGTGC GTGTTCGTGT TTCGAATAAT TTACTTTAAC ATGAGGTTCC ATTACAGTTG ATGAAGGTTT TGTCCACTAA GGAGTTTTCT GAGATCATCG GTTGGCTGCC TTCAGGAAAA TCGTTCTCCA TCATCAAACC TAAGGTCTTC ACTGTAGAAA TTCTTCCAAC CTATTTCAAA AGTGCCAAAT ACTCTTCCTT TACTCGCAAG CTCCACCGTT GGGGGTTTAT GCGTCACTAC AGGGGACCAG AAGCGGGGGC ATTTTACCAC AAGGACTTTC AGAAGGGGCG TCTTGACCTT GTTGAACAGA TGACATGCCA CAAGATAGAA CCTCCTAAGT CTTCCGTTAC GAAGGCAAAG CAAGTCAGGA AGTCTGTACC ACGCGATACG ATTGCCAATG CCGTCGGTCG ATCACCCAGG TCAATTGATA TGCATTCGCT GGCTCGTGGT CCTAGTATGC AGTCGGCTTC GTTTCCTCCT CACCCGATGG ACGCTGATTT GGCAATCGAG CTCGAAGTTG CTCGACGTCT TCGTCAGCGC ATGGATGCTG CGGCTTACAA TCAACAGGCC TTACTCGCCA TGCAGCAGCA ACAGAAACTC CAAGCTCTAC GAGTCCAGAA TCTTGGAAGT CATTCTTTGA TGGGATGGGA TGCGTCACCG GCGACCTCCT TGCCTGGCTA TGCTTCTTCC CTTTCAAGCT TGAATATGCG AGGGTACGGA GCTTCATCAC TGCCTCGGTC GGGCTTCGAT CAGGGCTATT CCATGGCCGT GAATAAGGCT CAATACGATC ACGCCTTCAA CTACTCCGCT TACGGTGATC TACGGTCGTA CGAATCCAAT ATCCATGGGG CAAAGACGGC GTGAGATTCA TTGATGCACA GGCCTTTATC TATTGTTCGA TGTAAGAGAC TTAGGTTTCC TTTTACTATC T
|
Protein sequence | MASETVPSNT VVSDSEDILT KPLAMEDEVK AEEKDVKEQD DESKAAVALT SLVDTSLPIE ASTSGNDEER MDDEEEEEEE CDERDFEIPQ RFTRSGRRRA TPFPLKLMKV LSTKEFSEII GWLPSGKSFS IIKPKVFTVE ILPTYFKSAK YSSFTRKLHR WGFMRHYRGP EAGAFYHKDF QKGRLDLVEQ MTCHKIEPPK SSVTKAKQVR KSVPRDTIAN AVGRSPRSID MHSLARGPSM QSASFPPHPM DADLAIELEV ARRLRQRMDA AAYNQQALLA MQQQQKLQAL RVQNLGSHSL MGWDASPATS LPGYASSLSS LNMRGYGASS LPRSGFDQGY SMAVNKAQYD HAFNYSAYGD LRSYESNIHG AKTA
|
| |