Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29608 |
Symbol | PAF1 |
ID | 7203769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 643380 |
End bp | 644606 |
Gene Length | 1227 bp |
Protein Length | 402 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | PolyAdenylation factor subunit 1 |
Protein accession | XP_002183001 |
Protein GI | 219125463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.275441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTATTGTAG CCCCAGCCTG CATGTCGCAC TACTCCCCGA ATAATGCTCG TCCCAAAGTC TCCAACGCTG GTGGAGGCGG CGGCGCTGGG GGCATGTCGG CGGACGATTT GGAGCAAGAA GCCGCTGAAC GCAACTTGAT CCGACGCTGC GTCGACTACC ACGGCCCCGC CATTTTGGAT TTGCAGAACC GTCTCTACCG TAAAGCCTGT CGGAGTCACC ACACTCGACA CTCGGCGCTG TACTTGCAGC CACACTCGTC GTATATGCGA CTTATGGGTA TGCCGCTGGC CTCCGTGGCG GCGCCCATTC CCTCCGAGTG CTACCTCACG TACATGGCTC ACGTGACGAG GGCCAAAAAC TCGACGCCAG TCATGTGCCT CAGTTGGACT CCGGGAGGTC GAAGGCTATT GACAGGAAAT CAAGAAGGGG AGTTCACTCT CTGGGACGGC ATCACCTTCA GCTTTGAACT TATCATGAGC GCACACGACG CTTCCTTCCG AAGCATGGCC TGGTCACACA ACCGGAACTA CCTGCTCACC TCGGACGCCA GCGGCAACAT CAAATACTGG AGTCCTAGCA TCGCTCCCGT ACAGTCCATC GACAGTCACA ACAAGCAACC CATTCACGGC CTTTCCATTT CTCCCTCGGA CACGAAATTT GTGAGCTGTG GTGACGACGC CGCCGTGCGC GTATGGGACT GGGCCTCACA CAGCGAAGAG CGGACTCTGG AGGGGCATGG ATGGGACGTC AAAACGGTTG CTTGGCATCC ACGGTCCTCG GTCATTGCCT CTGGATCCAA GGACAATCTC GTCAAATTGT GGGACCCTCG GGCGGGTAGC TGCCTGAGTA CGCTCTACGG ACACAAAAAT ACCGTCACCA AAGTGGCCTG GAACGACAAC GGCAACTGGC TGTTGACGGC ATCACGCGAT CAATTAATCA AACTATACGA CATTCGCGCC ATGAAGGAAT TGGTCTCGCT CAAAGGACAC CACAAGGAAG TTACCAGTCT CGCTTGGCAC CCGTTGCAGG AAACGGTGTT TGCGTCTGGC GGAATGGACG GTACGCTGAT TTATTGGAAC GTGGGTGCCA AGGGATCGGA GGAACCCGCC GCCAAAATCC CGTACGCCCA CGACATGGCT ATTTGGGATC TGCAGTGGCA TCCAGCCGGC CACATGCTGG CAACGGGTAG CAACGACCGG CAAACCAAGT TCTGGGCCCG CAACCGG
|
Protein sequence | MSHYSPNNAR PKVSNAGGGG GAGGMSADDL EQEAAERNLI RRCVDYHGPA ILDLQNRLYR KACRSHHTRH SALYLQPHSS YMRLMGMPLA SVAAPIPSEC YLTYMAHVTR AKNSTPVMCL SWTPGGRRLL TGNQEGEFTL WDGITFSFEL IMSAHDASFR SMAWSHNRNY LLTSDASGNI KYWSPSIAPV QSIDSHNKQP IHGLSISPSD TKFVSCGDDA AVRVWDWASH SEERTLEGHG WDVKTVAWHP RSSVIASGSK DNLVKLWDPR AGSCLSTLYG HKNTVTKVAW NDNGNWLLTA SRDQLIKLYD IRAMKELVSL KGHHKEVTSL AWHPLQETVF ASGGMDGTLI YWNVGAKGSE EPAAKIPYAH DMAIWDLQWH PAGHMLATGS NDRQTKFWAR NR
|
| |