Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44089 |
Symbol | ARP1 |
ID | 7203860 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 933100 |
End bp | 934548 |
Gene Length | 1449 bp |
Protein Length | 391 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | actin related protein |
Protein accession | XP_002186156 |
Protein GI | 219113145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.549451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCACGCGT GAGGACAAAG ATTGACCATG AGCGCCGAAG GGTCTGGTCC GGAAGGCGGG TCACTCCTAC TCAACCAACC TGTTGTCATC GATAATGGCA CAGCTAGCAT CAAAGCTGGG TTTGCGGGAT CGTCCAAACC AAAGGTACGG ACCGAACGCA CCGAAAGTAC CGCAACGAAG CGTCCGAGGG ACGTCGCTAT GCCCAAAGAT CAGGATCCAG TGTCGGTACA ATTGCAGACT CCATTGTTTC TACTATACTA ACCAGTCTTC AATTGTTGCA GGTTGTAGTT GGTACCAAAG TTGGACGGCC TAAGCATATG CGAATCATGC CGGGAGGTGC TCTCGAGCTT GATCAAGGCA GTAGCATTTT TGTTGGGCCG AAACTGGACG AACACCGTGG CGCCTTTGTT CTGGAACACC CTATGGACAA AGGTATGGTG ACGGACGGAG GATGGGACGC AATGGAACAC CTTTGGGAGG TACGTCGAGT GTCAAAGCGA CTTAAATTGT TCGTCATTGG CTACAGTCAC CTAGAAATTG GTAATGTTCA CCGTAACTCA ACCGTGTGTC TTTTTTTACA CTGCCTAGCA CGTGTATTCG AAACAGTGCC TGAACGCAAA GCAAGAGGAA CATCCTGTTC TCTTGACGGA AGCGCCGCTT AACCAACGCA AAAATCGTGA CCAAATTGCG GAAATCTTTT TCGAATCCAT TCGGGCACCC GCACTGTTTT TTACACCGCC CTCAGTTCTC AGCTTGTACG CTTCTGGTCG CACTACTGGT GTTGTCTTGG ACGTTGGTGA AGGTGTCACT CACGCTGTAC CAGTCTATGA AGGATTTGCT TTGCAGCATT CGGTAGTGAG GAGCGATGTT GCGGGTCGGG ACGTCACCAA ACAACTGCAG ATGCAACTGA GACGAGCTGG CCTGTCGTTC ACCACTACGG CAGAGTCGGA ACTGGTCAAG AAAATTAAGG AAGAGTCGTG CTATTTGACG CCATCTCGGC TGGCAAATGA AAATACCATC AAAGAATCCA ATACGCCGTA CCAACTACCC GATGGACAAA CTGTGAACCT GTCCACGGAA CGGTACCAGG CGCCCAACGT TCTTTTTGAC CCGTCCCTAA TTGGATCCGA GGAACCTGGA GCCGCCGAAG TTTTGGTCAA CTCGATTATG AAAAGTGACA TTGACTTGCG ATCAAAGTTG TTCTCGCAAA TCGTGTTGGC CGGCGGTTCT TCGCTCACAC CAGGCTTTGG CGACAGAATG TTGTACGAAG TTCGTTCTCG AGCTCCCTCA CATTTACGCA TACGTATCTC GGCACCGCCA GAAAGATTGC ACTCAGCCTA CGTAGGAGGA TCAATCTTGG CAAGTCTGTC GACGTTCAAG AGCATGTGGG TTTCCCGGTC CGAGTATGAG GAACACGGAA GCAACATTCT ACACCGCCGA GAGTTGTAG
|
Protein sequence | MSAEGSGPEG GSLLLNQPVV IDNGTASIKA GFAGSSKPKV VVGTKVGRPK HMRIMPGGAL ELDQGSSIFV GPKLDEHRGA FVLEHPMDKG MVTDGGWDAM EHLWEHVYSK QCLNAKQEEH PVLLTEAPLN QRKNRDQIAE IFFESIRAPA LFFTPPSVLS LYASGRTTGV VLDVGEGVTH AVPVYEGFAL QHSVVRSDVA GRDVTKQLQM QLRRAGLSFT TTAESELVKK IKEESCYLTP SRLANENTIK ESNTPYQLPD GQTVNLSTER YQAPNVLFDP SLIGSEEPGA AEVLVNSIMK SDIDLRSKLF SQIVLAGGSS LTPGFGDRML YEVRSRAPSH LRIRISAPPE RLHSAYVGGS ILASLSTFKS MWVSRSEYEE HGSNILHRRE L
|
| |