Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37941 |
Symbol | |
ID | 7202858 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 473737 |
End bp | 474906 |
Gene Length | 1170 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182072 |
Protein GI | 219123522 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.149432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAT GGAGACGGGT TTATGCATCC TTGTGCGTCG CGGTAGCTAC TACGGTGGTG CTTTCCGCTC TCTCTAGGGA TGTCCGCTCG ACCCTCATGC CGTTCTACAA TGAAACCCAG AAGACGTCGC TACTTTTTGA TGACAGCAGT GATCGCGTTG GGCATTTCGA TGGGCTATTT TCGGATGCCC GGGCTTATCA AAACATTGAG GAGTCAAATC AACACAGACA AAAATATGAA GCGAAGTGCG CAGACGAAAG CTCAACCCAA GAGGAGATAG TTGAAATTTT GGGTAGTTGG TATAGGCCAA GTCTTGATGG AGAAGTCGCA ACTCAACAGT CCAAGCTGCC GGTTGAACCG TGCCGGTTTA CCTTTCTAGA TTTTGGCGCC AATGTTGGAG ATTCGATGGG CAAACTAGTG GACGCTGGCA TTCCACCTTG TTCGAAGAAA GGCATTTTAG CTCCACGAAT AGATCTGGAA CATGGATTTC TACAACCTCT TCAAAAGGGA AAGGGTTTTA GAAAACTCAT CACCTGGATA CGTACTCAAA TGGAGGAGGT GAGCCGGCAA CTTTCGGGCC CGGTTCAACC AGAGAATTAT TGTTACTTTG GTATCGAGGG AAATCCAATC TTTACAAATC ATCTCAATAG ATTACAGCAA CGTCTCATGC TTACTTCGCC GAGACCACTT CGAAGAGTCC ACTTCTTCAC CGAGACGGTG GGCGCTGCAA AAGACGAGAC TACGGTTCTG TTCTTGGACA CAGTCAATGA GAAAGAGAAT TTTTGGGGTT CGTCTACACT CTCTGGACAT AGGGATGTTC AAAGCTCCCT CTTGAGTGGG AATGACAAGC GTGAGGTGTC TGTGCAAGGT TTCACTTTGA CTCGCCTTCT TCACGAAACA GTCAAGATGA TGCCTGGTGC ACATGTTATG GTGAAAATGG ATATAGAGGG TGCCGAGTAT GCATTGCTCA ATGAAGCATT TGACTCGGGT GCACTGTGCA ACACGACTGC TCGTGCTGTC AGGGTCGATA TAATTGTTGA AGTTCACGGC GAGGTGAGTG AAAATCGTTC GTATATGAAT AGATATCCTA TTTCTACTCG AAGTCACATT CTCTCTGTAG ACCTTAATAG GAAGAAACTT ACACGCCGAT AGATTCAGAA GCAAAGTTAA
|
Protein sequence | MRKWRRVYAS LCVAVATTVV LSALSRDVRS TLMPFYNETQ KTSLLFDDSS DRVGHFDGLF SDARAYQNIE ESNQHRQKYE AKCADESSTQ EEIVEILGSW YRPSLDGEVA TQQSKLPVEP CRFTFLDFGA NVGDSMGKLV DAGIPPCSKK GILAPRIDLE HGFLQPLQKG KGFRKLITWI RTQMEEVSRQ LSGPVQPENY CYFGIEGNPI FTNHLNRLQQ RLMLTSPRPL RRVHFFTETV GAAKDETTVL FLDTVNEKEN FWGSSTLSGH RDVQSSLLSG NDKREVSVQG FTLTRLLHET VKMMPGAHVM VKMDIEGAEY ALLNEAFDSG ALCNTTARAV RVDIIVEVHG EIQKQS
|
| |