Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54688 |
Symbol | myoA3 |
ID | 7202004 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 353352 |
End bp | 356620 |
Gene Length | 3269 bp |
Protein Length | 1027 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181360 |
Protein GI | 219122035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCT CGCACGATTC ACACGTATGG GTCCGAACCG ACCTGGTCGA AAGCGTCTTG GGTAACGACG GGACTTTACC CAAGGGTTGG CGTCCTCGGA AACGCACACG GGTAAGCGGT GATGAGTGGG GATGGGTTCG GGCGGTTGTC CACCAAACGA CCAACTCGAC GAATACCGTT GATGAGGCAA CGGCTACGAG TGCCTCGGAA GCAACTTCGC CGTTTGCACA CGTAAAATTA CGAAAGACGA ACGCATTATC GCCGGCACAA AAGTCGCCCT CGTCCAGATG GACCGGATCT CCGACACGAC TCGCCAAGAC GGTACAGATT ACGCTCACGG TAGATGATCC CCTGCTGGCC AATGGAAAAC TCAATGGTGA AACCGTTACG TTCAGCTATA ATACGGGGGA ACAATCCAAA GTCTGCGTCG CTAACGCCTG GTGGCGCGAG GGGCAACCGC CACCGGAAGA CTTGACCAGT TTGGAACAAT TGCACGAGCC AGCGGTAGTC TTTTGCCTCC TCCAACGTTA CCAGTTAGAC CACGTTTACA CATACACGGG CAAGATTCTT CTGGCATTGA ATCCATTCCA AACGTTACCC ATTTACGGGG AAGAGATTAT GCGATTGTAC TGGCACACAA CCGGCTCGTC GTCGCCGAAA GCCCAATACG AACGCCCACC ACCTCACATT TACGCTATTG CGGAGGACGC TTACAGATCC ATGATGCGCT CACTCCAAAT CAATGCCTCG CGTGGAGAAA ATCAATCCAT TCTAGTATCG GGCGAAAGTG GTGCTGGAAA AACCGTCACC ACGAAAATCA TTATGCGCTA CCTGGCGACT CTGTCGGAGC AACGCTCCCA CACTTCCAGG GTAGGCATCG AGTCGCAAGT ACTTCAGAGT AATCCCATTT TGGAATCATT CGGCAACGCC CGTACCGTTC GAAACGATAA TTCGTCACGC TTTGGCAAAT TCATTGAAAT ATCCTTCCGG GACGGGTCTC TCGTATCGGC ATCGGTTGAA ACGTACCTAC TGGAAAAGGT TCGGCTGATT TCGCAGTCGC CGGGTGAACG GAATTATCAC ATTTTCTACG AAGCCCTGGT AGGTTTATCT TCAAAAGATG CTCAAAGCTT GGGTATTGCC GACTCATCAC CACGAGACTT CCGCATGACG GCCGTGTCCG GTACCTTCGA TCGCCGCGAT CAGGTACGCG ATGTTGATAC ATACCGCGAT TTGCGACAGG CCTTAGATAC AGTTGGTTTT TCGACAGAAG AGCAGCACGG CCTATTCGTG GTAGTATGCG CATTGTTGCA CGCGTCGAAT TTAACTCTGA CCGAGTACGG TCACGATGCG AGTGCATTGG ATGAATCGAA CCCTAGTTTG CCTGCGACAA TTGCTTTGCT CGGAGTCGAT CCCGAGGATT TAAACAATGC CGTCTGTAGC TGCGCTATCG AAGCTGGGGG GGAAATCTTG TTCAAGAATT TACCTGTGGA GAAGGCACAT AAGGCAATGG AAGCTTTGAT CAAGGCCACC TATGGTGCGC TCTTCACGTT TATTGTGCGC AAGATCAACT CAAAGATACA AGCACAACAC GATACAAGCG GATTATGGCA AGCTTCGATT GGCGTTTTGG ATATCTTTGG TTTTGAAAGC TTCGAAGTGA ACTCCTTCGA GCAGTTGTGC ATCAATTACT GTAACGAGGC GTTACAGCAA CAATTCAACA GGTTTGTATT CAAGTTGGAA CAGCAAGAAT ATCACAAGGA GGGAATTGAC TGGTCCTTCA TTGCGTTTCC TGACAATCAG GATGTTCTTG ATTTGATTGA AAAGCGTCAC GATGGAATAT TGTCCGTACT TGACGAACAA TCCCGGCTGG GCCGATGTAC GGACAAGTCT TTCGCTCAAG CTATTTATGA GAAGTGCGGT GCCCACCCTC GTTTTGAATC TTCCAAATCA CAGCAAGCCA TACTAGCATT TGGAATTCAG CACTATGCTG GCTCCGTTGA ATACAACACG GCTAACTTTT TGGAAAAGAA CAGGGACGAC TTGCCGAAAG AGACAACAGA ACTGCTTATG TCGAGTTCCA ATCCGTTTTT GGTTGGCCTT GGAAAGATAC TTTGTGAAAA ATCAGTGGCA TTGAATGCTT CAAACTCAGC CATGTCGAGG GGAAACCGGA AACAGTTGCA ACGCGCCGCC AGTTCCATCT TACGGGACAG CGTCGGCAGC CAGTTCAGCT CACAATTACA GTTGCTACGG AAACGTATAG AATCAACAGC TCCGCATTAC GTCCGGTGTC TTAAACCCAA TGACGATTTG GTACCAAATA GCTTTGATCC TTTGGTGATT GCCGATCAAT TACGCTGTGC TGGTGTTCTA GAGGCGATTC GAGTGTCTCG AGTCGGATTT CCGCACCGAT ATTTTCACGA TCACTTTGTG CAGCGCTACA GTTTACTGGT AGCTAAGCGG CTGACCAAGC GAGGGCGAGG GCTGAACGGT TGTGACTCTT GCGGAAGTTT AGTGGAAGAG TTACTCCCTC AGATTTCGAG TATTCTGGAT GATGAGGCAG TCTCCCCTTC CAAGAATCAT CGTCCTACCG CGTAAGTCAT CCCTGATTTG CTGGTTACAG TTATTGCATC ACGCTCTCAC AATGCTTTTC AAACAGAATC TCTCTTCTGG GAATGCAAAT GGGCAAAACA AAAGTTTTCC TTCGTCGTCG TGCATTTGAA GCCTTGGAAC ACCTACGAGG ACTCAAAATG GAAAAGGCCG CCTCAAAAAT TCAAGCATTT GGACGAATGA TCGTCGCGAA ACTCAATTAT GATATATCTG TGTACGCTGC CGTTTTAATA CAAAACTTCT TTCGACAAAT CGGTGCATTC CGTCTTGAAC GTGCGCAGAG AATCGAAGAT GCTGCCGAGA GAATTCAGTG CAGCTGGAGA AGTTACGATG CACGAAGGAC AATGCAAGCT GCGCGTTACG TTGCCTGGTG GTGTCAGAGT ACTTATAGGG GAAGTGTCGC CCGTCAGTTA TGTGCCTATT TATTTTTGGA CCGTAAGGTG TTGACGATCC AACATGCTTG GAAATATTAT GCATCAACTC GAACTTTTCG TAAGTTACGC AAAGCGGTGG TCCTTCTACA GTGTCGACAC CGTGGTCGTG TTGCCTATCG CGACTTGTGC AGACTGCGCC GCGAAGCTCG AGACCTGTCT ACCGTTGCTG CTGAACGCGA TCAGCTTCGC CAGGAATCTC AGCGTCTTCG TCGAGCGCTT GAGCACGCG
|
Protein sequence | MASSHDSHVW VRTDLVESVL GNDGTLPKGW RPRKRTRVSG DEWGWVRAVV HQTTNSTNTV DEATATSASE ATSPFAHVKL RKTNALSPAQ KSPSSRWTGS PTRLAKTVQI TLTVDDPLLA NGKLNGETVT FSYNTGEQSK VCVANAWWRE GQPPPEDLTS LEQLHEPAVV FCLLQRYQLD HVYTYTGKIL LALNPFQTLP IYGEEIMRLY WHTTGSSSPK AQYERPPPHI YAIAEDAYRS MMRSLQINAS RGENQSILVS GESGAGKTVT TKIIMRYLAT LSEQRSHTSR VGIESQVLQS NPILESFGNA RTVRNDNSSR FGKFIEISFR DGSLVSASVE TYLLEKVRLI SQSPGERNYH IFYEALVGLS SKDAQSLGIA DSSPRDFRMT AVSGTFDRRD QVRDVDTYRD LRQALDTVGF STEEQHGLFV VVCALLHASN LTLTEYGHDA SALDESNPSL PATIALLGVD PEDLNNAVCS CAIEAGGEIL FKNLPVEKAH KAMEALIKAT YGALFTFIVR KINSKIQAQH DTSGLWQASI GVLDIFGFES FEVNSFEQLC INYCNEALQQ QFNRFVFKLE QQEYHKEGID WSFIAFPDNQ DVLDLIEKRH DGILSVLDEQ SRLGRCTDKS FAQAIYEKCG AHPRFESSKS QQAILAFGIQ HYAGSVEYNT ANFLEKNRDD LPKETTELLM SSSNPFLVGL GKILCEKSLQ RAASSILRDS VGSQFSSQLQ LLRKRIESTA PHYVRCLKPN DDLVPNSFDP LVIADQLRCA GVLEAIRVSR VGFPHRYFHD HFVQRYSLLV AKRLTKRGRG LNGCDSCGSL VEEISLLGMQ MGKTKVFLRR RAFEALEHLR GLKMEKAASK IQAFGRMIVA KLNYDISVYA AVLIQNFFRQ IGAFRLERAQ RIEDAAERIQ CSWRSYDARR TMQAARYVAW WCQSTYRGSV ARQLCAYLFL DRKVLTIQHA WKYYASTRTF RKLRKAVVLL QCRHRGRVAY RDLCRLRREA RDLSTVAAER DQLRQESQRL RRALEHA
|
| |