Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16379 |
Symbol | myoA2 |
ID | 7198514 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 187375 |
End bp | 190269 |
Gene Length | 2895 bp |
Protein Length | 769 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184750 |
Protein GI | 219129131 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.191953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGATTTGA TTGCGCTCAC ACACTTACAC GAACCTGCGG TGGTCGAATC ACTCCAAGTC CGCTACGCGC AGAACCGCAT TTACACCGCA ACTGGACCAG TTTTACTCGC CCTTAATCCC TTCCGCAACC TACGAGGTCT TTACGGAGAA GCCGTCATGA AGTCCTATTG GCAGAGCGAT GATAGTGGCG CACCCACGAC ACGCGACACG TCGACCTCGC TGCCTCCACA CGTCTACAAC ATTGCCAACC AAAGCTTTCG CAATTTGATG CGCTGTTTGG AAGACGGGAC AAACCCGCAT CAATCTATTC TTGTGTCGGG AGAATCCGGT GCCGGGAAGA CCGTCAGTAC CAAGTTAGTT ATGAAATACC TGGCGGCCCT TTCCCATTAT AGATCCATGG GAAGTACGCG CAGCATGTCG GTCCCGAGAG TATACAAAAG GGCTCGGGAG GCCACGCAAC GACATCAACG AAACCACTCT AATTCGGTGT CTTCGGAATT CCCTGAACGT GACAACGTCC TCCTCTCTAC AATTGATAAA TCTTCCACTA GAGCACTGCC CTCAATGGCT TGCGAGGAGT CGACCACTCT TGACGCGGAA ACAAACCTCG GTCTCGATAC GATCGAAGCC CAAGTCCTTC AATCGAATCC TATTTTGGAA TCCTTTGGCA ACGCCCGCAC AACTCGAAAT GACAATTCCT CTCGTTTTGG CAAATTTGTT GAACTTCAAT TTTCAGCCCG AGGCAATCTC GTCGGCGCCC AATTAGAAAC GTACCTTTTG GAAAAGGTCC GTGTTGTGCA CGTTGGTACC GGCGAACGCA ACTACCATAT TTTCTATGAA TTACTGCAGG CTCACATTGA TAACAGACGC ACCGGGAACA CCCTTGGTAG CTACAGCAAA AATTATGATA TGTCTCAGTT GCATTTGGCG CCAACGGCGA CGAGCCGTGA TTTTAAACTC ACCTGTGACG GAGAACGATC TCGAAGGGAT GGTGTGGCCG ACGGTCAAAA CTTTGCCGCG CTGCTACAGG CAATGCAGAC CTTGCGCTTT AGTGAGCAGG AAATTCGAGA AATTTGGCGT GTCACCTCGT CGTTACTGCA CGCGTCAAAT TTGAATTTTG TCCCCGTGCG GGATCTCACC ACCGGTGAGA CGGATCCTGA CGACGCCTGC ATCTTGGACA GCACCAACGT TCACTTGAGC TCCGTGTGCG CCTTGCTAGG TGTGACGGAA AAGGAGTTGA ATCAAGCTCT TTGCGAGGTT GTACTGAGAG CTGGGAAGGA AGAAGCGCGA AAGCGTATGA CTGCCGCCCA AGCTCAACGA GGATTAGAGG CCCTAATCAA AGCAACGTAC GGAGCGCTCT TTGAATATCT TGTCTCCAGA ATTAACGATG CCATCCAAGG TGAGAACAGC AATGAGTCAA ATGCGGGATC GGCCGCCACC ATCGGTGTCC TCGATATTTT TGGTTTCGAA AGTTTTCAAG TTAACTCGTT CGAGCAGTTG TGCATCAACT ACTGCAATGA AGCGCTTCAG CAACAATTCA ACGCATTCGT GTTGCGGAAT GAGCAAGCTG AATACGAAAA AGAAGGTATT TTGTGGAAGT TTATAGAATT CCCAGAGAAT CAGGATGTGC TTGATCTAAT TGACAAGCGT GGGATTGGTA TTCTGCATAT ATTGGATGAT CAATGCCGAG CTCCTGGGCC ATCTGATACC TCTTTCGGTC TACAGGTGTA CCAAATGTGT GCGAATTCGT CCCGATTTTC AGCTACTAGA GCTCAAAAGG CCCATTTGCA ATTCGCCGTT CACCATTATG CTGGTCCGGT CACGTATACT GCGAAAGGCT TCACTGAAAA GAATCGAGAT GAGCTACCAA CATCGACAAG GACATTGTTC CTATCCTCCG AAAGCACTTT CGTGCAGCAG TTGGCACACA TACTGGAATG GGCACCGAAT TCTAACGCTT CCGCATCTGC TTTTTCGGTG CGCTCTTTTC AACGGTCAGA TTCCACCGTG GGACGTCCGA CTGTAGCGGG ACAGTTTCAG CGACAACTGA AAAAATTGCG ATCCAAAATT GATCAAACAT CTCCACATTA CATTCGATGT CTGAAACCAA ATGATTGTTT GCTACCGGAT AAATTTGACG TTGCAGTCGT TGCGGAACAA TTGCGTTGCG GTGGAATTCT TGAAGCCGTT CGGGTCGCCA GGGCCGGCTT TACCCAACAC TACCCTCATT CGGACTTTTA TCGTCGGTAT AGGGTTTTGG CATGGCGAGA AATGAACAAA GTAGGGGGGC GATTGAGACA CTCTAGCAGT GGCAACCCTC TCTCTGCGCC TTCTGCCATA AGGCCAGGAC GGCTTTTTAA AGTATCAAAC ACAGATGGTA CAGCGATAAC AAAGAACAAA GACGAGGCCT CGCGACGATG CAAAGAATTA CTCCAAATTT TGCAGGTAAA GATACAACAC CAGCAAATTG AAGATGGTAT CGAAATTGCC AAGCCGACGG ATCCTAAAAC CTCCGTTCAA ATACACAGCA CCCGTAAATC AATACCTTCT TCCTGTTCCC AGTCAATAAA GAGTAGTCAT AGCAGTAAGC CTTGCTTGGC TCCAGTTCGC GGTCACATGA CTCGTCAAAA ATTGGTGGAG AAGAAGGAGT TTGGCAAGCA AAAGGAGATT ACCATGTCAA GGAGCAGCAG GTCAGCGGCT TGGAAATACG GGGGAATGCA AGAGGATGTA TGCACCAATT TAGGTATTCA GATGGGGAAA AGCAAGGTGT TTCTCCGCCA TTCGGCCTTT GAAGCGCTGG AGCGTATCAG AACCTGTGAA CAATACAAGG CTGCAACAAG CCTGAACGCT ACCTTTCGGA TGTACCTGGC ACGTATCGCC TACGTACCCT ACCGG
|
Protein sequence | DDLIALTHLH EPAVVESLQV RYAQNRIYTA TGPVLLALNP FRNLRGLYGE AVMKSYWQSD DSGAPTTRDT STSLPPHVYN IANQSFRNLM RCLEDGTNPH QSILVSGESG AGKTVSTKLV MKYLAALSHY RSMGTQVLQS NPILESFGNA RTTRNDNSSR FGKFVELQFS ARGNLVGAQL ETYLLEKVRV VHVGTGERNY HIFYELLQAH IDNRRTGNTL GSYSKNYDMS QLHLAPTATS RDFKLTCDGE RSRRDGVADG QNFAALLQAM QTLRFSEQEI REIWRVTSSL LHASNLNFVP VRDLTTGETD PDDACILDST NVHLSSVCAL LGVTEKELNQ ALCEVVLRAG KEEARKRMTA AQAQRGLEAL IKATYGALFE YLVSRINDAI QGENSNESNA GSAATIGVLD IFGFESFQVN SFEQLCINYC NEALQQQFNA FVLRNEQAEY EKEGILWKFI EFPENQDVLD LIDKRGIGIL HILDDQCRAP GPSDTSFGLQ VYQMCANSSR FSATRAQKAH LQFAVHHYAG PVTYTAKGFT EKNRDELPTS TRTLFLSSES TFVQQLAHIL EWAPNSNASA SAFSVRSFQR SDSTVGRPTV AGQFQRQLKK LRSKIDQTSP HYIRCLKPND CLLPDKFDVA VVAEQLRCGG ILEAVRVARA GFTQHYPHSD FYRRYRVLAW REMNKVGGRL RHSSSGNPLS APSAIRPGRL FKDVCTNLGI QMGKSKVFLR HSAFEALERI RTCEQYKAAT SLNATFRMYL ARIAYVPYR
|
| |