Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54442 |
Symbol | AP1alpha |
ID | 7200376 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 643878 |
End bp | 647468 |
Gene Length | 3591 bp |
Protein Length | 1019 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179889 |
Protein GI | 219118219 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.595032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAACTACAG TACTTTGATA GTGGTCCAGT TCACTTCCGC AAGGCGATTC CACCATGGCT TCTCAAGCGC GGGGGCTCCA AAACTTCATT TCCGACCTCC GTAATGCCAA AGGAAAGGTA AGCAACTAGA TTGTGATCGG TGCTGTCCTT ATAGGACTCC GCAGTATCGC TTCCGATTGT TGTCTGAGAC CATTGTCGAA ACAGGCCAAC TTGGGAGCGT GGGACGACAT GACTGATCCT TTTCAACGGA CCAATGAGAC CTTTTTCTTC CAAGTACTTC AAATTTTTGT ATGTTTTTCC GCATAAGTGA ACGCACGGAT TCATTTTTTA CTTAGAAAAA CGCGAGGCTG CCTGTCTTGT TTCGCTGGAG AAGCCTTGTT TACGTGACCA ACGACCATGG AGATCTTTCC TCTTTTGGAC TTCTAGATCC CGCCGATGAT CCGATCTAAG GAGTGACTAT TTTCCATACC GACCATTTTA CTGATAGAAG TGTTCGCTTT TCATAGGATG ACGAGAAGAA GCGCGTCGAT ATTGAACTTG CCAACATTCG TAAGCAGTTC TCGCCGAAGG GTGGTAAAGT GGCCGAGGAC GGCAGTAATC CCAACTTGTC ATCTTACCAG CGAAAGAAGT ATGTCTGGAA GCTAGTTTAT ATCCACGTCT TAGGTTACGA AGTTGACTTT GGTCATGCGG AAGTTCTTGT CTTGGTACGA TCCCCAAAAT ACTCCGAAAA AGTTGTCGGC TACGCTGCCC TTAGCTTGTT GATCCGCAGC GACGATCCGG TGATCAATTC GATCCGCAGC ACAATTTCGA AAGACCTAAC GCAGCCGACT ATCACAGGAG GAAAGAATAG TGCACCTCCG GACGCGGCCC AGGCTTTGGC CCTATGTGCT GCAGCAAACA TTTCGGGTCT CGAGCTGGTA CAGTCTTTGC ACACGGAAAT TCAACAGACG TTGGTAGCCC AGTCATCCTC TCCTTGCGTA AAAAAGAAGG CAGCTCTTTG CTTGCTGCGA CTCATCCGAA CAAGCCCGCG TCTTTTGTCG GGACGAGAGT TTGCTTCACA GATGGCGCAG CTTTTGCAAG ATCGTCACTT GGGAGTTCTG ACAAGCGCAA TGAATCTGCT TTACGGACTC GCGTTGCAAG TGCCACACGA GTATGAGAGC CTAATTCCGT ACGCTGTCCA CATCCTTGGA ATGCTGGTGT TGAAAAAGGC TTGTGCGCGG GATTACCTTT ACTACCGTAC TCCTAGCCCA TGGTTGCAGA TTAAATTGCT TAAATTCCTA CAGCTATATC CCCACGCTCT AACCAAGGCC AGCCAGAATG GACAAGCTCA AGAAACGTCG CCTGCTAGCA ACGACGCTCA TATTTCGCAG CTGACAAGTA TTATTTCCAA AATCTTGACC GAAACAGATG TATCGGACTC GATCAACAAA TCGAATGCTG ACCACGCTAT ATTGTTTGAA GCCGTCAATT TGATTGTGTG CTGGGGATCT TCAGGTCCAA CGCAACTGCG GGATGGCGCA ATGAAACTGC TGGGAAAATT CATTTCGGTC CGAGAACCAA ATATTCGTTA CCTTGGGTTG ATGACAATGG CGAAGCTTGC TCAATTGGAA GGGAGCGCTG AATCGATCAA AAAACATCAG GCAACGGTTC TTGTTTCATT GAAGGATGCT GATATAAGTG TGCGCCGGCG GGCGTTGGAT CTGTTGTTTG TAATGTGTGA CACAGATAAC GCGGAACTGA TTGTCGATGA GCTCATAGGA CACCTTGCGC TCGCTGACGC CGCCATTCGG GAGGAGATGG TCTTAAAGAT TGCTATTCTG GCCGAGAAGT ACGCTACAGA CCTTCGTTGG TATGTGGACT CTATCCTGAA ACTTATCTCT ATCAGTGGCG ACAATGTCAG TGACGCAATT TGGCATCGAG TCGTCCAAAT TGTTACGAAC CACCCTCAGG GAGATTTGCA GGCTTATACA GCGGCTACCT TACTGGTAGC TGTCAGTCCG CGTCGATGCC ACGAAACCGC CGTTCGTGTC GCTTCCTACA TTCTTGGCGA ATTTGGGTTT TTGATCGCCG AGCGACCAGG CATGTCTGGA GAAGACCAGT TTCGAATTCT GCATCAACAT TGGGCAACAA GCGATCATGT GACGCGTGGC ATCTTGATAT CTACTTATGC GAAACTCGCA AATCTTTACG AGGAATGTCG CCCACTCGTT GCGCCAGTCT TTGCCCGGTG CACAAACAGT GTTGACGTGG AGATCCAACA GCGTGCAGCA GAATACTCCT CAATGCGGGA AGCCTTTACT CCAGAGGCTG TTGAAGACTT GCTTCGGGAG ATGCCTCCGT TTGAAGACAA TAAAACCAGT GCTTTAGAAG AACGTCTACG AGAGAAAGAA GGTGAAGAGA GCGCTGCATA CAAGAAGACT GCTAGGCCAA GCGCTGCTCA GCGGCAACGG GCAGCGCAGA GTGCGGCGGC GGCCCAAGCA GTAGAAGAAG TAGCGCAACA AGCGCAGACT ACTGATCCTG ACGAAGAAGA TCCTGTCAGC CCAATGAGTG GTAAGTAACA ATTGTATTTC TCTTACGCAA ACACCAGATT TTGCCTCATA TCCCATCTTT TGTAATTGCA GATGCTAGTC CAGGTGGATC TCGCCCTAAC TTCGATCGTT CGAAAAAAGT CGGGATCCCT AAGGAAGTCA TTCCTGCGAT GCGTAAAGCC TTTTCCAATC TTTGTACTTC TCCTTCGGGA GTGCTGTTTG AAAACTCACT GCTACAGGTG GGCGTTAAAC AAAGCTATGT TGGCTTTCAA GGTCAAATCT CTATCTTTTT CGGTAATCTG AGCAAGAAAC CACTGACCAA CTTCCGAGCT ATTATTGAAG ATGTTGATCA CTTGCGAATG CAGAAACAGG GCACGGAAGG CATCTTGGAC GATGAAGAAG ATGGCGGATG CACTGTCGCC ATCCGTACAC AAGCGAAGCT TTTGCTAAAA GTCGAGGTTA CTGCGCCGTT TGACGATGCC CCGGCAATGA GAATCTGCTT TCAAACTGGT GACGGGGAAT GTCATGAATA TCCTTTGCGT CTTCCAATCG TCGCTACTTG CTTCATGGAA CCTGTAACTC TTGAGTCGAA TGCGTTTCTT CAAAGATGGA AAAGCCTAGA AGGCCAAGAT CGCGAATGCC AGGAGATTGT CAAGGCACCC CCCACCTCTC CGCCGATCGA CGAAGCGTAC ATGGAACGTA TTGTTCATAT AGTAACAGAC GGTTTGAAGT TTGGTCGATG CCCTGGATGT GACCCAACAA TTTGGACAGT TTCCGGCGCC GCAACATTCC GGACAGGCGC CAGAGACATG AATGGAAACC ATATCAATGT GGGCTGCTTA GTTCGCATTG AAGCAAATCC GGAGGCAGGC GCTTTTCGTG TGACAACCAG AACTTTACAT CCTCTTTGTT CGAAAGCTGT TAAAAATGTC GCGCTGGTGA GCATCAAGAT GGGAAAGTAA ACGGCTGACC TTTCTGGATG AAAGCTTTTG TCTTGCTGGT TATGTTTGTT CATGTCCGCT TAATCTCGAT TACCTTAGTT TATGACATTG TAGAAAGAAT T
|
Protein sequence | MASQARGLQN FISDLRNAKG KANLGAWDDM TDPFQRTNET FFFQVLQIFD DEKKRVDIEL ANIRKQFSPK GGKVAEDGSN PNLSSYQRKK YVWKLVYIHV LGYEVDFGHA EVLVLVRSPK YSEKVVGYAA LSLLIRSDDP VINSIRSTIS KDLTQPTITG GKNSAPPDAA QALALCAAAN ISGLELVQSL HTEIQQTLVA QSSSPCVKKK AALCLLRLIR TSPRLLSGRE FASQMAQLLQ DRHLGVLTSA MNLLYGLALQ VPHEYESLIP YAVHILGMLV LKKACARDYL YYRTPSPWLQ IKLLKFLQLY PHALTKASQN GQAQETSPAS NDAHISQLTS IISKILTETD VSDSINKSNA DHAILFEAVN LIVCWGSSGP TQLRDGAMKL LGKFISVREP NIRYLGLMTM AKLAQLEGSA ESIKKHQATV LVSLKDADIS VRRRALDLLF VMCDTDNAEL IVDELIGHLA LADAAIREEM VLKIAILAEK YATDLRWYVD SILKLISISG DNVSDAIWHR VVQIVTNHPQ GDLQAYTAAT LLVAVSPRRC HETAVRVASY ILGEFGFLIA ERPGMSGEDQ FRILHQHWAT SDHVTRGILI STYAKLANLY EECRPLVAPV FARCTNSVDV EIQQRAAEYS SMREAFTPEA VEDLLREMPP FEDNKTSALE ERLREKEGEE SAAYKKTARP SAAQRQRAAQ SAAAAQAVEE VAQQAQTTDP DEEDPVSPMS DASPGGSRPN FDRSKKVGIP KEVIPAMRKA FSNLCTSPSG VLFENSLLQV GVKQSYVGFQ GQISIFFGNL SKKPLTNFRA IIEDVDHLRM QKQGTEGILD DEEDGGCTVA IRTQAKLLLK VEVTAPFDDA PAMRICFQTG DGECHEYPLR LPIVATCFME PVTLESNAFL QRWKSLEGQD RECQEIVKAP PTSPPIDEAY MERIVHIVTD GLKFGRCPGC DPTIWTVSGA ATFRTGARDM NGNHINVGCL VRIEANPEAG AFRVTTRTLH PLCSKAVKNV ALVSIKMGK
|
| |