Gene PHATRDRAFT_54442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54442 
SymbolAP1alpha 
ID7200376 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp643878 
End bp647468 
Gene Length3591 bp 
Protein Length1019 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179889 
Protein GI219118219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.595032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAACTACAG TACTTTGATA GTGGTCCAGT TCACTTCCGC AAGGCGATTC CACCATGGCT 
TCTCAAGCGC GGGGGCTCCA AAACTTCATT TCCGACCTCC GTAATGCCAA AGGAAAGGTA
AGCAACTAGA TTGTGATCGG TGCTGTCCTT ATAGGACTCC GCAGTATCGC TTCCGATTGT
TGTCTGAGAC CATTGTCGAA ACAGGCCAAC TTGGGAGCGT GGGACGACAT GACTGATCCT
TTTCAACGGA CCAATGAGAC CTTTTTCTTC CAAGTACTTC AAATTTTTGT ATGTTTTTCC
GCATAAGTGA ACGCACGGAT TCATTTTTTA CTTAGAAAAA CGCGAGGCTG CCTGTCTTGT
TTCGCTGGAG AAGCCTTGTT TACGTGACCA ACGACCATGG AGATCTTTCC TCTTTTGGAC
TTCTAGATCC CGCCGATGAT CCGATCTAAG GAGTGACTAT TTTCCATACC GACCATTTTA
CTGATAGAAG TGTTCGCTTT TCATAGGATG ACGAGAAGAA GCGCGTCGAT ATTGAACTTG
CCAACATTCG TAAGCAGTTC TCGCCGAAGG GTGGTAAAGT GGCCGAGGAC GGCAGTAATC
CCAACTTGTC ATCTTACCAG CGAAAGAAGT ATGTCTGGAA GCTAGTTTAT ATCCACGTCT
TAGGTTACGA AGTTGACTTT GGTCATGCGG AAGTTCTTGT CTTGGTACGA TCCCCAAAAT
ACTCCGAAAA AGTTGTCGGC TACGCTGCCC TTAGCTTGTT GATCCGCAGC GACGATCCGG
TGATCAATTC GATCCGCAGC ACAATTTCGA AAGACCTAAC GCAGCCGACT ATCACAGGAG
GAAAGAATAG TGCACCTCCG GACGCGGCCC AGGCTTTGGC CCTATGTGCT GCAGCAAACA
TTTCGGGTCT CGAGCTGGTA CAGTCTTTGC ACACGGAAAT TCAACAGACG TTGGTAGCCC
AGTCATCCTC TCCTTGCGTA AAAAAGAAGG CAGCTCTTTG CTTGCTGCGA CTCATCCGAA
CAAGCCCGCG TCTTTTGTCG GGACGAGAGT TTGCTTCACA GATGGCGCAG CTTTTGCAAG
ATCGTCACTT GGGAGTTCTG ACAAGCGCAA TGAATCTGCT TTACGGACTC GCGTTGCAAG
TGCCACACGA GTATGAGAGC CTAATTCCGT ACGCTGTCCA CATCCTTGGA ATGCTGGTGT
TGAAAAAGGC TTGTGCGCGG GATTACCTTT ACTACCGTAC TCCTAGCCCA TGGTTGCAGA
TTAAATTGCT TAAATTCCTA CAGCTATATC CCCACGCTCT AACCAAGGCC AGCCAGAATG
GACAAGCTCA AGAAACGTCG CCTGCTAGCA ACGACGCTCA TATTTCGCAG CTGACAAGTA
TTATTTCCAA AATCTTGACC GAAACAGATG TATCGGACTC GATCAACAAA TCGAATGCTG
ACCACGCTAT ATTGTTTGAA GCCGTCAATT TGATTGTGTG CTGGGGATCT TCAGGTCCAA
CGCAACTGCG GGATGGCGCA ATGAAACTGC TGGGAAAATT CATTTCGGTC CGAGAACCAA
ATATTCGTTA CCTTGGGTTG ATGACAATGG CGAAGCTTGC TCAATTGGAA GGGAGCGCTG
AATCGATCAA AAAACATCAG GCAACGGTTC TTGTTTCATT GAAGGATGCT GATATAAGTG
TGCGCCGGCG GGCGTTGGAT CTGTTGTTTG TAATGTGTGA CACAGATAAC GCGGAACTGA
TTGTCGATGA GCTCATAGGA CACCTTGCGC TCGCTGACGC CGCCATTCGG GAGGAGATGG
TCTTAAAGAT TGCTATTCTG GCCGAGAAGT ACGCTACAGA CCTTCGTTGG TATGTGGACT
CTATCCTGAA ACTTATCTCT ATCAGTGGCG ACAATGTCAG TGACGCAATT TGGCATCGAG
TCGTCCAAAT TGTTACGAAC CACCCTCAGG GAGATTTGCA GGCTTATACA GCGGCTACCT
TACTGGTAGC TGTCAGTCCG CGTCGATGCC ACGAAACCGC CGTTCGTGTC GCTTCCTACA
TTCTTGGCGA ATTTGGGTTT TTGATCGCCG AGCGACCAGG CATGTCTGGA GAAGACCAGT
TTCGAATTCT GCATCAACAT TGGGCAACAA GCGATCATGT GACGCGTGGC ATCTTGATAT
CTACTTATGC GAAACTCGCA AATCTTTACG AGGAATGTCG CCCACTCGTT GCGCCAGTCT
TTGCCCGGTG CACAAACAGT GTTGACGTGG AGATCCAACA GCGTGCAGCA GAATACTCCT
CAATGCGGGA AGCCTTTACT CCAGAGGCTG TTGAAGACTT GCTTCGGGAG ATGCCTCCGT
TTGAAGACAA TAAAACCAGT GCTTTAGAAG AACGTCTACG AGAGAAAGAA GGTGAAGAGA
GCGCTGCATA CAAGAAGACT GCTAGGCCAA GCGCTGCTCA GCGGCAACGG GCAGCGCAGA
GTGCGGCGGC GGCCCAAGCA GTAGAAGAAG TAGCGCAACA AGCGCAGACT ACTGATCCTG
ACGAAGAAGA TCCTGTCAGC CCAATGAGTG GTAAGTAACA ATTGTATTTC TCTTACGCAA
ACACCAGATT TTGCCTCATA TCCCATCTTT TGTAATTGCA GATGCTAGTC CAGGTGGATC
TCGCCCTAAC TTCGATCGTT CGAAAAAAGT CGGGATCCCT AAGGAAGTCA TTCCTGCGAT
GCGTAAAGCC TTTTCCAATC TTTGTACTTC TCCTTCGGGA GTGCTGTTTG AAAACTCACT
GCTACAGGTG GGCGTTAAAC AAAGCTATGT TGGCTTTCAA GGTCAAATCT CTATCTTTTT
CGGTAATCTG AGCAAGAAAC CACTGACCAA CTTCCGAGCT ATTATTGAAG ATGTTGATCA
CTTGCGAATG CAGAAACAGG GCACGGAAGG CATCTTGGAC GATGAAGAAG ATGGCGGATG
CACTGTCGCC ATCCGTACAC AAGCGAAGCT TTTGCTAAAA GTCGAGGTTA CTGCGCCGTT
TGACGATGCC CCGGCAATGA GAATCTGCTT TCAAACTGGT GACGGGGAAT GTCATGAATA
TCCTTTGCGT CTTCCAATCG TCGCTACTTG CTTCATGGAA CCTGTAACTC TTGAGTCGAA
TGCGTTTCTT CAAAGATGGA AAAGCCTAGA AGGCCAAGAT CGCGAATGCC AGGAGATTGT
CAAGGCACCC CCCACCTCTC CGCCGATCGA CGAAGCGTAC ATGGAACGTA TTGTTCATAT
AGTAACAGAC GGTTTGAAGT TTGGTCGATG CCCTGGATGT GACCCAACAA TTTGGACAGT
TTCCGGCGCC GCAACATTCC GGACAGGCGC CAGAGACATG AATGGAAACC ATATCAATGT
GGGCTGCTTA GTTCGCATTG AAGCAAATCC GGAGGCAGGC GCTTTTCGTG TGACAACCAG
AACTTTACAT CCTCTTTGTT CGAAAGCTGT TAAAAATGTC GCGCTGGTGA GCATCAAGAT
GGGAAAGTAA ACGGCTGACC TTTCTGGATG AAAGCTTTTG TCTTGCTGGT TATGTTTGTT
CATGTCCGCT TAATCTCGAT TACCTTAGTT TATGACATTG TAGAAAGAAT T
 
Protein sequence
MASQARGLQN FISDLRNAKG KANLGAWDDM TDPFQRTNET FFFQVLQIFD DEKKRVDIEL 
ANIRKQFSPK GGKVAEDGSN PNLSSYQRKK YVWKLVYIHV LGYEVDFGHA EVLVLVRSPK
YSEKVVGYAA LSLLIRSDDP VINSIRSTIS KDLTQPTITG GKNSAPPDAA QALALCAAAN
ISGLELVQSL HTEIQQTLVA QSSSPCVKKK AALCLLRLIR TSPRLLSGRE FASQMAQLLQ
DRHLGVLTSA MNLLYGLALQ VPHEYESLIP YAVHILGMLV LKKACARDYL YYRTPSPWLQ
IKLLKFLQLY PHALTKASQN GQAQETSPAS NDAHISQLTS IISKILTETD VSDSINKSNA
DHAILFEAVN LIVCWGSSGP TQLRDGAMKL LGKFISVREP NIRYLGLMTM AKLAQLEGSA
ESIKKHQATV LVSLKDADIS VRRRALDLLF VMCDTDNAEL IVDELIGHLA LADAAIREEM
VLKIAILAEK YATDLRWYVD SILKLISISG DNVSDAIWHR VVQIVTNHPQ GDLQAYTAAT
LLVAVSPRRC HETAVRVASY ILGEFGFLIA ERPGMSGEDQ FRILHQHWAT SDHVTRGILI
STYAKLANLY EECRPLVAPV FARCTNSVDV EIQQRAAEYS SMREAFTPEA VEDLLREMPP
FEDNKTSALE ERLREKEGEE SAAYKKTARP SAAQRQRAAQ SAAAAQAVEE VAQQAQTTDP
DEEDPVSPMS DASPGGSRPN FDRSKKVGIP KEVIPAMRKA FSNLCTSPSG VLFENSLLQV
GVKQSYVGFQ GQISIFFGNL SKKPLTNFRA IIEDVDHLRM QKQGTEGILD DEEDGGCTVA
IRTQAKLLLK VEVTAPFDDA PAMRICFQTG DGECHEYPLR LPIVATCFME PVTLESNAFL
QRWKSLEGQD RECQEIVKAP PTSPPIDEAY MERIVHIVTD GLKFGRCPGC DPTIWTVSGA
ATFRTGARDM NGNHINVGCL VRIEANPEAG AFRVTTRTLH PLCSKAVKNV ALVSIKMGK