Gene PHATRDRAFT_47243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47243 
Symbol 
ID7202335 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp10094 
End bp14513 
Gene Length4420 bp 
Protein Length850 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181469 
Protein GI219122266 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACAAGTTTA TGCTGTAGCC AAGAAGCACC TCCCACCTTG TGTGTAGGAC GTACTCTGTG 
CGGTCAATCA TTGGGGACCG GAGTACGCAC TAGTACCAAC GCACCTCCAT TACAAAGGAA
AGCATATCAT TCGGCAAAGA TATCGTAGAG GCTGGTTTGT TGTTCCGGAA GGAAATCGAC
CGCTATTGAC GGCCGAAGAC AAGTCTCCAT GGACTTCGCG AATGGTTTTG GAGACAGTAG
TGATGACCTG ATGGAAAACG CAAGGATCGT CAGCCCTTAT GGCGATGGAG AAACCCAAGC
GTTTCCTGCT GATCCGTGGG CAAATTTTCA GTACGCACTG CCTGAAGGGT CCTCGCAGTA
TAAATATCGT GCTTCTAGAC AAAGACCAAC GATTGGAGGA TCACGATCTC CCTCGGAGCC
AATTCCCACC CCACGAGAAA GGATGGTGAC ATCTTCTAAG ATCTCAACTC CTCACCGACA
CGGTATGGTA AATCGAGGCC ATGCTAGTCG ATACGAATAT TCTAGTGACA AGCGATGCGA
TGACGTACAC GTAATATTTA CAGATCCCTC CGACGGTGTT GGGATCAGCG TTGAAAAAAG
GAGCCAACAG GACTTGTTGC GACAAAAGGT TCGGGAGGAC ACCAGTCGTA GGAGCTTACA
ACATTCAACC AGCGCACAAA GTATGTACCG GAAACGTAGC GAAAACAAAG GTGCGTTTCC
TTCATTTCCA CAATTAGCGC CGAGCAAAGA GAAAGCACTG GTACGTACTC CGAACACCTA
AACTTGAATG CATTGACCAA TTTTGAAAAG TGAACTACAC TCCTCTAAAC TACGCCTTCA
ATTCGTATTC TTTTGTCTGA AGCGAGGCCA GCGTCCATCT AAACATACTC GGGACCATAT
CTCACGGCAT ACTTCGCCAA AATCATCAGT GGAAATAACT TCAGAAAGCA CAAACGCAGA
GACCGATGGT CTAAATTTGA ATTGGGCATT CTCGCGTGTT CAAGTTCGAG CGAATAGTGA
AACGGAGTCA ACGACGGTTT GGCTAAATAC TGATCACAGG CAGGATTTAG AAGGAGAAGA
CATTTGGTTA CAAGAAGAAC ATGCATTCCC TTTGTTGTCA ACGGGGGCTT CAAGCAGATT
TTCGGGCCCA TCTGGTAAAG ATGGCCTCCG AAAACGTACT GATAAGAAGT TAAATCGTGT
CCGTTTTGCC GAACCGATAC AGAAGCCGCG ATCTGTCCCT TTTCTGAAAA CTGTTTCACG
GGAAACTATA CGGGAAGAAT CCTCAGATCC TCGTTCCAAA AAGATCACAC ATAGTGACAA
TCGCGCGTGG TTCGTGGCCC AACCGAAGTC AATTCTCCGT CGGCGGCGTT TCGCTGGCGA
AACCGTGTCC CATGACCCAC AGTACCCCCA GAAGAATCGC CCATCCTCTC ATCGCGCAGC
TCCACAACGG AAGTCTGCTA CATCGTTTTT GGATACACAA GGATCCCTAC TCTCTCCCAT
TCATTCAGAT AGGCGGCCTT GGGACCGCAT CTCTGAAACA GGCTCGGAGT CACTCAGTCC
TTCGTACAGT GATGTTGAGC GAGAGAAACG CGTTAGTCTT GGTCCCTACC ACCTGCAGGA
GCTAAATGAG ATGTATCCCG ATCCTCCTCT TGAGTTGCAG GTAAGACATT TTTGACTTTA
CAAAGCCCTA TCTTTTGTAG CCGACCTACT CACCGACATA CATTTTCTTT TAGTTCGACG
ACGAGTCAAC TGTAGTACCA GCACGCGCTT CTTTCATCGA CACTGTCGCT GCTGTTGTCG
TTCAAGCCGC TGTTCGTAGA TTTCTTGCCC AAAAAGTGAT GCATGAGATG GTCGGCAAAG
CATATTCCTT TCCGCATTTA GAATCTGACG ATAAGAAATA TCGACCATTG TCGTCGCGAA
AGGTAACTCC TGAAAAAAGG TCGTCCCGAA AAAGTATGGT AGGAATTTGG GAGGAGCAGT
GTTCATAGAA GTCATGGCTG CGATAAAAAT CCAATCTGCC TTCCGAGGCT TTTGGGTTCG
AGATTCGTTG AATGTGGATC ATTTTTGCGC GACTATGATC CAGAAATGGT ATCGACGACA
TCATCAGAGG CACCACTATT TTGCAGATCT TTCTCGGATC ATACTGGTCC AGTCCATTTG
GAGGCGCAGT ATAGCCAGGG AGCACGCTGC CTTTTTCCTT GGGAGCGTAA TTACAGTTCA
GTCGCTGTTT CGCTCGTACA GCGCTCGCAA AAAGCTCTAC TCAGGACTCA CTTGCCTACG
AAAGGATACT ATGGCAGCTG TAGTGATCCA ATCGCACTGG CGTACATATG CTTGCGAATG
CAACTTTATT CGCGATCTTG TCGATATTTT GATCGTTCAA AGTGTTGTGA GAACTTGGTT
AGCAAGACGA CACCTGTCAT CACTACGCTC CAGGGCCCAA AGTATTTCCG GCAAAAAGTC
ACCAACAGTA TCAAAAAAAT ACGCGAATCA AGTGGCGGCG CAACCTACTG GAAGTCCTCG
ACCTGGAGAG GCCAATCGTA ACTCGGCGAC AGGGCAATGC TACTCCTCGT ATAGGTCTGT
CGAAGAGAGT TCGTTCAGCG CTATTCTTGG CAATATAAAG AGCAAGGAGA ACAATCACCT
CATTGTGTTG ATTACATCTC AGTCTCTCTC GCGCAATCAA GCTTCCACAA GAAGTAATAT
TGGTACAATC TTACGCGTCC ATAATGTCTC ATTCGAGGAA GTGGATGGAG CAAATCCGCT
AACCCGAGGA CGACGCGACG AACTCTTTGC TATATCACAA ATGCGCGGCG TGTACCCGCA
GTTCTTTGTG GTAGACTATG AAACAGGGCT CACGTTATTT TTCTGCAACA GTGATTCTTT
TTTCGGTGCC AATGAAGAAG GCTCTCTACC CAGGATACTC AATATTGCTG GTGTTGTGCA
GAGCGCGATC GGAGGACATC AAGAAAGAAA TAGTACCATA GACGAAGCTC CTAAAGCAAA
CAAGCACCTG TTTGAGCCAA AGAGGCAAAG CTCACATACT ACGGTTTCAA TTGACAGTGA
AACTTCGGAG CCCTCTGTAG GACGGAACAG TTTGCTTTCG ATGTGGAAAA ATCTTGACAA
GAAGAACACA TTAGTATTAA ATGGACACAG GAATTGACAA CAACCTGATT GTGAACGCGG
AGAGAAGTTG TTAGCTGTTA GTCTGTACAT GTTAGTAGCA ATTTCTACAC TAATGCGTAC
GACATGAGTT CATGCATGCA ATTTAAAGGA ACCCCTTCCA TTCGCCGCCT CAAGAAAGAA
AACCGTTACT CTTTCTGTTT TATCAAGGCA GACAACTCTG AGGCAAACCC ATGCATTGTA
GCTGTCTGCA AGCGCGAACA AGCTTTTGAC ACACTTTTGA TTATTGCAGC TGATGGCAGA
TGCGACAGGG CACCGTCGTG GTGTCTCGTA ACAATTGTCT TAATAAGACG AGCAGCTCGC
TCAACGCAAT GTGCATCACG ACTATTGAGA AGGGCATTGA GACAGGCATC GTAGCACCTT
TCATCCGGCT CTATATCGGG ACGACCATGA ATGTGCAAAT CTAACATGTA GGACAATACC
ACGCAAGCAT TTTCCGCAGA CTCCTTGGTT CTCGCATTTC TGAGAGACGT CAGTATTGCT
GCAAAGACAA GTCTATCGAG CTTCAGCTTG TTTGAGGGAT CGGAGTCTAA AACCTGCATT
TCTCGAAACA AATCGAAGGC AACCCTGCCA GATTTTACGT TCCGAAGGGA AGCGAAAGCA
AAAATAACAG CAGTATACGC TCGACTTGTC GGTTGAACAC CCAAATTCTT CTGGTGTCCA
AGTGTCTGCA ATGCTAAATC AGCGGACTCC TTGGATCTAA AGTTCGCCAT TGCTGAAATT
ACAGCGGTGT ACAACCCAAC AGATTTCGGA TCCCATTGAA AAGTTCCTTT CTTAACCCCC
GCCTCAATAC GATGTAATAA GGTAGAAGCG GTCACAGTGT CTTCGTACTT TTTCGTTTTT
ACCAAAGCAT TGATGAGAAC CTGGTATGGA AACATGTCTA CAGTAAGCCG GTTCTCACCA
TCTATCAAGA CGTCGAAAGC CTCTTTGAGC GAGTTTTCCT TACAAAGCAA GCCAATGCAC
ATGTTGAAGG TGCCCGAATT TGGAGCACAT GGAAATCCGT GTTGCTCCGA GAGATTTATC
ATCGAATTCA GAAGGGATAA GACATATTTC CCCCCGCTCC CTTGAGACAG GCGGAGCCAG
CCATGAATAA CAAAATTGAA TGTGTCTGTT GTTGGAGGCG GCCCAGCTTG GCCAATAGCT
GCTTCGACCA TAGCGACAAC ATGAGCATGC GCTTGTAGCA TTCTTCCACA ACGGACTTGA
GCTTCCATGT AGGACATGTG TGTTGCTAAG TCCGGTCGAA
 
Protein sequence
MDFANGFGDS SDDLMENARI VSPYGDGETQ AFPADPWANF QYALPEGSSQ YKYRASRQRP 
TIGGSRSPSE PIPTPRERMV TSSKISTPHR HGMVNRGHAS RYEYSSDKRC DDVHVIFTDP
SDGVGISVEK RSQQDLLRQK VREDTSRRSL QHSTSAQSMY RKRSENKVEI TSESTNAETD
GLNLNWAFSR VQVRANSETE STTVWLNTDH RQDLEGEDIW LQEEHAFPLL STGASSRFSG
PSGKDGLRKR TDKKLNRVRF AEPIQKPRSV PFLKTVSRET IREESSDPRS KKITHSDNRA
WFVAQPKSIL RRRRFAGETV SHDPQYPQKN RPSSHRAAPQ RKSATSFLDT QGSLLSPIHS
DRRPWDRISE TGSESLSPSY SDVEREKRVS LGPYHLQELN EMYPDPPLEL QFDDESTVVP
ARASFIDTVA AVVVQAAVRR FLAQKVVPKK YGRNLGGAVF IEVMAAIKIQ SAFRGFWVRD
SLNVDHFCAT MIQKWYRRHH QRHHYFADLS RIILVQSIWR RSIAREHAAF FLGSVITVQS
LFRSYSARKK LYSGLTCLRK DTMAAVVIQS HWRTYACECN FIRDLVDILI VQSVVRTWLA
RRHLSSLRSR AQSISGKKSP TVSKKYANQV AAQPTGSPRP GEANRNSATG QCYSSYRSVE
ESSFSAILGN IKSKENNHLI VLITSQSLSR NQASTRSNIG TILRVHNVSF EEVDGANPLT
RGRRDELFAI SQMRGVYPQF FVVDYETGLT LFFCNSDSFF GANEEGSLPR ILNIAGVVQS
AIGGHQERNS TIDEAPKANK HLFEPKRQSS HTTVSIDSET SEPSVGRNSL LSMWKNLDKK
NTLVLNGHRN