Gene PHATRDRAFT_21667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21667 
Symbol 
ID7202588 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp733423 
End bp736847 
Gene Length3425 bp 
Protein Length976 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181619 
Protein GI219122578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0681414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACAAGAA ATACTTTGCG ACCATGGCGA CCCAGCAGCT TCAACAAATT ATTTCCGAGA 
CGCTGTCTCC TTACGCGGAA ACGCGAAAGA CTGGTACGAT TTGAGGGTGT CGAAAAGATC
GACGTAGTTG GTGTCGCGAA AGAATTAACC GACGGACCGG ACCGGGCCGT CACTGCTCAC
AAGCACTGTT TGGTTCCTTT TTCTCACCTC TTCTTTTGTC TCCCTGCACC AGCCGAAGAT
CATCTGAAAG CTGCCAAATC TAGTCCTAGT CATCCGCTGC AAGTCCTGGA AATTGTCGCC
AAGGCGGACG GTAACGACGC GGCGGTGCGG CAAGCGGCTG CGGTGCACTT TAAAAATGTT
GTCAAAAAGG GCTGGGACGT TCAACGGGAG GAGGGTAACG AGGGGATCGT CATCAACGAT
CAAGACCGTA TCACCATCAA GTCACATTTG GTTCAACTCA TGTGTACAAC GCCGCCACAG
ATTCAAGTAC AGCTCAGCGA AGCCATCTCC TTGATTGCGG CCGTCGACTA CCCAAAAGCC
TGGGACAATC TACTGCCCGA ACTCGTAAAG CAATTTCAGT CTCCCGATCA GACGGTGGTT
AACGGTGTAC TGAAAACTGC CAACGGAATT TTCAAGTCGT TCCGATTCGT CCAACGATCC
GACGATTTGT ACGGGATTAT CCTTTACTCT CTCAATATTG TGCAAGGACC ACTTTTGGCT
CTTTTCAAGT CCACCGGCCA AAAGGTGGAC GCCGTCGCCA ACAATACGGC TCAGCTCAAA
CCACTCATGC AGTCGCTACG CCTCATGTGT CGCATTTTTT ACTCGCTCAA CTACCAGGAC
TTGCCTGAAT TCTTTGAAGA TCACATGACG GACTGGATGT CCGAATTTGC CAAGTACCTC
ACGTACCAAA ATCCGGCCTT GGTGGATACC GACGAAGAAC TCGAACCCAG TCCGATTGAT
ACTTTGCAAG CGGCTATTAT TGAAAATTTG GCCCTTTACG CGGACAAGGA TGAAGAGCCA
TTTATGGAAT ACCTACCCAA CTTTACTCGA CTCGTTTGGA ACCTGCTCAT GACGATCAGT
GCCTTCCCGA AACACGACAG TCTCGCTACC ACCAGCATTC GTTTCCTGTC CAGTCTCGTC
CAAAAACGAA TGCACCACCA CCTTTTTCAG GAAGAAGCCA CCCTCCGCGA AATTGTTTTA
AAAATCGTCA TTCCCAATTT GCTTTTTCGC GAGTCCGACG AAGAACGATT CGAAGACGAT
CCGAGGGAAT TCATTGTCAC CGAAGTCGAA GGTTCTGACA GTGAATCTCG CCGTCGATGC
AGTCAAGACT TGCTCAGGGC CATGTGTCGC CAGTTCGAAA CGCAAACCAC CACAATCTGC
TCCGAACACG TTGCCAGTAT GCTGCTCGAG TTCACCAATA ACCCTAATGG TAAATGGGCA
TCCAAAGATG CCGCGGTACG TACCCATTAT TGGGTTCATG TGCGAACGTA CGTAATTTGA
TCCGCATTGG CTTACCCTTG CGCACTCATT TTTTGTCCTT GCACAGATTC ATCTCATGAT
GGGCATTGCC ATTCGACGAG AGAGTTCATT GGGGGTTTCT GAGCTCAACG ATGCGGTCAA
CTTGATGGAC TTTTTCCAAT CGCAAATCTT GCCAGAACTA CAGGATCCGA ACCATTCGAA
TCGACCAGTG GTCAAAGCGA CTGCAATCAA GTTCGTCAGT GTATTTCGCC AACAGTTTAC
GAGGGAGCAC TTGACTCAGA TCATGCCCAT GCTGATTGCG CAACTCGGCT CACCAGCGGT
TGTAGTCCAC ACCTTTGCCG CGTATGCGAT TGAACGCATT TTGTATACGA AAGAGACCAT
CAACGGAAAA AAGCATCCCA AGTTTGGCGC GGCCGATCTC CAACCCTTTT TGGAACCCCT
CTTCAATGGA CTGTTTGCGA TTGTAGACAA CGTGGAGCAC AACGAAAATG ACTACGTCAT
GAAGTGCATC ATGCGATCTT TGGCGACGCA AGGCGAGGGT ATCATTCCCG TGACACAGAT
TGTTCTCACC AAACTGACTG CGGCATTGGG TCGCGTCGCC AAGAATCCTC GCAACCCACA
GTTCAACCAC TTCTTGTTTG AGTCCATTGC CGTCTTGGTT CAATCGGTTT GCTCCGTAGA
CCGCAATGCC ACTGCACTAT TCGAACCGCT TTTGTTCGAA CCATTCAATA TTGTGTTGCA
AATGGATATT GCGGAATTTA CACCTTATGT CTTCCAAATC TTGGCGCAGC TACTAGAGTA
TCGCCCGACT GGCTCGGGTT TGGGGACGGC CTACCAAGCA CTCTTTTCCC CGTTGCTGAC
CCCGGGTCTT TGGGACAAGC GTGGAAATGT TCCAGCGTTG TCACGTTTGA TGCAAGCCTA
CATTCGTAAG GCGGCACCGG AACTGGTGGG ACAACTTAAC CAGATACTGG GTGTTTTCCA
AAAGCTGCTT TCATCGAGAG CTACAGAGGC CAATGCGTTT GACTTGCTGT CGTCAGCAAT
TCTTCACTTT CCACAAGAAG AAATGGAAAC GCGCATTGCT ACAATTTTTC AGCTTGTGTT
GACACGGTTG CAAGCGGGCA AAACGCCCAA ATATGTCCGG CTTTGCACGC ATTTCTTTGC
CCTCTTCATT GGCAAGTATA GCGCGAATGT GTTTATGGAT CGTATGAACG CAATCCAGAA
TGGCTTGTCA TTGAATTTGT TGGAGCATGT ATGGATCCCA CGCGTGACGA CGGATCCCCC
GGTCCAGCGT ACGGAAGCCA AAGTGCAGGT TGTTGGGCTC ACCAAGCTGC TTTGTGAATA
CCCCACCCTG TTGAACGATG CCCATGGGCA AGCCATTTGG TCAAAAGCAG TCGTTGCCAC
AATCACTATC CTTACATCAT CATCATTTAA AGCCACGGAA GAAACAGGTT TAGATGAGGA
AGAGATCGAA ATCGGGTATG ATGCCCAATT TTCACAGCTC AAATTTGCGA GAAAGGCCGC
AGAAGATCCC TTCCCAGAAG TTGCGGACCC TACACTTGGT TTTGCCCAGG CTCTTCATCA
AGTTTCGAGT GCACATCCGG GACGTATATT GCCCTTAATC CAGCAGGGGC TGAACGGGGC
GGACCCAAAG TTGTCGGTTG GTCTGGAATC CATGCTACAA GCCGCCAACG TGCAACTATC
GTAAAATCTC ATGTGTGTAT CGGATACGAC TCCATGAATG CTTCGACTTT TATTTGATAA
AGCTCTCCAC TTTAGAACAC ACACATAGTT GTGCTTCTAT AATTTCTGAA TAGGCAAGCT
GACCGAGAGA TCAACATCCT TTTTACGTCA TCCAAGGCCA AAATGGGCTG CTCATGACTG
TGGCTTACAA AACAAGTAAC GGATTTGAAA AGCCGGTCTA AAGAAGTATG TTTATAAATG
TTTTA
 
Protein sequence
MATQQLQQII SETLSPYAET RKTAEDHLKA AKSSPSHPLQ VLEIVAKADG NDAAVRQAAA 
VHFKNVVKKG WDVQREEGNE GIVINDQDRI TIKSHLVQLM CTTPPQIQVQ LSEAISLIAA
VDYPKAWDNL LPELVKQFQS PDQTVVNGVL KTANGIFKSF RFVQRSDDLY GIILYSLNIV
QGPLLALFKS TGQKVDAVAN NTAQLKPLMQ SLRLMCRIFY SLNYQDLPEF FEDHMTDWMS
EFAKYLTYQN PALVDTDEEL EPSPIDTLQA AIIENLALYA DKDEEPFMEY LPNFTRLVWN
LLMTISAFPK HDSLATTSIR FLSSLVQKRM HHHLFQEEAT LREIVLKIVI PNLLFRESDE
ERFEDDPREF IVTEVEGSDS ESRRRCSQDL LRAMCRQFET QTTTICSEHV ASMLLEFTNN
PNGKWASKDA AIHLMMGIAI RRESSLGVSE LNDAVNLMDF FQSQILPELQ DPNHSNRPVV
KATAIKFVSV FRQQFTREHL TQIMPMLIAQ LGSPAVVVHT FAAYAIERIL YTKETINGKK
HPKFGAADLQ PFLEPLFNGL FAIVDNVEHN ENDYVMKCIM RSLATQGEGI IPVTQIVLTK
LTAALGRVAK NPRNPQFNHF LFESIAVLVQ SVCSVDRNAT ALFEPLLFEP FNIVLQMDIA
EFTPYVFQIL AQLLEYRPTG SGLGTAYQAL FSPLLTPGLW DKRGNVPALS RLMQAYIRKA
APELVGQLNQ ILGVFQKLLS SRATEANAFD LLSSAILHFP QEEMETRIAT IFQLVLTRLQ
AGKTPKYVRL CTHFFALFIG KYSANVFMDR MNAIQNGLSL NLLEHVWIPR VTTDPPVQRT
EAKVQVVGLT KLLCEYPTLL NDAHGQAIWS KAVVATITIL TSSSFKATEE TGLDEEEIEI
GYDAQFSQLK FARKAAEDPF PEVADPTLGF AQALHQVSSA HPGRILPLIQ QGLNGADPKL
SVGLESMLQA ANVQLS