Gene PHATR_33295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33295 
Symbol 
ID7204380 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp542624 
End bp546324 
Gene Length3701 bp 
Protein Length1186 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186367 
Protein GI219113567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCACA ATCCGAGGTC GCGCAAAGGT GATATGGTTT TCCCTCGACC GTGAATGCAG 
CGCCCTCCGA CAATGAGGAT CAGAAGAGAT CCTCGTGCAA GCATCTGCAA CAAACTTGCG
GCGAAACCTG TTCTCTCTAC ACCTACCTCG CAGCTGTATT CATCGTGAAC GTGCTGTTGT
GAATTATCTG GCAGGCCGAG ACTCTCGTAT CATCAGCATG GTGGTTGCGG AAGTTCCCCT
GTGGGTTATG CACAGCAGCA ACGCGAGTTC CAATACTACT CTGGACGGTG CCAGCGGACT
CGCAACTGGT GGTCCTTTCA ACAAGCTTTC GGATGGACAT TTTTCCCGTC ACCATCATTT
CGAGCACAAG GCGGCTTCTT CCTTGGCCTT GTTGGACTCG GCGTCGGGAA GGGCAAGAAA
GTCTGCCATT TACGCCGTAG ACGTCCATCC CGACGGCCGA ATCTTTGCGA CAGCGGGAGG
CGACTGTGCG GTCCGAATTT GGAATACACA GGCCTTGTTT GCTCCCAAAA ACAAGGGCGG
TAGCTTTGCT GTGGCAGGTT CTTCGGATCC GACCGGCAAA CCCAACAATG CTACCACAAC
CTACGTGAGC ACGAGTGCCT CGTCGGGACC GGAAGCCTCG GAAAGTAGCG CTGGGGAACA
AGAAAGTGAC GTTGGTCCCC GGGACGAAAT GGTTCACGAT CTTAATAGTT TCGTCCGTCG
CAAAAAAGAC CCCACAGTGA ACCAATCCTC GGCGGTACCC GCCGAGACGA CGAAATCATC
TACTGCATCC GGGCAGTCGG TAGCGGACTC GTCTCCTGTT CGCCCCTCGA CGCACAAAAG
ATCGCACCAT CAGCATCGTT TACTATGTAC ATTGTCCGCA CATTCCGGTT CATCGGTACT
GGCGGTGCGA TTTTCAAGTA CCGGAACCTA TTTGGCTTCG GCGGGGGATG ACGGTTGTGT
GTGTATCTAC ACTCACAACG AGGACACGGA GGGTAACCTG ACCCAAGAAC CGTCACCGCA
CGACGAGCAC TGGTCTCGGA TCAAACTTTG CAGGGGACAT GGCTTGGATG TTGTGGATTT
GGCTTGGGCA CCGGACGATT CATACCTAGT TTCCTGTTCG CTGGATTCCG AAACACCGAT
CATTGTGTGG AAAACGACTC ACTTGGGATC TTCTCGTCGA GCCAACGCAA CGAGTATGAT
ACTGAATCCA TTCAAGGTTC TCGGTAGAAA AGAACACACG AGTACGGTCA AGGGAGTTTC
TTTCGATCCG GCAGGATCCT ATTTTGCGTC CTCGGGTGAT GATCCAGCCG TCTGCGTCTG
GCGCGCGCAC GATGATTGGG GTTTGGAAAC CAAGATTGAC GCTAGCAGCG GTATCTTTCG
GCGCTGGAAA GAAGACGATA CTATGGCCCT GTCTTCGCAA AGTCTTTTCC GCCGAATTAG
TTGGTCAACT GATGGAGCAT TCTTGTGTTC CACTAATTCG GTAGTGAAGA ACAAGCACGT
GGCGTCCACT ATTAGTCGCG ATGGGTGGAG TGTGAGTAGT GCGTCTTCGG CGGCAGCGGG
AGCGGCCAAC CTAGTGGGGC ATAAGCAACC AGTGGTCGTC AGCCGCCACG CGTCGCAGCT
GCTGAGTGCA CGCAAAGCCA ATGTTTCAGG AGGCCAAAAC GGTGACGACG ACGAAGAACC
CGATTACGCA ACTCTGTTGG CACTCGGCGA CAAGCGAGGC TTCGTAACGA TCTGGAGTAC
CAAGAAATCA CGTCCGATTT TTAAACTTCA ATGCAGTGAA AGTCGCAGTA CGGTCACTGA
CATGGCCTGG GGTTCCTTAC CAAGAGGGGA TCTGATGCTT TTAGTAACAT TTTTGGACGG
TCAGGTTGTG GCCCTGCGCT TTGAAGTACC CAGTGAGTTG GGAAATTTGT TGAGTAAATC
TGAACGGGCA CGCGTATTTC AGCTCAGGTA CGGTATCGAT GTGAACGATG TGGAAACTTT
TGGACAGCGC CATCTATTTA CAGGAGCAAG CTCAGGTCCT AACCTGATCG AAAACGCTTT
GCAAATGACG TTGGAGCACA CTCATACTGG AATAGACGAT ATGGACGATG ATACCTCGAC
ACCAGGTCCG GAACCAGAAG AGAGATTGAA TGACCTGCAA GCGGTTTCAA TTCGTTCGAA
ACAGAAGGAA AGCCTCTCGA AAGGAAAGAA GCGCATTCAA CCAGTTCTTA TGGCTGTTAC
CAGCAAGAAA ACGAAGCCAG GAGCCGAACT TCTCAAGACC AAAGCCGTCG AGCCAAACCA
ATCCATCGAT CCTTTGCAGA ATGCTATTGA TGCAGCCAGC AAGGCCTCAG CCGCCATGGC
AACCGCCGAT TCCACCAAGC GCGACATTAC TGTAAACGCT GCTCCCATGG ATGGCCTGTC
CGGAACAGCG CAGTCAAATT CCGCTCGGCC ATCAGTGCGA CCCAATGGAC CTAGCTCTTG
GATGGGAACA ATTCTGCCGC ACAGCTCCGA ACGGATCCAT TCTCTGGATC TCCCGCTTCT
TGGTTTACAG TCAATGGATG TCACCACAGG ATTCGCAGAG CCATGCGTCG CCGAATGCAC
CAACTCGGTC AAATTACCGG TGGGGTCACG GACAACATCA ATTCCATGTG TCGATGTAGC
ATTATCTCGT GATGGCAAAA TATCGTGGAA GGATCAAATC CCTGGTACAT CATGCTCGGC
CATTGCGGCC AGTACCACGT TAATGGCTGT CGGAACAACG GATGGATGCC TACAATTGTA
TGGCACATCT CCTACGATCG GTTGGACATG TGGTCAGAGC TTTCGTTCTC ATCCGTCGTT
AGTTCTTGGG CATCCCATTG TCTCATTGCA ACTTCAAGAG ACGCAGGGGG AAGACGACGA
AATCTTTGCT ACTTTGCTCA CCTTAACGGG AGATGGTACA TTTGCCGTCT ACTCTGTTCT
TCCAGTATTG CAACTACAAT TCAAAGGATC TGTTATGCCG GCGATGTCAC ACATGGCTTT
GGGTACATCG TTGACGTCGG AGCAACATTC GATAAAAATT TCCAGAATAC AAATTACTGA
AACCAATCGT GTATTACTGC TTCTATCGCT ACAGACAGTT GACAACGCAC AGCTTCGAGG
CGGATTGCGG GGGACTACTC AAATTGACGC AGGGGTTGGT GGCTCGTTGC AAGCGTTTGT
CTTCGACCAA AAGGCCGAAC TTTGGATGAA GGCGGCGGAC AACCGCTTTG TCCTTTCCGA
CTTTTACAGT GCTCTACCGT CTGCGAAATT TAGCCCCAAT GGAGAGCTGT CAAGGTTGGA
AGATGCTGTT CGAATCGGCG CACTCCAAGC GAGCATGAAG CCAGCTCAGC GCGGTCGTCT
ACGTGATACC GACCGCCATG CGGACGAGAT GTTTTCCAGA GCCGATTTGG AATCTGGGAA
TTTTATTCCG ACTCGGGCAC ATTGCGAGGA TCGAATGGCT TGCGCTATTG CACTCGAATC
TGCGGACGAA TTCAAAAAAT GGTTATCATT GTATATAAAA GTATTGTGTG TGGTGGGGCA
CACTGATTTT CTCAGAGTTT TGGTGGATAT CTTAATGAAC GAACCCAAGG ACAAACGAGA
AACGATTCCT GATGGCATGT GCTGGTGGAT GTCGATCGCC CCTACAGTGG TGGGTTTGGA
CAAAAGGACA CTTGTCAGGT CGCTAGTTAT TCCCGAGATG A
 
Protein sequence
MNHNPRSRKG DMVFPRPCIH RERAVVNYLA GRDSRIISMV VAEVPLWVMH SSNASSNTTL 
DGASGLATGG PFNKLSDGHF SRHHHFEHKA ASSLALLDSA SGRARKSAIY AVDVHPDGRI
FATAGGDCAV RIWNTQALFA PKNKGGSFAV AGSSDPTGKP NNATTTYVST SASSGPEASE
SSAGEQESDV GPRDEMVHDL NSFVRRKKDP TVNQSSAVPA ETTKSSTASG QSVADSSPVR
PSTHKRSHHQ HRLLCTLSAH SGSSVLAVRF SSTGTYLASA GDDGCVCIYT HNEDTEGNLT
QEPSPHDEHW SRIKLCRGHG LDVVDLAWAP DDSYLVSCSL DSETPIIVWK TTHLGSSRRA
NATSMILNPF KVLGRKEHTS TVKGVSFDPA GSYFASSGDD PAVCVWRAHD DWGLETKIDA
SSGIFRRWKE DDTMALSSQS LFRRISWSTD GAFLCSTNSV VKNKHVASTI SRDGWSVSSA
SSAAAGAANL VGHKQPVVVS RHASQLLSAR KANVSGGQNG DDDEEPDYAT LLALGDKRGF
VTIWSTKKSR PIFKLQCSES RSTVTDMAWG SLPRGDLMLL VTFLDGQVVA LRFEVPSELG
NLLSKSERAR VFQLRYGIDV NDVETFGQRH LFTGASSGPN LIENALQMTL EHTHTGIDDM
DDDTSTPGPE PEERLNDLQA VSIRSKQKES LSKGKKRIQP VLMAVTSKKT KPGAELLKTK
AVEPNQSIDP LQNAIDAASK ASAAMATADS TKRDITVNAA PMDGLSGTAQ SNSARPSVRP
NGPSSWMGTI LPHSSERIHS LDLPLLGLQS MDVTTGFAEP CVAECTNSVK LPVGSRTTSI
PCVDVALSRD GKISWKDQIP GTSCSAIAAS TTLMAVGTTD GCLQLYGTSP TIGWTCGQSF
RSHPSLVLGH PIVSLQLQET QGEDDEIFAT LLTLTGDGTF AVYSVLPVLQ LQFKGSVMPA
MSHMALGTSL TSEQHSIKIS RIQITETNRV LLLLSLQTVD NAQLRGGLRG TTQIDAGVGG
SLQAFVFDQK AELWMKAADN RFVLSDFYSA LPSAKFSPNG ELSRLEDAVR IGALQASMKP
AQRGRLRDTD RHADEMFSRA DLESGNFIPT RAHCEDRMAC AIALESADEF KKWLSLYIKV
LCVVGHTDFL RVLVDILMNE PKDKRETIPD GMCWWMSIAP TVLFPR