Gene PHATR_46935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46935 
Symbol 
ID7204759 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp861823 
End bp866768 
Gene Length4946 bp 
Protein Length689 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185798 
Protein GI219121136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0448198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGAA CTGCGCCCAA TTGTTAGTAA ATCAATCCTC ATGATGGGGT TTCCCGAGCT 
CACAACCCGA TGAGGGCTCT TCTCCATAGC GTGTTGACTG CGACTTCGAC TGTGACTAAC
AACAGTAACT GTAAATTGGC ACCTTCCCCG GGTGCTCACT AACATGTAAC TGCTAAAGTC
TGTTCATACG ACCAAAATAG TATCTACAGG CGTTAGCACT TCTGACAATG AAAGGTCTAT
GATCTGCTTT GCTTTGAAGT GTCGGGGGAG AGTAACTATT GAACATGATT CGTAGGCGGA
GAATTTCGAG CAATGATCCA GGAAAAGACG AGGTGGGCGT CCAAGACACA ATTACCTTTT
TCACCACTGA CATCACAGCT CTCAATACTT TGGGCGCTTC GATTTGGACG TATTTGGCTA
GAGCCGCTGG GAAATTGCAG GCTACGATTC GTATAGCTAG CTTTCTATTT ATGGGGTACG
GCTTTTTTCT CTCGCAAACT CTTTTGTTCA CTTCGGAAGA ATGCGGCATG ACTTATTCCT
GGCGCCGTTT TCTTGAGCTG GATATATCCT CCATTCATCC TGTAGGGCGT TCTCCATATC
GACTGTACAA ATTCTATGAT CAGCGCGACC CCCGACATGA ACGCTTTTTA CAGCAAGAGA
GCGTGACGAC TTCAAGAAAG GCTTCCACGG ACTGGTGCCT AAACGCCGCC TTCCCGACTG
CTGTTGTGTA TATTCCAGGT CACGGCGGAA GTTATCAGCA AAGTCGAAGT TTGGGTGCGC
ATGGAATACA GCTCACGCGA CAGCGGGATG TGACGCAAAA CTACGTTGTG CAAGCGTTAC
AAAAGGGAAT GTGGCATGGA AACGCGACGC AGCTGGAAAA CTTTGTTTAT GACGTGTATG
CTTTGGATTT TGCTGAAGAA GGTGGTGGTA TGCATGGAGA TTTTTTGGTG GATCAGAGTC
GGTTCGTGTC GAAAGCGATT CATTTTTTGA GCGAAGCATG TGGCTTTTCC AGTATCACAG
TTGTCGCCCA CTCCATTGGT GGCATTTCGA TCCGCTTAGC TTTAGTTCGT GATGAAAAGC
TGCGCCTTTT GGTTACAAAT GTTATTCTAC TAGGATCACC TCAAGCACGC ACCGTTCTAG
CCTGGGATCC CTCTTTGGAA AAAATTCAGA CAGAAATTGT TGAAAATCAC GTAAATGGTA
CTGCTTTTGT TGCCATATCA GGCGGCCTAC GCGACGAAAT GATTCCTCCC GCAGCTTGTG
AACTCGTTCC TAAAGATAAT AACACCTTGA CACTTTTGGC TGTTGATATC ATGCCTAAGG
AGGCGTCAAG CCCTTCGTTT GGAATGGACC ATCGCGCAAT CGTGTGGTGC CACAATGTTT
TGGTACCACT GCGGAAAATA ATTTTTGCTC TAGTCAGGTC GGAACGCGAT GGAGAGGCTG
CACCAGCAAG AATAGGAGCA GTACAATCGC TGTTTGATCG AAGTAAGACG CAAAACTATA
ACACTGCACT TCAACGTATG ATGACGACGT TTCGGGTAAG AATTGCTTTG AGGTCCGTTG
TTCTTTTAAG GCGTGGTCCT CTCATTCAAA AGCTGCTTTT ATTCTTAAAA GAAAGTGCAC
GGACCAGTCG CCAGTTTAGC CATGGTAACT GGTCTCCTTC ACAATGCCGA ATTGCTACTG
GGTTTATTTG CTTACATCTC CCTGTGGAGG TACGTATTCC GCTTTTCGGC AATGCTGCCA
ATAACTTTTC CTTTCGGGTG CGGCTTGTTT TGCTGGGTAA CAGCAAAGCT GGATTTGCCT
CTTGCTTCAA TTCTCATTTT GGCGTTTTCA GCGGACGCAA TCCGAGCTAC TTTGTTGTGG
ACAGCACATC AGTCATCAGC ACTGAAGCCG ACGACGTTCC AAAGCAACGG GATAAGTTGG
CGCTGGGCGG TATGCTCCAT CGTTACCTCA GTCAGCATTG TTCATGTCAT CTTTGGCGTC
GTCCGTCTCT TACGTCCCAA TGATTTTGCC ATTGAAATGT CAAATTCAAT CAATATTGCA
TTGATCGCCT CGATCTATCC ACTGGCTCTC CGACGCATCC ATAAGTTTGC ACAGAAGGTT
GGTAGCTCCC GCTTTTCTTT CATTGACCTT GATCTATTGA CGATTGTAGT GGTCCCGTTT
TTGGGCGCTG GAGAATTTGC TTATGTGCTG TCTAAAGGCT CTGTGCAAAG GTCAACACTA
CCGATGCTAG CAGCGCCTTT CCTCATTCGA TTGGTCTTAA CCTCGAGCGA CCCAAGCATT
CCACCGCATT CGTCTCGAAA ACGGTATATC TCAGATGTCA TCCGCACACT TCAGGTATGC
ATTCTCTTGG TGGTTGGTCC TAGAGTTCTA CAAACGGGAT CAGGCTTGGC GTATAGTTTT
AATTTACCAC TCGGCGGACT GGTGGGTATG ATGATGTGGA CGGATACGTT ATGGTCATTA
ACGATTAGCG GACTAGGTTA GTTATTGTCA ATGCAATTGT TTTTGCGTAG CACATCCCTG
GATTGTATCC GAATATGTTC CATCCGTATA GTAGTAACCG CAGAATCAAT CACTTCATGG
GAACATGGAA GCACTTGGGT CCACTTATCG TACACCTATG GCATCAAATC GTTTCCAAAA
GCACCGAGGT CTTGGTTAAA ATCTTTGTCG TACTTTATTA CTGGTGTGCC GTCCTTCATT
GGCTGCTATG CTCAGCTGTC GTTTAACTGC CCTCTCATCC AAATGGTCTG GAATTTGTTC
TGTATATTCC TGGGAATGAA CTGATTCAGC AAGGTTGAAT GGGTAAAGGG AGCGCCGCTT
CGAAAAAATA GATCCCTCGA CACAGTACGG AATAACCCCA TACTGGTTAC TATCGATGAA
TCCTACCCAT CCCAGACTGG CAAAAGAAAC GTCCATTACA TATCTTCCGG ATGCAGAAAC
AAATTCCTTA TAACCCGCCA CGAATCTCCC ATCCGGGGCA GTCTCAGTCA AAAATGGTAG
TAAGGGAAGC GAATATTCAT CGGCCAGCCC CCTCACTTCG TTTCTAGTTG CTTCAAAAAT
TCGTTCTTTT ACTCTTGCAA TGTGAAATGA TGGGATGGTG GCTCGATCCG GAGCTCTTGA
TGTTGGAACA ACCCGAAGTC TCAGAGATGG ATGTAAAAAA GCCTGTGCAT TTATATGATG
CTTCGCCTGC ACTACATCTA TACGGCCCAA AACGCAGGTC TCCTCGTCTT CGTCCCATAT
GCCTTTCGTG TTTTCTTCAT CCTTGCCCAT CCAGCTGGCT TCGATTAAGA GGCTCTGTCC
CTCCCGTAAC GATACTTTCA ACCCATTCCT TGAAGCCGGA ATAGGAATTG CTTCTGGGCG
AGTGAGTGGT TCCATCAAGT GTGCAGGGAA AATTTTGTAT TGAAGCGCAC GTGGACTAAT
AACACCGGGT GTGTCCCACA AAGCGTGAGA GTCCGAAGGA AAACATGGCA CGCGAACTGC
TTGCAGCGTG GTACCGGGTA GATTCGATCC GGTGACTTTC AAATTCTTGA TCGTGGCTCT
CCGTTTTACA GCGAATCGAT TTTGTCCCTT TAAATACACC GATTCAGCAA TTAAAGGTGA
CAATGTTTTC ACCAAACTTG ATTTTCCGAC GTTGGCAGTG CCGATGACGA ACACATCTCT
ACCTCCTAGC TGCAGGAGTA TGCTTTCAGC CAACCGCACC AATCCGACGC CATTTGTAGC
ACTAACATCG AAGACGGATG TAAATCGGAC GCCGGACATT GCCTCAATTC TCCGAGTTAT
ATTCATCACA TCACTTTCGC TGCAACGAGG CAACAGATCA ATTTTGTTTA TCACCAATAT
CACCGGAATG CTTCCAATAG TTCTACGCAG ATGCTTAACG ACAGTGTGTT CCGGATCAGT
GGCATCCACC ACCATTATAC ACATTCCAAA CTTGCGTCGG GCTACAATGA AGCGTAGCTG
CTCGCTAAAG ACTTTGGGTT CAATATCGCG CAAGGCATCG TAGGCTCCCC AAATATCATT
TCTTTGTAAC GATTGACAGC GACTACAGAG AAAACTATCC ATTGGGCGTG TCGCATAATC
TCCAACATCC ATGTAACGAG TTTTCTTCTG TATTCTTTTG CTTAAAGATG ACGTATGCTC
CATGGTATCT TCGCCACCGA CAAGGCGAGT TCCTGTTATG TTCGCTGAGT CTGTATTGTT
CGACCTTCTA CCGGATACCT TGGCTGAAAC AACTTGTGTC CCGCAGCCAG AGCAAAGTTT
CGGTACAGCA TTTGATATTC GTTTGCCAGC ACTGTGCTGC TGGAGAACTA CCCGGCCTTT
CACTCCTCTT GGAGGCGAGC CGCCAGCTTT GTTCGGTTTG CGCGTGGCGA TTTTGCTCGA
GGACGAAATG GTTCCGGGAG CACCCGGGTG TTTCTTCCCT TTGGAGCTGC TCGGTTGTTT
CTGTACCGGC GACTGCTGTT TCTTTTTACT GCTGCCCTTC TTTTTTGACT TTGGGCTTTT
TGCAGCAACG GCAAAACGGC GAGATACTGT TGCTGTGGGG GCTAGTTTGG ACGATAGGAA
CGTATACGAC CTTCCTAATT CCGCTAGCAA AAAGGGACCA GCGGAAAGAT GTTGCTGGGC
GGACCAAGAA AACGCTGCCG TTGCTGTTTG GTAAAGGTAT TGCTGATTGT TGTAGCGATC
TATCGTTGTT ACGACACTGC TCGCTGGGAA CACAGGAGTC GGCTGTTGTA ATTTCATCGC
GGTCAAGCCT TTTCGCGGAG AGGCGCCTAG TCGTATCCCT GTTCTAATCG ATCGCAACAT
GCTTGCATAA CAGCTACCCT GGAAAAAGGT GATGGGAGTG CAAGGAAAGG ACGCTAGACT
GTGTCACTTG AGCTTCTCTA GTTGTGCATG AGGAGAGAAG CCAGATCAAC AAAAGAAACT
TGTTTGGTAG TAAGGGTGAT TGAAGG
 
Protein sequence
MSGTAPNLST GVSTSDNERR RISSNDPGKD EVGVQDTITF FTTDITALNT LGASIWTYLA 
RAAGKLQATI RIASFLFMGY GFFLSQTLLF TSEECGMTYS WRRFLELDIS SIHPVGRSPY
RLYKFYDQRD PRHERFLQQE SVTTSRKAST DWCLNAAFPT AVVYIPGHGG SYQQSRSLGA
HGIQLTRQRD VTQNYVVQAL QKGMWHGNAT QLENFVYDVY ALDFAEEGGG MHGDFLVDQS
RFVSKAIHFL SEACGFSSIT VVAHSIGGIS IRLALVRDEK LRLLVTNVIL LGSPQARTVL
AWDPSLEKIQ TEIVENHVNG TAFVAISGGL RDEMIPPAAC ELVPKDNNTL TLLAVDIMPK
EASSPSFGMD HRAIVWCHNV LVPLRKIIFA LVRSERDGEA APARIGAVQS LFDRSKTQNY
NTALQRMMTT FRKVHGPVAS LAMVTGLLHN AELLLGLFAY ISLWSKAGFA SCFNSHFGVF
SGRNPSYFVV DSTSVISTEA DDVPKQRDKL ALGVSIVHVI FGVVRLLRPN DFAIEMSNSI
NIALIASIYP LALRRIHKFA QKVGSSRFSF IDLDLLTIVV VPFLGAGEFA YVLSKGSVQR
STLPMLAAPF LIRLVLTSSD PSIPPHSSRK RYISDVIRTL QVCILLVVGP RVLQTGSGLA
YSFNLPLGGL VGMMMWTDTL WSLTISGLG