Gene PHATRDRAFT_45690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45690 
Symbol 
ID7200469 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp946578 
End bp949899 
Gene Length3322 bp 
Protein Length1015 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179943 
Protein GI219118332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCCG ATTTCGACGA GGATGATCTC ATCAATGACT ACATTGAGGA ATCTTACGAG 
CCGCCGACGG CCGAATACGA TGTAGACTTT TTCGAAGAAA TGATGGCCAG CGGTGGCGTT
ACCAAAACGA CGACGGGTAG TAGGACCAAC GATGCGACGA CAATGGGCAA GCAGTCAGTA
CTCGTGCCGG TCGAGAATAC GGGTGCGGTA GTCAATCCTG CGCCTGTAGA TCCGCACGTT
AGTGTTCGTG ATGCATTCGA ACAACGCGCC GAAAAGCCTA CCGAGAATCT CTTCACCTTT
GAGCGGTACG TGACGTCGCA TTACGAAACG AACAATCGGC ATACCGACGA ACGATGATTC
CAAAACTCAC GAGCGATAAT GCCAAACGTA CTTTTTCAGG TACAACTACA ATATGGATTG
GAGGGCTCCT CGCCAAGCCA ATTCGCCGAG CGGCAATACC ATGCAAGCCA AGGAATGGAA
AAAGTCGGAA CCGGGAAGGA GACGGAATCG AGACTTGTTC GGAGCATACG ACGACGACGA
CAATCCGGAA ATCACGCTAT CCGTGGGAAT ATCCAGACAA GTGAGCAATA GATTGCCGTC
GGCACCGGAT GCACAATTGT TGGAGTTTGG CACCAAATCA GCCCGATCCC AAATGGGTCG
GAAACGACCT TGCTACCGAT CGCACTCCAT GCCCACACGA CCACAGGTGG GACGGCACCA
GCAAATACCC ATGACGCTGG GTGACGGGAC CCGTGTGCAT CTCAACGTGA AGGTTCCCGC
TAGTGACGGC AAGCTAGATG GTGGTGAAGG TATCGGACAC AAAAACGATA CCCACAACTC
TTTGGGAATT TCTGTTGCGG AACTCATGGA GCGGGTACAA GCAATTCGCC GTAGGCAAGA
ACACGACAAA CAGCAGCACT GTAACACTGA TCATGACGAC CCAGTCCACG GAGATAGTCA
CCGTCATTTG GGGACGGAGG ACCATCGTTT GTGGGTAGAT AAGCATGCCC CCACGTCCTT
TGCGCATCTT CTTTCCGACG AACGTACCAA TCGCGAAGTA GTGCGAGCTC TGCGCGCCTG
GGATCCGTAC GTCTTTCGGC GAGATCCCCC ACCGCGGCCC GATTTTGGGT ACTCCGCCAA
ACCATCGGAT TTTCATTCGG ATCGCAAAAA CGAACATGGC AGTGGCAAGA GTAGGGATAG
CAGTCGGCAA GATCGCCGCC CGGAAGAGTC GTGTAGAGTC ATTTTGTTAT CGGGACCGCC
AGGTGTCGGC AAGACTACTC TGGCGCACAT TGTCGCCCGG CATGCTGGTT ATCGTCCATT
GGAAGTCAAC GGATCGGACG AACGTTCGGC TTCTGCTTTA ACGGAACGAA TCGTTCGAGC
CATGGAATCT ACAACTCTCC ACACTGCAAA GATGCGAAGA AGTGTACACA ACCACGAGTG
CAAAGACGAT TCCCTACCAA AGCCCAACTG TGTCATTCTG GACGAGATTG ATGGTGCGGA
TGCCAAAGGT TCCATACAGG CCATTGTGAA CATAATTCGA GCCGATATTC CAGCCAAGTC
ACAGGCTTCC AAAGCACAAT ATTTGCGGCG CCCCTTAATT TTGATATGCA ACAACAAATA
TGCGCCGACG CTGCGGGCTT TACTGCCTTA CGCAAAAGCC TTTCACGTCA ATCCGCCGTC
GCCAGCTCGC TTGGTTGCTC GCCTCCGATC CGTGCTGACG GCGGAAAACC TAACAGCGGG
AGGTGGCAGC TCGTTGCTAA ATCAATTAGT ATCGGTCGCA TCCGGTGACA TTCGCTCCTG
TTTGCACACC TTGCAATTTG CGTCCTCGCG GTCCAAGGAG CTGGCCACCC ACGCGGAAGA
AGCCCCGTCC GTTATCGATT TGTCCGACAG CTTGCGCGGT GCCATGTCGG GGGATGGCCT
CAAGGATGAA CGAAACGATA TGGCTGGTAC AATCACGAGC GTATTTCGGA AGAGAAAGGA
TCGAACCTTT CTTGATAGCA AGCGTGTCAT GCAAGACAAG CGCCCGAGCT CAACGCGCAT
TTTCGAGGCT GTGCAGGTAT GTAGACTGTA GATTGGTGAT AGAATTCACA TTGTACATTT
GTCGTTACTG ACACCATTTC GTGGGTGGCT ATAGAATTTT GGGGACAATC TTCGTATCCT
GGATGTTTTG TTTCTCAATG TACTCCGCGT TTCGTACATC GACCCAACCT TGGACCGCTG
TGCAGCAGCT CACGAATGGT TGTCGAGTTC GGATCTGTGT CCGCGGCAGG TTCCGTCCAC
CGCCGGTGCG ATTCATTTAC TGTGTCGCGT CGAACAGCGC CCGGACTTAT CCTTTTCAAC
ACGGGAGCTT ATGGACAGTC GCTACCAGTT TGAAGCGAAT CAGTCTCTGG CGCAAAAGTT
TGCTGAAGGG CTTTCAATGC AGACACGAAG TCGGTCAACG AGCTTATTGG CGACGGAAAC
CATCCCGTAC AGCTTATGGG TACTTTCCGC AGGGGAAGGT AGCAGTGGGG CCCTAGATCG
GGCCGCTACG TCTCTGCAAA TTCTGAATAA GGCAGAACTT GGTTCCTTTC ACAGACACGT
TATGTCTTTG CGATGTTTGG GCCTTAGTTA TGTCGCGGAG CAGGAGGAAG CCGCACCGGG
CGAGTTCAAA GGGATCACTG GCAGCGTTCT CCGTCTCGAG CCACCAATCG ATCGCCTCGC
ACACTTTATG GACCTGACGC GAGCCAAGAG TCAAAAACGA ATTGAGATTC CCATAGCGGT
ACGTTGTGTC GTGGGTGCCG GAACCGTTTG CGGCGGTGTC TCGTGTATTG GTTCACTCAC
TCATCCTTTT GCTCGCCCTC GCTGTCTTGA CTACTTATAG ATGAAAGAGT TGCTGGCACA
AAGTGTGCTT CACGAAAATA TGCGCCATCT CGGAGCCCAA GCGCAGTCCA AGGCGATCTC
CACGAAAGTT CGGTCCAAGC TGCCGGCGCC CTCCGTGGTG GCCGCCCCGA TGGAAGCAGC
GCCTGAATTG TCCGCGATGA ACACTTCCTC TCCCGACAAA CGGGAAGCGG CTAGTAGTTC
CGACGCACCA CTCGCCAAGC GCCGCAAAAC ACCATCACCA ACCAAAGCTA CGGCCCACAA
TTTTTTGGGA CTACAGGCTC GCAAAGTCAA GCAGCAGCGG TCAGCCCGTA CGGCGGCCCG
CGTGGGAGTC GAGCGTTCCC ACAAACATCA AACGTCTCAT ACGGGCAGTG GTGTTCCGTT
GACCCAAATT GTCCGCCTCC GGTACATCAA GGGTTTTACA CAGGCCGTTC GGGCACCGTG
TCGATTGGAA GACTTGGCGT AA
 
Protein sequence
MEPDFDEDDL INDYIEESYE PPTAEYDVDF FEEMMASGGV TKTTTGSRTN DATTMGKQSV 
LVPVENTGAV VNPAPVDPHV SVRDAFEQRA EKPTENLFTF ERYNYNMDWR APRQANSPSG
NTMQAKEWKK SEPGRRRNRD LFGAYDDDDN PEITLSVGIS RQVSNRLPSA PDAQLLEFGT
KSARSQMGRK RPCYRSHSMP TRPQVGRHQQ IPMTLGDGTR VHLNVKVPAS DGKLDGGEGI
GHKNDTHNSL GISVAELMER VQAIRRRQEH DKQQHCNTDH DDPVHGDSHR HLGTEDHRLW
VDKHAPTSFA HLLSDERTNR EVVRALRAWD PYVFRRDPPP RPDFGYSAKP SDFHSDRKNE
HGSGKSRDSS RQDRRPEESC RVILLSGPPG VGKTTLAHIV ARHAGYRPLE VNGSDERSAS
ALTERIVRAM ESTTLHTAKM RRSVHNHECK DDSLPKPNCV ILDEIDGADA KGSIQAIVNI
IRADIPAKSQ ASKAQYLRRP LILICNNKYA PTLRALLPYA KAFHVNPPSP ARLVARLRSV
LTAENLTAGG GSSLLNQLVS VASGDIRSCL HTLQFASSRS KELATHAEEA PSVIDLSDSL
RGAMSGDGLK DERNDMAGTI TSVFRKRKDR TFLDSKRVMQ DKRPSSTRIF EAVQNFGDNL
RILDVLFLNV LRVSYIDPTL DRCAAAHEWL SSSDLCPRQV PSTAGAIHLL CRVEQRPDLS
FSTRELMDSR YQFEANQSLA QKFAEGLSMQ TRSRSTSLLA TETIPYSLWV LSAGEGSSGA
LDRAATSLQI LNKAELGSFH RHVMSLRCLG LSYVAEQEEA APGEFKGITG SVLRLEPPID
RLAHFMDLTR AKSQKRIEIP IAMKELLAQS VLHENMRHLG AQAQSKAIST KVRSKLPAPS
VVAAPMEAAP ELSAMNTSSP DKREAASSSD APLAKRRKTP SPTKATAHNF LGLQARKVKQ
QRSARTAARV GVERSHKHQT SHTGSGVPLT QIVRLRYIKG FTQAVRAPCR LEDLA