Gene PHATRDRAFT_46170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46170 
Symbol 
ID7201251 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp468751 
End bp471998 
Gene Length3248 bp 
Protein Length1069 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180644 
Protein GI219119783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGGG AGACCAAACG CCAGAGCGTA GCTGATAACA CACAAAAGGA TATAACGTAT 
CGATATGAAG TATCGTCACT TCAAAACAGA ACATCGGAAA ATGCTCAACG TGATTACATT
TCGGCCTATG GGAAGAGTCA GAACGATGTA ATCTCCCCAA CAGCAGTTCA TCTGATTTCA
AGGGTCTCAA CAGATCCGAG TAATTCTGTG CATAGTGATC TGCCGTCGTG CATGGGCCCG
AGTCCGAACT TATCGGTATC TGGAAGTTTG TCTCGTAATC ACACTTCCGC CGAGTCTGAT
TCTTCTCCCT ATTTTGAATC TTCCACGCCG CTGATGCCCT ATTCTTTGTC AAAACCTCAC
CCTAAGGGGG AAACCAATTC GTCGGAAGCA TGCGCCAGCA ACGCAACTGG TATGGTGTGT
TCTGTGCTCC CGGAACCAAA CCGCTCCAGG TGGCGACAAA CCTTTGGGCC GAGCCACGGG
ACCGATCAAA CGGCTCTAGA GAACGTCAGT CTTAAAGCAC CGCAGCGTGC TTTTAAAAAC
TCATCACGTG ATCAGTTTGG TGCCGTGGGC GACGATGATG ATATTTGCTC ATCTTCTGAC
GAGGAAAATG GAATGATTGC AATTGGTATT CCCGATAAGA TTCAATCACG CGCTAAAGAA
AAGATGCAAC AAAGCAATAT CTCCCAAACT ATCGCTGAGG AACTAGCGCG ACAAGAGCAT
GAGGCAAAGT TAAAGAAGAA CCAGCTTCCA GCCAAAGCAA TCGTCACACG GCGCGTCACG
GAACGCAAAC TGGGCGCTTC GAAGGATCCT CTATCACAGA ATCCCATGAT TGGAGAGAAA
GATCACGTAC TCCAACCCGG CTCGGTTGTT GCTCGAGCCG GTTCAGAATC GCATCGTACC
ATCATGGAAG AAAAAAGCCT GGCTCGTGGT ACGCCAACCC GTGAATTGTC GCTGATGCCA
GGCGCATTCC GAGCCACGAC ACGTAGTGAC TTGGACCAAA AGAGTGCAGA GCGCGGAGTG
GTGCGTCCCA TTTCATCCTC TTCGCGGAAC ATGAAGGCGC GTCGATTAAA TCGGTTAAAT
GGACGTCAGA ACCATCAAAT TGAGGCTAGC GTGAATGAGG ACGATCAGCG CCGTCTGACA
CATTCGTACA GCTCCTCGGA CGAGTCGGCT GTCTTGCTTT TGCCCATACG TACCAAATCC
TCTGATTCCA TCAAGAGTGA TGTAGATTCG AGTAATCATT CTACCCGCGC CAGAGCCCGT
TCCCGTTTCC ACCGCATGCG ACAAAAGTCT ATGGATTCTT CGGCTCACAG CTCAGATGGA
TCTACAGTAG TACCTGCTTC AATTCGGGAT CTTGTCTCAC TGCGCAGCAT GGAAGAGACC
ACTAAATTGC AGCGACACGA TTCTGGGCCT TCTTTGGCTC CGGCGACAAG TGGTTGTGTC
GTTCCAGGAG CAACAGAGTT TATTGCCGCA GTACGTCACG AAAAGGAGAA TGGTCCATCG
CTGGCGCCAG CAAGAGCCAT AAGGATTTCT GGCGAGTGCA ATGTAGAGGG CTCAAAACAA
AGTGTGAAAA TATACGGACC GGTGTTGGCT TCTGGATTTG TTCCGGATCA AGTCGCCTTG
ACTCCAGGTA TGATGGATCT GACAGGCCAA GAATGTTATC CAGATGAGGA TGAGGATGAT
ACGATTGAAG CACAAGCTGG TCTTCCAGTA TTGATCCCAG GTGCGTTTGC GATTGAAGGT
ATGGAATCGT CTCACACGGC TACCTCTAGG CACAACTCTG TGGTGGACAC GCAAAGTTTT
TCGGAAGCTG AAGAAGTATA TGGGGAAATC GAAGAAGATC AAGCAGATAC AGAAATTTTC
TTGGAGCCTT CCCCGGACGA CACACCGCCT CTCGTAGCCG AATTACACGA AGAAGTTGTT
GTAGACGGAG CGGTCCTTGA AGAACACGGT GAGGACGATC CAAAGCAACG ACATAGGCTG
CGTCTTTTTC AGGCAATGGC TTTTTTTTGT TCGGTTGTGG CAGTTACCCT CATTGCAGTT
AGTGTTTCTG GGGTTTTCCA ACCAGATCAA GCTGGTCCAC AGAAGACAGC GCCGAAAATT
TCCGGTTGGT TAGCGGCGGG TGAGGAACTT TTCGGATCTA CAGAGGAGGC GCAAGTTTTA
TTTGGTACAT CCATTGGAAT GTCAGGAGAC GGTTTCATTC TTGCCGTGAG TTCTCCGGGA
TGGGATAATT CCTCAACAGA GCTCAACGTC GGACAAGTGC AAGTTTTTTC TGGAGCAGAC
ACCTTTAACG GGACCCAGTG GGATAATGTT GTCACTTTAG AAGGTCCAGG CTCGAGTGAA
GATGAAAAGA CTTCCATAGC CATGTCTAGT GATGGTAGAC GGCTGGTAGT TGGTTATCCC
TCTTTCAATA GTGGAACAGT ACAAGTTTTT GAAGATCGCG GTCGTGGATG GAGTCCTTAT
GGGGGAGTCG AGCGTATGGA AAGAGATGGT GAAAATATTT GGTTTGGACA TGCCGTAGAT
ATCAGCGCAG ATGGAGATGT TCTGGCAGTC GGCGCACCAC TCAGAAACTC TCTTGCGGGA
GAACAGAGTG GAGCAGTTCG TGTCTTTCGG TCGTCCAACA CACGTTGGAT TCAAATTGGA
TCCGATATTT TGGGCGAATC CATGAATGAC TTTGTAGGCT GGTCTTTGGC ACTCAACTCA
CAAGACGGGT CACGCGTCGC TGTCGGTGGG CCAGTTGCTC GAGATGAGCG TGGGATTGTG
CGCATATACG ATTGGGATGG CTCAACCTGG AAGCAAATCG GGGAAACTCT GACTGGGATC
AATATCTTGA GTAGATTTGG ATCATCTGTT TCACTATCAG GGAACGGACA AGTGCTTGCA
ATTGGTGCTC GAGGTACTGC GTTCGAACCT GGGGAGGTCC GTGTTTATCG AGAGATCGAC
AATGCTTGGG TCACAGACAA TATCTTTAGC GGACTGGAGC CAAGCGAAGG ATTCGGGACA
ACCGTGTCTC TTTCAAAAGA TGGTAATGTT CTCGCGATTG GCATCCCTCA GAATAACGAA
TTTGGCAACG GCAGTGGTTC GGTGCAGGTG TGGAAATACT ATGATGATCA GAAGGCTTGG
AAACAGGAAG GCACCAATAT TGGCGGATCC GAGGGGAGCG CGTTTGGGTC GGCTGTCGCA
CTTTCCGCAG ACGGTTTTCG GGTAGGCGTT GGATCCCCAC TTGCAACGTT TGATGGCAGT
GTAGCTAA
 
Protein sequence
MDRETKRQSV ADNTQKDITY RYEVSSLQNR TSENAQRDYI SAYGKSQNDV ISPTAVHLIS 
RVSTDPSNSV HSDLPSCMGP SPNLSVSGSL SRNHTSAESD SSPYFESSTP LMPYSLSKPH
PKGETNSSEA CASNATGMVC SVLPEPNRSR WRQTFGPSHG TDQTALENVS LKAPQRAFKN
SSRDQFGAVG DDDDICSSSD EENGMIAIGI PDKIQSRAKE KMQQSNISQT IAEELARQEH
EAKLKKNQLP AKAIVTRRVT ERKLGASKDP LSQNPMIGEK DHVLQPGSVV ARAGSESHRT
IMEEKSLARG TPTRELSLMP GAFRATTRSD LDQKSAERGV VRPISSSSRN MKARRLNRLN
GRQNHQIEAS VNEDDQRRLT HSYSSSDESA VLLLPIRTKS SDSIKSDVDS SNHSTRARAR
SRFHRMRQKS MDSSAHSSDG STVVPASIRD LVSLRSMEET TKLQRHDSGP SLAPATSGCV
VPGATEFIAA VRHEKENGPS LAPARAIRIS GECNVEGSKQ SVKIYGPVLA SGFVPDQVAL
TPGMMDLTGQ ECYPDEDEDD TIEAQAGLPV LIPGAFAIEG MESSHTATSR HNSVVDTQSF
SEAEEVYGEI EEDQADTEIF LEPSPDDTPP LVAELHEEVV VDGAVLEEHG EDDPKQRHRL
RLFQAMAFFC SVVAVTLIAV SVSGVFQPDQ AGPQKTAPKI SGWLAAGEEL FGSTEEAQVL
FGTSIGMSGD GFILAVSSPG WDNSSTELNV GQVQVFSGAD TFNGTQWDNV VTLEGPGSSE
DEKTSIAMSS DGRRLVVGYP SFNSGTVQVF EDRGRGWSPY GGVERMERDG ENIWFGHAVD
ISADGDVLAV GAPLRNSLAG EQSGAVRVFR SSNTRWIQIG SDILGESMND FVGWSLALNS
QDGSRVAVGG PVARDERGIV RIYDWDGSTW KQIGETLTGI NILSRFGSSV SLSGNGQVLA
IGARGTAFEP GEVRVYREID NAWVTDNIFS GLEPSEGFGT TVSLSKDGNV LAIGIPQNNE
FGNGSGSVQV WKYYDDQKAW KQEGTNIGGS EGSAFGSAVA LSADGFRCS