Gene PHATRDRAFT_46292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46292 
Symbol 
ID7201223 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp835308 
End bp837856 
Gene Length2549 bp 
Protein Length772 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180717 
Protein GI219119933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGATA AGAGTACCGC GAATTACCCC ATAGTGGTCG TGGATGTGGA CGACGATACG 
ACCGACGGCG AGAACGACGA AGTATCGACC ATTGTCCGTA CGCGAATCGA CACGGCGAGT
CCGACAACGT CACCGATTCG GCCCCACACT CCACTCGAGG AAGCAGTGCA CGGGGGCTTG
CGGTTGACCC GGACGCGCCG CACTTATCAC GAAACAGCCG CACCCCCGAG CTCCCCCGTG
ACCGCGTCGT CCCCCCCGTC GACACCACTC TCCTTCACGG CCCAAGCGCC GAAAAAACGA
AAGGCACCCG TCGTACACGA ACCGTATTCT GCCGCAGCTA ATATGCGAGC CGCCATGGAA
CTGGCCAAGT ACGAATTGTT GGTGGACCCG TATCGTGGAA TGACAAAGTT ACGAGACGCA
CACCACGACG AGGAACGGGA CGAGACGGAT ACGAACGGAC ACGTGGCTAC GTCGCCGTCG
CTAGCAACGG TGGAAATAAC TCCCAGGCCC AGTCCCTGGC AAACATCCTG CGCCGTCGCA
ATCCCGGAGA GTGGTGATTG TAGGAGCCCC ACGGATGATG GAGACGACAT TGCGAATTGG
GAGGCGGCCG ACATTGTGAA ACGTGCTTGT AACTGTTCTT GTCCTTCATT GACGTCCTGG
ACCAGCGCAT TGGCTGTCGC TCGCGACAAG AAGAGGAAAA AGTCCAAACA CCAACGCAAA
GCCAATTCGA CTGTCAATAC ACATGTATTG GACGGTGATA CGGTGCGGAT TGGGCAACGA
CCCGATGGAC ACGCCGTACG CTGCCCGTGC GATTATAATC CTTTCTGTCT CGTGTCCTTG
GGCGGTGTTG TGAACGAAAT TCTCGTCGAC CGTTACAAAG AGTTGGAGAC CAAAAACGAT
CGAGCGGGCA ATGGCGAGGT CGAGGAAGTT GTCGACGACT CCTCTGTCAC GAACGATGCC
GAAGGTCCCG TTCAACCAAA CAGAATCATA TCGGACGCAA CCCAAGAAGA CATGAATGCC
GTCCGCCGAA GTATATCCGT GAGAGTTGAG CCTATTCGTA GCTATCTGGA ACATACCTTG
CAAGATTTGA CGCCGGCCCT TACCCTGGAG GACTGCATCA GCCGTATTCG AAAACGCCAT
GCTGCTCTAA TATTTGTCAA TCCACTTTTG AAAGAGAAGG CCGATTCACC CAAGGATAAT
GATGCACTGG TCATGTCCAT TCCACCCGGG ATGCAAAACT TGGGCGCCAC TTGCTATCTG
AATACACAAC TGCAGTGTTT GGCACAGAAT CTGGTCTTTT TGGAAGGGGT CCTGTCGTGG
CGTCCGCCAA CGTCCACAGT GGACGGGAAT GCAAACCCCG ATCCCATTCC CCAAATGATT
GAAACGTTTC AATCGCTCCT GGCTTCCATG CGTATCGGAC CACACTATGT CTTGAATACC
AAAGACTTTT CCAACGCACT ACGCTTGGAC CACTATGAAC AACAGGACCC GAACGAGTTC
AGTCGGCTCC TGCTTGACTG TATACAGCAA AGTTTTCAAA GCGCCACGCA ACAACGCGAT
TTGGCAACCT TGCTGCCCCA TCTTTTTCAC GGCAAAACAA CCTACACGAC CACCTGTCAA
GTATGTCACA AAATGTCCAC CACGACAGAG AACTTCATGG ACGTGACTCT GCCGATCGTA
AAGCCGCTGC GAGAAAAAAG CATGCCCGGC CAACAATCGC TTGCCGATGC CTTTGGTAAA
AGCAAAGCAA AAAAGAACCT GCAAAGCTAT GATACGGATG TCCAGTATTG CTGGGATCGG
TACGTTTATG CAGAAACGTT AGAGGGTGAC AATCAGTATT TCTGCACCGA ATGCGAAGCG
AAAGTAGACG CTCAGCGGGC CTTGACTTTC TCTGCACTTC CACCCGTTCT AAACATTCAA
TTGTGTAGAT ACGTTTACGA TAGAAATCGC GGTACAAAAA AGAAGGTGAC GGACAAGGTC
CTCTTGCCAA CGGAACTAGA AATTGAGCAA GCGACGACTT CTGCACCTTC AACATCACCG
CAAGTCGAGA CGGAGCCCAC CAAACATCGG TACGTGTTGT GCGCCGTCAT GCTACACAAA
GGAAATTCGG CGTACAGTGG TCATTACGTG GCTGAAGCGA TGGATTGGCA AACAGGACAA
TGGTTTGAGT TTAATGATGC TCATGTAACT CTGCTGGAGG CGCCCTCATG CAGTTGGGAC
CCTTTGTTTG ATAGTGACAG AAGTGACGAT CGCAAGAAAG ATACAAAAGA GACCAAGGCA
AAGAAGGGCA GCGAGGATGC CTACAATATG TATTACGTTG AAGAGTCGTT TTTGGCGCAG
AGTGTTTTGG ACAGCATTCG AGAAATTGAT CCACAGACTG GTGCTTCGAA GTCTGAATCA
GAGGACGGAG CGTCCGCTTT AAAGACAGCT GCCCTGACAA GATCTGAATA CTTTGCTGAT
CTGAGACGGT AAGTTTTGGG ACACAAAGCA AGAGTGCATT TCGTTTCTGT TCTTTGTTTA
GTACGTACGG AACGTTCGGT AACTTATAA
 
Protein sequence
MTDKSTANYP IVVVDVDDDT TDGENDEVST IVRTRIDTAS PTTSPIRPHT PLEEAVHGGL 
RLTRTRRTYH ETAAPPSSPV TASSPPSTPL SFTAQAPKKR KAPVVHEPYS AAANMRAAME
LAKYELLVDP YRGMTKLRDA HHDEERDETD TNGHVATSPS LATVEITPRP SPWQTSCAVA
IPESGDCRSP TDDGDDIANW EAADIVKRAC NCSCPSLTSW TSALAVARDK KRKKSKHQRK
ANSTVNTHVL DGDTVRIGQR PDGHAVRCPC DYNPFCLVSL GGVVNEILVD RYKELETKND
RAGNGEVEEV VDDSSVTNDA EGPVQPNRII SDATQEDMNA VRRSISVRVE PIRSYLEHTL
QDLTPALTLE DCISRIRKRH AALIFVNPLL KEKADSPKDN DALVMSIPPG MQNLGATCYL
NTQLQCLAQN LVFLEGVLSW RPPTSTVDGN ANPDPIPQMI ETFQSLLASM RIGPHYVLNT
KDFSNALRLD HYEQQDPNEF SRLLLDCIQQ SFQSATQQRD LATLLPHLFH GKTTYTTTCQ
VCHKMSTTTE NFMDVTLPIV KPLREKSMPG QQSLADAFGK SKAKKNLQSY DTDVQYCWDR
YVYDRNRGTK KKVTDKVLLP TELEIEQATT SAPSTSPQVE TEPTKHRGHY VAEAMDWQTG
QWFEFNDAHV TLLEAPSCSW DPLFDSDRSD DRKKDTKETK AKKGSEDAYN MYYVEESFLA
QSVLDSIREI DPQTGASKSE SEDGASALKT AALTRSEYFA DLRRTYGTFG NL