Gene PHATRDRAFT_42968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42968 
Symbol 
ID7196205 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1657469 
End bp1659630 
Gene Length2162 bp 
Protein Length609 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176827 
Protein GI219110151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGACACAT ACAGGCTTAT AAGGAGCACT ATGTGTTGAA CAAGTTACAA GTATAGAATC 
GGCAGCAAGA AGGTAAGACA ACAAGCGAGA ATGTCCCGAA ACAACGCTTT TTACTATTTG
CGCCGAAGAA AGGTGCGATG GCCGTGTTTG CGATGGTAAC GAGGGGCGCA AAGTGTTCCG
TCATAAATGC TTGTGGAGGT CACTTCTCAT CTTCTTCACA AGTTCAATGC CTCTATTCGA
AGCTTCCCAT TAAGGAATGA AGAAAACTGT CCTACAATGC CTCAAGATCA TCGGCGTTCT
CTCTACCGAC TTGACGAGTT GCGACTCGTT ACAGCAAGAG TTCGTCAACA TCAAACGCGT
CTATTTCAAA AAGATTCTTG TGTGCCATCC GGACAAAGGC GGCGACGCAG AAGTCTTCCG
AAATGTGCAA ACTTCCTTTG AAGTGCTGCG TGGAATCTAC GAAAAGGAGG CTATCTCATC
TTTTATCACA GAATCGGCCT TTTCCGCCCA AAATTTCGAC GATGTCTTTC GAGATTTTGG
TGAGATGCCG ACGCCATCTT GGGAGTATTA TCATGAAGCT GCAGAAGAGA ATTTTCCCTT
GTACCGCGTA GAGCGAGCTC GGTCCAATCG TAGCCGCTGT ACACAAACAA CAAAACTTGG
GAAGAAGTGC GGGGATGATA AATTGATCGC CAGTGGCTCG GTTCGTGTAG GTTCACTTGA
CGAGAAGACG GGAACATACA CACGATGGAA TCACTTGGAA TGCTGGCGCG TTCCGAGCCG
TATATGGCTT GGTATTGCAA GTTCTCGTGA GGACGAAGAT TCGGTTCTAA ATACGAAAAA
TGTGCTTCAG TCTCTTCTAC AAATGAATGA CGTTCTTCTG AGTGGAATAA ATGAGATGAA
TGTTGAAGGG CAGATGGAAT TCGTCACGCA TATCATGGAT CGGTCTAGTT GGGCTCGGAT
TTCTCAGCGC AGGAACATCA ATTCATCTAA GGAGCTAGAT ATGTCTGTCG CCAAACAAAC
TCAACCGGCA CGTCAGGCGG CCGTCTTTCC ACGTGACGGG CCTTCATCAC GGGAATCATT
CATTGTTCCA ATCCCCGGGG AAAACGGCGT AGCCTCTGAC GCTTTGAAAG GGAAACGGGT
TGTTTTGACG GGCGTATTTC CGGAAGTTGG CGGAGGCCGC GGCTTGACAC TTGGCAAAGA
GAAAGTCAAG AAGATGGTAG AATCTTTCGG TGGTCGTGTT ACAGGTTCAG TGTCTGGAAA
AACGGATATG GTACGTAAGT TGGGCACTTT TGAACTAGGA TAATTCTGAA AGTGTTCTGA
TAGTATTTAC ACTTTCTCCA TCGACGAACT TGAAGTTAAT TGTGGGCAAG GACCCTGGTT
TTTCCAAAGT TTCTCAGGCC CGAGGTCGAA GCGTTACGTT GTTGAGTTTG AAAGATCTGA
AGGAAGGCAT CGAGAGCAAC TGTCTCGAGT ATACACGGCA CGAGGACCTC AAAATCTCTT
CCTTTTCTTC GGGTTACAAA AACAACTCCG CTGCGCTGTC GGCTTCGAGC AGCGACTTCG
CCATTGCTGC TGGTCAAGCT CCAGCGCTTC CGCAGGCAGG ATTGACGAGA TCTTCTAACT
CAAAAATGCC AATTGCGAGT TCGGGGACCT ATGCATTTCA TTCTGCAATA AAAGCGCAAC
CAGTGTTGCC TGTGACGCTA TCGTCGACGT CGCCATCATC ATCCAACCTC CTAGAATCTG
CCAAATTCGT AGTAGGCAGC CATGAGGCGA GTCCTGTGCG TTCTGCTGTG CCGAAAGGAG
GGCACGGCTT ACTGGGTAGA GCCCGGTTTG TCGAAGCACC GACTATGCTC CGAACGAAGC
TACAGGAAGC TATACGAACC GGCGCAACGA TGGACCAAGA GTCGAAGATC CCGATCAGTC
AAGAAACACA AAGAACTCTA GAGAACACTC TTGCAGAAAT TGACGGAGTT GGAGAGCAGG
GTCCACAGCC GCAACCAAAC ATACCCTTGC CTGACCTTTG CGACTCAATA AAAGGTCTGC
CCTTCCGAAT AACACGATAT CGTTATAAAA CGCTGAAGGC GGAAGTAAAC TTACAAAAAC
AAAGGACGGC CAAACGTGCG CGAGAAGGTG GGTAAACGGA CGCCTCGACA AGCAATTTGT
GG
 
Protein sequence
MKKTVLQCLK IIGVLSTDLT SCDSLQQEFV NIKRVYFKKI LVCHPDKGGD AEVFRNVQTS 
FEVLRGIYEK EAISSFITES AFSAQNFDDV FRDFGEMPTP SWEYYHEAAE ENFPLYRVER
ARSNRSRCTQ TTKLGKKCGD DKLIASGSVR VGSLDEKTGT YTRWNHLECW RVPSRIWLGI
ASSREDEDSV LNTKNVLQSL LQMNDVLLSG INEMNVEGQM EFVTHIMDRS SWARISQRRN
INSSKELDMS VAKQTQPARQ AAVFPRDGPS SRESFIVPIP GENGVASDAL KGKRVVLTGV
FPEVGGGRGL TLGKEKVKKM VESFGGRVTG SVSGKTDMVL FTLSPSTNLK LIVGKDPGFS
KVSQARGRSV TLLSLKDLKE GIESNCLEYT RHEDLKISSF SSGYKNNSAA LSASSSDFAI
AAGQAPALPQ AGLTRSSNSK MPIASSGTYA FHSAIKAQPV LPVTLSSTSP SSSNLLESAK
FVVGSHEASP VRSAVPKGGH GLLGRARFVE APTMLRTKLQ EAIRTGATMD QESKIPISQE
TQRTLENTLA EIDGVGEQGP QPQPNIPLPD LCDSIKGLPF RITRYRYKTL KAEVNLQKQR
TAKRAREGG