Gene PHATRDRAFT_43034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43034 
SymboldsCYC4 
ID7196837 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1856612 
End bp1858383 
Gene Length1772 bp 
Protein Length493 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177388 
Protein GI219111273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGGC CCTGGGAGAT TAATTTTGGC GAAGACCATA AGTAAATGGG GAGTGTACAC 
TCTGTGTAGT TCGTTCTTGG CCGCTTCCGT CGATGAAATT TCTGGACGCG CCAAGTTAGG
CAAATGTGAG TGAGTCTTGA GAATTGGTGA AGTCTTTTCG CGAGACACGC CAATCATCTA
TACCTATTTA CAACGTAGTT TAATATTAGC ATAGAGATTT TTACTTCTGC AGGCCAGGTA
AATCCTGCCT GTCTCCCCAG CAAATAAATC GCAGCACCAA AAGAACTCCC ATATCTTTTC
TGCCGTGAGT TTTTTGCGTA GAGCAGTGGA GTTTGCGCCA GGCGCACCAT CAAATTTTGT
GAGAGCACGG TCACAGACCA ACTCTCCGCT ACCAGAGAGA CTTTTCTTCT CACACAGCCA
GCATCCAATT CCGCTCACAA CTCGGCCGCC TGGGGAGTTA TGGCAGATTG TTCCGTACGG
CATACAAGTC TCCATCTTTT GCCGTATACG GGGAAATACG ACAACAGGGT CGACCATGAT
GCTCAAGCTT ACAATGAACC CGCGTAGAAG ATCCAAGAGG CATCGATCGA TCCCCCACCG
ACTGCGTTTG CCCACCAACC CCACAGCAAT GGAGACCGGT GATGATGAGG AAGATCGGGA
AAAGGACATG CTGGCAGAAA ATATCGGTGT GATAATTAAA CAAGAGGGTG CAGCCAAATA
CAGCTGCGTT GACTATTTGA GTCTAACTGT TTGGCAACAA AGTGTGTATC AGTTGATGAA
GAAATCCAAG GCGGATCCCA TAACCCACCA TGCTAACACA ATGATCGATG AATACTGCCG
CGAGCAAATT GTTGAGTGGT CATTTCGGGT CGTTGATTAC TTTCGCATAG ACCGTGAAGT
GGTAGCGCTT TCAATGTCCT TTTTGGATCG ATTTCTGGCG ACTTGCCGAT GCGATCGGAC
CAGTTTCAAA TTGGCGGCAA CAACAACATT GCACTTGGCT GTTAAACTCT TGTATCCATG
CAAACTAGCC GATTTGGGTA TCTTGAGTGA TCTCAGCCGA GGCGAGTTCG ACATGCACGA
CGTGACCGAA ATGGAAAGCC ATATCCTTCA CGCACTTGAG TGGAACCTGC ATCCGCCTAC
ATCCGCGGCA TTCACCTCAC TTTTCCTGGA CTACTTTTTC GCCACCCGCG CAGTTCATGT
GTCGAACGCT GATCTTGACG ATATCTACGA CGTATCGTCC TTTTTTTGTG AGTTGGCTAT
TTGTGATTAC TTCTTTGTTC CTACCCGAGC GAGCGCGATT TCTCTTTCCG CTATTCTGAA
CTCCCTGGAA GGTATGTACG GTCCCGACAA TCGCCTATCT CATGCCATAT TGGAAGCAGC
TCTCGAGCTG CAGGTTTGCG GCAGCGGCCT TATCGACTTA TCCGCCGCTC GCAACCGTCT
ATGGGAGCTA TACGAGCGTA GCGAAGAATG TGCATTGCAC AACGACAAGC CTGCGCAGGA
AGATATTCGG CAACATGGAA GCTGTACATA TATCAAGAAG CTCTCGGCAA CGGCGTCGCC
AGTATCAGTC TCGAAACCGT GCCATTCGTC CACCGACTTT TCACGTACGA GTCATAGCTC
GGCCCTACGT AACGAAAGCT GGTGAATCTA CGTACAGTGC GTGTCTAAAA CGTACGCTTA
GCCCAGAGCT TTGCTACTAC CGCTTGAATA TATTTTCCCG GAATGGGGAC AACTCCGTCC
GCATAACCTA ATGTATTCGA TGCTATTTCT TC
 
Protein sequence
MTGPWEINFG EDHNSFLAAS VDEISGRAKL GKSNKSQHQK NSHIFSAVSF LRRAVEFAPG 
APSNFVRARS QTNSPLPERL FFSHSQHPIP LTTRPPGELW QIVPYGIQVS IFCRIRGNTT
TGSTMMLKLT MNPRRRSKRH RSIPHRLRLP TNPTAMETGD DEEDREKDML AENIGVIIKQ
EGAAKYSCVD YLSLTVWQQS VYQLMKKSKA DPITHHANTM IDEYCREQIV EWSFRVVDYF
RIDREVVALS MSFLDRFLAT CRCDRTSFKL AATTTLHLAV KLLYPCKLAD LGILSDLSRG
EFDMHDVTEM ESHILHALEW NLHPPTSAAF TSLFLDYFFA TRAVHVSNAD LDDIYDVSSF
FCELAICDYF FVPTRASAIS LSAILNSLEG MYGPDNRLSH AILEAALELQ VCGSGLIDLS
AARNRLWELY ERSEECALHN DKPAQEDIRQ HGSCTYIKKL SATASPVSVS KPCHSSTDFS
RTSHSSALRN ESW