Gene PHATRDRAFT_35781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35781 
Symbol 
ID7201019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp736233 
End bp737390 
Gene Length1158 bp 
Protein Length385 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180108 
Protein GI219118680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAA ATGAGATTGG GCTCATTGCG CCAACACCGG CGCTAGACAG TTTGTCCGAA 
GATGTTACCT TTCTTTGGCT CTCGGGCCAA GCGTCCATTC CCGTATACGA CGAAGTGCCA
TCGTCACTCG TATTTCTACG TGATCATGTT GCTTTGAGCC GTCCGTGTAT CATTCGCAAT
GCCGTTTTGG ATAAAAGTGA AAACAAATGT CCTCTGCACC TAACATTGGA CGACTTGGTT
GACTCGGATC CGACACTCTC CTTAGTGGTA GACGTGACTC CAGACGGACA AGGCGATTGT
TTGAGGCTTG CCCAACATCA AACCCTGGGC TGCAAACATA AAGAAAACAG TCAACGAACG
TTTGTCAAAC CATTTGAACA CCGCATGTCC ATATCGGAGT TTCGTTCTTG TTTGCGAGCA
ACTCGATCTG GGACGACACC ATCGCTAGAG CAAATCAAAA ATCGTATATT TCAGTCCACG
GCCGACGTAT CATGCACTGT TTCTGAGGAA GCATTCAATC ACGGTCTTCC GACGGAAGCC
GTTTACTACT ATTCTCGTCA AAACGATTGC TTGCGGAGCG AGCTGTACTC GTTGTGGCAA
AAAAAGCTCT TTCCGGAGAA TTTTGTATGG GCATCCGAGG CCTTTGGTGT GCCCGAACCG
GAGGCTGTCA ACCTTTGGTT GGGCAACGAG CAAGCAGTTT CTTCGATGCA CAAGGATCAC
TACGAAAATT TATTCTACGT CCTATCGGGC GAGAAAGTTT TCACTCTTTG TCCTCCAGCT
GATGCACCAT TCTTATACGA ACAGAATTGT TCGAGTGGAT GCTTTCAGTA CAGCGCGACC
GAAGGCTGGA CGATAAGCTC CGATGTTCAT CAAGACGGAA CAACATTGAA GATCCCTTGG
ATTTCTGCCG ACGTGGTCGA GAAAGAGAAA TCGGAGGTTC TTGATGAGTT TCCACTTTTG
ACTTATACGC ACCCTTTGGA AGTGCACATT CGAGCTGGCG ATCTCTTGTA TTTACCGGCT
TTGTGGTTTC ACAGAGTTAC GCAATCCTGC GAGACCGTTG GCATAAACTA CTGGTACGAT
ATGAAATTTG ATTCACCTTC TTGGTGTTAT TTTCATTTTT TGCAATCCCT CATACCCAAC
GAGGCCATCC AAGGCTGA
 
Protein sequence
MQKNEIGLIA PTPALDSLSE DVTFLWLSGQ ASIPVYDEVP SSLVFLRDHV ALSRPCIIRN 
AVLDKSENKC PLHLTLDDLV DSDPTLSLVV DVTPDGQGDC LRLAQHQTLG CKHKENSQRT
FVKPFEHRMS ISEFRSCLRA TRSGTTPSLE QIKNRIFQST ADVSCTVSEE AFNHGLPTEA
VYYYSRQNDC LRSELYSLWQ KKLFPENFVW ASEAFGVPEP EAVNLWLGNE QAVSSMHKDH
YENLFYVLSG EKVFTLCPPA DAPFLYEQNC SSGCFQYSAT EGWTISSDVH QDGTTLKIPW
ISADVVEKEK SEVLDEFPLL TYTHPLEVHI RAGDLLYLPA LWFHRVTQSC ETVGINYWYD
MKFDSPSWCY FHFLQSLIPN EAIQG