Gene PHATRDRAFT_37941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37941 
Symbol 
ID7202858 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp473737 
End bp474906 
Gene Length1170 bp 
Protein Length356 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182072 
Protein GI219123522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.149432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT GGAGACGGGT TTATGCATCC TTGTGCGTCG CGGTAGCTAC TACGGTGGTG 
CTTTCCGCTC TCTCTAGGGA TGTCCGCTCG ACCCTCATGC CGTTCTACAA TGAAACCCAG
AAGACGTCGC TACTTTTTGA TGACAGCAGT GATCGCGTTG GGCATTTCGA TGGGCTATTT
TCGGATGCCC GGGCTTATCA AAACATTGAG GAGTCAAATC AACACAGACA AAAATATGAA
GCGAAGTGCG CAGACGAAAG CTCAACCCAA GAGGAGATAG TTGAAATTTT GGGTAGTTGG
TATAGGCCAA GTCTTGATGG AGAAGTCGCA ACTCAACAGT CCAAGCTGCC GGTTGAACCG
TGCCGGTTTA CCTTTCTAGA TTTTGGCGCC AATGTTGGAG ATTCGATGGG CAAACTAGTG
GACGCTGGCA TTCCACCTTG TTCGAAGAAA GGCATTTTAG CTCCACGAAT AGATCTGGAA
CATGGATTTC TACAACCTCT TCAAAAGGGA AAGGGTTTTA GAAAACTCAT CACCTGGATA
CGTACTCAAA TGGAGGAGGT GAGCCGGCAA CTTTCGGGCC CGGTTCAACC AGAGAATTAT
TGTTACTTTG GTATCGAGGG AAATCCAATC TTTACAAATC ATCTCAATAG ATTACAGCAA
CGTCTCATGC TTACTTCGCC GAGACCACTT CGAAGAGTCC ACTTCTTCAC CGAGACGGTG
GGCGCTGCAA AAGACGAGAC TACGGTTCTG TTCTTGGACA CAGTCAATGA GAAAGAGAAT
TTTTGGGGTT CGTCTACACT CTCTGGACAT AGGGATGTTC AAAGCTCCCT CTTGAGTGGG
AATGACAAGC GTGAGGTGTC TGTGCAAGGT TTCACTTTGA CTCGCCTTCT TCACGAAACA
GTCAAGATGA TGCCTGGTGC ACATGTTATG GTGAAAATGG ATATAGAGGG TGCCGAGTAT
GCATTGCTCA ATGAAGCATT TGACTCGGGT GCACTGTGCA ACACGACTGC TCGTGCTGTC
AGGGTCGATA TAATTGTTGA AGTTCACGGC GAGGTGAGTG AAAATCGTTC GTATATGAAT
AGATATCCTA TTTCTACTCG AAGTCACATT CTCTCTGTAG ACCTTAATAG GAAGAAACTT
ACACGCCGAT AGATTCAGAA GCAAAGTTAA
 
Protein sequence
MRKWRRVYAS LCVAVATTVV LSALSRDVRS TLMPFYNETQ KTSLLFDDSS DRVGHFDGLF 
SDARAYQNIE ESNQHRQKYE AKCADESSTQ EEIVEILGSW YRPSLDGEVA TQQSKLPVEP
CRFTFLDFGA NVGDSMGKLV DAGIPPCSKK GILAPRIDLE HGFLQPLQKG KGFRKLITWI
RTQMEEVSRQ LSGPVQPENY CYFGIEGNPI FTNHLNRLQQ RLMLTSPRPL RRVHFFTETV
GAAKDETTVL FLDTVNEKEN FWGSSTLSGH RDVQSSLLSG NDKREVSVQG FTLTRLLHET
VKMMPGAHVM VKMDIEGAEY ALLNEAFDSG ALCNTTARAV RVDIIVEVHG EIQKQS