Gene PHATRDRAFT_46979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46979 
Symbol 
ID7202222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp93495 
End bp94882 
Gene Length1388 bp 
Protein Length425 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181133 
Protein GI219121562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACTCATCT CCAGTAGGCC CTCCTCGACA AAGACAGTGC TTCAATTTCA GCCGTCATCC 
GTTACACCTC ACTGCTGCTC CCGCTGTTTC AGTCGTTCCA CAGATCTGAA AAATGGTCGA
TCCCGATACA AAAATACATC CGCGTCCGCG AAGAAGTCGA AGGAAGCCCC AGTCAGACCA
ACGCTTTTCT GCCTATTTGA GCTTTGGCTT CTCAGCCACG GTCTGTCTCG GAATTTTCGC
GGCGTGCTAT GCCTTGGTAC TTTTCGGTCT GTGTCCACTA CTAAATCAAG CGTCTCCATT
AGTTACACCG ACTCATCGCG GTGAAATTTT GCAGCCTGTT GTACATTCGG CTGTGGAGTC
TGTTAAGCAT CACATTCCCC ACTTACCTGG ACAGATGATT GCTGAAAGTG TGGCGGGTAT
AGTGAAACGA AAGATCGCAG ACATGCGCAA ACAACAGGAT GTGACTGATA CGTCCTTGAT
GGAAAAGGCT ACAGAAGAAT TTAATCTTCT TCGAAAGCGT CGAGATCAGA AAGGCGCACG
CCAGGCGAAT GACGAAAGTG CCGCCGAACC TGCGCAACTA GCACCCGGCA AACGATCTGG
CTTTTTGGTA TTGGGCATGC ACCGGTCGGG TACCAGCATG CTGGCGGGTC TCATGGCGAC
CGGTCAAGGT TACAACGTGG GAGGTCCACT GATTGGCGGT GCGTTTGACA ACGAGAAGGG
GTTTTTTGAG TTAGTTGATG CAGTTCTGCA GAATGACGAA TTTATGCAGC TTCAGCGAAT
TTGGTGGAGT TCCAATGTAA TAAACTACGA CCACGAAAAG GCCATTGTCG CCAAGAGGAA
TGGAAAAGTT TCTTTCGACC ACGGCGAAAA GGCGCTGGCT TTCTTCAATA GTCCAAGGAA
TGCTCCATAC TTGCAAAAAG ACCCGCGAAT GTGCATCACC CTCAAGACTT GGCTGCCCCT
GCTGAAAAGT GAACCGGCTG TACTTTTTAC CTACCGTCAT CCTTTGGAGG TTGCTTTGTC
CCTGAAAAGG CGTGAGAAAA ACTTCACTTT GGAACACGGT CTTCGCCTCT GGATTGTTTA
CAATATGAGG GGTGTTGAGA ATTCGCAACA ACTTTGCCGT GTTTACAGCA GCAATGAAGC
AATTCTAGCT GATCCATTGA ATGAGGTGAA CCGCATCTCC AGCGAGCTGA CGTCTAAATG
TGGCATTCCC CCACCGCCGG AACAACTCAC GCAGGAAGAC GTAGACAAGT TTGTGGATCC
AGATTTGCAA CACAACAAAA AGAAACGCGA AGCCGACGAC GAGAAGAAAG AAATAATTGC
CAAGCACGGC ACCTGTACGG TGAGAGACTA CGAAAGCAGT CTACCGGAAG GAAGTCCTGA
TTACAAAC
 
Protein sequence
MVDPDTKIHP RPRRSRRKPQ SDQRFSAYLS FGFSATVCLG IFAACYALVL FGLCPLLNQA 
SPLVTPTHRG EILQPVVHSA VESVKHHIPH LPGQMIAESV AGIVKRKIAD MRKQQDVTDT
SLMEKATEEF NLLRKRRDQK GARQANDESA AEPAQLAPGK RSGFLVLGMH RSGTSMLAGL
MATGQGYNVG GPLIGGAFDN EKGFFELVDA VLQNDEFMQL QRIWWSSNVI NYDHEKAIVA
KRNGKVSFDH GEKALAFFNS PRNAPYLQKD PRMCITLKTW LPLLKSEPAV LFTYRHPLEV
ALSLKRREKN FTLEHGLRLW IVYNMRGVEN SQQLCRVYSS NEAILADPLN EVNRISSELT
SKCGIPPPPE QLTQEDVDKF VDPDLQHNKK KREADDEKKE IIAKHGTCTV RDYESSLPEG
SPDYK