Gene PHATRDRAFT_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_2032 
Symbol 
ID7198685 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp379755 
End bp381175 
Gene Length1421 bp 
Protein Length442 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184871 
Protein GI219129386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.688536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TACGAGCGGT ACGATGTCAC CGTAGACGAT GCGGAGGCGG ATCGTGCGAC GGAAATCAAG 
ATATTCTCTA TCCGACGCCC GCACATGCGA GCCTTTCACG TGGCCTGGTT TTCCTTCTTT
TGGGCCTTTA CCATTTGGTT CGCTCCCGCG CCACTACTAA AAGAAATACA AAAGACACTC
GGATTGACCA GAAAAGAGAT TTGGACGAGT TCCATTACCA ACGATATCAC CGCCATTTTC
TTGAGAATTT TGATTGGCCC CTTGTGCGAC GTCTACGGCG CGCGCTTGCC CATGGCGGCC
GTCCTGGTCC TCGCATCGAT TCCTACCGCC ATGGTAGGAC TCATTCAATC GGCGGCGGGG
CTTTCCGTCA CACGCTTCTT TATCGGTATT GCCGGAAGTT CCTTCGTCAT GGCACAGTTT
TGGCCTTCCC GTATGTTTAC CCGGGAATTG GCGGGCACCG CCAACGGGAT CGTTGGTGGT
TGGGGGAACC TGGGGGGTGC CTTTACACAA CTCCTCATGG GCACAATTTT GTTTCCGGCT
TTTCGGAATC TGTACGACGG GGACTCGGAA AAAGCATGGC GCGTTATTTG CGTCATTCCC
GCTGCCGTCG CCTTTTTGTG GGGTATCGCC GTCCCGTGGA TTTCCGACGA TGCCCCGATG
GGAAATTATG GAGAAATGAA AAAGCGTGGC GCCATGGATC GAATTCTGAT GACGACGGCC
CTCCGACAGG GCGCGGTCGT CAATACGTGG ATACTGTACG TCCAGTACGC CTGTTCCTTT
GGAGTCGAGC TCGTCATGAA TAATGCAACC GTGCTCTACT ACACGGATGA GTTTGGATTG
AGTACGGAAG ACGCGGCGGC TCTCGGTTTT ATTTATGGTT CCATGAATTT GTTTGCTAGG
GGCATGGGTG GATATCTCTC GGATCAGCTT AACCTCAAGT TTGGCCTACG GGGTCGCTTA
TGGCTTCAAA CCTGTTTATT GGTAGTCGAA GGCATCGTCA TCATTATTTT TCCATTTGCT
GATACACTCA GAGGAGCCAT CGTTACCATG TGCATTTTTT CTATTTTTAC GCAAGCCGCA
GAAGGTGCCA TTTTTGGTAA GCCTTGACAT AAGACGCTAT ATCTCTTTTT GTGTTTGAGT
CCTATTGCGA CTAATAGTTT GCTAATCTCT TGCTGCTTTT GACTACATTA GGGGTGGTCC
CATACGTGAC CAAATTGTAT TCGGGCTCGG TTTCGGGTTT GGTCGGCGCT GGAGGCAATG
CCGGCTCCGT CATTTTTGGT CTCGGATTCC GGTCGCTTTC GTACCGGCAA GCTTTCATCA
TGATGGGGTG CATTGTGATC GCTAGCTCTG GTTTAAGTGC CTTCATCAAC ATTCCGTTGT
ACGCGGGCTT ACTCTGGGGT AAGGACAATC ACTCCGTTAT T
 
Protein sequence
YERYDVTVDD AEADRATEIK IFSIRRPHMR AFHVAWFSFF WAFTIWFAPA PLLKEIQKTL 
GLTRKEIWTS SITNDITAIF LRILIGPLCD VYGARLPMAA VLVLASIPTA MVGLIQSAAG
LSVTRFFIGI AGSSFVMAQF WPSRMFTREL AGTANGIVGG WGNLGGAFTQ LLMGTILFPA
FRNLYDGDSE KAWRVICVIP AAVAFLWGIA VPWISDDAPM GNYGEMKKRG AMDRILMTTA
LRQGAVVNTW ILYVQYACSF GVELVMNNAT VLYYTDEFGL STEDAAALGF IYGSMNLFAR
GMGGYLSDQL NLKFGLRGRL WLQTCLLVVE GIVIIIFPFA DTLRGAIVTM CIFSIFTQAA
EGAIFGVVPY VTKLYSGSVS GLVGAGGNAG SVIFGLGFRS LSYRQAFIMM GCIVIASSGL
SAFINIPLYA GLLWGKDNHS VI