Gene PHATRDRAFT_33867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33867 
Symbol 
ID7197675 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp537301 
End bp538509 
Gene Length1209 bp 
Protein Length342 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178247 
Protein GI219114903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.17062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGAT CGCTTTTTTA CGTTGATTCT CACTGCCGGC AAAGTTGTCG GTCGTCCTTC 
ACCGCATCTT TATTCTCACT TCTCGCGTTC ACAGCTTTTT TGAAGATGGC AAAACAGTTT
CAGCATGTTG TACGGAGAAC TGCGGGAACA AGATAGATGG CGGCATCCAG GGATTCTTCT
ATTCCCTAGG AGCCTTTGTT GGTTCCAAGC CTCGTCAAAC GATTGGAATG GCGATTCTTT
TCACGGTCGT TTGCGGAGCC GGTTTCGTGC GCTTCGAAAC AGAAAGCCGC GGCGAAGAGC
TCTGGGTTCC GCAAGATACA CGGGCCGAGC AAGAAACGCT CATGTATGAA TCCTATTTTA
ACAGTTCTAC TCGTTTTAAT ACTATGATTG TTCAGGCTGC AAATCCGGGG GGCGACGTTC
TCACGAAGGA GATCTTGTTG GAATCAATGC TCATGCATAG CGAAATTGCC ACAAAACAAG
CTAAGCTGGA CGGCATTGAC TATGGGCTCT TGCAGCTGTG TGTAAAGTCT GGAGGAACCT
GTGTGTCCAG CACGGAAGGT GCTTGTCAAT GCTTGATGAC GAGTATTTTG CGCCAGTGGA
ACTATGACTT GGCAACCTTG CACCAGGACA ATGACCGAAA TTTACGCTTC TCGCCAGTCT
CTGGTCGAGC TTGACCAGCG CCTAACGGGT CTCTCAACAG TACCTCCTTT CATTGCCGAG
CCTGTTTCAG AAGATGCGTA CCGCAATGTT ATGGCTGGCC TTTTCAACTT TTTGAGCACG
TCGGGTTCGA ATGATATTGG CAACGTGACC CTAGGTGGTG ATAATTGGCC GACTACAGAA
GCCGATTTTG TTGCCACGGT GGCGGCCTTT GCAAGCAGTT CGGGGCCCGG ATCAATTTAT
GATCGTGATG TTACCTTCTC GCAAGATGGA TCGCAGATTG AAGCGTTTCG TGTGGAGCTC
GAATATGTTC GGCTGACTAA GGAGAACCGC GGAGAATTGA TTGACGACGC TGCCCGCCAG
ATTGACGCCA TGGATAGTAC CCGCGATATG GTCAATAGTT GGGACGACCT ACCGACCGCG
TTCGCCTACT CTTCCAAGTT CATCACGATT GAGGGTTTTA AAATTATTCA ACTTGAACTT
TTCCAGATCG TTGGGTTGGC AATTGCAGCC GTCGGCGTGA TAGTTTGCTC ACCGTTCCCA
GTCCAATGA
 
Protein sequence
MSRSLFYVDS HCRQSCRFFE DGKTVSACCT ENCGNKIDGG IQGFFYSLGA FVGSKPRQTI 
GMAILFTVVC GAGFVRFETE SRGEELWVPQ DTRAEQETLM YESYFNSSTR FNTMIVQAAN
PGGDVLTKEI LLESMLMHSE IATKQAKLDG IDYGLLQLCS LVELDQRLTG LSTVPPFIAE
PVSEDAYRNV MAGLFNFLST SGSNDIGNVT LGGDNWPTTE ADFVATVAAF ASSSGPGSIY
DRDVTFSQDG SQIEAFRVEL EYVRLTKENR GELIDDAARQ IDAMDSTRDM VNSWDDLPTA
FAYSSKFITI EGFKIIQLEL FQIVGLAIAA VGVIVCSPFP VQ