Gene PHATRDRAFT_35733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35733 
Symbol 
ID7200995 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp628440 
End bp629567 
Gene Length1128 bp 
Protein Length346 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180280 
Protein GI219119027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGT GCTGCGGCTA TTTCGCCGCG TTCGTCTCGT GTGTCGCCTT TGGAACCTTC 
GCAGTTCCGA TCAAATGTGC AGCCGTCCGC AAGGTCGATG TGGATCCTCT CGGTACGTGG
GCAGTGAAAC TGAAAGTAAA CCAGCAATCA TATCTGCTGA TCTATTGTTT CGCTCACTCG
CTCCCTTTTG TTTGCACAGT CTTGCAGACT TACAAGATCG GCATGACGTT GCTTACGAGC
TGGTTGGTCT TGCTCTTTGG TGTACCCTTC ACTTTTACTC CTTGGGGTTT TGTTTCCGGC
TTGTTTATGG TCCCGGGGGG CACTGCGGGG TACTTTGCCG TCCAGAACGC AGGTATGGCT
GTAAGTCAAG GCATATGGTC GAGTCTTAAA GTATTGGTCG CCTTTTGTTG GGGCATTTTG
ATTTTTCATG AGCCTGTCCA TTCCAAGCTG GGGACCACCC TAGCGATCGC GCTGCTCATG
GTGGGATTGG CCGGCGTGAG CATCTTTGCT GCTCCACGGA CTTCAACGTC GTCACCACAA
GAAGAGCCGC TACTCCCGGA TGTGGAAGAA CAAAACCAGC CGGAAATTGT TGACAATAAG
GACTATTTGG GCTTTCTGAA ACGGAGACAC GTTGGCTTAC TTGGTGCCGT AATCGATGGG
GCTTACGGTG GCAGTGTTCT GGTACCGATG CACTATGCGG GCCCCAAAAC AACGAACGGA
CTTTCGTACG TTATGTCCTT TGCCATTGGT TGCTCATCCG TCGTGACCAT GGTTTGGGTT
TTGCGTCTCC TTTTCAACAG CGTTCAGGGG CAATCTCTCC GCGTTGGGTA CGATCGCTTG
CCGTCGTTGC ACGTCACAAC AATAGGGCCG TATGCAGCCT TGGCGGGGCT AATATGGAGT
TTGGGAAACG TGAGCTCAAT CTTGACGGTG GCGTTGCTGG GCGAAGGTGT GGGCTACAGT
ATTGTGCAAA GCCAGCTTTT GGTGGCCGGT CTCTGGGGCG TGTTTTGGTA CAAGGAGATT
CGTGGCATGC GAGCCATTGC GAGTTGGTTC ACCTTTGCGG TGATCACGGT TGCGGGTATT
GTGATGTTGT CTCGGGAGCA TGTACCCGTA CCAGCGGAAG CCCCGTGA
 
Protein sequence
MEECCGYFAA FVSCVAFGTF AVPIKCAAVR KVDVDPLVLQ TYKIGMTLLT SWLVLLFGVP 
FTFTPWGFVS GLFMVPGGTA GYFAVQNAGM AVSQGIWSSL KVLVAFCWGI LIFHEPVHSK
LGTTLAIALL MVGLAGVSIF AAPRTSTSSP QEEPLLPDVE EQNQPEIVDN KDYLGFLKRR
HVGLLGAVID GAYGGSVLVP MHYAGPKTTN GLSYVMSFAI GCSSVVTMVW VLRLLFNSVQ
GQSLRVGYDR LPSLHVTTIG PYAALAGLIW SLGNVSSILT VALLGEGVGY SIVQSQLLVA
GLWGVFWYKE IRGMRAIASW FTFAVITVAG IVMLSREHVP VPAEAP