Gene PHATRDRAFT_42479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42479 
Symbol 
ID7196669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp179015 
End bp180370 
Gene Length1356 bp 
Protein Length391 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177035 
Protein GI219110567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0629377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAATCTCGAA AAAGGCCCAT CTTCCTGACC CATTGCCGGT AGGCCTATTC GTATCCGTCT 
CCGAATATAG GACACCACAA GGCAACACAA ACCATGGTAT CTTCAACTGC TGTAGCGGCT
GCTTTGCTCA GTACGGGTCT AATTAGTTTG GCTCCGAATC TCATTCTCTT GGCCTTTCCC
CGTTACACTG CGGGAAGCGG TGTTCATTCG CACCTTCTGC AGTTGGGACA AGCGCTTGCC
GCCGGTGCCT TGTTGGGAGA TGTCTTTTTG CACGTCTTGC CACACGCGAG TGCCACGGAT
CCGAACGTCG GCGCTTGGAT TCTTGTCGGA TTCAGCGTTT TCTTTGCGGC TGATCTCCTC
ATTCGATCAC TTGAGCAACA GCAATACGAA CCTCACCATC AAAGTCATTC CCACCACCAC
GGCAAGGCAG ACAGCAAAAG TAATCCCCTT TCAAAAAAAG AATCCTCCCA CCAAATTCCT
GACGAAAACG ACGACGATTC TTCTTTGAGC ACTACGGATA TCAAAGTCTC CACGGTGCTG
TTGAATTTAG CGGCCGATGC GCTGCACAAC TTTTCCGACG GCCTCGCCAT TGGAGCGAGT
TTTGCCACGC TGCAACAACT GAATCCCCAG CATCAAAGTG GTGGCACCAC AAATGCAACA
TCAACGGTTG CGGACAGCGT CCTTTCCATG GCTTCGCTTT GGGCCTCCCG CGGAGGATTG
GCGACCCTGT CCGTGCTCTT TCACGAAATT CCTCACGAGC TGGGTGACTT TTGTACTTTG
GTAAAGGCTG GCTACAGTCA CAAACAAGCC GTAGCGGCAC AGTTTCTCAC TGCCATTGCA
GCTTTTGTCG GGACCGTACT GGCACTCTAT CTGACTAGCA AAAATGAGAA CAATATGGAC
AGCTGGTTGG GTGGGGAAAA TTTGGTGCAT TTGACCGCCG GTGGCTTTAT TTATCTAGCA
GCGACCAATA TTTTACCGGA TGTTCTGGAC GAACGGGTCT CTCCGTCCTT TCGTCTTGCG
CAGTTGATGG CCTTTGGTAC TGGTATTGCC TTCTTATACA TGGTGGCCTT ATTGGAAGAT
CACGATCACG ATCATCAACA CGGCTCGGGA CATACACATG AAAAGCACGT TCACTATCAA
CACGGCTCTC CTTTTCCGAT GGAGGATTAT TATTATCAGC ATCCAACTTT GGATGCCCAT
CACCATCATT TCCAGGTCTC GGATTTTCAT AAACTGCATC AGCACCATCA CGCGCACAGT
GAACTATAGG AATTGGTGTC ATTTCCTTCC TCTATTCATT TCTACGCACA TGACTATCCA
ATATGGCTTA CAATTTTTAT CACACACATA TTTAAA
 
Protein sequence
MVSSTAVAAA LLSTGLISLA PNLILLAFPR YTAGSGVHSH LLQLGQALAA GALLGDVFLH 
VLPHASATDP NVGAWILVGF SVFFAADLLI RSLEQQQYEP HHQSHSHHHG KADSKSNPLS
KKESSHQIPD ENDDDSSLST TDIKVSTVLL NLAADALHNF SDGLAIGASF ATLQQLNPQH
QSGGTTNATS TVADSVLSMA SLWASRGGLA TLSVLFHEIP HELGDFCTLV KAGYSHKQAV
AAQFLTAIAA FVGTVLALYL TSKNENNMDS WLGGENLVHL TAGGFIYLAA TNILPDVLDE
RVSPSFRLAQ LMAFGTGIAF LYMVALLEDH DHDHQHGSGH THEKHVHYQH GSPFPMEDYY
YQHPTLDAHH HHFQVSDFHK LHQHHHAHSE L