Gene PHATRDRAFT_40467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40467 
Symbol 
ID7198176 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp480311 
End bp481367 
Gene Length1057 bp 
Protein Length328 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184469 
Protein GI219128540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.848099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCCGT ACGACGACGG GGAATCATCC GCCCGTGGGA TGCCAATTGA GCAGCCGCGA 
GCCATCTCTG CAAAACAAGC TGTTTCGTAT ACCGTCGGTG CCTTCATTGC TCTGTCGGTG
ATGTTCACTC CCCAGCTTTC CATGCCGGCG GAAGCGGCTT CTTCCACCGC CACTCCCACC
GCTTCCGTAA CGCGGCCCAT GTCGGTCGAA CAACGAAACT TGGACGCGGC CCAGTCAATT
CTTGATGCCG CGGACGCCAA GCTCAAGGTG ACGCAAAAGT CCGTCGCGAC GGCCAAAGCA
GCCGATGATA AGCAGCTGGC GATACTCAAA AAGGCAGAGG ACAAGGCCAA CAAAGCGAAG
GATGACCTCG TGTCTGCCAC TACCAAACTC ACTACTATGC GAGGAGACGT TAAGGTCTCG
GACAAGGCCG TGGAGAAACA AAAGGAAAAA ATAGGTACGT CTCCAAGGAT CAAAAAAGGC
TGATGGCTCC GTATTCACGT CATCTCACGT TTCTTCCTCA TCAGTTGCTT TCAAGCAAGC
CCAAACGGAG TCACTAGTTG GCTTGGACAA AGCTCGCGTC GCCCGAAAAG AAACTGCCTT
AAAGCTCACG ACGGCCCAAC AAGGGACAAA AAGCTACGAA AAGACGAAAC AAACAGCTCA
GAAAAGTTAC GACGTTGCCA ACCAAAAGTT CAAATTGTAT GAGAGCCAAC AGGCCGAAAA
GCAGAAGAAA ATTGACGCTT CCAAAAAGGT TGCCACCGAA AAGCTCAAGG CCAAGCAAGC
TGCTGACGCC AAGGTGGCCA AAAAGAATGC CGAACGAAAA GCGAAAGAAG AAGCCAAGAT
TCGCAAAACA AATCTCGACG CTTTAAAGAA ACAGATTTCG AAACAGGATG AGAAGGTGAA
GAAAGCCCAA ACAAATTTGA AGGCTCTCGA AAAGGAACTC AGGTCGCTCA ACTCACGCAA
AACGATCGAA AAACAAGAAG TGAAGATTGC ACAAGCAAAA ACAAAGCTAG AAGCGGAAAA
GAGTATATTG AAAGAACTCA AGGCCAAAAA AGTCTAG
 
Protein sequence
MSPYDDGESS ARGMPIEQPR AISAKQAVSY TVGAFIALSV MFTPQLSMPA EAASSTATPT 
ASVTRPMSVE QRNLDAAQSI LDAADAKLKV TQKSVATAKA ADDKQLAILK KAEDKANKAK
DDLVSATTKL TTMRGDVKVS DKAVEKQKEK IVAFKQAQTE SLVGLDKARV ARKETALKLT
TAQQGTKSYE KTKQTAQKSY DVANQKFKLY ESQQAEKQKK IDASKKVATE KLKAKQAADA
KVAKKNAERK AKEEAKIRKT NLDALKKQIS KQDEKVKKAQ TNLKALEKEL RSLNSRKTIE
KQEVKIAQAK TKLEAEKSIL KELKAKKV