Gene PHATRDRAFT_47589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47589 
Symbol 
ID7202644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp184028 
End bp185421 
Gene Length1394 bp 
Protein Length422 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181863 
Protein GI219123087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00364513 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTCGGTGTA GTAGCATATA AAGGTCATTT TGCATTTACA CGACCACATC CAACGAAGTG 
TAAGTATCTC TGTATAGTTT ATTCACAGGT AGAGGAGACT GCTGGTTTCA GGAACCTACA
ACGCCATGAA TTCGAACTCC CCTTTTTCGG CACACAGCGA TATTCTGGAT GCCCCCCTCC
CATCCAAGGA CGACCCCGCT ACCAGGCTCC GCCTCGTGGA AGAATGCAAA TCGCGTGGAA
AGGCTGCCGT CCAGGCGGGC CACTGGCCGG ATGCGGCGGC TTTGTACGGA AAAGCTTTGG
AGTGCTACGT CGGTCAGGAT AGCGATACTG CCAAAACCGA AACGGCGATC TTGTACTCCA
ACGTCTCGTT GGTTCGCGCC AAAATGAGCC AATGGTCAAT GGCACAGGAA GCTGCCCAAC
AAGCGGTCCA AGCCGACCAA GTCTACGTAA AAGGATGGTG GAGGCTGGGA CAGGCGGAGT
CAGCTATGGG AAACTACACA AAATCGGTTG AGGCCTTGCA GCAGGCCACA AAATTAGAAC
CGGATAATAA GGCATTGCAA AAGGAGCTGA CCAAGCAAGA GGAGAAGGCG AAGAAAGCAG
CCGAGGAGAA AAAGAAAGAA GCCGATACGC CGGCGGCAAT GCGAGTGGAT GAGCCACAAA
CAATTGAAAA GAAAGCTGCG GCAACGGATT CTAATTCGAC TCCTTCCAGC ACTGCCAAAA
CAAAAGAGGA CGACAGTGCC ATGCAAGTTG ACATTGACGG AACTGATTTT TCCAAATCGG
AACACATCCG TGGCTACAAA ATTGTCAACG GGAAGAAAAC ATCTTTTTTT CACAACGAAC
TGTCCGAAGA CGCGGCGAGA TTAATTGGTG ACATTGCCCC CAAAAAGTTG GACGCCGACA
CGGGATCGTC TAGTACAGCC GCCGGAGCGA AAGGGACGTC GGCTTGGAAT CAGGCCGGGA
CCTGGGAAGA GAAGGACGTA ACCAACTGGG CGAAAACTTC GCTGCGAGAA CGGCTTTTGG
CAACGACATA CACGCTTCCC GAATCCTCTC CCGCACCCGG TGCCTTGGTG CTCGTAACGG
AGGCCAAAGT GACGGGCAAC GCCAGCTGTG CGGCGGTGAG GGGCAAGAAA CGGTACATTT
ACGAATTGTG CGTCACTTTG GACTGGAGTT TCTCGCACGG GGACCACCAG GCTGACGGGA
GTATCGTTCT GCCGGACGTG GACGGCACTT GCGTATTGGG TGATGGCTAC GAGGAAGCGA
ATTGGAAGGT CGATCGCGCG GATGATCCCA GCATGCGACC GCTGCTCGAA ACCTTTGTCC
ATAAACAAGG ATGGCGTGAG GCAATTCATG AAACGATTGA CGATTGGGTG CGCCATTTCA
AAGACACGTA TTAG
 
Protein sequence
MNSNSPFSAH SDILDAPLPS KDDPATRLRL VEECKSRGKA AVQAGHWPDA AALYGKALEC 
YVGQDSDTAK TETAILYSNV SLVRAKMSQW SMAQEAAQQA VQADQVYVKG WWRLGQAESA
MGNYTKSVEA LQQATKLEPD NKALQKELTK QEEKAKKAAE EKKKEADTPA AMRVDEPQTI
EKKAAATDSN STPSSTAKTK EDDSAMQVDI DGTDFSKSEH IRGYKIVNGK KTSFFHNELS
EDAARLIGDI APKKLDADTG SSSTAAGAKG TSAWNQAGTW EEKDVTNWAK TSLRERLLAT
TYTLPESSPA PGALVLVTEA KVTGNASCAA VRGKKRYIYE LCVTLDWSFS HGDHQADGSI
VLPDVDGTCV LGDGYEEANW KVDRADDPSM RPLLETFVHK QGWREAIHET IDDWVRHFKD
TY