Gene PHATRDRAFT_47389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47389 
Symbol 
ID7202436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp499321 
End bp501096 
Gene Length1776 bp 
Protein Length591 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181738 
Protein GI219122824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00749667 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACA GAGCCTTACA AATCTTTGCT ATCGGACAAA CGATCGTTGC TTCCATCCGT 
GCCGGATCGC GAGCATGGCA TGTTCTAAGA AAAGGTCCCA TTGGAGTCAT GAACGGACCT
CTGATTCTGC CTCCTCCAGT CAATGAGACA TCAAATTTAT TCGATTTGAC TCTGGCTTTT
CGTCATGCAG TTCAGGTGTG TGATGCGCCG CAAACCGACC GCGAGGCAAG TGCTATGCTC
GGAAGAGAAA AAAAGTGCGC ACGCATCGAC AGAGAAAACC AGTCTGGTTT TTCGCAACCA
AGCGATCCTT TGGCCCAAAA TATAAATAGA GACCATCGAA CGGAAAACGA GGAAATTGAT
CGACTTCTGG ACGAATTGGA GAGTAGTGAG AAAGAGCCAA AAGTCTCGTG GAAAAGCTTT
TTCGAAGCAT GTAGACAAAC TGCTGTTGTC CCAGTTCAAC GGGCATCCTT GTTCGGTGCT
TCGCTACCAG TCTCTGCGTG GTTCGCTACC AAACGCTTTT GGAACCTTGA TGCGGCATCG
GCATGGAGCA CAATTGAGAT CCTGGATGGG CCCTCAGGTT TCACTGACTA TGTTTCTGCA
GAGCTAGTCT TTGAGAAAGG TGATAAAAGC GCATACCCTC GAGTAGCTCT GATAAAGGCG
TATGCACCGG AGGAGTTCAC CAACTTGCGC TCGAACTTTG GAATTTCCGA ATCAGACTAT
GCAAGGTCGA TTCTGCATTC TGGTCCGTTT GTATCTTTCC AAAGCAATTC CAAAGGAGCA
GCGCGGGTTG GTGGGGTTTT CTTCTTCACC CGTGATGGCA ACTACATGAT CAAAACAATA
AAGGCAGCGG AAGTACATGC TTTGCTACAA ATGATGCCAA AGTACGACAA CTTCATGAAG
CGAAATGGGC GTAGATCATT GCTGACCAGA ATTTGTGGTC TTTACGATAT TGATATTCAG
GACGCCTCAA GCGGTGTCAA TGAGAAATAC ACTATAGTTG TTACCAACTC GGTCTTTCCA
GCGGAAAGTT CTAGTATAAT TTCCGAACGA TTTGATCTGA AGGGATCGAC CCTAGGCCGA
GAGTGCTCGC CAGAAGAACG GCGTACCAAA GGGTCAAATG CTATCCTTAA AGATCTTGAC
CTTTCGCGAG AGGTACAGCT AGTCAAGTCT TTTCAAGACG AAGGAACACC GCACTTTGAG
GGCTACGGCC TGCATATTGG ACCTGCTGCC AAGGCAGCTG TCCTCACGCA ACTCCGAAAA
GATGTGCACC TGTTGGTGCT GTGCAACGTA ATCGACTACA GTCTGCTGGT TGGTGTCTCT
CGCTTGGATT CTCGTCACTT TACAGTCGAC GAATTGCACC TTATAGATTC GAGCACAGAA
GCTGAGCTTC GTCTAAGTTT AGCTCGTCGA GGACAAGCAG CGGATGCCAT CTTGTCCGCA
CTAATAATGC CTGTTCGATT GCTTACTGCT CCGCCAATCT ACCTGTATCG GAGAGCGTGG
TCCCTTTTTC GAAGGACAGT ATCCTGGCCT CTTCCATATT ACGGTTCCGG AGAATGTGGA
ATAGATGCGG GTGGGCTAGC CAGGGTACAG GGAGATCGGC TTGGCCACCC TTCCGTCTTT
TATTTAGGGG TAATTGACTT TCTCCAGCCT TTTAATATCC CAAAGAGAGC TGAATGGAAG
TACAAGAGCT GGAAGTACGG GGAAGGATTT AGTTGCGTCC CTCCTGAGCA GTACGCAGAA
AGATTTTTGG CGTTTCTCGA AAGCCATATC AGTTAA
 
Protein sequence
MLDRALQIFA IGQTIVASIR AGSRAWHVLR KGPIGVMNGP LILPPPVNET SNLFDLTLAF 
RHAVQVCDAP QTDREASAML GREKKCARID RENQSGFSQP SDPLAQNINR DHRTENEEID
RLLDELESSE KEPKVSWKSF FEACRQTAVV PVQRASLFGA SLPVSAWFAT KRFWNLDAAS
AWSTIEILDG PSGFTDYVSA ELVFEKGDKS AYPRVALIKA YAPEEFTNLR SNFGISESDY
ARSILHSGPF VSFQSNSKGA ARVGGVFFFT RDGNYMIKTI KAAEVHALLQ MMPKYDNFMK
RNGRRSLLTR ICGLYDIDIQ DASSGVNEKY TIVVTNSVFP AESSSIISER FDLKGSTLGR
ECSPEERRTK GSNAILKDLD LSREVQLVKS FQDEGTPHFE GYGLHIGPAA KAAVLTQLRK
DVHLLVLCNV IDYSLLVGVS RLDSRHFTVD ELHLIDSSTE AELRLSLARR GQAADAILSA
LIMPVRLLTA PPIYLYRRAW SLFRRTVSWP LPYYGSGECG IDAGGLARVQ GDRLGHPSVF
YLGVIDFLQP FNIPKRAEWK YKSWKYGEGF SCVPPEQYAE RFLAFLESHI S