Gene PHATRDRAFT_47152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47152 
Symbol 
ID7201941 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp629230 
End bp630793 
Gene Length1564 bp 
Protein Length437 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181238 
Protein GI219121781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCCG TTTGGGGTGT CGATCTTCCT GGCGGTGCGC GGCTCCGTCC GAGTCAGCCC 
CACTACGCGC AGGAAACGGA GAAGGCGCGG CGGCGTGACC GGTACGGTAC ACGTGCCGGT
GTTCGTTTGC GCCAACGGCT ATAAAAAAAA GACATTCACT CGGCACCACA CCGCTGGCCG
TCCACAATCC CGCAGACCAA CGTGTTTCCA CACATCCCTT GCGCAAGTTT TGTACCATGG
TCGAAACGAA CGGAACAAAG CCCACGGCGC CGGTGGAAGA TTACACGCCG GAGAATATCC
TTGTAACGGG AGGAGCAGGT GAGTAACGGG CGGGTATCAA TCAGCTTACT GCAATGCAAA
ACGAACGAAC CCAGCAACGC CCCCGTATGA CCAGTAAGGC GACGACACGG GTACGGAGTG
TATCCCTCGA CTGTATTCCT GGCACCACGG TTTCGCACAC ATTTCTCCCT TTTCACGCGC
CTCACGTCAC TGTACGAACC TCTTCTTCTC TTGTTTAGGT TTCATTGCCT CGCACGTGGC
GATTCTCCTT TGCAAAAAGT ACCCGCAATA CAAGATTGTC GTCTATGATT GCCTGGACTA
CTGCGCCTGT CTCGCCAACT TGCAAGAGCT CTTCGACTTG CCCAACTTCA AATTCGTCAA
GGGAGACATT GCCTCGCCTG ATCTCGTCAG TTACGTCCTC CGCGAAGAAA AGATCGACAC
CATTCTGCAC TTTGCGGCGC AGACGCACGT CGACAACTCC TTCGGAAATT CCTTCGCCTT
TACGCAGACC AACATTTACG GAACGCACGT CCTGCTCGAG TCCGCCAAGT GCTGTGACAC
CCTCCGTCGC TTCGTGCACG TCTCCACCGA CGAGGTCTAC GGAGAAGGAG AAGACTTTGA
AACGGACCCC ATGTCGGAAG AGCACGTCCT CGAACCGACC AATCCCTACG CCGCCACCAA
GGCCGGCGCC GAATTCCTCG TCAAGAGCTA CTTTCGTTCC TTCCAATTGC CCTGCTTGAT
CACCCGCGGT AACAACGTTT ACGGACCTCA CCAGTTCCCC GAAAAACTCA TTCCCAAGTT
CACCAACCAG TTGCTCAAGA ATCTGCCCCT CACCATTCAC GGTGACGGGT CCAACACACG
CAACTTTTTG TACGTGACGG ATGTCGCCAA CGCGTTCGAC ATCATCATGC ACAAGGGAAC
ACCGGGGCAC GTATACAACA TTGGGGGGAA GAATGAAGTG CCCAACCTGG AAGTGGCCCG
TGCCTTGCTC AAGCTCTTTG ACAAAGAAAA GGAGGAAGAT ACGCTCATTA AGTTCGTCCC
GGACCGACGA TTCAACGATC TACGGTACAC CATTAATTCC AACAAGTTGC ACGAGCTCGG
GTGGACGGAG CTCATGAGTT GGGAAGAAGG CCTCGCCACT ACGGTCGATT GGTACAAAAA
GTATACCTCC CGTTACGGCA ACATTGACGC GGCCCTCGTG GCGCATCCGC GCATGCTCAA
CACCAACAAG GAGGACTTGG ACGAATCTAC CCAAAAGGTC ATTATGAAGC AGCACAAGAA
CTAA
 
Protein sequence
MSSVWGVDLP GGARLRPSQP HYAQETEKAR RRDRHSLGTT PLAVHNPADQ RVSTHPLRKF 
CTMVETNGTK PTAPVEDYTP ENILVTGGAG FIASHVAILL CKKYPQYKIV VYDCLDYCAC
LANLQELFDL PNFKFVKGDI ASPDLVSYVL REEKIDTILH FAAQTHVDNS FGNSFAFTQT
NIYGTHVLLE SAKCCDTLRR FVHVSTDEVY GEGEDFETDP MSEEHVLEPT NPYAATKAGA
EFLVKSYFRS FQLPCLITRG NNVYGPHQFP EKLIPKFTNQ LLKNLPLTIH GDGSNTRNFL
YVTDVANAFD IIMHKGTPGH VYNIGGKNEV PNLEVARALL KLFDKEKEED TLIKFVPDRR
FNDLRYTINS NKLHELGWTE LMSWEEGLAT TVDWYKKYTS RYGNIDAALV AHPRMLNTNK
EDLDESTQKV IMKQHKN