Gene PHATRDRAFT_28794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_28794 
Symbol 
ID7202589 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp738122 
End bp741288 
Gene Length3167 bp 
Protein Length818 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181620 
Protein GI219122580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCAGATCGAC CTGCGAATCG TCGTTGCCGT TTCTTCTTGG TGAACCCAGC AAAAAGCACA 
AGTCATCATG GCGCGTTGGT TTCGTTCCGA ACCGATGGAG TACATCTCCC TCATTGTGAA
CGAGGATGCC GCTCACGACT GTTTAGCGGA CCTCGGAAAG CTTGGGGTGA TCCAGTTCAC
GGACGTAAGT TGCACGTAGA TCGTTGTCCA GCCACAGGCG GGAGTTTGAA AACGCGAGTT
GGCTTCCCGC ATTCTGTACT ATAAAGATTC TTGAATCGAA GTACTGCTTT GTCGGAAGAA
AAAGTCGGCA GATTTACTAC GATCCTCTCG GGTCGTGGTC ATCGTCGAAC TTCCATCCGG
CGCCGCGGCT TTTCGAACTC CGTTTCTCTG GTTGAAAACG TCCTTCCGAT GTCGTATGCG
GAGTTTTGTC CCCGTCAATT TTGGAATCGT CCTTTCTGCT ACCTGTTTCA CAGTCAGCGC
GCAAACATTA TCTGTAGGTT TGTTCTTTAG TTTCTCACCC TCATACTCTA CATTCCTATA
GTTGAACCCT GACTTGACTC CATTCCAGCG TCGCTACGTT TCTTACGTTA AGCGATGTGA
TGAGTTGGAA CGCAAGCTTC GTTATTTCTC CAACGAGATT GAAAAGTTCG AGATTGACCT
TGTTTCGGCT GGAACAGTCG ACAACTTTGT CATGTCTCCC ACGCTTGTGT CCAGTATGGG
TAATGGCTCA AAAAAGAGCG GTGCTCAATT GCTCGAGAGT CTCGAAGTTG AACTTGAACA
ATACGAGTCG CAGCTCAGGG AACTCAATTC TTACTCCGAA AAGCTTACCA CCGAGTACAA
TGAAAAGGTC GAGCTCCAGG AAGTCCTCGA GAAGGCCCGT CGCTTTTTTA TGACCGACGC
TCCCCGCCTT GCCGTTTCGG AACTTACCAG CGGGCCCATG GACATGACTG GAAAGGAAGA
TGGGCTCCTT GACTCGGACG CTGCCCCTCG TCCCGACTTG GACATGCGAT TCTCTTCGAT
TACCGGGGTC GTATCCACAG AAGAGAAGGT CCGCTTTGAA CGCATGATCT TTCGTGCCAC
TCGAGGAAAC TGCTACATTC GATTCGCTCC TATTCAGCAG CCCATTACCG ATCCGGAATC
TGGAAACTTG GTCGAAAAGT CCGTCTTTAT TATCTTTTAC AAGTCTGAGT CCATTGAAGG
CAAGCTCAAG CGCATTTGTG ACGCATTCTC TGCTCACCGA TACTCTCTCC CTGATATGGA
CGATGCCGGA TCAGTTGACA AGATGCTGAC GGAGAACGCA CAGGAACTCG TCGACTCTCG
CACTGTTTTG CTCAAGAACC AGGATACGCG CTTCCGTCTC TGTCAGCTGC TTGCGAAGCA
CACGGAGCGC TGGACGTGGA TCGTCCTCCG CGAAAAGGCT GTTTATCACT CTCTGAATAT
GTTCAAGGCT GATGTTCAGG GTATGCTTCG TGGTGAAGGT TGGGTCATTG CTGAGTCCAC
CGACGCTGTC CGTCAAGCAG TTGAACGTGC TCACTCCAAT ATGGACATGG CCATGCCTTC
CTTGGTGGAC TTGGTTCCCC AACCATGGCC TACTCCTCCC ACGCACTTTA TCACCAACAA
GTTTACCTAC GGATACCAGG AATTCGTCAA CACGTACGGT ATTCCACGTT ACCGGGAAGC
CAACCCTGCG CTTTTCACAG CCGCCACATT CCCCTTCCTG TTCGGTGTCA TGTACGGAGA
CATTGGTCAT GGTCTCTTCT TATTCTGCGC TGGTTGCTAC TTACTTTGGA ATGAGAAGGC
TAACGAGAAT GCAAAACTTG GTGAGCTAGG CGACGGTATG CACTCTGGTC GATACATGAT
TGTCATGATG GGCTTCTTTG CCGTGTACGC TGGTTTCATG TACAACGACG CATTTTCCCT
CGGTCTCAAC CTTTTTGGAA CTCGCTACAA GTTCGAGGGC CAGGATTCTG GTACCGTCGA
AGAAGGTGAT GTTGCCTATC AAACGTTCAG TTATGGTTCC GGTGAATCCG TGTATCCGTT
CGGACTCGAT CCCATTTGGC ACGTTACCTC CAACGAATTG CTCTTCTTCA ACTCGTTCAA
GATGAAACTT TCCGTCATTT TTGGTATCAT CCAGATGTTT TGTGGTACTT GCCTCAAGGG
AGCGAATGCC GTCTACTTTG GCGAAAGACT CGACTTTTTG TTTGAGTTCC TTCCCATGGT
TGCGTTTGCG TCTTCGATGT TTGTTTACAT GGTTATCCTC ATTGTTCTGA AGTGGTGCAT
CAACTGGAAT AGCCGGATGC TTTCCGCCAC TTGCGTTGAT CCTAATGGCG CTGGATGGGG
AGCGTCCAAT TACGTTGGAA CATGGAAGCA GTGCGATGGA GCTGTTGATG GCTGGGACGG
AACCTGTACA CCATGGGGAA TGTCCTGCAC CGGATACGAT GATACGGCGA CGAAATGTCC
TCTCAACTAT GGTGGTTCTG GTGATGGTTG CCAGCCTCCC AATCTTATCA CAACTTTGAT
CAATATCGCC CTCAACCCGG GTGTTGTTGA TGAACCTTTG TACGCTGGAC AGGGACCAAT
CCAGAACATT TTACTTTTGA TCGCCTTCGT CTCGGTTCCT ATTTTACTTT TGGCCAAGCC
TTACTATCTA TCCCAGAAGA CGCATTCCCC CGTTGTGCAC CACTCGGACG ATCTCGAGAA
TGGGCATGAC GAGGATGACC ACGAGGATGA TGACCATGGT TTCGGAGAGA TTGTTATCCA
CCAGGCCATT GAAACGATCG AGTTCGTTCT CGGTATGGTT TCGAATACGG CGTCGTACCT
TCGTCTCTGG GCTCTTTCCT TGGCGCACTC CGAACTTGCT ACTGTCTTTT GGGAGAAGGC
CATGCTTTCT ACCTTGAACA TGAACTGGTT CGCCGCCTTT TTTGGATTCG GTATCTTTGC
CGGCGTGACA TTCGGAGTGT TGCTCATGAT GGATGTATTG GAATGTTTCT TGCACGCCCT
TCGTCTTCAC TGGGTCGAAT TTCAGAACAA GTTTTTTGCC GCTGATGGCG TACGCTTTTC
GCCGTACTCG TTTAAGCAGG TGATTAAGGA TACCAGTGCC TAGAGAGAAA CTAATTGGAT
AAGTATAGGG ACGAATTAAG CATATAACCA ATGTACTCGT CAAAAGC
 
Protein sequence
MARWFRSEPM EYISLIVNED AAHDCLADLG KLGVIQFTDL NPDLTPFQRR YVSYVKRCDE 
LERKLRYFSN EIEKFEIDLV SAGTVDNFVM SPTLVSSMGN GSKKSGAQLL ESLEVELEQY
ESQLRELNSY SEKLTTEYNE KVELQEVLEK AHGLLDSDAA PRPDLDMRFS SITGVVSTEE
KVRFERMIFR ATRGNCYIRF APIQQPITDP ESGNLVEKSV FIIFYKSESI EGKLKRICDA
FSAHRYSLPD MDDAGSVDKM LTENAQELVD SRTVLLKNQD TRFRLCQLLA KHTERWTWIV
LREKAVYHSL NMFKADVQGM LRGEGWVIAE STDAVRQAVE RAHSNMDMAM PSLVDLVPQP
WPTPPTHFIT NKFTYGYQEF VNTYGIPRYR EANPALFTAA TFPFLFGVMY GDIGHGLFLF
CAGCYLLWNE KANENAKLGE LGDGMHSGRY MIVMMGFFAV YAGFMYNDAF SLGLNLFGTR
YKFEGQDSGT VEEGDVAYQT FSYGSGESVY PFGLDPIWHV TSNELLFFNS FKMKLSVIFG
IIQMFCGTCL KGANAVYFGE RLDFLFEFLP MVAFASSMFV YMVILIVLKW CINWNSRMLS
ATCVDPNGAG WGASNYPPNL ITTLINIALN PGVVDEPLYA GQGPIQNILL LIAFVSVPIL
LLAKPYYLSQ KTHSPVVHHS DDLENGHDED DHEDDDHGFG EIVIHQAIET IEFVLGMVSN
TASYLRLWAL SLAHSELATV FWEKAMLSTL NMNWFAAFFG FGIFAGVTFG VLLMMDVLEC
FLHALRLHWV EFQNKFFAAD GVRFSPYSFK QVIKDTSA