Gene PHATRDRAFT_43654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43654 
Symbol 
ID7197364 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1111876 
End bp1113991 
Gene Length2116 bp 
Protein Length629 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177754 
Protein GI219112005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.769463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCTACCTTA CCCCGTCACT TGTTCCAAAG ACGCCGAGCA TTCTACGATT GAATCACTAA 
AATGGAGAGA TCCGGTAGAG ATGAAAATGC TGAACCGGAA GTCCGCAAAC GAAATCGTCG
AAGGTTGCTT TGGTGTTCTT CTTCGGCGCT GTGTGTTGTT GCCGCAGCTC TTAGCAGCAA
CGGTTTGTCG GCCTCGCTTA CAGGAAAGTC ACTACCTTTT CCAGCGTTAG GCCATCTGCT
TCAGTCTTCG ATGCCGATCT TGGAAGACGA TGATGAACGG TTGGGTATGA AATCGTCCAA
GCGAATGCTA GAAGATGCTG CTAATGGGAA TACTAGTGGA CAGGATGATG CACAACAAGA
CGAAAGTCAG CAAGTCACCG ATGACCAAGC AGGTATCAGC GCCCAAGATC AGGATGACGA
CAGCCAAACG GCTGGAGACG ACGACCAGAA GCAGACGGAA GAAGAGGAGC ATCGAGGTCC
AGACCAAGAC GATTGGTTTC GAGATGGCTT TGCCGACGAC ACTTTTACCG ACGATGATGC
GAAAAATAGG TACAGTGAAG TAGACGATTT TTATGCCTAT GTTGGCGACC CGCGTCCTCC
GAAGCTAATG CCACTTTCGA GTCGAGAAGT AATTGGATAC TCTGTAGTGG CAATGGCGTT
GACTCTTGGA GCTAGTGGAG GGATCGGTGG TGGTGGAGTC GTCGTACCGG TTTATCTCCT
CGTCATGGGA TTGCATCGTA AGTTTACGTG TGGGTTCAAT AGCCATTTGG ATACTCAAAC
CTCACCTATT TATTTTGTTG TCTGAACTTA ATCACCAGCT CATTACGCCA TACCTATCGC
CAGTGTCACC GTTTTCGGAG GGGCACTCGC TAGTACTATC GTGAACATGC AACGTCGACA
TCCACTAGCG GATCGTCCCA TTATTGACTG GGACCTTGTC TTAATGATGG AACCATTGAC
ATTGATTGGG ACACTACTGG GTACCCTGTT TCATCGGATC TTGAGCGAAA AGATTTTGAT
TGTTTTACTA GTCTTGCTTT TGAGCATAAC AGCTCACTCA ACGTTGAGCA AAGCCATGCG
CATGTATGAA GCTGAAAAAC GCTATATTCG GCATCTTATA GCCGCCCAGG CCGATTCTCC
AAGAGGAAAC CCATCACTCG GAGGCTACGT GCTCCCGTTC GGTGACGAAG ATGACTCTCG
GGCTGATACT GGTTGTAAGG AAGAAGCCAG AATGGCGGCA GAAGAGCGTC AACGTATTTT
GATTCTTAAT CCAGACTTTC GAACGATGAA AACAGATTTG CTAGAGCAAG AGAAAGTGAC
CCCTCGAAGC AAGATCATAG CGCTTTGCTG CATGTTTTCC GTACTTATCT TTTTGAATCT
CATGGTTGGT GGAGGTTCTT TCGATAGTCC ATGGGACATC AAGTGCGGCT CGACCGCATT
TTGGGTGGTG CATGTTGTAA TGATTGCATT TTTGATGTCA TCAGCGTGGA TGGCACAAAC
ATATCTCATT GCTCGACACG AGATCAAGGA TATGGTTCGA TTTGATTATG TCCACGGAGA
TATCAAGTGG GATACTCGCA CATCCATTAT CTATCCAGCT GTATTCACCA TCGCTGGGGT
TTTCGCTGGA ATGTTTGGCA TTGGTGGAGG TGTCGTCATT GTGCCGCTCT TACTGCACTC
CGGAGTGCAT CCTGGCGTTG CGTAAGTTCT ACATATCCTT TTACTCTTTT GTCGGTGATG
GCTACCTTGA CTTAAGTTCT AATTTGTGCT TTCACCATTG AGCAGATCCG CAACATCTAG
CGCCATGATT CTGTTTACAA GTCTCGCATC TGTCTCCACC TACTTCGTTT TTGGTTTAAT
CGTTGCCGAC TTTGCCATGG CCGGCTTTGT CATCGGTTTC ATATCTTCTA CTCTAGGACA
AATTCTCATG CGTCGAGTCC GCCAAGCCAA AAGTGCCAGC GGACGCAAGT TTGAGCGCAA
CTCTTACCTC GCTTTTGTAA TTGGTGGCGT CGTCTTAGTG TCTGCCTTGC TGATGACAAT
TCAATACGTC TTCATGATTG TCGATCAGCC CGACGAAGAT ACGTTTGGTG GTTTGTGCGA
TGGACTGCGA TTCTAA
 
Protein sequence
MERSGRDENA EPEVRKRNRR RLLWCSSSAL CVVAAALSSN GLSASLTGKS LPFPALGHLL 
QSSMPILEDD DERLGMKSSK RMLEDAANGN TSGQDDAQQD ESQQVTDDQA GISAQDQDDD
SQTAGDDDQK QTEEEEHRGP DQDDWFRDGF ADDTFTDDDA KNRYSEVDDF YAYVGDPRPP
KLMPLSSREV IGYSVVAMAL TLGASGGIGG GGVVVPVYLL VMGLHPHYAI PIASVTVFGG
ALASTIVNMQ RRHPLADRPI IDWDLVLMME PLTLIGTLLG TLFHRILSEK ILIVLLVLLL
SITAHSTLSK AMRMYEAEKR YIRHLIAAQA DSPRGNPSLG GYVLPFGDED DSRADTGCKE
EARMAAEERQ RILILNPDFR TMKTDLLEQE KVTPRSKIIA LCCMFSVLIF LNLMVGGGSF
DSPWDIKCGS TAFWVVHVVM IAFLMSSAWM AQTYLIARHE IKDMVRFDYV HGDIKWDTRT
SIIYPAVFTI AGVFAGMFGI GGGVVIVPLL LHSGVHPGVA SATSSAMILF TSLASVSTYF
VFGLIVADFA MAGFVIGFIS STLGQILMRR VRQAKSASGR KFERNSYLAF VIGGVVLVSA
LLMTIQYVFM IVDQPDEDTF GGLCDGLRF