Gene PHATRDRAFT_45249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45249 
Symbol 
ID7200263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp603868 
End bp608538 
Gene Length4671 bp 
Protein Length1421 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179250 
Protein GI219116911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCGT TCGGTAGCTT TGATCAGATT GAAGGATCGG ACGCGATTCC GCGCTACGAT 
CAAATGGGTC GATTGCTTTC GCCAGACGAA CGCGAACAAA TTGCTCGTAG GAAAGGTTTG
TGCATGCGCT GTGGAATGAA AACGCATCAG GGTGCCTTCA AACGTCCCTT GACGGACGAT
AACTGTTACA AAGGAACCTG CATTCGTTGC AACCCCAATG CCGTCCCTAA ACGTGTGTTG
GAGTCATGGA ATCTCAAAAA TCGACCGGCC AATGTGCAGT TGGTGAGTGG AGCGGCACAG
GCCAATGCTG TGGCTTCTGG CATTACTCTG CATGGGAAAC ATCTTTTGAA AAAAGCCACA
CGCTCAGTTA TGGCTGCGGG GCGAGCTACC AATAAAAATG GACGAGTGAT TCCTACAGGA
CCTTCGACAA CAATCGCATT CACGAGTGAA CATTCAACAT CTTCCAACTT GAAAACGGGT
TCTGGTACGA GTTCAGAAGC GCAGTCTTCA CAGACTCCAT TACCCCAAAA ATCAGCGCCT
CTGCTACATG CTCAAGATCT GCATCAACCC TGGATCGAAG GAAAAAGGCC ATCAGCCCAT
ATACCCAGCC TGCACTCGGA ATCAAGCAGT ATGCGCCAAA CGTCGACATC GACTATTGGA
AAACTCAGTC CGACTGTTTC CGACCGATCC TTGTACAGTG CAGAGTCATT TGCAAGTTCC
GCCAGTCACG TGTCAACTAA ATGTGCTGCT TCTGGATGCG GAATTGGCAC TAACAATGGT
AATTGTGGCC GCGAACCAAG TGTGGGTGCC GACCAAAATG CGAACGATGA TTTGATGGCA
TCGGAGTACC GTCAAAAGCC TGCCAAATCG GCTGGTCTTT CCAATAGCGT ACGATTGAAT
TCGTCGCGTA ATGATCTAGA TCTAAGCTCT CCTTCCAACA TCCGTTCTAA GTATCTCTCA
AACAGATCAC ACAGCAAGAT GATAGAAGAC GACTGGACCG GGGGTTCTCA GGAAATGACT
CATCATACAA TCATCGAAAC TTTGAGAGCG GCCCGAGCTG ATCCAGTAAA ACTCCGTCAA
GCTCTACACG TGTTGCGAAA TGGCCTCGAA GTGGATTTGG ACGGAAGCTT AATTATTATT
GCAAGAGATA TTTTGTCCCA GTATGTTTTT GATCCAAGCA TTGCCGATGC CGCGTGTGGG
GCTGTCTGGA GAATTACCAC TCTGGAGGAT TCGCTTAAAT CCCTGGCGAT CGAATCAGGA
ATCGTTGGAC TCCTAGTCGA TGCCCTGAAA GCGCATCATG AGGATGCAAC ACTTTGTGAA
TGGGCACTCG GTGCGTTAAC AACGCTTGCG TGTAGTCCCT TCACGAAAAG CGACTTGGCG
AAGACTGGAG TGATTGAAGC TGTTTTGGAA TTGCTCGACT TTCATCAAAA TTCGGCCGGA
ATCTTGGAAT GGACATCCCG GTGCTTGCAC AACCTCGTAC ATCAATACGT GGTACTTGAT
GTTGAAGCTG CCGAGGAGGA AATGCAAGCG CAAATCAAAA AGAATATTTC CAGCATCATT
GAAGCCAACG GCGTTTCTAC GCTCCTTAAC GCAATGAAGC TTCATGCTAC CGAGCCAATT
GCGCTGCTAT GGGCGACAAA GCTCATGTGG CGTCTTTTCG GTCGTAAGGA AGAAAGTTCC
ACTGTGCGAG TCTTATTTCA ATTGCGTCAG GACGGATTCG TTCCTCTCTC CACAAAGCTG
CTACGACAGC AGTCTACGAG CAGTGAACTT TTCCAACTTA TATCTCGATT AGTCTGTATG
TTGTTGTTGA AGATGGATGA TGGAGCGTTG TATGAAACTG CCTCGGTTGC CATGCCGTCT
ATTGTTCGTC AAATGGAAGA GTTCAAAGAC GACGAAACGT TGCAAGAGGC CGGATGTCGA
CTGCTTTGCG CTCTCAGCAC CGGTGGCGAA TCAGTACAAG AAAATTTAAA GGAAGCGGAT
GGAATCTTGG CAATTGTGAA GACTATGGAG CGTCTTCCTG AAAATTTGAT GCTATTGACA
AGAGCTGGCT GGTGCTTATG GCGGCTATCC GCCAATCCCT CGCTGTTTGA TTCAGGCCTT
GTGGAGAAAT CCTTACAAGC ATTGAGCAGT GCTATGGACA GCCACAGGGA CTCTGTCGAT
CTATTGGTGT CGACGTGTGG ATTCTCGAGA AATAGTACCA TGGTAGATGG CGTGTCGCCT
ACTGCTTATC CTTTAGATGT TATTTTTCGG TGGTTGACAA CGGAGGGGCA AGCAAGCCAT
TATAAAGTAC AGGCATCACA AGCTTTGCAT ACTTTATCGA GAAAATACAA CGATTTCTTG
CATCTGCTCA ACGAAAGCGT CGGTATCCCA AGGATGATTG CATCTCTTCG CGATCCACAA
TCTTGTTGCC GTATTGATAT TTGCTGCATT CTGACAGTCC TGGCCAGAAA TTCAGAAGAC
TCACGCCAAA TGCTCGCATC TGCTGATGTT GTCCAGACAG CTTTAGCAGA GCTCGCCGAA
ACAAATGATC TCAACTGGAA GGCTTGTCTC CTGCGACTCG TTTCTACAAT TTTGGTGTCA
GAATGTGCCC TACAGATTGA CGTTCCTATA CATGCTATTC AAACAGCAAT TGGTGGAGTG
GAAGGGAGCA CATTTGATCC TTCCTTAAGC GAGTTAGCGT GCATATGTGT TCGCAACCTG
CTTTTGACAC CTAATTCACG CGTTGGTGTC GAAGGGTTGG TGAAAGCTAT GACAGACACT
ATTGATACAT GTGCCGTCTC CGACAGCCTC TGTATAGAGG CGTGCTATGT AATTTGGGCC
CTGACCTCAA AGTACTCCGA TCGGAATCCA TCCGAATTGT CTGCTATGAT GACGTCTTTA
ATTGGACTTA TGGGTAAGTT TATGGAACCG CTTAATCTTG AGATCCAGTC CGCTGCAGCT
GGCACTCTAG CTAGTGTACT CGCATCTATA GTACGCTCTC CTATCCCTTT GAAGGTTCAA
GATGTTGAAG TGGTCATATC GGTCATTTAT AAGATCATCG ATACTAAGCC CGGAGCATCC
GAAGCAATTG AACATCTACT CGCCGTTTTG TGGAATTGCT GCCTTGTGGA CGAGAATACT
CTCGTTCAAG GTGGTGTAGT TGTCGCAGTT ATCGATACTA TGGTCGACAA CGAAAGCAAT
TTGCAAATTC AGGAGCGTGG TTGCACAATC CTCGCATTGC TTGCATCTGC GGAGAATTTG
GAGGTAAACT TTAGCATCGC TGAAACCAAT GGTGTCGAGC TGCTGGTTAG TGCGCTTGCC
GTATTCGGGG ACAACGTAAA TGTCACGTTG CAGGCTTGTA AATGCTTCTC ATATTTGAGT
ATCGATCCAG AGCTACGTGT GATGATCGTG GCCCAAGGAG GTCTCCGTCT GGTTGTTGTT
GCGATAACGT CGAATCCAGA TAATGCCGAA CTAGTGGGTT TCGCTTGCTC TACCCTCTTA
AATTTGACCT TTGACGCTGA AGTATCTGCA TACATTGGAT CGGGGATCGT GGACGCAATT
GTACAGACTA TGACTGGTCA CTTAAAGTCA GCGCTTTTAC AAGAAACAGG ATTGGGAATT
TTACAAAATA TATCTATGCG CGGTCCGGAT GAGAAGGCAC GCATTGCTGA AGCTGGAGGA
GTCGAAGCTG TCGTTTCAGT ACTTAGGGAA CATATTCGGT TACCGTCCGT TGTTGAACGT
GGGCTTGCAA CGTTGTGGAG TTTGGCAGTT CTTGACGAAA ATCAGATACG AGTTGCGAAT
GCTGACGGCA TCAATCTTGT GGTCAACTGC ATGATGGCCC TGATCGAATA TGAACGAGTG
CAAAAGCAAG GCTGCGGGTG TCTATGCGCA CTGGCAGGTG ATTCAACCAG TAAAGTTTTG
CTTCGCAATG CCGGTGGATT GGATGCAATA GTTTTTGCCA TGTGGGCTCA TTTCAACAAA
AGTGGAGTTC AAAAAGAAGG ATGCAGAGCA ATATCAAATC TTGTACATGA TCCCGGAACG
AATGAGATAA TGCTAGTATC AGAGACTGAG GTTGGGGCGA TACTATCTGC TATGAGAAGA
TTTCCTTCAG TGGCCGATTT GCAGATGCAT GCTTGCTACT CTCTTCGAAA CCTTACACTA
TCTGTGGACA ATGTAGCTGT CGTACTTGGA AGTGCGGACG ACATCCGCGA GCTCGTAGCC
AAGGTTTCCT TACGAAACCC CGAATGCAGT CCAATCGTCA ACCAAATACT TTCTCATTTT
GGATAAAAGG TTGTCGCTAG TAAAAAGAAA AGACTCAAGT GTAGACTAAA CGCGCCCGTT
TGCTCTTACG TTGTCTTCAG TGGAAAAACG TCACCTAATT GGGTCACTTG CGGCGCGTAG
GGATGTACCT TTCTTTCTCC GGTTGGAGAG CCTTCACGAC TGGTAGGTGG GGTGACAATG
CTCTTCTTGC TATGTGCACC AGTGTTGACT GTTTTGTCCC AATGATCAAG AAGTTGTGGG
CACTTTGGAG ACACTATCTC TTTTGAGCAC GACGTGTCGC GTTCCAATCC TTTCCTGGTT
CGCTTACGGA GGAATCCTGT GTATTTTCTC TGGATCGAAA TGAACAGAAT TGATATAAAA
CTAATCATGC ACAGTGTTCC GATCATAAAG TCAATGATTG TGCGGTTTTG G
 
Protein sequence
MDSFGSFDQI EGSDAIPRYD QMGRLLSPDE REQIARRKGL CMRCGMKTHQ GAFKRPLTDD 
NCYKGTCIRC NPNAVPKRVL ESWNLKNRPA NVQLVSGAAQ ANAVASGITL HGKHLLKKAT
RSVMAAGRAT NKNGRVIPTG PSTTIAFTSE HSTSSNLKTG SGTSSEAQSS QTPLPQKSAP
LLHAQDLHQP WIEGKRPSAH IPSLHSESSS MRQTSTSTIG KLSPTVSDRS LYSAESFASS
ASHVSTKCAA SGCGIGTNNG NCGREPSVGA DQNANDDLMA SEYRQKPAKS AGLSNSVRLN
SSRNDLDLSS PSNIRSKYLS NRSHSKMIED DWTGGSQEMT HHTIIETLRA ARADPVKLRQ
ALHVLRNGLE VDLDGSLIII ARDILSQYVF DPSIADAACG AVWRITTLED SLKSLAIESG
IVGLLVDALK AHHEDATLCE WALGALTTLA CSPFTKSDLA KTGVIEAVLE LLDFHQNSAG
ILEWTSRCLH NLVHQYVVLD VEAAEEEMQA QIKKNISSII EANGVSTLLN AMKLHATEPI
ALLWATKLMW RLFGRKEESS TVRVLFQLRQ DGFVPLSTKL LRQQSTSSEL FQLISRLVCM
LLLKMDDGAL YETASVAMPS IVRQMEEFKD DETLQEAGCR LLCALSTGGE SVQENLKEAD
GILAIVKTME RLPENLMLLT RAGWCLWRLS ANPSLFDSGL VEKSLQALSS AMDSHRDSVD
LLVSTCGFSR NSTMVDGVSP TAYPLDVIFR WLTTEGQASH YKVQASQALH TLSRKYNDFL
HLLNESVGIP RMIASLRDPQ SCCRIDICCI LTVLARNSED SRQMLASADV VQTALAELAE
TNDLNWKACL LRLVSTILVS ECALQIDVPI HAIQTAIGGV EGSTFDPSLS ELACICVRNL
LLTPNSRVGV EGLVKAMTDT IDTCAVSDSL CIEACYVIWA LTSKYSDRNP SELSAMMTSL
IGLMGKFMEP LNLEIQSAAA GTLASVLASI VRSPIPLKVQ DVEVVISVIY KIIDTKPGAS
EAIEHLLAVL WNCCLVDENT LVQGGVVVAV IDTMVDNESN LQIQERGCTI LALLASAENL
EVNFSIAETN GVELLVSALA VFGDNVNVTL QACKCFSYLS IDPELRVMIV AQGGLRLVVV
AITSNPDNAE LVGFACSTLL NLTFDAEVSA YIGSGIVDAI VQTMTGHLKS ALLQETGLGI
LQNISMRGPD EKARIAEAGG VEAVVSVLRE HIRLPSVVER GLATLWSLAV LDENQIRVAN
ADGINLVVNC MMALIEYERV QKQGCGCLCA LAGDSTSKVL LRNAGGLDAI VFAMWAHFNK
SGVQKEGCRA ISNLVHDPGT NEIMLVSETE VGAILSAMRR FPSVADLQMH ACYSLRNLTL
SVDNVAVVLG SADDIRELVA KVSLRNPECS PIVNQILSHF G