Gene PHATRDRAFT_49808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49808 
Symbol 
ID7198470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp387101 
End bp392450 
Gene Length5350 bp 
Protein Length1718 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184535 
Protein GI219128680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.365245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTGGC AAAGGCAACG GGCGGGATCC GAAGACGGAG AAATCGATGA GGAAGAAGGA 
GAAATCACAG ATAATCCTCA GCCACCGGTG GCTTTATCGC TGTCTCCTTC GAAGTCTCGT
CCCACCGTCA CCCACTCGCT CCCACACAGT TCCCAAGCAA CGGCCGCATT CCATGGCGAA
TCAAACTTTC CTCCGCAGCC GCCTTTGCCG AACCGTCTGA GCAGTACTAG CGGGCCGAAC
AGCATCCATC CCTACCCGGG CGCCAGTCAC AGTAGTAGTA ACAGCAACAA CAACATACCG
GTCCCTCCCC CGCGTCGTGG AAGCTGGCGG GGCGGACGTG GCGGTACTAG TGTCGGTGGT
GGAGCCTTTG AACGACGTCC CGGTCGAGGG TTGGGCCACC GCAATACTTC TTTCGGCAGC
GGACCGCCAG CGTTCGACGC GCCCCCACCG GCACGCAGCC AAAGCTTCCA GTCATTTCAC
CGCCACAGCA GCGGAAGCAT TCCAGGCCTT CCCGCCAGCA ATGTTCTCCC ACCGGTAGCG
GCTACTGATC CGAGACGGGC GACGGATCCA CGGTTTCGGG GAGCACCCGG CGTTGCGAAC
ACCCCGGTGT CCGCACCGCA TTTTACCGAA TCGCGGGCAA CCAGCGAGGG CCGGGGCTTT
ACTACCTCGT CCAGTAGTGC TCCGGTATCG GTCAGTAGTA CTTTAGCGGA TTCAGTCAGT
CTGGCAACAC CCTACAGCAG CTTGGCTGAA GGCAAGCCAC CCCACGTACT AGCTGCTGAA
GACGCGAAGG TCGTTCGAGG ACTCCGTGCC AGTAGTGTCG CGAGAAACGA GTCCATCGGC
AATGATGTTC CACGGGATAG CAGCGTCAGT GGGCCCTTCC CGCCACTAGG GGGTTCCACG
GAAATTGCCT CTTTTCCTGG AGGTTTTCGC GATAGAGGTC CACCGCATCG TCGACATACT
GGCGACTTTC GGGGGCACCC TGGCGGCCAC GGCTCGCTCA ATCGACGAGT TTCAAGCGAA
TATGGTAGCG GCGACGGACC GAATAGCGAA GTCCTACCCT TTGGAAGGCC CAACGAAATT
CATCAATCTG CTAGGCAGGG TTCTCAGCAT GCAGTCCCCC CATCCGGTGA GCCACTGCCG
GTGTCGCGTG GTCAATCCAC AGGTAATCAG ACTCCTCCAC CTCCGTTTCA TCGAGGCCCA
GCGCCGCCTC CTACTCAAGA GCAACAGCAA CCTCCATTCC ACCGTAGTCA TCCGTTACCG
CAGGATCCAC CGCCGGGAGC TTTCCCTCGA GACGGGCCGC CTCAAGGAGA AGTGCCATCC
TTTTATAGGG ATGAGCAATC AGTGTTTTCG CGAAATCAAC ACACCCATGG CGGAGATTTC
CCACCGTTCA ACCAAAGAGC TTCGGATCAG CCTCCTTTTT CCTCGGGCCC GCCCAACGTA
GAGCAACCGC TATTTCGAGG ACCAAGACAG GATTCCTATT ACGGACCCGC TTCACGCGAC
GTAAATTCTG GAAGGTTCGG GTCGCCCTCC CAACGTGATC GCCCCATTGT CAATGCCCGA
GGGGTTTCGG GAGGTCCTCC ACCTCCCCCA CCACCACTGG GATCGCAAGG GGCGCTCACA
TCCACACAAC ATCCCTCTTT AGCTCCTGGT GTGGCCCCGC TTCATCGACG GAACGATCCA
CGCCTTCATC GAGACCCCGA TGCGGAAGGA CGAGATTTCG CGGACGCGGC ATCCGCATCC
GAGCCGCTTC GACCCAATTT TCCACCCGAG CGCACCGGAT TCCCGATGCA AAGTGAACGC
GGTTTCCGAA AGCCGCCTTT TGGACAAATG TTTCCACCGG GAGGTGCTAC AGAAAATAGT
ACCGGCTCGG ATTCGTTTGG TCGATCACGC GAACGGAATA CCGCGGCAGC GCGGTCGCCT
CAAACGTCAC CACATACGCG TAAACCCGTT TTGAGCTACT TTCAGGAATC GCCGGCGAAG
GAAATTCCGC GTCTGCCTGC CATAATTGAT GCCAAATCGG GCAGTCTGTC GAGTCGAATC
AAATCTGTAG GTCAACATAC CGAAGCAAGG ACAGAAGAGC CAGAACCTCT TTTGACATCC
GTGCTTGGTG AGGATTCAGT GGATAGGGCG GAAAAGGTTG TATTGCTTCT GACTGATCAA
AGGGATAAAG CTAGCTTGGA AAGAGATGAT AAGGGATGTA GTGAGCTTCC GAAGAAGCAG
ACAATCCTGA TTGCGCTGAA TCGTATGGAC ACCAAAATCA AGCTGCTTCA GAAATCTACC
TTAGATAAAG AAGAAGAAGT TGAAGCACAT ATCGAAAAAG AAAAGGAAGA TCAAAAACGG
GCTGCTAAAG AGGCGAAATC TGAAGCTGAA CGTTTGGAAA AGGAACACAG GCGACGCCGG
GAAGAGGAAC AACAAGCCGA TGAAAAGGCC AAACAAGAGC AGATTGAAAG TATGATAGAA
GAAGGGCAGG CTGGTTTCGA TGCAGATCTA ACAATATCTA CAGTGACGTT CGAGACCGAT
CTCGAAGCAG CTCGTAAGGT AGAAGAAGCA AGGTTTGAGC TAGAATGTCA AGAACAGATA
TCTGCGGCTA CGGAGCGATT CGACAATGAT GTGCAAACTA CACAGCAAGA GTTGGAGAAT
TCTATACAAT CTATTTCGAA TACTCAAAAC CTAATTTCGG CACTCGAGGA GGAGTACAAG
TGCAAGATGG AGGAAGGAGA TACAGCCGGT GAAGAGAAAA TGGATCAACC TGATCTAGTA
AATACAGTTT TGGAAGAAAA TCGAAGGCGC GCTGCCGAGG CCCATGTGTC TCAATGGGCA
GGTTTCCCTG TGGTGTCGGA TGATGATGAG TACGGTGTTT TAGAGAACGA AAAGGATCCT
AAAGAAGGTA AACGTCATGT ACGGTGGGCA GAGATGGCGC AGAAAGTTAC GGGAGTCGGA
GATGCACTCT ACAACGAACC TTCGGAAGCG CCGTATTTTG AGCAAAATGA GAGACTTCAT
GCACTGATCG GCCCGCTGGT AACAGAGCAA ATACGCTACA GTCAACGGCA ATTCGACACC
CACTGGAGAG AACTTGCCGA AGAATACGAA TACCGAAGAG TAGTTTACGA GGCTCAACAA
CTCAAAGATG GCACGGCTCA GAGAAGGCGC ATCAAATCCA CAAGTGTGCC CCATAGGCTC
GTTGGGAGCA AACCTAATGT CCCTATCCTC GAGTCCACAT CTGGCCACGG ACGCTCGTCG
AACAACCCAT ATCGTCGGGC ACGTAGAGGC AACGAGGTGC GGACAGAATA CGAACAAGAA
CAAATTATAG CAGAGCTGGC AGCCAAAGAA GCGCTGGAAA AGAGAATTGC AACTGGGGGG
TCAGAGCTTC CGCGTCAGAT AGGTCAGATC GAAAGAAGCT GGACAGCCAC CTACATCCAA
ACATTTTCGG CGCAAAGGGT TGACCTTGAG GAACAGGAGG CAGAGTTACG TATTACGGGT
GTTTGGACGG ACATGGAAAA GTGCATTTTC TTAGACCGAT TTATGCAGCA TCCCAAGGAT
TTCCGCAAGA TTGCTTCTTT TCTCCGAAAT AAGACGACAA CTGATTGTGT CGCCTTTTAT
TACGATTCCA AGCAAACGCT GCCTTATAAG GGTGCGTTAA AGGAACACGT AATGCGGCGG
AAGAGACGTG GCGGATATCC AATTTGGGAA GCAACTATTC AAGCCGCCCT CTCGGTAGGT
GCAGTCGTTG AAGCAGGGGA TAGTGAAGAA AAGCCATTGA TCTTCACACT TCCGTTTGAT
GATCACACTT TTTCTACTTT TGGCCTTCAT CCTTTGAAAC GCGAAGTTTT GGATTTAATG
GAAATAAAAG AGCAGGCTCT CGCTGAATTT GACGCAGATG AGGATGCAGA CGACGTTTCT
AGCAAATCAG GGCAACCCAA AAAACGTCCT CGCGATCGTC TTTTCCTGTT GGATCCGAGA
CAAAGAAAAT TCCTGAAACC CTTGCCCCAG GAATCGGCTC ACGCTACCTG CCTTAAGGTG
GACAGTGGAA AAGCAAGCAC AGCTGACGAT GATCACAACG ATTCCAAAGA GGGTACAGCA
AAAGATGAGT CGGGGCGATT AACTCCTCTA AGAAAAGCAC CCCAAAAATG GACGGCGTCA
GAGAAAAAGA TTTTTCACGA TACCTTGGAG AGTCATGGTA GGAATTGGAG CATGCTTTCC
CAGGCTGTAG GGACGAAAAC GATTTCTCAG ATTAAGAATT ACTACTACGA CTACAAGAAG
CAGAAAGATA AAAATCGGAC GACTGACAAA GACAAAAAGG TCGAAAGCAA AACTGAGAGG
ACCGAATCTC ACGAAAACAG TCCTACACCG CCACATATTG CCGCGGATCA AAGACCCGGC
GACCAGACTA GTAACGAGCC GATTTCGGAT CTACGCAAAA ATCAGCCGCC TCGCTATGAT
CCCCAATTTG AAGCTAAACA TATCGAGCGA CAGATGTTCG AAGTGTTGCA ACAGCAAGGG
CAGGGTCCAT ACCCCGAACA AGAAAGGCTT GTCGATCGAC GTCCTGTCGA ATCGTTGAGT
GATCAAGAAT TATGGGCCCA ATTACACCGA CAGGGACTTT TGGGTCAACA GCGAGGGCAT
TTATCGGACG AGGCGGCACG GCAACTTCTC CAGCATCACT CGCAGTCACA CCATCAGCAA
GTCCTCTCAA ATTTGATGCC CTGGGCTTCG GGAGGGCAAC TTCCGCAGCC AGTCAAACGA
GCGCAACCAA TCAATGTGCA AGAATGGGAG CAGCTGCAGG CAATTTTGCA GATCCAGCGT
CAACAAGAAC AACATCGCCA TCAGCATCAA CCTCACGTAC CGCACAACCC GATGGCCAAC
TTGGACCCTC AAATGCTTGC GTTGGCCCGT CTAGCGGGTT TGGATTCCAG CGCATTGGGT
ATGAACCCGC AATTATCGCG ACTTGCGCAT CATCCTGCAG TTGGCTCAGC TGGAAGTCAT
GATGACGCAC AAATGGCTTT AGCACAACGG CTTCTGAGCT ACAGTCAGAG CGCTGGGGGA
GGGGGGAATA GTGCCCAGGG GGCGCTAGAT TTGTTGACAC AGGCCATGAG TCGTGGGGGT
GCCGGACGCC ATCCGAATCC AGATCGGGGT TCAGATCGGG GTACAGATCG GTACTAGAAT
GGATACCTGA TCGAGAAAAG TGGTTGGCGT TGGGTGTTGG CCGGGTACAC AAGGTTTTTT
GTGCATTCAG AAAAGTTCAA CAGCTCAAGT CAATAGTTTT TGTGTTGAAC TGCTCCGCTC
TCTGCTATCG AGCGCTTGAT CCGTTGGAAT AGCAAATATC TGCCTCTCTT GATTTTCTAT
AGTTTACGGT
 
Protein sequence
MSWQRQRAGS EDGEIDEEEG EITDNPQPPV ALSLSPSKSR PTVTHSLPHS SQATAAFHGE 
SNFPPQPPLP NRLSSTSGPN SIHPYPGASH SSSNSNNNIP VPPPRRGSWR GGRGGTSVGG
GAFERRPGRG LGHRNTSFGS GPPAFDAPPP ARSQSFQSFH RHSSGSIPGL PASNVLPPVA
ATDPRRATDP RFRGAPGVAN TPVSAPHFTE SRATSEGRGF TTSSSSAPVS VSSTLADSVS
LATPYSSLAE GKPPHVLAAE DAKVVRGLRA SSVARNESIG NDVPRDSSVS GPFPPLGGST
EIASFPGGFR DRGPPHRRHT GDFRGHPGGH GSLNRRVSSE YGSGDGPNSE VLPFGRPNEI
HQSARQGSQH AVPPSGEPLP VSRGQSTGNQ TPPPPFHRGP APPPTQEQQQ PPFHRSHPLP
QDPPPGAFPR DGPPQGEVPS FYRDEQSVFS RNQHTHGGDF PPFNQRASDQ PPFSSGPPNV
EQPLFRGPRQ DSYYGPASRD VNSGRFGSPS QRDRPIVNAR GVSGGPPPPP PPLGSQGALT
STQHPSLAPG VAPLHRRNDP RLHRDPDAEG RDFADAASAS EPLRPNFPPE RTGFPMQSER
GFRKPPFGQM FPPGGATENS TGSDSFGRSR ERNTAAARSP QTSPHTRKPV LSYFQESPAK
EIPRLPAIID AKSGSLSSRI KSVGQHTEAR TEEPEPLLTS VLGEDSVDRA EKVVLLLTDQ
RDKASLERDD KGCSELPKKQ TILIALNRMD TKIKLLQKST LDKEEEVEAH IEKEKEDQKR
AAKEAKSEAE RLEKEHRRRR EEEQQADEKA KQEQIESMIE EGQAGFDADL TISTVTFETD
LEAARKVEEA RFELECQEQI SAATERFDND VQTTQQELEN SIQSISNTQN LISALEEEYK
CKMEEGDTAG EEKMDQPDLV NTVLEENRRR AAEAHVSQWA GFPVVSDDDE YGVLENEKDP
KEGKRHVRWA EMAQKVTGVG DALYNEPSEA PYFEQNERLH ALIGPLVTEQ IRYSQRQFDT
HWRELAEEYE YRRVVYEAQQ LKDGTAQRRR IKSTSVPHRL VGSKPNVPIL ESTSGHGRSS
NNPYRRARRG NEVRTEYEQE QIIAELAAKE ALEKRIATGG SELPRQIGQI ERSWTATYIQ
TFSAQRVDLE EQEAELRITG VWTDMEKCIF LDRFMQHPKD FRKIASFLRN KTTTDCVAFY
YDSKQTLPYK GALKEHVMRR KRRGGYPIWE ATIQAALSVG AVVEAGDSEE KPLIFTLPFD
DHTFSTFGLH PLKREVLDLM EIKEQALAEF DADEDADDVS SKSGQPKKRP RDRLFLLDPR
QRKFLKPLPQ ESAHATCLKV DSGKASTADD DHNDSKEGTA KDESGRLTPL RKAPQKWTAS
EKKIFHDTLE SHGRNWSMLS QAVGTKTISQ IKNYYYDYKK QKDKNRTTDK DKKVESKTER
TESHENSPTP PHIAADQRPG DQTSNEPISD LRKNQPPRYD PQFEAKHIER QMFEVLQQQG
QGPYPEQERL VDRRPVESLS DQELWAQLHR QGLLGQQRGH LSDEAARQLL QHHSQSHHQQ
VLSNLMPWAS GGQLPQPVKR AQPINVQEWE QLQAILQIQR QQEQHRHQHQ PHVPHNPMAN
LDPQMLALAR LAGLDSSALG MNPQLSRLAH HPAVGSAGSH DDAQMALAQR LLSYSQSAGG
GGNSAQGALD LLTQAMSRGG AGRHPNPDRG SDRGTDRY