Gene PHATRDRAFT_47619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47619 
Symbol 
ID7202823 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp297191 
End bp302273 
Gene Length5083 bp 
Protein Length1565 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182043 
Protein GI219123462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC GAGGTCGTAC GTCCGTCGTA GGAGCCGATC ACTGCGCCAA AAAATTCCTT 
CCGACCCCCC TAGACTTTGT CGGTGTTGCG GACCGGGACC GTAAAAGGCA ACTTCCCTCA
CTGGAAGTTC TCGATAGACT CTGCGAGGGC ACAAGCGCGC GTTCGTTGGC TGGATGGACC
TGGTGTTTGC CTCTTTCACC AGAGAACACT AGAGTCCCTA TAGAAGGGAA GAGCTTTACC
AATAGATCGA ACCACGTACA TCCCGAAGTC ATTATGAGCG CGGCGGCAGC CGGCAGTGGC
TGGAAATGTT CTCGTTGTAC GTTCTACCAC CATGCTCCTG GATCACGTTG CGCCATGTGC
AACGAGCTAC GAGTCTCGCG ACAGCAGATG CGCGACTTCG TCATCGGCAA GCCTATTCCC
AATGACGATG CAGTATCCCC GATGGGCCAA AAGCTACACG GACCATCCAC AAGCCTGTAG
TCAATCCCTA CGTTCGCACC TGTCCCAGCA ACAAACCATA CTTTCTACAT TGCGGCCCAA
GAGCGCCTCT TCACGGGCCA TTGCGAATCC GTACGCAAAA CCGTTGGCTG TCTCACAGCA
GCAACAATCC GAATCGGTCG ACAAGACGCC GTCATCACCA ATTCAACCAC ACATCCCTTC
CGAGTCCCTG CCACTGCCGC AATATACCGT CACGAATCAA ACAACCAGTT CCACTAGCAG
TACCGCAGGT ATTGGGAATA AACTCCACCG ATCAGCAACC AGTGATGGAG ATTCTAATCC
ATCCGATACG GGACGCGGAA AGAACCCCTT CGTGGCCCTC AAACCTCGCC CTGTTTTGGA
GTACAAACCT GGTCCGGTCC CGATAGACGA GAGCACGGCT CATGAATGGG TCTATCCGAC
AGCCGAAACT TTCCCCAAAC GTCAGTACCA ATTAGAAATT ACGAAAACGG CTCTCTTTCA
TAACACTTTG GTATCCTTGC CAACCGGCCT AGGCAAAACT TTGATTGCTG CCGCGGTACT
CTACAACTAC TATCGATGGT TTCCAACCGG CAAAGTCATT TTCCTCGCTC CCACTTTACC
TTTGGTCAAT CAGCAAGTCA AGGCTTGCTA CGATATTATG GGTATTCCTC CATCAGACAC
GGCGGTGCTC ACAGGAAAAG TGCATGCCGC TCGTCGGGAA ATCGTATGGC GTGACCGCCG
CGTCTTTTTT TGCACGCCTC AGACCGTCCA AAAGGACTTG GACGCGAACC GCTGCGACTC
CTCCAAAGTC GTCTGTGTAG TGCTAGACGA AGCTCACAAG TCCACGGGAG ATTATGCTTA
CGTCAAAGTC GTGGAACGCC TCGAGGAGGC TGGTGCACAT TTTAGGGTTC TGGGATTGAG
TGCGACGCCC GGTACCAATA TAAAAGCAAT TCAGAGTGTG GTAGATGTAC TGCGCATCAA
CAAGATTGAG GCCCGTCGGG ATTCGGATCC ATCGGTAGCC CGATATATAC ACGAAAAGCA
GTCGGAGATT GTTGTTGTGA AGCAGGCGTC TGCATCACGA ACCATTGAAC GGGCCCTCAA
CGATGTTGTC GGACCATTGT TGGAGCGATT ACGGAGCGCA GGGGCATTGG GGCGTCTCAC
GGGAAACGCT ACAATAACCG CGTATAATTT AATACGTGCC CGGGAAGAGT TTTGCAAACG
TCGGAACGAC GACGGCTTGA TTGGTTTCTT TCTCGCTGCG CAACAATTTG TACAAATACG
ATCGGATTTA CACAAACACG GAGTTGGTTT GGTGAGGTCA AAGCTTAGTC GTCTACGGAC
CGAACGTCAA CGGGGTATGG TGGCTTCAAT TGTCAAGGGG AAAGAATTTC AAGCTTTGTG
GGAGGAAGTT GCCAAATCGA CGTGCGACCC AAATTCCAAT CACAATAACG TGCAGGACAA
ATTAGTGAAC AATCCAAAAC TAACGCAGCT CAGAGAGATT CTTGTCGAGC ACTTTGAAAG
AGCTCGGGCT TGCTCGACGT CTTCGCGGGC TATCGTCTTT TCGCAATTTC GAGACTCTGT
GTCGGAAATT GTGGATATAC TTTCCGCTTC CAGACCGTTA ATTCGGCCTC GACATTTTGT
GGGTCAAGGG AAGGGTACCA AAGGCGAGGG AGGAATACAG CTAAAAGGGA TGAGACAAGT
TGAACAGCAA CAAGCTATTC GCGAATTTCG CGAGGATACT TTCAATGTCC TCGTATGCAC
TTGTGAGTAA AATCGGGTCA AAGGATGTGC ATTTGCGCGC TAACACGCCT TTCTAAACCT
TGCCCCAATT TGCTTATCGC TAGGCATCGG CGAAGAGGGT TTGGATATTG GAGAGGTTGA
CCTGATTGTT AATTTCGATA CTCTTCGCTC CCCAATCCGT ATGATTCAAC GTACCGGGCG
TACTGGACGG AAGAGAGATG GTCGCGTGGT TTGCTTAGTG GCTGAAGGGC CCGAGGAACG
AACGTTGCTG GCGTCTCGCC AGTCCGAGCG AAATCTAGCG CACGCTCTGA AGAATCCCAA
GTCATTTCGA GTAGCGCAAA CAATGCCACT TTTTCCAAGC CAACCGAAGT TAAGAGAGCA
AAACATGTTG GTATCAAAAG ATTTCCACAT TTCTCAGGTT GAAGGTCACG AAGGAACTAG
GCGCAAGGTT TTGGGCCCCT TTGATGCGAG AAATAGTCAA TCTGCTCAAG AGAAAATTCG
CTGGAGGCTT GTGACAAAAG AGGAAAACGA ACGCGAATCG GCTTTAGGGA GAATACTTGT
TTCCAAGGGT TGCTCAGAGG TTTGGAGAAC GTCGCTACGG CGCCGTTTTC TGTTGGCAAG
GAGTCTTTCC AATATCTCTG ACACTCGACG ACTCAATTAC ACAACTGGAA GAACCGTCAG
AATTCTTGCA GATCTTCGGT ATTCCCACGG TGTAAATGTT TCACACAATT ATAGCAAGAC
ACACTCTAGG GGTCATGGCT GGACTTTAGA GCATCTCTTC CCTCTCAAGT TTGACTCATT
GATCGGCAAA GGATTGCATG GGGAAATTCT GCCGGTAATC AATCAGTCGC TTCCGAAGAA
CCTCCTAGTT GAGCTTGATG TGCTAAATAA AGGGAGCATG GAGACTATTT TCCGGAATGA
CCACGGATGT GTGGGGAGGA ACTCTATCCT CGGACAGAAA GGCTTTGTCA GAAACAAGAA
AACACGAGAA ACAAATACGG AGCACTCCGA AGGGGAACAT TATCTGACCA AGGAGCAAAC
AGAAGCTTTC GGAATCGATG CGCCTCCCCA TGAAGCAAGA AGAAATTGCA GAAGAAGAAT
TCAACAAGAA GAAGCCAAAA AGAGACCGTT GTCTTCATCA GGAGTCGATA TACTCGGTCA
AGAATCTATG ATGAGCAAAT TCGACTTGGC CTCCACTGCC AATGACAAAG AAAACGAGCC
TCATTGTGCG TGGACGGGCA GTACAACTCT AGAGACATCG ATAGTTGACG AATTTATACT
GCCCACTGCA GACCAAGATT GCGATACAGA TGAGTGCGCA CAAGATGAGT TTGTCCTCCC
CCCGACGGAA GATTTGTCAT CGAGTAGCGA CGAAGAAGAA TCTAGGCCAG CCTATGAATC
TGTTATTCCC AGAACTGCTG TTGCTCACAA TACAGGATCT TTTGCCGGAG CGAGCTACTT
CCCTATGTCG GAGCATGAGA CAACTACTGT TGATTCAAAC CAGTTCTATT ATCGTCTGCC
GACCCAGAGC GACTCTTCTT CCGATGAGGA TGACGAAACC GCTATGGCTT CGAGGTATGT
GGTAAACAGT CAGTCAAAGA CGGCAAGCAA GCAGGTTGCA GTCTCTTCTT CCATTAAGGA
TAAGCAAAGA TCCTCTACTG GACTTCGATC AACGGCACGT TTCGGTCTTG AAACGTCATC
AGAATGCTCA TTTGACGATT TAGTCACCAC CGATAGTATT CGCGACGCTA GCGTCATAGA
GAGTCCAGAT TCCCAAGCAA AAATGAAAGC TACTACACGA CGTGCCATTG AAGACACCCC
AGAATCTACG ATATCGCCTC GTGTAAGCTT GGCAGAACCC TTACAAGGAA AAAATGAGCT
TACCAATGAC GTAGATGGGT TACTGGATAC ACAAGATGAC GTTGTCTCGG GTGTTTCCGA
TATAGTATGT GCTGTATGCT TCTCTGGGGA GTCGGTCGAC GATGACCCAA TAGTCTTGTG
TGATGGACGA GGCAAAGGAG AAACATGTAA TTTAGCTGTG CACGCAACCT GCTATTCAAT
ACCCATCAGT TGCCTCGGTG ACGCTGAGTG GTTCTGTGAT CTTTGCCGGT TTCCTACCAA
CCCGTCACAG CCTGCTCCCT CATGCTCGCA TTGCCATCAA GAAGGAGGTT CTCTACGGCG
GATGCCTTCG TTTGAATGGT CTCACCCGCT TTGCCCTATG AATCAAAATA GCGAAGCGCG
TAGAGCAGGG TTCAAAAGGC TCCGAAAGAT CCCAAAACGA AAGTCGCTCC CCCTCGATTT
GTCATCACAG ATTGATATGT ATCATGCAGA TGACAACACC AACGTACCCA AGCGAAAGCT
TCGGCACTAT CGCTGCTTTC TGGACGAAGA AGCTGGGATT GATTCAGACG AGGACATTGA
CGGAGATCGT ATGGAGGACG AAGAGCTGGA TGCTATCGAG GACGAGGAAG CCGCAATGAG
CAGTTTCATC AACGACTCGT CACAATTGGG CCAAACGCAG GACGAATTGG ATCTAGCTGA
CCCACCTGCA CCAGAGGATT GCGTGCATCG CCAGCTCGAC CTTGATCGTG AGCGCAAAAC
CGTCTTCTCA ACTCCACTTC TTAATCGCCG AATGAAACGT CGAAAAGGAC GCGATAGTTG
GACACCAACA CCAGCGTCGG CCCCAGACTC AGAGAAAGGA CTGGGGAACA TGCATTTCAT
AAGAAGTGTG ATCGAACACC ATCGTCGAGG CGGTGACTCG GAAGCTATCG AGCAACTGTA
CCGAATGGAA GAAGACCAGA GTGTTGATGA AAGTGGTGCT CTTGACTGCG AACAGACACT
ACGTCCTCGG ATAGTTCACT ACTGTAACAG TGACTCAGAC TAG
 
Protein sequence
MSERGRTSVV GADHCAKKFL PTPLDFVGVA DRDRKRQLPS LEVLDRLCEG TSARSLAGWT 
WCLPLSPENT RVPIEGKSFT NRSNHVHPEV IMSAAAAGSG WKCSRCTFYH HAPGSRCAMC
NELRVSRQQM RDFVIGKPIP NDDAQQTILS TLRPKSASSR AIANPYAKPL AVSQQQQSES
VDKTPSSPIQ PHIPSESLPL PQYTVTNQTT SSTSSTAGIG NKLHRSATSD GDSNPSDTGR
GKNPFVALKP RPVLEYKPGP VPIDESTAHE WVYPTAETFP KRQYQLEITK TALFHNTLVS
LPTGLGKTLI AAAVLYNYYR WFPTGKVIFL APTLPLVNQQ VKACYDIMGI PPSDTAVLTG
KVHAARREIV WRDRRVFFCT PQTVQKDLDA NRCDSSKVVC VVLDEAHKST GDYAYVKVVE
RLEEAGAHFR VLGLSATPGT NIKAIQSVVD VLRINKIEAR RDSDPSVARY IHEKQSEIVV
VKQASASRTI ERALNDVVGP LLERLRSAGA LGRLTGNATI TAYNLIRARE EFCKRRNDDG
LIGFFLAAQQ FVQIRSDLHK HGVGLVRSKL SRLRTERQRG MVASIVKGKE FQALWEEVAK
STCDPNSNHN NVQDKLVNNP KLTQLREILV EHFERARACS TSSRAIVFSQ FRDSVSEIVD
ILSASRPLIR PRHFVGQGKG TKGEGGIQLK GMRQVEQQQA IREFREDTFN VLVCTCIGEE
GLDIGEVDLI VNFDTLRSPI RMIQRTGRTG RKRDGRVVCL VAEGPEERTL LASRQSERNL
AHALKNPKSF RVAQTMPLFP SQPKLREQNM LVSKDFHISQ VEGHEGTRRK VLGPFDARNS
QSAQEKIRWR LVTKEENERE SALGRILVSK GCSEVWRTSL RRRFLLARSL SNISDTRRLN
YTTGRTVRIL ADLRYSHGVN VSHNYSKTHS RGHGWTLEHL FPLKFDSLIG KGLHGEILPV
INQSLPKNLL VELDVLNKGS METIFRNDHG CVGRNSILGQ KGFVRNKKTR ETNTEHSEGE
HYLTKEQTEA FGIDAPPHEA RRNCRRRIQQ EEAKKRPLSS SGVDILGQES MMSKFDLAST
ANDKENEPHY QDCDTDECAQ DEFVLPPTED LSSSSDEEES RPAYESVIPR TAVAHNTGSF
AGASYFPMSE HETTTVDSNQ FYYRLPTQSD SSSDEDDETA MASSIRDASV IESPDSQAKM
KATTRRAIED TPESTISPRV SLAEPLQGKN ELTNDVDGLL DTQDDVVSGV SDIVCAVCFS
GESVDDDPIV LCDGRGKGET CNLAVHATCY SIPISCLGDA EWFCDLCRFP TNPSQPAPSC
SHCHQEGGSL RRMPSFEWSH PLCPMNQNSE ARRAGFKRLR KIPKRKSLPL DLSSQIDMYH
ADDNTNVPKR KLRHYRCFLD EEAGIDSDED IDGDRMEDEE LDAIEDEEAA MSSFINDSSQ
LGQTQDELDL ADPPAPEDCV HRQLDLDRER KTVFSTPLLN RRMKRRKGRD SWTPTPASAP
DSEKGLGNMH FIRSVIEHHR RGGDSEAIEQ LYRMEEDQSV DESGALDCEQ TLRPRIVHYC
NSDSD