Gene PHATRDRAFT_47853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47853 
Symbol 
ID7203078 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp240966 
End bp245693 
Gene Length4728 bp 
Protein Length1314 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182354 
Protein GI219124109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGGA GTGACCTGAC CGTAACTGAC CTCCAAGCAG TAAAATTGAG TCGATTATTG 
GAGGTCCAAA ACACAACCTC GCTGTCTGTT TCGCCACCAT CATTCCCAAA CAAGCAACTC
AGCCAAGCGT AACAACTCTG CGTCCGTCGC AAAAGCTCTA CGTTCCATCA ATACATGTAC
TGCATTTGCT GAGCCATCAC TAACAGTAAA ATACGCGAAA CGGGAATTGT TTTCTGAGCT
TTGCCTGGAT TGCTTCCAGT ATGATACGCC GAAGAGGTCA ATCAGTCGTG TTCGAAGACG
TGGGCGGTAT TGTCCACAAA ACGGCGCAAG AAAAGGCGGT AGCGTCCGCA TCAAAGCGAG
GCTCAGATGG GGGAGGGAAG ATAGTTGACG ATGTTCAGGG TGTTCCTTTT GAAAACTCGA
CAGCATCATC ATCGCAACTC AATTTATTTT CACGAAACAA AGCGCTTGTC TTTCTTATCC
TAATTGTACT CGTTGGTGTT TTTGCCAGCA GTCTCTTTCT GGTTCTTGGT GTTCGATCCG
CCAAAGAAGA AGTGGAAGAC AACTTTGTTC GGCAAGCGTC TGATGTGGTT CAACAAACCG
AACGAGTTTG GGAGGATTAC CAAACACTCG CCATGTGGAT GCATCAGGCA TGCTACGAAC
GGCAAATCAC ACGTCCCAAG TTTCGTGAAA TCCAAACGTA TATGAACGCT ACAGGTTTGG
AATTTCAGGC TGTGTGTTGC GCACACAACA TCAGCTCACC GACGGAGAGG GCTGCGGCCG
AAGCGGAGGC CAGCGCTTAT TATTCGGAGA ACCACCCTGA CATGGTTTAC GAGGGTATTA
CGGGTCTGGA ACCGGATCCA GAAACGGGGC TTTTTTCACC ACGCTACCGC TCGCAACAAC
CATTTTATTT TCCTGTTCGA TTCATCGAAC CGCTGGAAGG GAATGAGGCT GCTGTGGACT
TTGATCTCTA CTCCGCTGAG GGTCGAGCGA GAACGATTGA TGCCGCAATA TCCACCGGAA
AATCGGCTCT GACGCCACGA CTTCGTCTCG TGCAAGAGTC CGATCCGAGT GCCTACGGTG
TCATACTCAT GCATCCCGGC ATTCCAGTCT CCCCGGAATT CAAATGGCCT TTTGATCTGG
CGACAGTGGT GATTCGCATT CCGTCTCTTT TGAGCCGAGC AATTCGGTTT GAGTCGCAGA
GCATCGCCGT CTACCTGTTC GACTCAACCG CAAAAAACTC CGATCCCGAG TTTATGGGAG
GGGCTGCAGT AAATATGGTC GAAAATAAAC CGTCAGTGTC TTTTCTCGCG GAAACATCAC
TAGCGGATCA ACGGAAACGG GGCCGTACCT ATGAAGACAC GATTGAGATG GTTTCCAATC
AGTGGACAAT GATCGTCATT CCCCTGGATG GAATGTACGA AGAGAATTAT ACCTTTGTCA
TTATTGGAAC CTTAATGATT TTTCTTTCGT GTGCACTGAT CGGGTTTTTG TCTGTGCATC
AATTCGCCGC GTGGCGCATA TGAGTAAGGT CAAGTCAGAG ACTGAAGCTG AAAAGGCTGC
TCTTCTGGTT GATAACGCCA AACAGTCTGC CAAAGCCGAG CGAGAACTTA ACGACTTTAT
CGCGCACGAA GTCAGAAATC CTTTGTCAGC CGCCATGTCA GCCCTGAGCT TTGTTTCGAT
GGAAGTGGAT ACCGATCCTC CTTTAGCTTC TGAGGAAGCT CGACAATCGG TGCAAGAAGA
CCTCGATATC ATTAAAGGAA GTCTGCACTT CATCAACGAC TTACTCAGAA GTATGCTAGA
TATGCATCGT GCGGCTAGTA ACCAATTAGT GATAGCGATG TCTCCAACCG ATATCAAGAA
AGATGTCTTT GAACCTGTGG CTGCAATGAT TTACAATCGT AGAGAAAACT TTGAGGTTCT
CATCGATTGT GCGGACAATT TTACGATTTT GGCGGATCGT TTGAGATTAA CTCAAGTCAT
TCTGAATTTG GCAAGAAACT CGGCAAAATT TGTCACTACG GGGTACGTTC GGCTACGAGC
AGGAGCTGTT GGCGAAGATA GAATTGTGAT TTCAGTGGAA GATTCCGGTC CAGGTATTCC
AAGTGAGAAG CGAGGCATCC TATTTTCCAA GTTTCAGGAG AGTCTCGATT CCTTAAATCA
GGGTACTGGT ATAGGTCTTT GTCTGTGCAA ACACTTGACT GAACTTATGA AGGGAGACTT
GGCATTGGAC GAGACCTTTG ATAGCGGTAT TCCACACTGT CCTGGAACAC GCTTTGAGGT
GAACCTTCGT ACGTCGATTG TAGATCTTGA TTCTAATTCA CTTGACTTAT ACGAGAAAAC
GTCAAGGGAG GGAAGCCGCT TGCTTACCAA CAGTTTGCGA ACAGCCACTA CGTCAACGAA
ATCACAGTCG ATGTCAAATT TCGTTCCTCA CAATGAAGCA GTCGCGACCT CCATTCACCA
GCTTCCTCAG ACCATATCTG TTCTATTTGT CGACGATAAT CTTGTCCTTC GAAAGTTATT
TTCGAGAGCC ATTAAAAGAG TAGAACCTGG TTGGATTGTA CAGGACGCCG CAAGCGGAGA
AGCAGCACTG ACTATGGTAG AAAGCGACAC TTTTGATCTT ATTTTTGTTG ATCAATATAT
GGCAAGCATG GAGAAGCAGC TTTTGGGAAC GGAGACAGTC GCAGCATTAC GCAGTAAAGG
AGTAAAATCA AGAATATGTG GCCTTTCGGC TAACGACGTT GAGGTACAGT TTGTCACCGC
AGGGGCAGAC TTTTTCCTGC TCAAGCCCAT ATCGTCGGAC AAAGATATTT TGACTGCTGA
CTTACATCAT ATCCTATACG GGGCAAGGCA GTGGAAAGGC GAGGCAGGCT CTTCAAATGG
AGACAGTGAT ATCGGAACTG CCTCAACAAA GACACCAGGC TCTGTCCTTG GAAGCGACGA
CATGGTGTGA TCCATTTTTC ATTTGGATTC TATCGCAGTA ACGAAAGATT GTTCGTTGCG
AAAAGCATGC ATGCGATTAT GCACACTAGC GTTGGTGTCT CCTTGGGGAG ACTTTACAGA
TGACATTTAA TCGGAAATGA TAACTTCACA ATCAGAAAAC ATCTCGGTTG AGATTGCCGA
AAAATCCCAA TCCGGCCAAT CTTTTTTTGA ATTTCTTTGG CTGGATTGTG TCGACCCGTA
GAGTCCTTGA CATCTTATCA CCGAATTTAG AGGTATTAAC TGTAACTTCA CAGTCAATAG
TCACGCTGGA AAACGGTCAA ATTGGAAAGC TACGAAAAAC TTGATTTTCT TTTCTGACGT
TAACAGTAAA ACAGAGTTTT ATGGATTTTT CGCTCGCGTT CTGATTCGAT GTATTTCGCT
TGAAAGTGAG TCCATGCTTG TTGGCTCGGT ATTTGCAGTT AGTCAATTTG ATGCTCACGC
CACCTGTGTC GTCCCATTGT GTCCATATTC ATCCCTTTTC GACGCGCTAG TGCGCGGTAT
AGGCGCAATT TTCACAGCTA TTAACCAAAC ATGAGTCACA CAGATAACAA TGAGAGCTTG
ACGGAAGCCG AACGAAACCT TTCTTCACCA GGTAAAGCCC CTCTGTACAA GTTTGTCATG
ACGGGAGGTC CTTGCGGAGG TAAAACAACG GCTCTTGCTC GAGTTTTCAA CTTTTTGCGC
GAACGAGGAT TCGAAGTGAT TACCTGTCCC GAAGCCTACA CATTGTTGAT GTCTAACGGG
ATGTCGGTAG ATTTCTTCTC GACACCCGGA ATGGGACGGA TCATCCAAAG TACTGTGCTG
GATGTGCAAC TTAATCTAGA AGACAATGTA GCTCGAGTTT TGAAAGCGCG TGGGAAACCT
GGCATCATCC TATGCGACCG AGGATCGATG GATGGTACGG TGTACGTGAC TAAGGAAGAG
TTCCAAAAGG TTATGCAGGA ACGCGACACG GATGTTGTGC AGTTGCGCGA TAATCGCTAC
GACGCTATTT TCCATCTCGT TACGGCAGCA GACGGCGCGG AACACGCCTA CACGTTGGAC
AATAACAAAG TGCGCACCGA AAATGTGGAA GAAGCGATCG AAGTGGATCG CAGGACTCAG
AAGGCATGGG TAGGACATCC TCATCTGTAC GTGCTTGACA ACGCGACTGA CTTTGAAGGC
AAAATGAACC GCTTGATTGA TGTGATTAGT AATCTGGTCG GTCTACCATC TAATCTCAAG
CGACGATCGG CCAAGTTCTT ACTAAAATCC ATGCCAGACA CCTATTCATT CCCGCCCGAT
ATTGATCATC AAACCTTCGA AGTGGAAAAG GTTTACGTAC AACAAACTGG CCAAAAGTAC
GATTATGCCT TTGTTCGTCG CCGCAGCAAC GTAGACGCGG ACAATAACTT GTTGGGTAGT
GTCTATCAGC TCACGACGGT TCAACGTTTC GAAGAAGAAG TTATCGAACA GAAGCGTATA
ATCAGTCAAC GTGAGTATGC TGCATTTTAC ATGACACGGT GTCCAGAACG ACACGTGGTT
CGTCAAAAGC GAATTAGTTT CATTTACAAG CAACAGAGCT TTGTGATACA CATTTACGAA
GAGCCAGTCT CGGACATATG TATTCTGCAC GCGCAAGTCG AAGCGTCCAA GGAAAAGGTG
GACTTACCAC CATTCATTGA CGTAAACAGA ATACTGCTCA ATAGCAAACC GGATGAGGAG
AAATACGGGG CATTCAGCCT GTCGCTTATA AACGGTTGTG GCATGTAG
 
Protein sequence
MSRSDLTVTD LQAVKLSRLL EVQNTTSLSV SPPSFPNKQL SQAMIRRRGQ SVVFEDVGGI 
VHKTAQEKAV ASASKRGSDG GGKIVDDVQG VPFENSTASS SQLNLFSRNK ALVFLILIVL
VGVFASSLFL VLGVRSAKEE VEDNFVRQAS DVVQQTERVW EDYQTLAMWM HQACYERQIT
RPKFREIQTY MNATGLEFQA VCCAHNISSP TERAAAEAEA SAYYSENHPD MVYEGITGLE
PDPETGLFSP RYRSQQPFYF PVRFIEPLEG NEAAVDFDLY SAEGRARTID AAISTGKSAL
TPRLRLVQES DPSAYGVILM HPGIPVSPEF KWPFDLATVV IRIPSLLSRA IRFESQSIAV
YLFDSTAKNS DPEFMGGAAV NMVENKPSVS FLAETSLADQ RKRGRTYEDT IEMVSNQWTM
IVIPLDGIKV KSETEAEKAA LLVDNAKQSA KAERELNDFI AHEVRNPLSA AMSALSFVSM
EVDTDPPLAS EEARQSVQED LDIIKGSLHF INDLLRSMLD MHRAASNQLV IAMSPTDIKK
DVFEPVAAMI YNRRENFEVL IDCADNFTIL ADRLRLTQVI LNLARNSAKF VTTGYVRLRA
GAVGEDRIVI SVEDSGPGIP SEKRGILFSK FQESLDSLNQ GTGIGLCLCK HLTELMKGDL
ALDETFDSGI PHCPGTRFEV NLRTSIVDLD SNSLDLYEKT SREGSRLLTN SLRTATTSTK
SQSMSNFVPH NEAVATSIHQ LPQTISVLFV DDNLVLRKLF SRAIKRVEPG WIVQDAASGE
AALTMVESDT FDLIFVDQYM ASMEKQLLGT ETVAALRSKG VKSRICGLSA NDVEVQFVTA
GADFFLLKPI SSDKDILTAD LHHILYGARQ WKGEAGSSNG DINSHAGKRS NWKATKNLIF
FSDVNSKTEF YGFFARVLIR CISLENNNES LTEAERNLSS PGKAPLYKFV MTGGPCGGKT
TALARVFNFL RERGFEVITC PEAYTLLMSN GMSVDFFSTP GMGRIIQSTV LDVQLNLEDN
VARVLKARGK PGIILCDRGS MDGTVYVTKE EFQKVMQERD TDVVQLRDNR YDAIFHLVTA
ADGAEHAYTL DNNKVRTENV EEAIEVDRRT QKAWVGHPHL YVLDNATDFE GKMNRLIDVI
SNLVGLPSNL KRRSAKFLLK SMPDTYSFPP DIDHQTFEVE KVYVQQTGQK YDYAFVRRRS
NVDADNNLLG SVYQLTTVQR FEEEVIEQKR IISQQRHVVR QKRISFIYKQ QSFVIHIYEE
PVSDICILHA QVEASKEKVD LPPFIDVNRI LLNSKPDEEK YGAFSLSLIN GCGM