Gene PHATRDRAFT_50246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50246 
Symbol 
ID7199020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp91559 
End bp94030 
Gene Length2472 bp 
Protein Length823 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185123 
Protein GI219129916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAC GCAAGCAGCA TCAACAGTTA CCGAAAATAG CAGATCACGT ATCTGAATCG 
GAGGATGAGG AAATCGAGGA GGACGAAGCG TTCAATTCAG AAGATGAACG CAAATACGGC
GGATTCTTCG AACGAGGTTT AGCACCAGAA TCTTCCAAGA CAGCAACAGT CGATAGTGAC
GCCGAATCGG AAGAAGATGA GGATAGCGAC AATATAGCAG ATAGAAACGG GTCGGAGGAA
GGCGATGGAG GTCAATATAT GCTTGATATG CTCGATATAC TTGGTGAAGA CAGCTCCAAA
AAGAACTCGA GAGAAATCAA AACACCTCAA ATGGCAATCA GTGTCAAAGA ATCCGAGTTT
TCTGCATCGG TCTTGCCATC GGCAAACCTT ACGCTAGACT CACTGATGAA GGGATTGGAG
GACACCAAAG GATTCGGAGT GGTTCAAAAA ACCATGAAGA AAATTGCGCA GGGCCATGCC
ACGGCGGCAC CGGTTGCCCG TGTTGTTTCC GAGCGTGCGC AACGCAAGGT TCACTATGAG
CAGCAAACGA AAGAAGTTGA TAAATGGATT GACGCGGTGC AGGAAAATCG ACAGGCAGAG
ACTCTAGATT TTCGCCCGAA AGAACGATTA GAGATTTCCC GTGACGTCTT GGTGGACAAG
TTTGTGCCGA CGACCGATTT CGAAAAGCAA CTTCACGAAG CGTTACAAGA AGCCGGGCAA
CTAGACGAAG AAGATATGCT CAGGGCGGAA GAACGGGCTC TACAGGATGA CCTTGGTGCG
AATGAGATTA CCATGGAAGA ATACAAGCAG AGAAGAGGGC AACTCGCCAA GATGCGTGCT
CTCATGTTCT ATCACGAACA AAAGCGCCAC CATATGAACA AGATAAAATC GAAGAAATAT
CGTCGAATTC GGAAAAAGCA ACGCCTTCGC GGGAAAGAAG GCGAACTAGA AGCCGAAATG
GAGGAGAACC CTGATCTTGT CCGAGAGCTT CAGGAGAAAG AAGAAGTTGA CCGAATGAAG
GAACGAATGA CGCTCGCTCA CAAAAATACA AGCAAATGGG CGAGGCGGAT CTTGAAGCGA
GGCAAAAACG TTGATGTTGA TACTAGACGA GCCTTGTCCG CACAGAATAA ACGCGGAGAC
GAACTTTTAA AGAAAATGTA TTCAGGATCA GGCGAGGAAG ACGGAGATGA CTCAGACAGC
GAAGATCTCA TCGAAGCGGC TAGGAAAGTT CTGCAAGATA CAGAAGAAGA AGAAGTTGCA
GGGTCTTCTA AAGGCAAAGG GCTCCTGAAC TTATCCTTTA TGCAACGGGG AATTGAAAAG
CAGCGGGAAA AAGCCAAAGA AGAGGCTCGT CAGCTTTTGC TCGAATTGGA GGCAAACGAG
CGTATCGAAA CAAGCGACAA TGATGGTGAC ACTAATATGA ACTCAAAAAA GAAGAAGAGA
GTCGCCGGCG CTGCTGAGAT GAAGGCTGTA CTCAAGGAGG GAGCGCTTGT TGTTTCTTCC
CTTCAAACTG GCGGTTCAGC TAGTGTAGCC ATGAGTGGTG GCATAGACAT CAATTCTGAC
TTCGCAGATC AGAATGAAGC AAAGATGTCA AGCTACGCCA GTGAACATAC TGCGGCCCTC
TCATTGGGAA ATTCGCCTAA ATACATTCAG CCAAGGCAAC TTGTGAAGCC GATGGAGAAA
AAGGGCTCAA ACACACAGGA TCTCTGCCCA CAACCCGATA ACGAAGTAAA TCCCTGGCTA
CTTTTGAAAT CACAGGGAAA CGAAGTCTCA GATACTGCTA GCATGACATC CAGACCGGGA
ATAGGTGCCA AACTATCGTT ATCAAATCAA GCGTTGGTGA TTGACCCTGA GAAAGCGGTT
TATATGATGG AACAGAAAGG AGACACGGAG CTTTCTGTAA ATAAGATATT CACGAACGAT
GTTGTGACCT CGACGGAGAA GAAAATAACT ATGCTCACAC AAGAAGAATT GGTGAGAAAG
GCGTTTGCGG CTCCGTCGGA CAAGGAAATT GAAGAAGAAT TTGCAAACGA AAAAGATGCC
ATTCAGGACT CTGAAGACCC TACTCGCACA AGAAAGAAAG ATAAGCTTTC GAATACAGTG
TCGGGATGGG GTTCTTGGAC TGGGAAGGGA GCCCCTCCAC CTAAGCCTCC GAAAAAGATT
CCAAGGCACT TGTTGCCTCC TGAACAGAAG CTTTCGAAAA GAAAACGTGA AGATGCTACG
AAGCCAAATG TGATCATCAG CGAAAAGCGG ATAAGGAGAA CCGCCGACAA GTTTATGATA
TCACAGATTC CGTATCCGTA CACTTCGCGT GAGGAGTACG AACGAGCCAT GGTTGGGGGG
TTAGGAAGGG AGTGGAATGT TACAAGCAGC ATGAAAGACA TGACACGTCC AGAAATCATG
ACTCGATCGG GCAAAGTGAT TCAGCCAATT TCGAAGAAAG TGAAGCAAAA ACGCCCAGCT
GCAAGATTTT AG
 
Protein sequence
MGKRKQHQQL PKIADHVSES EDEEIEEDEA FNSEDERKYG GFFERGLAPE SSKTATVDSD 
AESEEDEDSD NIADRNGSEE GDGGQYMLDM LDILGEDSSK KNSREIKTPQ MAISVKESEF
SASVLPSANL TLDSLMKGLE DTKGFGVVQK TMKKIAQGHA TAAPVARVVS ERAQRKVHYE
QQTKEVDKWI DAVQENRQAE TLDFRPKERL EISRDVLVDK FVPTTDFEKQ LHEALQEAGQ
LDEEDMLRAE ERALQDDLGA NEITMEEYKQ RRGQLAKMRA LMFYHEQKRH HMNKIKSKKY
RRIRKKQRLR GKEGELEAEM EENPDLVREL QEKEEVDRMK ERMTLAHKNT SKWARRILKR
GKNVDVDTRR ALSAQNKRGD ELLKKMYSGS GEEDGDDSDS EDLIEAARKV LQDTEEEEVA
GSSKGKGLLN LSFMQRGIEK QREKAKEEAR QLLLELEANE RIETSDNDGD TNMNSKKKKR
VAGAAEMKAV LKEGALVVSS LQTGGSASVA MSGGIDINSD FADQNEAKMS SYASEHTAAL
SLGNSPKYIQ PRQLVKPMEK KGSNTQDLCP QPDNEVNPWL LLKSQGNEVS DTASMTSRPG
IGAKLSLSNQ ALVIDPEKAV YMMEQKGDTE LSVNKIFTND VVTSTEKKIT MLTQEELVRK
AFAAPSDKEI EEEFANEKDA IQDSEDPTRT RKKDKLSNTV SGWGSWTGKG APPPKPPKKI
PRHLLPPEQK LSKRKREDAT KPNVIISEKR IRRTADKFMI SQIPYPYTSR EEYERAMVGG
LGREWNVTSS MKDMTRPEIM TRSGKVIQPI SKKVKQKRPA ARF