Gene PHATRDRAFT_49932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49932 
Symbol 
ID7198629 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp333529 
End bp337027 
Gene Length3499 bp 
Protein Length1102 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184783 
Protein GI219129200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCTG CGAATTCCGC CAACGAGGCC GTGGTGGAAG TCGACGACGA TGCCGTTCGC 
TTCGAGGATT TGACCGTCGA CGACGATCTG CCGATTCTAG AACGGGTCGT CCGCTACAGT
CGGTCGCAGA TTGCCCTACA GCGCCTAGTG CACGTTAAGA TGCTGGCGGA AACAGCCGAA
ATTGTTGGGT ACGTGTGCAA CGAGCATTGT AGAAAGAACG TCAGGAATGG AAAATGCTCT
AGAAAAGGGG AGAGAGGGTC TACGTTTGTA CTGTTGAAAG GGACGTGTCT TTCTGTAGAT
ATGTATTGTT GCTTGTGTTA CTCACGACGC TCTTCTCTCC CCATTGATCA AACCCTCTAT
CCTTTGAACC CGTTACAGGC AACGTTCGAC ACAAGAAGTG TTAATCCCCT TGCTGCGATC
GCTGGTCAGC GATCCGGAAA GTATCATTCG GCAACACGTT TCTACACAGC TCGTCCCCGT
CTGCATCGTT TGCATGGTCA AGAACGTCAG CAACGTCGCC GAACTCGTAC AAAACCCCGT
CTTCTCTAAA GATTACGACG AAAAGGGTTA CACTCTCGTT ACCACAACCG TGCTCGGACA
TATAAACACG CTCCTGGAGG ATTTTGATCT GGATGTCCGT AGAGCGGCCG CCGACGCTCT
TTCGGGACTC GCTTTGCAAA TTCGACCCGC TGATGTTCCC CAGGCCTTGC TCCAAATCCC
TCTCGCGCTC GCCGCAAAGT TACCGAAAAA TCCACACGCC AAGAAAAAGA CCGAAGCGGA
TCAGCACGTT GAGGAATTGC GCATCACGGC GGGTAATCTG TTGGCCGAAC TTGGTGGTGC
CGCCTCGGAA CACTCCACTA CCTTACTCGC ATCGTCCACG TACGTTTCTG GATTGATTCT
CCCAGCCGTC CTGAAATTGT GTGACGACGT TTCCTTCCGT GTCCGACGAA GTGCGGCCCA
AGCCTTGCCA CGCATTCTCG GAGCATGCTC GTTGAACGAT GTGGAGGAGA CTATTCTACC
GGCGTTCGAT CAATTGAGTC GAGACGACTT GTATCGCGTT CGGAAATCCA CAGGCGAGTG
TCTGGTGGAT ATGAGTAGAT CCATGATGTT ACTGGCGGCG AGTAACAAAA AGGCTGAAAG
AACGCTTTAC AAATTAAGAC GCGAGACACT AATTCCTATT GCCGATCGCT TGATTCAAGA
TTCGCATAAA ATGGTCCGTC AAGGTATGAT GCAGTTTCTG GGACCCTTCA TGGCTAGCTT
TTATCCATAC CAGACGTCTG CTCTGCGCGA TTTATTGCCC GGCACTGTCG AATCGGACGG
CAGCAATCAC ATGGGGATCG TGGCCCAATT CTTTCCGCAC GCCACATCCA TGGTGTCCCG
TCTCAACTCG GCACAAAACA TTAGCATGTC GGCACCTACC CCGGTGAACG TACATTTGGA
CGAAATTTTA CACCGTGTTC TATCCGAGAT GGATGTTTTG CACCAGGCAC TGCCGGCATT
TTTGCAAGCC TCCCGCATGT CGGCGCTGTC ACTAGCCGCC GTCGCGACCC ACCGGAATCG
TAACTTACCG GACTCGGAAG ACGTTGACGT ACTGATAGAC AAACTATTGG ATTACTTTGC
CGCACTCGCA ATTGTATCGA CGGGCGACGA AAACACCGAC GCGGAAATGC GTGTATACTG
TGCGTATAGC TTTCCCGCTC TTATATTGCT ACTGGGAGCG GACAATTGGG AGGGGGCGAT
GCGTACATGC TTTTTTACAC TCATGAACCC GAACTATGCC AAAACTCAAC AACCGGAAGA
AAAGCAGTCC GACCCGGCGG AAAATCTGAA CGTCGCCGAG CCACCCCTGC CAGTGAAACG
CTGTTTGGCG AGTAGTTTAC ACACGGTTGC CAACATCCTG GGACCCGAGC TGGCCGCCTC
CGATATTGTA CCTGTTCTGC AGGACTTCTT TTTGAAAGAT CCAGATGAAT CCGTACGACT
GAACGTCATT CGCAACTTTC CAGCATTGTT GCAAGTGCTC TCCCCTTCTG ATCGCAAGGG
CCCCTTTCTC ATGTGGAGCG AAATTGTCCG TGGTGAAGAG TTGCTGGGTA TCAAGAAGCG
CAGTGCACAC AATCCGGTTG TGCTTAACTG GCGACAGCGT GATTATTTGG CGCGATCTCT
GCCGGATCTG ATTGGTTTGG TGGAGCCGTC TCTGGTGCAC GAACATATCT GGCCCATTAT
GAAAACGCTC TTAACCGATG CGGTCTCTAT AGTCCGAGAT GACGCCATTT GGTCGATTGC
CATGATCCTT AAAGCTTATT GCTTGGAGTC ATTGCAGGCA TGGCCGAATG TGTCCAATTT
TCGAGCTTTC GGTGCTCAAT CGTGTGCGGA AGTCATTGAT TGGTTGAAAG AAAGTATTCT
CAAGCTGGGA GTACCTAGAG AAGCTCGTAT AAAACCCGTC AACTTTTCGG AGCGCCAGCT
ATACTGTCGA ATTTGCGCCA CGCTAGGATT GGCCCTGCGC TTTAGTGAAC AGATTGAAGG
GGACAAACAA GACCCAGTCT CGGTGTTGAG TGGCAAGTTT AAGACGTTTT TCTTCCCAAA
GTCGAAAAAT CTGCAAGACG ACCTCCCAGG GCCGTATCAA GCCATGACCA AGTCGGAACA
GAAACACCTT CGACGCCTTC TGCTGAACGA GCTATTGCCT CCAGCATTGG AAATGAAAGA
AGATCGGATA TCTAACGTCC GCGTGTCTTT AATGAAAACT CTGCAACTCA TGCCAGCCGA
AATTCGTGCA ACGCCTCTTG TCCAACCAGT TCTTCAGGGT TTGGTCGAGG AAGTGGAGAC
ATGGGAAAAT TTTGCAATTT CTGATCAGCC AGTGCCCAAC CCGCTAAAGC AATCCGCATC
ACAGGCTGCG CTGTATCAGC CCGCGCAACG CTCACTTGCC AGTGGTGCTG TGTCGCCACG
AGCGCAACAG AGTGTCCCGG TGGATTTAGA TGCGGAGATG CCCGACGACC GACGGTCGTC
TTCGGACGAC TCGAGTGCAT CAGCAGAGGA CGTTGCCGAA TCGTCAGATT GGAAGACGGT
AGTGTTTCAA GCTGGACCGA TCGGCATGCA GCTGGAACCC ACTGCCGATG ATCGTGCTTG
TCGGGTTTAT GGTTTTTTGG ATTCGGGTGA TGGAAAGCCG TCACCGGCGC GGCATTCGGG
CAAGATTGAG TTGGGCGACG TCATCGTCAA GGTGAACGGT AAAGATGTAC ATTCCTACGA
CGACACGATT GCGGTTCTCA AAGCAGGTGG CCGTCGGGAA ATCACTTTTC GACAGGGAAC
AGCCGACGAC GATTACGACG ACGACGAAGA AGAAGAGAGT GTAGGAGGAT TTTCGTCTAC
CGACGACGAA ACCGATCGAA AAGAACGGGA ACGCAAGGCA AAGAAGGAAG CCAAGAAGGC
AAAAAAGGCC AAGAAGGAAA CAAAGAAGAA AAGCCGTGAC AAGAAAAAGG ACAAACGGAA
AAAGGAGGAA ACAGGATGA
 
Protein sequence
MEAANSANEA VVEVDDDAVR FEDLTVDDDL PILERVVRYS RSQIALQRLV HVKMLAETAE 
IVGQRSTQEV LIPLLRSLVS DPESIIRQHV STQLVPVCIV CMVKNVSNVA ELVQNPVFSK
DYDEKGYTLV TTTVLGHINT LLEDFDLDVR RAAADALSGL ALQIRPADVP QALLQIPLAL
AAKLPKNPHA KKKTEADQHV EELRITAGNL LAELGGAASE HSTTLLASST YVSGLILPAV
LKLCDDVSFR VRRSAAQALP RILGACSLND VEETILPAFD QLSRDDLYRV RKSTGECLVD
MSRSMMLLAA SNKKAERTLY KLRRETLIPI ADRLIQDSHK MVRQGMMQFL GPFMASFYPY
QTSALRDLLP GTVESDGSNH MGIVAQFFPH ATSMVSRLNS AQNISMSAPT PVNVHLDEIL
HRVLSEMDVL HQALPAFLQA SRMSALSLAA VATHRNRNLP DSEDVDVLID KLLDYFAALA
IVSTGDENTD AEMRVYCAYS FPALILLLGA DNWEGAMRTC FFTLMNPNYA KTQQPEEKQS
DPAENLNVAE PPLPVKRCLA SSLHTVANIL GPELAASDIV PVLQDFFLKD PDESVRLNVI
RNFPALLQVL SPSDRKGPFL MWSEIVRGEE LLGIKKRSAH NPVVLNWRQR DYLARSLPDL
IGLVEPSLVH EHIWPIMKTL LTDAVSIVRD DAIWSIAMIL KAYCLESLQA WPNVSNFRAF
GAQSCAEVID WLKESILKLG VPREARIKPV NFSERQLYCR ICATLGLALR FSEQIEGDKQ
DPVSVLSGKF KTFFFPKSKN LQDDLPGPYQ AMTKSEQKHL RRLLLNELLP PALEMKEDRI
SNVRVSLMKT LQLMPAEIRA TPLVQPVLQG LVEEVETWEN FAISDQPVPN PLKQSASQAA
LYQPAQRSLA SGAVSPRAQQ SVPVDLDAEM PDDRRSSSDD SSASAEDVAE SSDWKTVVFQ
AGPIGMQLEP TADDRACRVY GFLDSGDGKP SPARHSGKIE LGDVIVKVNG KDVHSYDDTI
AVLKAGGRRE ITFRQGTADD DYDDDEEEES VGGFSSTDDE TDRKERERKA KKEAKKAKKA
KKETKKKSRD KKKDKRKKEE TG