Gene PHATRDRAFT_49924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49924 
Symbol 
ID7198535 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp311504 
End bp315142 
Gene Length3639 bp 
Protein Length564 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184777 
Protein GI219129188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTAATGGCCG GATCGGTGTC GCCGTTGTAC ACCAGCACGC GAATTTTCCC ATTGATGCTT 
CGGTAAAAGC CGCTCAAATC TGGTTCGGTG GGGGTGTAGT CGAAATCGCC GTCGGCATTG
TCGACGGAAA AGAAAGTTGC GTTGGTCACG TGCAGGGCTT CCTTGACCAC TGGCAAGCTA
AGATAATGTT GCATGACAGC GTCGCCTCCG CAGGGGTAAT CGTTCAGTGC TCCTCCGAGA
AGACCACCGC TGTACGTGCA TTCATCGTAC AAGGAATATC CGTAGTAGCC ACCTACTTGA
CGATCTACCT TGGCGAGGGC GGCTTGACAT ATATCGCTGT CCTTGAGAAA GCGATAATTC
TCCTCATTGC TCGAAAAATC GTCCGCTGAG GACGCTGACT GAGGCGACAA AGGATGTTTG
TGTGGACGGC AAGCGTGCAT GACTTCCTGA AAGGTTGCCA TGGGCATTTG ACCGTGACCC
GCGAGAAATA AAATATTCCA AATATCAATC GCATCGGGAT TGGAAAGATC ACCACAGATG
GAAGTTTCGG TTCCGAGACA TCCGTCACCG ACGGCCAATC CTTTCAACGG AATGTGATTT
TCCGGATCTT CCAAAATTCG CCGTGCGAGC GTGGGAATGT AGATGCCAGC ATAGGATTCT
CCGGTCAGAT AGAGCTCGTT GGTACCGTAG CATGGGAATT TTTCGTGAAA GGCCAAGAGC
GCCAGATGAG CGTTTTCCGC CGCGAGTTCG TCCGTCCAGG CCAATCCAGC GCAGGAATGC
GAATCAACGT CTTCGTTGCA GTAGCTGAAT CCTACCGGAG CGGGTTGATC GATAATCAAA
ACATGTCCCA GCCTTGTCCA CGCGCACGGG TTGTAAATGG GTGTAGGAAT TCCCGTGGCT
TCGTACTCGG CAGTGTGGAG TGATTCGTCG CTGAAGAGCA ACGGCCCTAG TTCCGTCAAG
AGTCCGAAGA GAGAGCTCGC ACCGGGACCA CCATTGCTCC AGTAAATTAA AGGCTTGTTC
TTATGGAAGT CTTCTTCAGC CTCGACGAGG ACGTAATGTG TGTGGACCGT GCGGCCCTCG
AGTTCATACT CGACAAAGCC GGAATACCAC GGTGACGGCA AGGGCTGATT GAACCCCGGC
AGACGCGAGA CAAAATCGGG ATTGGAGGAG CCCAAGCACG CCAACGCGAA CGCCAATTTG
GACACGAAGA AGGCTACAAA TGTTGATGAA CGAAATTGCA TCATGCTCGG TTTGAATGAC
TGTGAGTTGG ATCTTACGGA ACTGCAGAAA GAGAATTCGA TGCAAGCGGT CGATGGAGTT
TGCGTTTTGC ACACCACGAC GAGGCAGTCA CAGTCACAGT CAAATCGGAT TTGACGGTTT
CTTTAGAACC CGGATATAAG AGGGATCCGT CCGTGACGGA TAGACAGACG GAAAAGATCG
ACTCGCTGTG TTTGACAGCG CAAGTGACTT GTCTTCTCGA CCCAAAACGG GAAGCACGGT
TTGAAATTCG AACAAATTCC GCGGATCTTT CCGAATTTCG GGTTGGAGCG AACAGCTTCT
CGGAACGTCG CGCATCGCTC GAGGAGGTAC CATTGGATGG TCCCCGGTGC CTTCGGGTTG
ACACTGACAA CCGACGTCCG TGCTCTTGCG TTCCTTCCTT ACAGTCAGCA CAAGCTTCCC
CATTTCCGTG AGCGTGAGGA TTGCCCCACA GGCAAAAGAC TAGCCATTCA CTTTTCAGGG
TTGGGAGCCG TATCATGCCG GATTCTAGCC AACGTGACGA CGAGGCTTGT ATGCGGGAGG
CGATTGCCGA AGCGGCGGCG GCGACATCCG AAGGGAAAAT GCCCTTTGGG GCCGTCTTGG
CTATCGACAG TGTCATCGTC GCACGAGCTC ACAATCAGTG TCCGGCGGCT GCCAAACGAG
GGGGTGGAAC GGGCGACGTC ACCCGACACG CCGAAATGGA ACTCGTTCGA CTCTTCACCA
GCAAACTCAC CGCGGAAGAA CGATCCAACG CCGTCCTCTA CACCAGTACC GAGCCGTGTG
AGTGCAATAC AGACGTGTCA AATGTGTCAG ATTAACGGCG TGACTAATGT TATATATTTA
CTTTCTTGTC TTGCTCACGT CGGGGTCTTT GGATATCATG TTAGGTGTCA TGTGTGCGGG
AGCGATTTAC TGGAGCGGTG TTTCCAAGGT TGTATACGGA TGTTCGGCGC GACAGTTAGA
GGCCTTGAGC GGTCCGGGCG GCTTTGACAT ACCCGTCGAC ACGCTCTACG GAATGGCGTC
GAAGGGAGCG CGACGAATGG AATGTCTTGG TCCCTTGCTG GCGGAAGAGT CCCTACAGGT
TCATGTCGAT TCCGGTGTCT GGAAGAATGC ACCGGTCCCT ACTACTGCGG AATTTCCCCC
CGTCACCCAG GCGGATCTAG ATATTGCGGT CGAGGCCGCC CTACTGAAAA GCGGCTTGGG
GTCCGCCAAG GTCGTCGACG ACGGTGTCGT GCCGGTCATT GATCTCTCCG TTGGCACGGA
CGAGCAAGTC GCCGAGAAAC TGTGGCAAGC GGCCATGGAG GTTGGTTTCT TCTGCGTGGT
TGGTCACGGA ATCGACCAAT CCATTATTGA CGGAGCCTTT GGCGCCTCGG AAACTTTCTT
TGCGCAGCCG CTCGAAGACA AGAAAGCACA ATCACCTCTG GATATGAGTA TCAACTCGGG
TTTCGAGTAC TTTGCCCAGG TCCGCCCGAG TACCGGCGTA GCGGACCAAA AGGAATCACT
ACAAATCACG GCCCGCCAAG GCTGTATGGA CGATCGCTGG CCGTCGGACG AATTCCACAA
GAGTGCCGAT GCATTGCTCG AAGCATCGCA CCAGTTGGCC AAGCGAATTT TGAACCTGTT
GCAACCCCAA GCGATTCCCC ATGTGGAACC GGAAACGCTG GCGAATTCGC ACACGCTTTG
GGAAGAAGAC GGGCAGTGTA CCTTACGATT CTTGCATTAT CCCCCACTGG ATAGTGACAC
CACGGCCAAG CTCATTGACG ATGGCTATTG GCGGGCGGGA CCGCACACCG ATTGGGACAA
CGTAACGTTA CTCTATCAAC AAATGGGACA GAATGGTTTG GAATGCTGTG CCAATCCACG
GACAGGCGAC CCCGCTTCCA TGTACTGGAC AGCCGTGAAT CCAGTGGAAG GAGGGATCGC
CATCAACGTG GGTGATATGC TGGCACGTTG GAGCGACGGC AAGCTCTTCA GCAACCTGCA
CCGCGTACGT CTACCACCCG ATGCGTCCAA ATCGAGATAT TCCATCGCAT TTTTCGCCCA
GTCCGACAAG AAGGCTCTGA TTGAAAGCAA GGAGTCGGAA CCGATTACGG CTGGCGACTA
CATACTTTCG CGAATTCGTA GCAACTTCGA TAAGAAGTAG ACTTCCCACC GTTAAGATCG
CTTTCGTTTG TTGAATATAT ATTTTTATGA TTTTCTCATA CCAAGCTCCT TCCTTACAGT
CGAACAACAC CTATGTTGAT CGCCATTGCT GCAAAGTATA GGAGTCTGAT TGGTTTGGAA
GCCCAGCGCC ACAACTTCGA ACTCAAAAAG GAATTGCCGT CCGAGGGGCT GTATTATAAA
TTGATGAGTC CGGAAAACAT GTTTCGAAGC TTTGCATAG
 
Protein sequence
MPDSSQRDDE ACMREAIAEA AAATSEGKMP FGAVLAIDSV IVARAHNQCP AAAKRGGGTG 
DVTRHAEMEL VRLFTSKLTA EERSNAVLYT STEPCVMCAG AIYWSGVSKV VYGCSARQLE
ALSGPGGFDI PVDTLYGMAS KGARRMECLG PLLAEESLQV HVDSGVWKNA PVPTTAEFPP
VTQADLDIAV EAALLKSGLG SAKVVDDGVV PVIDLSVGTD EQVAEKLWQA AMEVGFFCVV
GHGIDQSIID GAFGASETFF AQPLEDKKAQ SPLDMSINSG FEYFAQVRPS TGVADQKESL
QITARQGCMD DRWPSDEFHK SADALLEASH QLAKRILNLL QPQAIPHVEP ETLANSHTLW
EEDGQCTLRF LHYPPLDSDT TAKLIDDGYW RAGPHTDWDN VTLLYQQMGQ NGLECCANPR
TGDPASMYWT AVNPVEGGIA INVGDMLARW SDGKLFSNLH RVRLPPDASK SRYSIAFFAQ
SDKKALIESK ESEPITAGDY ILSRIRSNFD KNRTTPMLIA IAAKYRSLIG LEAQRHNFEL
KKELPSEGLY YKLMSPENMF RSFA