Gene PHATRDRAFT_49122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49122 
Symbol 
ID7195203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp659220 
End bp662396 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183649 
Protein GI219126825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0406753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGG TCAAGCCGAG CGAAATGAAA AGGCGCGAAG CCGAAAAGAA GCACGTGTAC 
TCGGCATCCG GAGCACCGCT GGGTACGATT TCTTCCCCCC CACGCACGAC CGTCTCCACG
CCGCCCTTGG GATTGAAAAC GACCATTGTC TTCTCCGGTA CCGGTGTCCC GATCGATACG
CGTCTCGAAT CACCCCCCGA GCTACCACCG GCCAAAACTG GGACGAGTCC CAACACGCGA
CCCCCGTTGC CACGCAAAAG ATCCAAACGG AGTCAGCGGA GTAAAGGCAA AAGCCGAGAC
AAGAATACTT CCTCCACACC CACGATTTTG TGTTGGCTGA TTCAGCGAGG CAAGTATGAG
GATGCTACGG AACGCTTGCA CGAAACTCCG CAGGAGGCTA GTATTTGGTG GGTGGAACGA
CCTCCGGGAG ACGATGCCGC TGTTTCCCGG GCCTTGCCCA TTCATTTGGC GTGTCGGAAA
CTCGCGGAAG AAACGGACGA AGCGGCTCGG GCTCGCCTCG GCGACTTTCT TTCCCACCTT
CTTTTGATTT ATCCACAAGG GGCAAGAATG CGTGACGACG GACACGTCGA CCGGACTCGG
ATCGAGGACA GCCTACTGTT TTGCAACAGT AGTAACAACA GCAACAATAA CAGCATTCGA
TCCGCCCGAC GTACAGCGAT TCCCGTCACG GGTCGATTGC CCGTGCACGA TGCCGTGGCG
GGGGGTGTCG ACGAAGAAAC ACTCGCCTTA TTCCTCACTG TATATCCGGA ATCGATCTAT
TCCGTAGATG AGCGACGATT GTCTCTGTCG GAACTCAATC GACTCGCCAC AAACGATCCG
AATATTCAAG GTGTTCTGGA TTTAGGCTAC GAAGACTGGA AGTCGGCCTA CGAATCGTCT
CCGCTATCTG GAAGGTCCGC CTCAAAAATG GATGATTTGG TAAGTTGCGC TTCCGAGGCT
CGGAACGCGG ATATCAGCGC GTCAACGCCG ATTTGCCTAT TAGTCGACGA AGGCGACGAA
CTCTTTCCGG ATAATGTCAG TGCCTTAACC ACACCTGACG AGCTTTTACC TTCCGGTAAC
AATGTGTGGG GGATAGACCG ACATGGCGAC AAGATTGAGG ACAAAGACAA AGAGGAAGTC
TCTTCCGAAC ACGACGTTGT CGAAGAGGCA AAGCACGAAA CTAAACCAGA ACAGAACTCG
CCGAGCTCCG ACCCGGCTCC TATTGTGACC TGGGAACAGA TTGAGGAACG CGCCCTCGCT
TTGGAACGAG TACTTGGCGA GATGAAGACC AAAAATTACG ACTTGCACGA GAAGATTCAA
GTTTTATCCA AAGACCAAGG GAGGGAGATC ATTTTGCGCG TAGATCGATC TCAGAAAACC
GATTTGTACG GAATGGTGGA TGTGTTACAA CACCAGAATT TCGCTCTTGA TCAAAATATT
TATAAAACGG AAACATTGCT TCACTACTCA GTTTTTCCGA GCGACGAAGA GTCGGTGGGA
CGGCAACGGC GACGAGGAGA AATCGCACGT ATGCTAGGCT GGCTGGATGT TGAAGAAAAT
GATAAAAGCA GCACTATCGA GGACAGCAGC GACGAGGAAA AGGTTAATGA CACCACGTTG
CAACAAATAT ACAATGAGCT CCACGAATCT TATGATCAGC AAAAAGCTGC CATCAAACAG
TTTGGTTTCG TCTTTGAAAA ACTCGGGATC AATCGTTTTG TGGACGAGGA CAATGCATCT
GCTGATACGG TGCCCCGGAG TGTCGTTTCC AATCTGACCG TGAACTCCGA CGACTGGAGC
TTTGGCTCCG ATCATTGTGA CGATGTCTCG CGTCCAGAGC GGTCTCGGAA TGTCTTGCGC
GAAGTAGAGA TTGAATGGCC TGAAGACAGC GTAGATGCAT GCCAAGTGAG TGACAATTTA
AGTACCATTT TCCGCCACGC TGCCGCCATC ACCGAAGAGA ATGAAGATTG CCTGATGCCA
ACATCCTCTG GGGGGCACAA TTTTGGAGTA GATAATCTAA GCCAAATCCT ACGGTCGGCA
GCTGCCAAGG AAACGAGGCG GAAAAAGGTC AAGCGTATGA AACCTGTCAG TCAAGAATTG
ACCATTCCGG CGCTCATGCC TGAAGGTGGT GCAAAGATGT TACCAGCAAT AGCGGGCAGC
CTGCAACACA CCAAGTCTAT TGGCTCTTTT CGATCGGCAT CCAAATCCTC CTTGAAACAG
AGCTTTTTTA CATCAACTAC CAAGATTTAC GAGGTGCCTT CTGTCTTGCC GGAGGCTCCT
GTTAAAAGTC CATCGTTAAA GTCTACGATT TCGGCCGAAC GCTCCTGTCA TGTAGCGTCG
GCGTGTATAT CAGACCCAAT CCGGCCTCGT AGCATCAAGT CCAGCGAGGA TCCGAAAAGA
AGTGACAGTG AAAGTAAAGG TAGCGGTGGT GGATTCCGGC GGCAACAGCG GCGCCCGAGC
TTGATGGATG GTGTGACGGC CTTGGCCGAA AAGGGGCGAC TCCATACACG CGAGCACAAT
GATCAACAAT CTCAATCTCC TTCGCATTTG CAGTCCGACG TTACAGATCT CCTGACTCCC
GATGGTGACA AAAACGGCAC GGACAATGAC CATTGTGAGC ACGACGAATT GAAGCTGTCC
GACAGAAAAC CTCCTATTGT GTCGTTTAAC ACTGTCTCTA TACGAGTGTA CGATCGTATT
CTCAGCGATA ACCCGGCAGC TGCTAGCGGC CCGAGTCTTG GTATTGGGTG GGTCTTTGTA
CCTCAAGATG TCAAATCAGT TGACGACTTT GAGATTTTGC GTGAGCCTAT GCGCGCTCCG
GAGAGGTTGC TGCTGACTCG CCAAGAGCGT GAACAGGTCT TTTTTGACCT GGGGTATACC
CAGAAGGACG TGGCTGTTAA CGTACGCGAG CTCAACAAAT TGCGATCACA GCGTCGAAGA
ACGATTGTGA ACCTTGGTTC CACAAGAGTT GAAGAAACGG TGGAAGTCGC CAAGCGGAAA
CTCAAATCAA TCTTGCGACT GAAACGTAGT AGTCCTCTGG AAACATCCGC TAAGAATTTG
CTTTCATCTA CTTCCACAAA TTCGACTACA TCGTCCTCGA AAGACAAGGA AAAGCTCACA
AGTGCCATCG AACAGAGTAC GAGCAAGCCT TTGCAGCTAG CAATCAACCC AATTTAA
 
Protein sequence
METVKPSEMK RREAEKKHVY SASGAPLGTI SSPPRTTVST PPLGLKTTIV FSGTGVPIDT 
RLESPPELPP AKTGTSPNTR PPLPRKRSKR SQRSKGKSRD KNTSSTPTIL CWLIQRGKYE
DATERLHETP QEASIWWVER PPGDDAAVSR ALPIHLACRK LAEETDEAAR ARLGDFLSHL
LLIYPQGARM RDDGHVDRTR IEDSLLFCNS SNNSNNNSIR SARRTAIPVT GRLPVHDAVA
GGVDEETLAL FLTVYPESIY SVDERRLSLS ELNRLATNDP NIQGVLDLGY EDWKSAYESS
PLSGRSASKM DDLVSCASEA RNADISASTP ICLLVDEGDE LFPDNVSALT TPDELLPSGN
NVWGIDRHGD KIEDKDKEEV SSEHDVVEEA KHETKPEQNS PSSDPAPIVT WEQIEERALA
LERVLGEMKT KNYDLHEKIQ VLSKDQGREI ILRVDRSQKT DLYGMVDVLQ HQNFALDQNI
YKTETLLHYS VFPSDEESVG RQRRRGEIAR MLGWLDVEEN DKSSTIEDSS DEEKVNDTTL
QQIYNELHES YDQQKAAIKQ FGFVFEKLGI NRFVDEDNAS ADTVPRSVVS NLTVNSDDWS
FGSDHCDDVS RPERSRNVLR EVEIEWPEDS VDACQVSDNL STIFRHAAAI TEENEDCLMP
TSSGGHNFGV DNLSQILRSA AAKETRRKKV KRMKPVSQEL TIPALMPEGG AKMLPAIAGS
LQHTKSIGSF RSASKSSLKQ SFFTSTTKIY EVPSVLPEAP VKSPSLKSTI SAERSCHVAS
ACISDPIRPR SIKSSEDPKR SDSESKGSGG GFRRQQRRPS LMDGVTALAE KGRLHTREHN
DQQSQSPSHL QSDVTDLLTP DGDKNGTDND HCEHDELKLS DRKPPIVSFN TVSIRVYDRI
LSDNPAAASG PSLGIGWVFV PQDVKSVDDF EILREPMRAP ERLLLTRQER EQVFFDLGYT
QKDVAVNVRE LNKLRSQRRR TIVNLGSTRV EETVEVAKRK LKSILRLKRS SPLETSAKNL
LSSTSTNSTT SSSKDKEKLT SAIEQSTSKP LQLAINPI