Gene PHATRDRAFT_50562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50562 
Symbol 
ID7199390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp132365 
End bp135484 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185489 
Protein GI219130684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGA AACGGAAAGC TTTCTCGCCT CTGCATGCGA ATACGCTGGG ATCCTCCCCT 
TCGCAACGGA GCTCCTCGAC ATCCTCATCG TTGCCGCGTT TGGACAAGGA CGACGACTTT
CTGTCGCGTC CACCGAAGAA GCCACAGCCT GTGCGGAGTA GTGATACCAA GACGCCGCGC
ATCAGCAATT GTACGAACCC CCAGGGATTG GTACGTACCA CAACCGCGGT CCCGGCTCCG
TCCACGGCAT CGTTGGGAGC CACATCCCGT ACCAGCCGTC CCTCCAAGGC GCGTTCCAAA
GTACCCCGCC AGACAAAGAG AACCTTGAAG AAACCTCCAC CACCAAGGAT TCAGCAGTCA
AACCAGCCTG CGGACCAAAG TCGAGCAGAT GTGCCCAAAC GTGAAGCACA GGGACCGGGA
GTGATCATTC CTTCGAATCC GCAAACGTCC AAAGATGACA CACAACGACA ATCGCGCCGG
TCCACCTCCA CCCTGGAAAC AATATTGGAA GAAGACTCCT TGTCTTCCAA CGGATCTTAC
GGTCCTCATG GCATATCCAC TATTAAAGCA GCTATCCAAG CGTGTTTACC GAATCCCTCT
TTACCAACAG CAAATCCAAG TCGCCCAATA CCCATCGCAT CTTCGCACCA CGTCATGGAT
CCACAGCATT GCCAAACAAA ATCGCCTCAC TTTCAAAAAC TGAACGCATC ATCGTCGGCT
CGCAACTGCT CGGATGACGA CATGCAGCTT TCCTCGGACG ATGAGACTCT GTCACTGTCG
TCTCCCGAAA CAAGCATACC AAATGTTCCC GACAGGGCTA TCCCCTACCA AAAGGGGACT
TCCCACGCTC TTCATTCTTT TGCTCGGGGC TCAATAGATC ACACAACGAG CCGTCTGAGA
GACGGTAGAG GTCTTCCAAA TGCCACCTTC CAGTCCAATG TCGGCAAGTC GGTGACCAAT
GGTTCCTCAA ACTTTGATGT TGCCACTAAA AATGGGGAGC AACACATTAC GAAAGCATTG
AAGGCTCAGG TGACGGACAA GGCAATGCAG CGCTTTTTGG ACACCCCGGT CGACAATCCG
GACGTTGTCA AGGCCATGCT GGCTAATAGC GACCCATCGC AAGGGCTCAT GGGACTCTCG
CTACAAGTTC AAGCTGGAGA TGATAAATCT TTGGCTCAAA GCGAATTGAC CGATCAGACT
GACTTGGACT CTTTCACGAA CACTCGGCAC GACATTCATG AAGGCTTCGA ATCAGCTGAC
GGCTGGATGG AGCCGCGGAC TTGGACTCGT TTGGCTGCCC GAAACGCGTT ACGAGATCAA
ACTGGGAAGC TATGCGCCGT GCCCGGACGA GCCTTGTTTT CTCCATCTTC CGAGCCGATT
AATGCTGCAT GCAACTCGAC GCCGCACGAG TCTTCGGTAC AATCGGCAAA GCTTTGGACA
GCCAAGGAGG CGAACGATTT TAATCCAGCC CCAATATTTG TTTTACGTGA TGGAAAGCGG
TACCGCCACC CACCCTTGCC ACCGGGATGG ATGATTGGAG TCTCACAGAC CAAGAACCGG
CCCTACTATT ACCACACGGA TTTTGGAACA TCCTTCTCTT GTCCTGTCCA CCTACCTGAT
GACGATGGAC AGGTTTATGG CGATACTCCT TCACCCGTTC CGTCTGCATG TGCAAAGAGT
TGCGCAAGCC TGTATAGTTC TACAGTAGGT CATGACGAAG CTTCAAGAGG AAGGGCGTCG
ACCATATCAT CAAACACTGT CTCTTTGCAA GCATCCAGAA AAAGCAAATT CGCATTAAGT
CCTGTATCAA AAAGGGATGG GCTTACTGTA GTCAGCTGTA GATTGCAAAG TTCCCCTTCC
TTCGAAACAC CGAAAAAGAC AAACGACGCA GTGGAAACCA TTGAATTGGG AGTGTGCGTG
CATAACATCA CAAAAGAGCA TTCCCATATG ACCCCAGCAG TCCATGAAAG TGAATACGTG
TCCGGAAATC CGCTCACTTC TTCTCTTCGA AACAGCAAGG GGCACGAGGC CCATCTAGAA
GAGCATACGG AACAATTGAA ACTAGCCCAG AATCTTATCA GGACGTCCAC AAATAAAATG
TTTAGTCCAC TGTCATTGGC AATGAGGAAG AAGTCCTGGC CTCGGGGTCA GGGATGTGTG
GCCAATGCTC GTATGCCGGA CAACGAATTT GACGCTTCCA GCAGCGAGGA TTCTCGCGTG
CGTTCAGAAA AAGTTTTGTG TTCGACAACG CACAACCCGA CTTCCCATCT TTACTCTCTA
TCATCCCCTC ACGGCACTCA CACCATTGAT CGACGCGACT CCTTGAGGAC AACCGAAAAT
CTGTCGAATT ATGCCACCTC TGGAGCAAAG TTTCCCGAAA CCTACTTGCC TAAAGAAGGC
GTTGACAGAT TCTTCGTCAA CCGGATCTCT GAGGCCCCTG ACAAGACGGT AGACATGAGC
AGTTCTGTTC CCTCAAAATC TTCGTTTACC CTACATTCAT CTCGTTTAAG CAGCCCGAGT
AACGTTGAAG GTGACTCAAG GCAAGGATTG GCTTATGAGA GGGGGAAGGA TAGTGACGAG
ATTCCAGGTG TGGCCAACGA TGACTTCCCA GATGTCGAGT ACGACGAAAG TCCGATCGGA
CTACCCACCA TCGGTGACAG AGATATGACA ATCGGCCTCC GACAGAAGCT AGACTTCACG
TCCACCGACC AAGAATTGGA GGAGATCTTA CCTATTGAAA TGTCTGCAAG CAGGCGATCG
TCAAGACTCC AACCTTTTGT TCTGACATCC ACCGGATCCA AAGACGACGA AATATCCGCG
TTGGGAGTAG ACGATCTCTC CCAACAAAGC AGAGTAGAAG ATTTTATTCA AGAGAGGGGT
GAGATACGGA GCCATAGGTC TTTAGGAGGA TCTGCGTCGA CGTTTGGTAC AAATTTTAGC
CACCGGGTGC GACATCCGCC AATGCCACTG TGCAGTTTAC AGAACGTTGG ACAGCTCGAG
CGATATGCCT CACCGAGTCA CAAACTTTCA AGGAAATCAA GAGACAAACG GGGACGTCGT
AAGTCGAAAG GCGACCTGAG AGTGATCTCG CCGCGAATTT CGGTGATGGT GTCGAACTGA
 
Protein sequence
MTKKRKAFSP LHANTLGSSP SQRSSSTSSS LPRLDKDDDF LSRPPKKPQP VRSSDTKTPR 
ISNCTNPQGL VRTTTAVPAP STASLGATSR TSRPSKARSK VPRQTKRTLK KPPPPRIQQS
NQPADQSRAD VPKREAQGPG VIIPSNPQTS KDDTQRQSRR STSTLETILE EDSLSSNGSY
GPHGISTIKA AIQACLPNPS LPTANPSRPI PIASSHHVMD PQHCQTKSPH FQKLNASSSA
RNCSDDDMQL SSDDETLSLS SPETSIPNVP DRAIPYQKGT SHALHSFARG SIDHTTSRLR
DGRGLPNATF QSNVGKSVTN GSSNFDVATK NGEQHITKAL KAQVTDKAMQ RFLDTPVDNP
DVVKAMLANS DPSQGLMGLS LQVQAGDDKS LAQSELTDQT DLDSFTNTRH DIHEGFESAD
GWMEPRTWTR LAARNALRDQ TGKLCAVPGR ALFSPSSEPI NAACNSTPHE SSVQSAKLWT
AKEANDFNPA PIFVLRDGKR YRHPPLPPGW MIGVSQTKNR PYYYHTDFGT SFSCPVHLPD
DDGQVYGDTP SPVPSACAKS CASLYSSTVG HDEASRGRAS TISSNTVSLQ ASRKSKFALS
PVSKRDGLTV VSCRLQSSPS FETPKKTNDA VETIELGVCV HNITKEHSHM TPAVHESEYV
SGNPLTSSLR NSKGHEAHLE EHTEQLKLAQ NLIRTSTNKM FSPLSLAMRK KSWPRGQGCV
ANARMPDNEF DASSSEDSRV RSEKVLCSTT HNPTSHLYSL SSPHGTHTID RRDSLRTTEN
LSNYATSGAK FPETYLPKEG VDRFFVNRIS EAPDKTVDMS SSVPSKSSFT LHSSRLSSPS
NVEGDSRQGL AYERGKDSDE IPGVANDDFP DVEYDESPIG LPTIGDRDMT IGLRQKLDFT
STDQELEEIL PIEMSASRRS SRLQPFVLTS TGSKDDEISA LGVDDLSQQS RVEDFIQERG
EIRSHRSLGG SASTFGTNFS HRVRHPPMPL CSLQNVGQLE RYASPSHKLS RKSRDKRGRR
KSKGDLRVIS PRISVMVSN