Gene PHATRDRAFT_46192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46192 
Symbol 
ID7201264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp529463 
End bp532978 
Gene Length3516 bp 
Protein Length1171 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180662 
Protein GI219119820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.320112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGA TGAACGAAGC CTTGGCGGTA GCTTTGAGCA GCAACGGCGC CGGGAGTAGC 
GTCCGGAGTA GAGGCCCAAA GCCCAAACAT CCGCACAACC AAACGATTCA AGTCAAAATC
GGTTCACACG CACCGCAGTC CATGACACCG GATCAAGCCG CCGAATACTT AAAAAGCATT
GCCAATACGG GAGGCTTTGA CGGAAGCTCG TCGGGCGCAG GTAGGGATGT TAAAATGACC
TTTGGCAAGG GACCGACGGC AGCGTCGCTG ACGACGGAAC TCGCTGCCGC TGTGTCGTCC
CTCGGAGGCG GGAACATTCT CCACAACTAC GCGACGGCTG CGGACACGAA CGCCGCCGTC
GGTACCGCCA TTCCCTTTGG TCCCAGACGA TCTATTTGTG GTGACTCGGA ACACGCCGAC
TTTGTTTTCT CCTCCAACTC GGCGCTTTCG ACCCTCGACT TCGCCGCCTT GGCAGCGGTC
GGTGGCGCCG ATTTGCCCCA CAAACGGAAA GACGGTGCAC CCTGCACGGA AAAGGAAATG
AAAGCGCTCA TGAGCATGTT TGTAGAAATT ATGGGACTGC AAATGAAAAC CGATCAACTG
CAACCGCCGA TGCACGTTCG GGGTGATGTG CTTCAGGCAT TCCCGTCGCA GTGTGCACCA
CCCCCATCCG GTTGGCCGGA AGAGCTGCTC TACCCCGCGT TACCGCCGAC TTTGAATGAT
GACGACTCTT TGCCGGATCT AGAAGAAGCA CCTGTGTGGT CGTTTGCGAG TCTAGGAAAA
TTGACGGCAC TCCTGGAGCA AACGCCCCCG GTACCCGAAC GATTCAACCC AGCTTCCGGT
ATTGAACGTT TGAACTGGGA CTTGTTGGAA CGGTTTGCCA TGGAAGATGC CTTGGAACAG
GAAGAGCGAG CTCGCAAGTC GGCAAAGTAT GTGTCACAAC AGCAATCTGC GTCGCGACAA
GCCTCACAAG CTCGATCGGA AGCCGAAGCG GTGGCGAAAC GACTCGACAA GGTCTTCCAC
GCGTGGCGCT CCAAAGTAGT CAGCGCAATC AACGTAAACG ATGTTGACGC GTTGGAAGTC
CTACTGGTGA ATTCTCCCTC TGTTAATGAC GAGAGAGTTG CCAAAACGCA CTGGCGGGCT
TTGGCTCCAC AGCTTTTGGC CAAGAATCGG CAAACCTTGC CCAAGAATAG ACAAGCGCGG
GTGCTGCTGG CCACTCTGTT GGGACGCAAG TCGATAGACT TGCTATCAGA ACCTCAACGG
AATGGGCGTA GTGTACTGCA TACGGCTTGC TTTCATGGTG ATATAGCTCT GGTTCAGGCA
TTATTGTCAT CCTGGGGCAG TGCTTCTGAG GGTTCGGATG AAAAACAGAA AAAAATCAAC
GTTCGCTGTC AAGATTCGGG ATGGACACCG CTGCAATATG CTGTCGCGTC CGGATCAATG
CGTGTTGTCG AACTTATACT TCAGCACGGA GCCGATATGC TGGTGCCAAC GAACGATACA
CACACTTGGA AGCGACAAGG AAAGGGTGGT TTGACCGCTG TGGAACTAAC CGACGTTATT
CGTAAGCAAG CATGGAGTAA ACAGATTGAA AGTCATGGTA TGGCGCTCCC GGAAATTACC
AAGGAACTTT TGTCGGGCAC CGAAGCACGG CTCTATCTAG AGCGTTTGGC ACAAATTGCG
CATCGCTTGC GTGAGGTGGC GTCGCAAGGC AACAGCATCG AGCCACTGTC AGAAAAGCAA
TTGAGCAAAC TTGAAGAGCA AGTCCGTCAG CAGGCTACAA AAACGCGCAA GCCAGAAGCC
GGTTCAGTTA CCAGCAACGA TGGCAGCCTA CGCAAACAAG AGTCCATGGC CAGCAATACG
CCCCAGAGGT CGAGTTCTAA AGCGCAAAAG AATTATGCTG AAACTCTACC GGTATCGAAA
ATCGACACGA AAAACATAAC AGAACCTGTG ACAAAATCTG TACCAGAAGA TCCAATGATT
TCTGCCCTAA TAGGTATGGG ATTTGTACGA GATCAAATTA TGAACGGTGT TCACGCTTGT
GGAGGCATGG ACCGAGCAAC AGCGGATGAC GTTGTGGCTT GGATTTTTGG ACAAGATACA
AGTCCAGCTG CGCCTGACGA ATCTCCGAAA ACAATAGAGG ACCAACCCGC GTCTCAGATT
TCCGGTGGTA TCCAATCTCG TGATTCGCTC CGGCATGCGC GAGCCGCTAA CGCAGCTGAA
GTTAACCGTT TGACAACTTT GCGTAAAGCC GAAGAAGAAA GACTTGCCGC CGAGCGACAA
GCCGCCCACC GGGAAGAGCA GCGTCGGAAA AATCGGGAAT GGAACAATCG ACAAAAGCGA
CAGGTGCAGA CAGCCACAGC CGCGGATGTC AAGAGATCGT CAGGTGGATT GGAGGGTAAG
GCCTTGCCTC CGAGATCGCT GTCGAAGCAG ATCTCCGGCA TGCCTGATAA AGGACCCATC
CCGGCTGCCA TGTCGGCGAT GGCTACTGCC CCTACATCGA CTTGGGTTAA CAGGGAAGTT
CGAGAAAATG GAAACGATTC CTCGACGGTC AATTCGTTTA ATGAGATTAC AACCATTGAG
ATCTACAACA ATGACGACGC AACAGTATCG ACTATTGGGA GTTTGCAAGC CCGTGCTACT
TCCATTCCCG TTCCTTCATC ACAGCCAGTC GCTCCCCCTG GCTTTGGGTT AACCGTTCCG
GCTGTTCCGG AGGCGATGGA AGCAACGCAT CCATGGGGCT TGACGCCAGC TGGGGTCCCT
CATCTACCTG GAAACGATGG ACCGGGGTGC TTTCTTCCTC CTCCAGGGCT CTCCGCCGAC
CTGAATAGTA TGCAGCATCC AGCAATTTCA GGAGCTAGCG AGCGTCAATT CCCAATGAAG
AATGATCCTA TGGCGGGAGG AAGCATTTCC CGTTCTGCGG ATTCTTTCAC TAGATTCGCG
ACTGGGAACA ATGGCACAGG TCTTTTGGGG AGTCATTTAA TGTCCACGAC AAATCCACCA
GTAGATCCAT CGTTGCTCTT TTCCGACTCG TTGACCGGTA ATTCCTTTCG GAACAATCTT
TCGCAACAAA CTTGTCTACC GCCGTTGCCC TCGCAATCTT TCACAAACGG ATCGTCGTTT
TCTGGTCCGA CGTTGCCATC ATCTTTGGGT ATCGGTCGGA GTGGGTTACC GAATCAAGCC
GCTCACCTAG ACTCATCATT CATCGAATCC ATTTCTACGG GTGATCCCTT GCTGGATGGA
GCCTCCCTGT GGGGCGGCAT CGATCCCCAG TCACAATCAC CGTCGGTGTT GCACAACTTG
CTACACGAAG ATTTAAATCA TGACGGCATC TTTCCAGCTC CGCAAACTTC GGAACAAGAA
AGAAGCTTTG CGACATGGGG GACAAATCAG CAACAATCTC ACGCTAACCT GCATCTCCTT
CACCAACGAC CTACAGCTAA CTCGCATCAA CAACCGCAGT CTCTAAACGT GGCAAGTACT
ATGCAGCGAG ATAACCGTGG CAGGTCCATA TGGTAA
 
Protein sequence
MEEMNEALAV ALSSNGAGSS VRSRGPKPKH PHNQTIQVKI GSHAPQSMTP DQAAEYLKSI 
ANTGGFDGSS SGAGRDVKMT FGKGPTAASL TTELAAAVSS LGGGNILHNY ATAADTNAAV
GTAIPFGPRR SICGDSEHAD FVFSSNSALS TLDFAALAAV GGADLPHKRK DGAPCTEKEM
KALMSMFVEI MGLQMKTDQL QPPMHVRGDV LQAFPSQCAP PPSGWPEELL YPALPPTLND
DDSLPDLEEA PVWSFASLGK LTALLEQTPP VPERFNPASG IERLNWDLLE RFAMEDALEQ
EERARKSAKY VSQQQSASRQ ASQARSEAEA VAKRLDKVFH AWRSKVVSAI NVNDVDALEV
LLVNSPSVND ERVAKTHWRA LAPQLLAKNR QTLPKNRQAR VLLATLLGRK SIDLLSEPQR
NGRSVLHTAC FHGDIALVQA LLSSWGSASE GSDEKQKKIN VRCQDSGWTP LQYAVASGSM
RVVELILQHG ADMLVPTNDT HTWKRQGKGG LTAVELTDVI RKQAWSKQIE SHGMALPEIT
KELLSGTEAR LYLERLAQIA HRLREVASQG NSIEPLSEKQ LSKLEEQVRQ QATKTRKPEA
GSVTSNDGSL RKQESMASNT PQRSSSKAQK NYAETLPVSK IDTKNITEPV TKSVPEDPMI
SALIGMGFVR DQIMNGVHAC GGMDRATADD VVAWIFGQDT SPAAPDESPK TIEDQPASQI
SGGIQSRDSL RHARAANAAE VNRLTTLRKA EEERLAAERQ AAHREEQRRK NREWNNRQKR
QVQTATAADV KRSSGGLEGK ALPPRSLSKQ ISGMPDKGPI PAAMSAMATA PTSTWVNREV
RENGNDSSTV NSFNEITTIE IYNNDDATVS TIGSLQARAT SIPVPSSQPV APPGFGLTVP
AVPEAMEATH PWGLTPAGVP HLPGNDGPGC FLPPPGLSAD LNSMQHPAIS GASERQFPMK
NDPMAGGSIS RSADSFTRFA TGNNGTGLLG SHLMSTTNPP VDPSLLFSDS LTGNSFRNNL
SQQTCLPPLP SQSFTNGSSF SGPTLPSSLG IGRSGLPNQA AHLDSSFIES ISTGDPLLDG
ASLWGGIDPQ SQSPSVLHNL LHEDLNHDGI FPAPQTSEQE RSFATWGTNQ QQSHANLHLL
HQRPTANSHQ QPQSLNVAST MQRDNRGRSI W