Gene PHATRDRAFT_46132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46132 
Symbol 
ID7201355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp361326 
End bp364358 
Gene Length3033 bp 
Protein Length1007 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180622 
Protein GI219119737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAAGCGCCA TGGCGACCGT ATCGCCCCCA CCTAGTGGGC AAGTACTGCA GCTCGTCTCG 
GCCCTTTCCA ATCCAGAAAA TCACAATGTT CATGTCCAAT CAATCCGCGC CCGAGACGAG
GCGCTCTCGG CCTCGGCTGA ATCTTACGGA AATCTTTGCT ATCAGTTGGC TCTCGTGCTC
GTAGGCAGTG ATAATGCGGC TGAGATGACA GCGCACATAA ATCCATCCGA GCTAGATTCA
TGGAGACAAG CCGATCCTTC CACGGTGCTG AGGCTACAGC AAGATATGTC CATGTGGATT
CCATTCGGAC AAATGGCAGG TCTAGTGCTC AAGAATGCTC TCTTGAGGCC ACCAATTTTG
CAAGGACGTC AGTCGCTTTC CATCCAACCA CCAAGCTCGG ATCTTCTTAA AGAAGCGCTG
TTACAAGCCC TCGGATGTCA GCACTCTGAG CTCCGAGCGG TTGCAAGTTC TGTGATAGCC
ACTTCTGCGG TTTCGGCAGA TAGTGTCCAA CCGGGGCTAT GTGTTCGCGC GTGGCCTCAG
CTGATACCTG CTTTGATTGC AAACTTACAG AAGACAGAGA ACGCTGCCTT AATGGAAGGG
TCGCTTGCTA CAATTCGAAA AATGATGGAA GATGGACCGA CCGAGTTGAC GCAGGAAGAA
TTGGATAGTT TGATTCCTGT GCTGATTCGA TTCCTTTCAT GCAACAGTGA ATTCTGTAAA
GTTGCTGCTC TGCAATCGCT CACAGCCTGT CTCTCCGACA ACGTTATGCC GAGCGCTTTG
GTCCTGTACT TCAACGACTA TCTCGGCGGA CTAAGCGCCC TTTCTACGGA TCCTAGCGCC
TCGATTCGAA AGTGGGTGTG CCGTTCGATT GTCACGCTGT TACAACTCCG AACCGAATAT
ATTCAACCCC ACCTCCAGGC CGTCAGCCAA TTTATGCTCA CGAGTACGGC AGATCGTCAT
CACGATGCAG TGGCATTAGA AGCTTGCGAG TTTTGGTTCA CGTTTGCAAC CTTGGACGAA
GATGTGTGCA CACCAGCGAT GGTCGAAACA ATTGGAGGAG TTCTACCTAA ACTGATTCCC
ATTCTTTTGG AGAACATGGT ATACCTTCCC GAGCAACAGA TTGAGCTCCA GGCAAGAAAC
GAGATTGACC AACAGGAAGG ATACAACGGG ATGAGTACAA TCAAGCCCGT ATTTCATCGC
AGCCGGGCAA AACATGTGGG CGGTCCCGAT GAAAGTAGCG ACGACGATGA TGGCTATGAT
CAAGACGATG AGGATGATGG CGAGTTTGAC GACGACAATA ATGAATGGAC GTTACGTAAG
TGCGCCGCAG CAAGTCTTGA CTCTCTGTCC AGTCTGTTTG GTGCCGATTC TATCCTCCCA
AGCTTACTAC CCGCTTTGCA AAATGGACTC TCCAGCTCAT GTCCGTGGGT ACAGGAGGCG
TCTATCCTGG CACTTGGGGC AGTCGCAGAA GGTTGTCGGG ATGCTTTGAA TGTACACATG
TCTCAAATGC ACCTGTATCT AGTAAATCAT CTTGCGGCTC CTGAATCTCC CAGTACTTTA
CCTCAAGTAA AATGTATTGC CGCGTGGACG ATAGGACGGT TTGCTTCATG GGCCGTAGAG
CAAGTTCAAA CCGGAGCCCA AGGTCATCTA CTGGCGCACA TGACAGAGGT ATTTCTGACT
CGCTTGAGCG ATAGGAACCG GAGGGTCCAA ATTTCTTGCT GCTCTGCATT CGGCGTCATT
ATCGAATCAG CAGGGGATCT GATGACGCCA TATTTGTCGC ACATTTACTA CGGCCTTGTC
TCAGCCTTGT CACGCTACCA GGGCCGGAGT CTCTTAATGA TCTTCGATGT GGTTGGAATA
ATTGCTGACT GCTGTGGCCC ATCAATCGCT GAAGGGGATC TGCCATCGAT CTACGTCCCC
CCATTGTTGC AGATGTGGAG CGGTTTAGCC AAGAACGACC CCACCGACCG GACGTTGCTA
CCACTCATGG AGAGCCTGGC AAGTGTAGCC ATGACTTCGG GAATGAACTA TCAGCCCTAC
TCGCTCGAGT CATTTGACAA CGCGATGGGT ATCATCGAAG CAGTTCAGCT AATTCTTACT
GCTTCTGGCG AAAAACTGGA ACACGAAGAA GAGGCGGACC CCATTGTTTG CGCAACGGAT
CTCTTGGACG GATTGGTCGA AGGTCTCGGA GAGAGTTTTC CATCGCTGGT TTCAAGTAGC
CGACGATACG GGCAGCATTT TCTTCCGGTA CTTCTGGCAC TTTGCAAACA TGATATTCCC
GGCGTGCGAA TGAGCGCTAT CGCTTTGGTC GGCGACCTGG CTCGCAGCTC CCCGGCTTTA
CTGGAACAGG CATTGCCAGA GCTTCTGAAA GAACTCGTTG CAAATATGGA TCCGGTACAA
CCGTCTGTGA GTACAAATGC AGTCTGGGCA CTGGGCGAAA TTTGCGTTCG ATGCGAACGA
AATTCCTCGC CTCTGGAAGC TGTTGTGCCT GATCTTGTTC AGAATCTCAT TGCATTGTTG
ATGGGCAATG GTATTGAGCG GAACGGCAGG GGATCGGATA TTCCCGGCAT CGCTGAAAAT
GCAGCAGCAT GTGCCGGGCG ACTCGCCAAG GTTAACCCCC AGTTTCTTGC GCCTGACCTC
CCTCGATTTT TGCTCGGATG GTGTGACGGG ATGGCAAAAA TTGTGGACCC CAAAGAGAGG
CGTGACGCAT TCCAAGGATT TGTTGCTGCT ATCTACGCCA ATCCCCAGGC ATTTCAGACA
TCTTCCGCAA CCGTTTCTGA TGCGATCGCA TCCATCATTT TTGCTATCGT GACTTGGCAT
ATGCCAGCGG AAATACCAGA GCAATCAGTT GTCCTTCTAA ATGGAGACTA CAAATTCCGT
CCGTTCCCCG CTAACGAGCC GGAACTTGGC GAAGCCCTTT TTAAACTCAT CTCAGACCTA
AAGACATCCG TCGATGAGAC GACATGGAGA GCAGTTCAGC AAGGACTGCC GGTGAATATT
CGACGTCTCC TCCGCGAGTT TTATAACATG TAG
 
Protein sequence
MATVSPPPSG QVLQLVSALS NPENHNVHVQ SIRARDEALS ASAESYGNLC YQLALVLVGS 
DNAAEMTAHI NPSELDSWRQ ADPSTVLRLQ QDMSMWIPFG QMAGLVLKNA LLRPPILQGR
QSLSIQPPSS DLLKEALLQA LGCQHSELRA VASSVIATSA VSADSVQPGL CVRAWPQLIP
ALIANLQKTE NAALMEGSLA TIRKMMEDGP TELTQEELDS LIPVLIRFLS CNSEFCKVAA
LQSLTACLSD NVMPSALVLY FNDYLGGLSA LSTDPSASIR KWVCRSIVTL LQLRTEYIQP
HLQAVSQFML TSTADRHHDA VALEACEFWF TFATLDEDVC TPAMVETIGG VLPKLIPILL
ENMVYLPEQQ IELQARNEID QQEGYNGMST IKPVFHRSRA KHVGGPDESS DDDDGYDQDD
EDDGEFDDDN NEWTLRKCAA ASLDSLSSLF GADSILPSLL PALQNGLSSS CPWVQEASIL
ALGAVAEGCR DALNVHMSQM HLYLVNHLAA PESPSTLPQV KCIAAWTIGR FASWAVEQVQ
TGAQGHLLAH MTEVFLTRLS DRNRRVQISC CSAFGVIIES AGDLMTPYLS HIYYGLVSAL
SRYQGRSLLM IFDVVGIIAD CCGPSIAEGD LPSIYVPPLL QMWSGLAKND PTDRTLLPLM
ESLASVAMTS GMNYQPYSLE SFDNAMGIIE AVQLILTASG EKLEHEEEAD PIVCATDLLD
GLVEGLGESF PSLVSSSRRY GQHFLPVLLA LCKHDIPGVR MSAIALVGDL ARSSPALLEQ
ALPELLKELV ANMDPVQPSV STNAVWALGE ICVRCERNSS PLEAVVPDLV QNLIALLMGN
GIERNGRGSD IPGIAENAAA CAGRLAKVNP QFLAPDLPRF LLGWCDGMAK IVDPKERRDA
FQGFVAAIYA NPQAFQTSSA TVSDAIASII FAIVTWHMPA EIPEQSVVLL NGDYKFRPFP
ANEPELGEAL FKLISDLKTS VDETTWRAVQ QGLPVNIRRL LREFYNM