Gene PHATRDRAFT_44694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44694 
SymbolSMP1 
ID7197898 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1285795 
End bp1289470 
Gene Length3676 bp 
Protein Length1028 aa 
Translation table 
GC content51% 
IMG OID 
Productmetalloprotease 
Protein accessionXP_002178401 
Protein GI219115211 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCCGTCTCA CAATCTCTTT GTCCCGACAA ACCTCAACAC AAGAACGAAG CCTAAAATAT 
CTCACAGTCA CCTGCCTTTC TCATGAAGTT TCCTTCGCAA CTCGGAGCCA CGGTCGCTTT
CGTCGTCGTT GCCGCAACGA CAACTAAAAC GGTGTCGGGC AATCTAGCCT CGCCGTTCCC
TTTCGAAGAA ACCAACGCTG ATGGAACCCC CACAGGTGAG ATGTACATCC ACGGAGGACC
TGGCGAAAGC TGGTTCGAAG ACAAGCAGGG ATTCACTATT TGTCCAGTGG ATGAGCAGCC
AACTAGACGT AGTCGACGTT TTCTTGGTGT CTTGGACTGG TTCACTGGTG GGACAAGCGA
GGAAGATTTA GCCTCTTCTG ATCCAGAAAG GACCTTCTAC TACTGCGACC AGGACGAGAA
CGGCAACGCG GTACCCAGAA CGGATCTGAA AGTGGGTGAG AGCGATCCCC AAGCCTCTGG
CCTCCCCGCA CATATTGAAA TAACGAGCGA TCAGCTCGAA GCGCAATGTG GGCCTTACTG
TCAGTTGGAC GAAAACGGTC GCCGCTTGGG TGAAGAAGAG ATCGATGGCA GCCATCGCAA
GCTCCAGGGA ACTTTGAAGA ACCTCGTGGT CCTTATGCGG TTCAGCGATC ACGCTACTCG
GGATCTTCCG TCGGTGGGTG AAGTGGACGA CCTGATGAGT GCCGATAGTC CCACCACGTC
CTGTCCTACT GGTAGTGTAA AGCAGGTGTT CTTGGAGAAC AGCAATGGTC GATTGATTCT
CGATTCCACC GTCTACCAAT GGGTGACCCT CGACCCCTTA TACACGGAGA TATACTGTGC
CAATGGACGG TCCGGTCTTG ACGCACGCGT CCACGAATGC ATTGCGAACG CACTGGACCA
GGTTGAAGCT GGTGGATTGG ACTTCCGTGA CTTCGATCAG GACAGCGACG GCAGAATCGA
TGCCATTACC TTTCTGCACT CGGGCTATGG GGCAGAATGT AAGTTTCGAA AATAGAAAAT
ACGGCAGTCG CATGAGATGA TAGTGCTCTC CTATAAATAT CTGACAATCT CTTCTTCTGA
ACCCTTAGGG AATGGAGCTC CTAATCGCAT CTGGTCGCAC AAGTGGGTGC TATACTCCAT
CAACGACGGT GGCGGGAGTG GCTGGACGTC TAACACGGGT GTGAGCGTGT ACACCTACCA
CATCAGCCCG TCTCTGTGGG GCACTAGTGG CAATGCAATT GGTCGCATCG GAGTCATTGG
TACGTAGCGT AACGGCACCA CACAAGGAGA ACTGGCGCTC TGTGGTAGTC GCGCTTCGAC
TTTTGCCCCG AGTAACTTCC TGTTGGGACG AACTCACACA TTTCTCAATC ACTCTGTCTT
TGCAGCTCAT GAAACCGGCC ATTTTCTGGG ACTACCTGAT CTCTACGACA CCGATGGAGG
CGGACAAGGG CTAAACTCCT GGAGTCTGAT GGCCAACAGT TGGGGATTTG ACGGGTCGCA
GCTGTACCCG CCTCTCATGG ATCCTTGGTG TAAAATCCAG CTAGGTTGGG TCACTCCCAC
AGTCTTGACG GAAACCTCTC GAAGTATCGA GATTAAACCT TCTTATACCG AGGACGACTA
CTATGTCATC CAAAAGGGTT TCCCACAGGG AGAGTATCTC GTGATTGAGA ATCGTCGCAG
GGTGGGATAC GACAGCCAGA TTCCCAGGGT AGGTTTATGT TTTGTTTTGT TTTCCAATGG
GCCATAATGC TGACTGTTGA GTCGACGTGA ATTCTTACCT GTTATCCTGT CATGCCTTGC
CTTGCTCTTT GACTGTGAAT GTCGTTTGAT TGTAGGAGGG ACTTATGATC TACCACATTG
ACGACAGTGC TAGTTTCAAT CGAGAGGGCT TCCCAGGCCA ATTTAATTGG CCTCAAAACG
GAAACCACTA CAGAGTAGCG CTCTTGCAAG CTGACGGGGC CTACAATTTG GAACGAGGAC
GCGGTGGAGA TAGTGGTGAT GTGTTCCACG GCGGGAGCGG CGGCGTTTCG TCCCTTGGCC
CTGGTCCGAA TGTCTATCCA AACACTGATA GTTACAAATT TGGTAACATT GTTCAAACTC
GAGTCACCAT TGAGAATATA AGTCCGGCCG GCGAGATGAT GACTTTTGAT TTCCTGGACG
GCGACGATGT GGTAGAGGGC GCTCCGACCC AAGCACCCAC AACAGAAGGC CCGCACGAAA
TTAGTGTTAG AGTGACGCAC GATAGATATC CCGAAGAAAC GTCCTGGAAC CTAGTAAACG
TGGCTGGAGG AGGCGTACTT GCTCGTCAGA ATGCAGGCGA AGTCACAACC GACAATACGG
TTGTTGTAGA GACAATTCGA GTTTACCCCG GCACCTTTCG GTTCGACATA TCGGATGTGT
ACAACGATGG CATTTGCTGC ACCTATGGAA TAGGGTCTTT CGAGATCAAG GTTGATGGTA
TGGTTGTTTA TGCTTCTGAC GGGTCGTTTG GTCAGTCTGA TAGTACCACG TTCGAAGTCG
GTGGCGTCAT CATCGATACG TCCGCACCAA CGGGACAATC CAGTCCTGCG CCCACGCCGA
TTCCGACTGT ACAACCCACA GGGCCGCCCA CTGTTTTTCC TACAGTTTCG CCTAGCACGA
GTCCCACCTT CCCTGCTCCA ACAGCTCCAC CGACTGTGAC GCCAACGGAT AAGCCTACCA
CTCTTGCTCC GACAAACGCT CCAACCAGTC AGCCTTCAAG TGTTTTCACG GATGCTCCAG
TACCGACATC GACCGCTACG CCAGTCTCGA CTTCAAGCGC TGCACCTACA ACTGTTTCAA
CAATGTCACC ATCCGCAGAA GAAACAAGCA CTGAAGAATT ACACCTAGTT GAGATTGAAG
TCATCCACGA CAATTATCCT CGGGAAACAT CCTGGACGTT TACAGAATCG AATGGTTCCA
ATGAAATTCT TGCATCCCAA GCAAGGAATT CTGTGGTCAC AAACGGGCAC GTTGCAAACG
TAGCTCTGCT GGTTTCGTCA GGAGAATATC TTTTTGAGAT CAAGGATTCT GCCTGGGACG
GTATCTGTTG TGCATGGGGA AATGGTAGCT ACACAGTGAA GGTGGATGGT ACCGTTGTCT
CTACAGGAGG TGAATTTGGA TCCTCAAAGA TTGAAAGTGT CAGTGTCGGC TCCTTGGAAC
CACCAACCAC CGATTTGACT ACCGTAAAGA TTCAAGTGAA GCACGACAAC TACGCAAGCG
AAACTGGTTG GGAACTGCTA AATGCAGCAA CCAATGCTGT GTTGGCTAGG CAACTGAAAG
GTACCGTTTT TAATCGCGGG CAAATCATTA CGAAGACGGT ATTGGTCCCT GCTGGAGACT
ACACTTTTCA TATTACGGAT TCCTTCAGCG ATGGCATTTG CTGCTCCTAT GGCGAAGGCC
TTTACGAAGT GACTGTTGGC AACACCGTTG TCGCAAGTGG TGGCGATTTT GGAGCCGAAA
GCTTTGACAG CTTCACGGTA GATGGTCCTT AGGTTGGTAG AGAAGCATGC TGTCTGTCTT
CTTTTTTGCG CAAACAATCA AATGCTGCGA TACTAAGTGT AACCTCTACG TAACGATTTG
TTGCATTTGT ATTATCGGTT TGAATAAGTG TACAGATGCA TCTTGCTTGC TGTATGGCAT
CGTTCAAAAA GATCGC
 
Protein sequence
MKFPSQLGAT VAFVVVAATT TKTVSGNLAS PFPFEETNAD GTPTGEMYIH GGPGESWFED 
KQGFTICPVD EQPTRRSRRF LGVLDWFTGG TSEEDLASSD PERTFYYCDQ DENGNAVPRT
DLKVGESDPQ ASGLPAHIEI TSDQLEAQCG PYCQLDENGR RLGEEEIDGS HRKLQGTLKN
LVVLMRFSDH ATRDLPSVGE VDDLMSADSP TTSCPTGSVK QVFLENSNGR LILDSTVYQW
VTLDPLYTEI YCANGRSGLD ARVHECIANA LDQVEAGGLD FRDFDQDSDG RIDAITFLHS
GYGAEWNGAP NRIWSHKWVL YSINDGGGSG WTSNTGVSVY TYHISPSLWG TSGNAIGRIG
VIAHETGHFL GLPDLYDTDG GGQGLNSWSL MANSWGFDGS QLYPPLMDPW CKIQLGWVTP
TVLTETSRSI EIKPSYTEDD YYVIQKGFPQ GEYLVIENRR RVGYDSQIPR EGLMIYHIDD
SASFNREGFP GQFNWPQNGN HYRVALLQAD GAYNLERGRG GDSGDVFHGG SGGVSSLGPG
PNVYPNTDSY KFGNIVQTRV TIENISPAGE MMTFDFLDGD DVVEGAPTQA PTTEGPHEIS
VRVTHDRYPE ETSWNLVNVA GGGVLARQNA GEVTTDNTVV VETIRVYPGT FRFDISDVYN
DGICCTYGIG SFEIKVDGMV VYASDGSFGQ SDSTTFEVGG VIIDTSAPTG QSSPAPTPIP
TVQPTGPPTV FPTVSPSTSP TFPAPTAPPT VTPTDKPTTL APTNAPTSQP SSVFTDAPVP
TSTATPVSTS SAAPTTVSTM SPSAEETSTE ELHLVEIEVI HDNYPRETSW TFTESNGSNE
ILASQARNSV VTNGHVANVA LLVSSGEYLF EIKDSAWDGI CCAWGNGSYT VKVDGTVVST
GGEFGSSKIE SVSVGSLEPP TTDLTTVKIQ VKHDNYASET GWELLNAATN AVLARQLKGT
VFNRGQIITK TVLVPAGDYT FHITDSFSDG ICCSYGEGLY EVTVGNTVVA SGGDFGAESF
DSFTVDGP