Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44694 |
Symbol | SMP1 |
ID | 7197898 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1285795 |
End bp | 1289470 |
Gene Length | 3676 bp |
Protein Length | 1028 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | metalloprotease |
Protein accession | XP_002178401 |
Protein GI | 219115211 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCCGTCTCA CAATCTCTTT GTCCCGACAA ACCTCAACAC AAGAACGAAG CCTAAAATAT CTCACAGTCA CCTGCCTTTC TCATGAAGTT TCCTTCGCAA CTCGGAGCCA CGGTCGCTTT CGTCGTCGTT GCCGCAACGA CAACTAAAAC GGTGTCGGGC AATCTAGCCT CGCCGTTCCC TTTCGAAGAA ACCAACGCTG ATGGAACCCC CACAGGTGAG ATGTACATCC ACGGAGGACC TGGCGAAAGC TGGTTCGAAG ACAAGCAGGG ATTCACTATT TGTCCAGTGG ATGAGCAGCC AACTAGACGT AGTCGACGTT TTCTTGGTGT CTTGGACTGG TTCACTGGTG GGACAAGCGA GGAAGATTTA GCCTCTTCTG ATCCAGAAAG GACCTTCTAC TACTGCGACC AGGACGAGAA CGGCAACGCG GTACCCAGAA CGGATCTGAA AGTGGGTGAG AGCGATCCCC AAGCCTCTGG CCTCCCCGCA CATATTGAAA TAACGAGCGA TCAGCTCGAA GCGCAATGTG GGCCTTACTG TCAGTTGGAC GAAAACGGTC GCCGCTTGGG TGAAGAAGAG ATCGATGGCA GCCATCGCAA GCTCCAGGGA ACTTTGAAGA ACCTCGTGGT CCTTATGCGG TTCAGCGATC ACGCTACTCG GGATCTTCCG TCGGTGGGTG AAGTGGACGA CCTGATGAGT GCCGATAGTC CCACCACGTC CTGTCCTACT GGTAGTGTAA AGCAGGTGTT CTTGGAGAAC AGCAATGGTC GATTGATTCT CGATTCCACC GTCTACCAAT GGGTGACCCT CGACCCCTTA TACACGGAGA TATACTGTGC CAATGGACGG TCCGGTCTTG ACGCACGCGT CCACGAATGC ATTGCGAACG CACTGGACCA GGTTGAAGCT GGTGGATTGG ACTTCCGTGA CTTCGATCAG GACAGCGACG GCAGAATCGA TGCCATTACC TTTCTGCACT CGGGCTATGG GGCAGAATGT AAGTTTCGAA AATAGAAAAT ACGGCAGTCG CATGAGATGA TAGTGCTCTC CTATAAATAT CTGACAATCT CTTCTTCTGA ACCCTTAGGG AATGGAGCTC CTAATCGCAT CTGGTCGCAC AAGTGGGTGC TATACTCCAT CAACGACGGT GGCGGGAGTG GCTGGACGTC TAACACGGGT GTGAGCGTGT ACACCTACCA CATCAGCCCG TCTCTGTGGG GCACTAGTGG CAATGCAATT GGTCGCATCG GAGTCATTGG TACGTAGCGT AACGGCACCA CACAAGGAGA ACTGGCGCTC TGTGGTAGTC GCGCTTCGAC TTTTGCCCCG AGTAACTTCC TGTTGGGACG AACTCACACA TTTCTCAATC ACTCTGTCTT TGCAGCTCAT GAAACCGGCC ATTTTCTGGG ACTACCTGAT CTCTACGACA CCGATGGAGG CGGACAAGGG CTAAACTCCT GGAGTCTGAT GGCCAACAGT TGGGGATTTG ACGGGTCGCA GCTGTACCCG CCTCTCATGG ATCCTTGGTG TAAAATCCAG CTAGGTTGGG TCACTCCCAC AGTCTTGACG GAAACCTCTC GAAGTATCGA GATTAAACCT TCTTATACCG AGGACGACTA CTATGTCATC CAAAAGGGTT TCCCACAGGG AGAGTATCTC GTGATTGAGA ATCGTCGCAG GGTGGGATAC GACAGCCAGA TTCCCAGGGT AGGTTTATGT TTTGTTTTGT TTTCCAATGG GCCATAATGC TGACTGTTGA GTCGACGTGA ATTCTTACCT GTTATCCTGT CATGCCTTGC CTTGCTCTTT GACTGTGAAT GTCGTTTGAT TGTAGGAGGG ACTTATGATC TACCACATTG ACGACAGTGC TAGTTTCAAT CGAGAGGGCT TCCCAGGCCA ATTTAATTGG CCTCAAAACG GAAACCACTA CAGAGTAGCG CTCTTGCAAG CTGACGGGGC CTACAATTTG GAACGAGGAC GCGGTGGAGA TAGTGGTGAT GTGTTCCACG GCGGGAGCGG CGGCGTTTCG TCCCTTGGCC CTGGTCCGAA TGTCTATCCA AACACTGATA GTTACAAATT TGGTAACATT GTTCAAACTC GAGTCACCAT TGAGAATATA AGTCCGGCCG GCGAGATGAT GACTTTTGAT TTCCTGGACG GCGACGATGT GGTAGAGGGC GCTCCGACCC AAGCACCCAC AACAGAAGGC CCGCACGAAA TTAGTGTTAG AGTGACGCAC GATAGATATC CCGAAGAAAC GTCCTGGAAC CTAGTAAACG TGGCTGGAGG AGGCGTACTT GCTCGTCAGA ATGCAGGCGA AGTCACAACC GACAATACGG TTGTTGTAGA GACAATTCGA GTTTACCCCG GCACCTTTCG GTTCGACATA TCGGATGTGT ACAACGATGG CATTTGCTGC ACCTATGGAA TAGGGTCTTT CGAGATCAAG GTTGATGGTA TGGTTGTTTA TGCTTCTGAC GGGTCGTTTG GTCAGTCTGA TAGTACCACG TTCGAAGTCG GTGGCGTCAT CATCGATACG TCCGCACCAA CGGGACAATC CAGTCCTGCG CCCACGCCGA TTCCGACTGT ACAACCCACA GGGCCGCCCA CTGTTTTTCC TACAGTTTCG CCTAGCACGA GTCCCACCTT CCCTGCTCCA ACAGCTCCAC CGACTGTGAC GCCAACGGAT AAGCCTACCA CTCTTGCTCC GACAAACGCT CCAACCAGTC AGCCTTCAAG TGTTTTCACG GATGCTCCAG TACCGACATC GACCGCTACG CCAGTCTCGA CTTCAAGCGC TGCACCTACA ACTGTTTCAA CAATGTCACC ATCCGCAGAA GAAACAAGCA CTGAAGAATT ACACCTAGTT GAGATTGAAG TCATCCACGA CAATTATCCT CGGGAAACAT CCTGGACGTT TACAGAATCG AATGGTTCCA ATGAAATTCT TGCATCCCAA GCAAGGAATT CTGTGGTCAC AAACGGGCAC GTTGCAAACG TAGCTCTGCT GGTTTCGTCA GGAGAATATC TTTTTGAGAT CAAGGATTCT GCCTGGGACG GTATCTGTTG TGCATGGGGA AATGGTAGCT ACACAGTGAA GGTGGATGGT ACCGTTGTCT CTACAGGAGG TGAATTTGGA TCCTCAAAGA TTGAAAGTGT CAGTGTCGGC TCCTTGGAAC CACCAACCAC CGATTTGACT ACCGTAAAGA TTCAAGTGAA GCACGACAAC TACGCAAGCG AAACTGGTTG GGAACTGCTA AATGCAGCAA CCAATGCTGT GTTGGCTAGG CAACTGAAAG GTACCGTTTT TAATCGCGGG CAAATCATTA CGAAGACGGT ATTGGTCCCT GCTGGAGACT ACACTTTTCA TATTACGGAT TCCTTCAGCG ATGGCATTTG CTGCTCCTAT GGCGAAGGCC TTTACGAAGT GACTGTTGGC AACACCGTTG TCGCAAGTGG TGGCGATTTT GGAGCCGAAA GCTTTGACAG CTTCACGGTA GATGGTCCTT AGGTTGGTAG AGAAGCATGC TGTCTGTCTT CTTTTTTGCG CAAACAATCA AATGCTGCGA TACTAAGTGT AACCTCTACG TAACGATTTG TTGCATTTGT ATTATCGGTT TGAATAAGTG TACAGATGCA TCTTGCTTGC TGTATGGCAT CGTTCAAAAA GATCGC
|
Protein sequence | MKFPSQLGAT VAFVVVAATT TKTVSGNLAS PFPFEETNAD GTPTGEMYIH GGPGESWFED KQGFTICPVD EQPTRRSRRF LGVLDWFTGG TSEEDLASSD PERTFYYCDQ DENGNAVPRT DLKVGESDPQ ASGLPAHIEI TSDQLEAQCG PYCQLDENGR RLGEEEIDGS HRKLQGTLKN LVVLMRFSDH ATRDLPSVGE VDDLMSADSP TTSCPTGSVK QVFLENSNGR LILDSTVYQW VTLDPLYTEI YCANGRSGLD ARVHECIANA LDQVEAGGLD FRDFDQDSDG RIDAITFLHS GYGAEWNGAP NRIWSHKWVL YSINDGGGSG WTSNTGVSVY TYHISPSLWG TSGNAIGRIG VIAHETGHFL GLPDLYDTDG GGQGLNSWSL MANSWGFDGS QLYPPLMDPW CKIQLGWVTP TVLTETSRSI EIKPSYTEDD YYVIQKGFPQ GEYLVIENRR RVGYDSQIPR EGLMIYHIDD SASFNREGFP GQFNWPQNGN HYRVALLQAD GAYNLERGRG GDSGDVFHGG SGGVSSLGPG PNVYPNTDSY KFGNIVQTRV TIENISPAGE MMTFDFLDGD DVVEGAPTQA PTTEGPHEIS VRVTHDRYPE ETSWNLVNVA GGGVLARQNA GEVTTDNTVV VETIRVYPGT FRFDISDVYN DGICCTYGIG SFEIKVDGMV VYASDGSFGQ SDSTTFEVGG VIIDTSAPTG QSSPAPTPIP TVQPTGPPTV FPTVSPSTSP TFPAPTAPPT VTPTDKPTTL APTNAPTSQP SSVFTDAPVP TSTATPVSTS SAAPTTVSTM SPSAEETSTE ELHLVEIEVI HDNYPRETSW TFTESNGSNE ILASQARNSV VTNGHVANVA LLVSSGEYLF EIKDSAWDGI CCAWGNGSYT VKVDGTVVST GGEFGSSKIE SVSVGSLEPP TTDLTTVKIQ VKHDNYASET GWELLNAATN AVLARQLKGT VFNRGQIITK TVLVPAGDYT FHITDSFSDG ICCSYGEGLY EVTVGNTVVA SGGDFGAESF DSFTVDGP
|
| |