Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18665 |
Symbol | SHMT |
ID | 7204241 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 884561 |
End bp | 886280 |
Gene Length | 1720 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | serine hydroxymethyltransferase |
Protein accession | XP_002186147 |
Protein GI | 219113127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACAATCCCA TAACAGACTC TCACAGGAAG GCATCGGCAG TTAGAAGCAA CGCAGATTTC AAAGATCCGA GTCGTCTACA GCATTCTCCA AAAAATCTTT ATCGCATCTC CTCTCTAAAA CAACAGTCAT GACTTCTTTC AAAGATCAGG TAGGTGTATC GATCCTTTCC TTGAAACGAC AATGCTGCTC GCGAGAATGG CGCCTTGCTG AGTTGGCGAC GCGTGTTCTT CTTCTTCGAG CTGGAATGTC ATTTTCGAAT GCCAATATGC GTCCCCACGA CGGATTCTTA CGATTGTGGT TTCTATTTCT GTTGATTAGG AATTCCGCGG CTTACTCAGT CTGGAAGAGC ACGACCCCGA ACTCTTCGAT CTCATTGAAC AGGAGAAGTC TCGCCAATGG CGATCTTTGG AGCTGATCGC CAGCGAGAAC TTTACATCCC GTGCGGTTAT GGATTGTCTC GGTTCAGCTC TCACGAACAA GTATGCCGAG GGTCTCCCCG GCGCTCGTTA CTACGGAGGA AACGAAGTTG TCGATCAGGT CGAAGCGCTT TGCCAAAAGC GTGCTCTGGA AGCCTACGGA CTCGATCCGG AGAAGTGGGG CGTCAACGTC CAGCCCTACT CGGGATCCCC GGCCAATTTT GCCGTCTATA CGGCTCTGTT GAAGCCTCAC GATCGTATTA TGGGACTGGA CCTTCCCAGC GGTGGACACT TGACGCACGG ATTCTACACC TACTCCAAGA AGGAAGGTAC CCGTAAGGCC GTCTCGGCGA CCTCGGTTTA TTTCGAATCA TTGCCCTACC GCGTTCACCC CGAGACAGGT TACATTGACT ACGATCAGCT AGAGCGTGAT GCTGGTCTCT TCAAGCCGGC CATGATTATT GCCGGAGGTT CGGCTTACCC ACGTGACTAC GACTACAAGA GATTCCGTGA AATTGCTGAC GCCAATGGGG CACTTCTAAT GATGGATATG GCACACACAT CGGGCTTGGT CGCCACCGGC GAACTCGACA GCCCGTTCGA ATACGCTGAT GTGGTCACTA CGACCACGCA CAAGTCGTTG CGTGGACCCC GCGCTGGTAT GATCTTCTTC CGCAAGGACG AACGCGGCTT TGAGTCGCGG ATCAATCAGG CCGTCTTCCC CGCGCTACAG GGCGGTCCCC ACGAGCATCA GATTGCCGGT GTTGCCACGC AACTGAAGGA AGTGTGCTCT CCTGATTTTA AGGTCTACTC GCAGCAGGTG AAGAAGAACG CCAAGGCCCT GGCCGACAAG CTTACATCGA TGGGATACTC TATGGCTTCG GGCGGTACCG AGAATCACTT GGTGTTGTGG GATCTCAAAC CCCAAGGAAT TACCGGCAGC AAGTTTGAAA AGGTATGCGA CGCTGTGTCT ATTACTCTGA ACAAAAATTG TGTGCCGGGA GATGTTTCTG CCGTCACCCC AGGTGGGGTC CGTATCGGTA CCCCGGCTTT GACTACCCGT ACCATGGTGG AGTCCGACTT TGAACAGATT GGGCAGTTCC TGCACGAGGC TTTGGAGATT ACCCTTGCGA TCCAGGAAAA GAGCGGCCCG AAGCTGAAGG ACTTCCTGCC ATTGTTGGAG AAAAATGCGG ATATTGAGGC CCTCAAGGTG AGGGTCCACG ACTTTGCCAC TACGTTCCCC ATGCCGGGTT TCGATCCCGC GACGATGAAG TACAAGAACC CTGCTGGCCC ATCCCACTAA
|
Protein sequence | MTSFKDQEFR GLLSLEEHDP ELFDLIEQEK SRQWRSLELI ASENFTSRAV MDCLGSALTN KYAEGLPGAR YYGGNEVVDQ VEALCQKRAL EAYGLDPEKW GVNVQPYSGS PANFAVYTAL LKPHDRIMGL DLPSGGHLTH GFYTYSKKEG TRKAVSATSV YFESLPYRVH PETGYIDYDQ LERDAGLFKP AMIIAGGSAY PRDYDYKRFR EIADANGALL MMDMAHTSGL VATGELDSPF EYADVVTTTT HKSLRGPRAG MIFFRKDERG FESRINQAVF PALQGGPHEH QIAGVATQLK EVCSPDFKVY SQQVKKNAKA LADKLTSMGY SMASGGTENH LVLWDLKPQG ITGSKFEKVC DAVSITLNKN CVPGDVSAVT PGGVRIGTPA LTTRTMVESD FEQIGQFLHE ALEITLAIQE KSGPKLKDFL PLLEKNADIE ALKVRVHDFA TTFPMPGFDP ATMKYKNPAG PSH
|
| |