Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54478 |
Symbol | MS |
ID | 7201080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 403450 |
End bp | 405322 |
Gene Length | 1873 bp |
Protein Length | 553 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | malate synthase |
Protein accession | XP_002180227 |
Protein GI | 219118921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0693019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTAATCCTT AGGCGCCTCA GCCGACTTTA CCCATACCGA GATACTGCCC TCCTTTTCAA GACGTCTCTT GCGTCAGACT TTTAACAGCG CTCATCGTTT TTTTACCAAT ACAATGATTG AATTTCGTTC GGAACAAGTA CACGTTCGCG TGCACGCGCC TGCCAACAAG GCCGCCGAAG AAATGTTGAC GCCGGATGCG CTACGTTTGC TCGGACTGCT TTGCGAACGC TTCGATGTCC GTCGGCAGGC TCTACTTGCA GCCCGCAAAA CCCACGCCAC TAGCTTTGAT GCTGGTGATG TTCCTCACTT TCTGTCGGCA GAAGACCATC CCGCCCAGCG AGATCCCCAC TGGCGCTGTG CTCCCGTCCC TGACGACGTC CAGGATCGTC GCGTGGAAAT AACGGGACCA GTGGATCGGA AGATGGTCAT CAACGGGTTG AATAGTGGAG CATGTGTGTA CATGGCGGAC TTTGAAGACT CCACAAGCCC TACATGGTTT AACGTGATTG ACGGACAGTT GAATTTGCGC GACGCAGTCC GCGGTACCAT CGCGTTCACC AACGCTGCCG GAAAGGTGTA CACAGTGCAG CATGCGAGTC GTCCAGCCAC GCTCTTTGTC CGTCCTCGTG GCTGGCATTT GGATGAGGCA CACGTCACGG TCAACGGCAA GGTCGCGTCG GGGTCCCTCT TCGATTTTGC CATGTACTTT TTCCACAATG TACACCATTT GAAGGAGAAG GGTACCGGGC CCTATTTCTA TTTGCCCAAA TTGGAGTCTC ACAAGGAGGC TGCTCTATGG AATGATGTCT TTGTCGCAGC GCAACAGTTT ATGGGAGTGC CTATCGGTAC CATTCGGGCA ACGGTTTTGT TGGAGACGAT TACAGCTGCT TTTGAAATGG AAGAGATTCT GTACGAACTG CGTGATCATT CTCTGGGTTT GAATTGCGGT CGTTGGGATT ATTTATTTTC GTTCATTAAA AAATTTAAGC ATCATACGGA CAAGTTGACT CCGGATCGAA ATCACCTGAC CATGACTACG CCCCTGATGG AGGCGTACGT CAAGCGACTC ATCTACATTT GTCACAAGCG GGGAACTTTT GCAATGGGCG GTATGAGTGC ATCAATTCCC ATCAAGAATG ATCCAGCAGC CAACGATGCT GCCATGCAAA AGGTTGCCGA CGATAAGTTG CGCGAAGTGA CAGCTGGACA CGATGGCTCC TGGGTTGCTC ATCCCGCCTT GGTCAAGGTG GCTAAGGATG TGTTTGATGA ACACATGCTG ACTCCGAATC AAATTACATC CAAGCCGGGC TATGTTGGAT CCTCCATAAA CGAGCAAGAT CTGCTGCGCT TGCCACCGAT CCCGCACGGA AAAGCCATCA CAAGTGAGGG TCTAGCTCGA GGCGTTGGTA TCGTATTGGC GTATACCGAA GCCTGGTTGC GCGGCATCGG GTGCATTCCC TTGCACAATG CCATGGAAGA TGCCGCTACG GCAGAAATTA GCCGAGCCCA AATTTGGCAA TGGCGCAGCC AGAAAGCGTC GACACAAGAT GATAATAGGC CAATCACGGC GTCTCGTGTG GCCGCATTGG TACAGCAGGA GGTAGACCGT CAATGCAATG GTGTTGCTGG AAAGTCCAAG GGCAAATGGC GACTTGCCGG TAATTTGGTG GAAAACATGC TCAACAAGGA TGAATTGGAC GACTTTTTGA CATCTGTTTG CTATCCACAC ATTGTCACAA CAGCATACGA TGATGGTCGA ATCGCCAAGC TGTAGGAACC AGAACGGAGA GTCGTTGTTT TCGATGGAAC GGTAAATTGC TGGCAAGAAA CTTGTGGACT TATTGACTAT GAGGTATTGT TTTACAACAT TAA
|
Protein sequence | MIEFRSEQVH VRVHAPANKA AEEMLTPDAL RLLGLLCERF DVRRQALLAA RKTHATSFDA GDVPHFLSAE DHPAQRDPHW RCAPVPDDVQ DRRVEITGPV DRKMVINGLN SGACVYMADF EDSTSPTWFN VIDGQLNLRD AVRGTIAFTN AAGKVYTVQH ASRPATLFVR PRGWHLDEAH VTVNGKVASG SLFDFAMYFF HNVHHLKEKG TGPYFYLPKL ESHKEAALWN DVFVAAQQFM GVPIGTIRAT VLLETITAAF EMEEILYELR DHSLGLNCGR WDYLFSFIKK FKHHTDKLTP DRNHLTMTTP LMEAYVKRLI YICHKRGTFA MGGMSASIPI KNDPAANDAA MQKVADDKLR EVTAGHDGSW VAHPALVKVA KDVFDEHMLT PNQITSKPGY VGSSINEQDL LRLPPIPHGK AITSEGLARG VGIVLAYTEA WLRGIGCIPL HNAMEDAATA EISRAQIWQW RSQKASTQDD NRPITASRVA ALVQQEVDRQ CNGVAGKSKG KWRLAGNLVE NMLNKDELDD FLTSVCYPHI VTTAYDDGRI AKL
|
| |