Gene PHATRDRAFT_54478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54478 
SymbolMS 
ID7201080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp403450 
End bp405322 
Gene Length1873 bp 
Protein Length553 aa 
Translation table 
GC content51% 
IMG OID 
Productmalate synthase 
Protein accessionXP_002180227 
Protein GI219118921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0693019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTAATCCTT AGGCGCCTCA GCCGACTTTA CCCATACCGA GATACTGCCC TCCTTTTCAA 
GACGTCTCTT GCGTCAGACT TTTAACAGCG CTCATCGTTT TTTTACCAAT ACAATGATTG
AATTTCGTTC GGAACAAGTA CACGTTCGCG TGCACGCGCC TGCCAACAAG GCCGCCGAAG
AAATGTTGAC GCCGGATGCG CTACGTTTGC TCGGACTGCT TTGCGAACGC TTCGATGTCC
GTCGGCAGGC TCTACTTGCA GCCCGCAAAA CCCACGCCAC TAGCTTTGAT GCTGGTGATG
TTCCTCACTT TCTGTCGGCA GAAGACCATC CCGCCCAGCG AGATCCCCAC TGGCGCTGTG
CTCCCGTCCC TGACGACGTC CAGGATCGTC GCGTGGAAAT AACGGGACCA GTGGATCGGA
AGATGGTCAT CAACGGGTTG AATAGTGGAG CATGTGTGTA CATGGCGGAC TTTGAAGACT
CCACAAGCCC TACATGGTTT AACGTGATTG ACGGACAGTT GAATTTGCGC GACGCAGTCC
GCGGTACCAT CGCGTTCACC AACGCTGCCG GAAAGGTGTA CACAGTGCAG CATGCGAGTC
GTCCAGCCAC GCTCTTTGTC CGTCCTCGTG GCTGGCATTT GGATGAGGCA CACGTCACGG
TCAACGGCAA GGTCGCGTCG GGGTCCCTCT TCGATTTTGC CATGTACTTT TTCCACAATG
TACACCATTT GAAGGAGAAG GGTACCGGGC CCTATTTCTA TTTGCCCAAA TTGGAGTCTC
ACAAGGAGGC TGCTCTATGG AATGATGTCT TTGTCGCAGC GCAACAGTTT ATGGGAGTGC
CTATCGGTAC CATTCGGGCA ACGGTTTTGT TGGAGACGAT TACAGCTGCT TTTGAAATGG
AAGAGATTCT GTACGAACTG CGTGATCATT CTCTGGGTTT GAATTGCGGT CGTTGGGATT
ATTTATTTTC GTTCATTAAA AAATTTAAGC ATCATACGGA CAAGTTGACT CCGGATCGAA
ATCACCTGAC CATGACTACG CCCCTGATGG AGGCGTACGT CAAGCGACTC ATCTACATTT
GTCACAAGCG GGGAACTTTT GCAATGGGCG GTATGAGTGC ATCAATTCCC ATCAAGAATG
ATCCAGCAGC CAACGATGCT GCCATGCAAA AGGTTGCCGA CGATAAGTTG CGCGAAGTGA
CAGCTGGACA CGATGGCTCC TGGGTTGCTC ATCCCGCCTT GGTCAAGGTG GCTAAGGATG
TGTTTGATGA ACACATGCTG ACTCCGAATC AAATTACATC CAAGCCGGGC TATGTTGGAT
CCTCCATAAA CGAGCAAGAT CTGCTGCGCT TGCCACCGAT CCCGCACGGA AAAGCCATCA
CAAGTGAGGG TCTAGCTCGA GGCGTTGGTA TCGTATTGGC GTATACCGAA GCCTGGTTGC
GCGGCATCGG GTGCATTCCC TTGCACAATG CCATGGAAGA TGCCGCTACG GCAGAAATTA
GCCGAGCCCA AATTTGGCAA TGGCGCAGCC AGAAAGCGTC GACACAAGAT GATAATAGGC
CAATCACGGC GTCTCGTGTG GCCGCATTGG TACAGCAGGA GGTAGACCGT CAATGCAATG
GTGTTGCTGG AAAGTCCAAG GGCAAATGGC GACTTGCCGG TAATTTGGTG GAAAACATGC
TCAACAAGGA TGAATTGGAC GACTTTTTGA CATCTGTTTG CTATCCACAC ATTGTCACAA
CAGCATACGA TGATGGTCGA ATCGCCAAGC TGTAGGAACC AGAACGGAGA GTCGTTGTTT
TCGATGGAAC GGTAAATTGC TGGCAAGAAA CTTGTGGACT TATTGACTAT GAGGTATTGT
TTTACAACAT TAA
 
Protein sequence
MIEFRSEQVH VRVHAPANKA AEEMLTPDAL RLLGLLCERF DVRRQALLAA RKTHATSFDA 
GDVPHFLSAE DHPAQRDPHW RCAPVPDDVQ DRRVEITGPV DRKMVINGLN SGACVYMADF
EDSTSPTWFN VIDGQLNLRD AVRGTIAFTN AAGKVYTVQH ASRPATLFVR PRGWHLDEAH
VTVNGKVASG SLFDFAMYFF HNVHHLKEKG TGPYFYLPKL ESHKEAALWN DVFVAAQQFM
GVPIGTIRAT VLLETITAAF EMEEILYELR DHSLGLNCGR WDYLFSFIKK FKHHTDKLTP
DRNHLTMTTP LMEAYVKRLI YICHKRGTFA MGGMSASIPI KNDPAANDAA MQKVADDKLR
EVTAGHDGSW VAHPALVKVA KDVFDEHMLT PNQITSKPGY VGSSINEQDL LRLPPIPHGK
AITSEGLARG VGIVLAYTEA WLRGIGCIPL HNAMEDAATA EISRAQIWQW RSQKASTQDD
NRPITASRVA ALVQQEVDRQ CNGVAGKSKG KWRLAGNLVE NMLNKDELDD FLTSVCYPHI
VTTAYDDGRI AKL