Gene PHATRDRAFT_40988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40988 
Symbol 
ID7198826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp70547 
End bp71617 
Gene Length1071 bp 
Protein Length356 aa 
Translation table 
GC content57% 
IMG OID 
Productenyol-coa hydratase 
Protein accessionXP_002185042 
Protein GI219129745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.569479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAG CGTTGTTCGG TTGGTCTCGG CATTGCGGGA GTTCCGTTGC GGACGCCGGA 
GTCCCACGCC ACCGACTCGA GCGTGAGCTT GTCCTTCCCC GCTCGTTCCG TACAAGGTTC
TACGCGTCTA CCGGTGAAAC ATCAACAACT CCCAATGCTA GCAAATCCTC GTCGACGGAA
ACGTCACCGT CCGATGAATC CCGCGTCTTG GTGGACGTGG ATCACCAAGG CGTCGCTCGT
GTGTGCCTCA ATAGACCCAC CAAACTCAAC GCACTCGACA TGCCCATGTT CGACGCCGTA
GCCGATACGG CCCTTTCACT ACAAAAGGAT CGAGCCATTC GCGCCGTCAT TCTATCCGGC
TCTGGCCGTG CTTTTTCCGC CGGTCTCGAC GTCGCTTCCG TTCTCTCCAC CAATCCCTTG
AAAAACTCGG AACGTCTCCT CACGCGAGAC GACGTTGACG ACAACAACAA CGACAACAAA
AACAACTACT CGAACGAAAG ACGGTCCATC GCGAATTTGG CACAACGCGT CTCCATGGCC
TGGCGGGATA TCCCGGCACC CGTCGTAGCC TGTCTGCACG GTGAATGTTT CGGGGGCGGT
TTGCAGATTG CCCTAGGGGC CGACGTGCGC TTGGCCACCC CGGATTGCCG CTTGGCGATT
ATGGAAGCCA AGTGGGGATT GATTCCCGAC ATGGGTGCGT CCGTGCTCTT GCGGGAACTC
GTACGCATCG ATGTCGCCAA GGAGTTGACC ATGACGGGAC GGATTGTGGA CGGGAACGAG
GCTGCCGCAC TCGGACTCGT CACGAGGGTC GTCGACGATC CCCTCGAACA AGCCGAAACA
CTCGTACAAG CCTTCCTGCA GAGGTCGCCG GATTCTCTGG CCGCCACCAA ACAACTTTAC
CACCAAACGT GGGTTGCACC GGAAGAGTAT AGTTTAAAGG TGGAAACAGC ACTGCAACGA
AAACTGTTGG TTTCGTGGAA CCAAATGGCC GCCGCGGGTA GGAGCTTTGG CTGGAAGGTT
CCTTACTTTC AACGCAAAGA CGGGACCCTT GATACGGAGA AGAAGTTATA A
 
Protein sequence
MKRALFGWSR HCGSSVADAG VPRHRLEREL VLPRSFRTRF YASTGETSTT PNASKSSSTE 
TSPSDESRVL VDVDHQGVAR VCLNRPTKLN ALDMPMFDAV ADTALSLQKD RAIRAVILSG
SGRAFSAGLD VASVLSTNPL KNSERLLTRD DVDDNNNDNK NNYSNERRSI ANLAQRVSMA
WRDIPAPVVA CLHGECFGGG LQIALGADVR LATPDCRLAI MEAKWGLIPD MGASVLLREL
VRIDVAKELT MTGRIVDGNE AAALGLVTRV VDDPLEQAET LVQAFLQRSP DSLAATKQLY
HQTWVAPEEY SLKVETALQR KLLVSWNQMA AAGRSFGWKV PYFQRKDGTL DTEKKL