Gene PHATRDRAFT_52208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52208 
SymbolADH_1 
ID7202502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp337686 
End bp338747 
Gene Length1062 bp 
Protein Length353 aa 
Translation table 
GC content51% 
IMG OID 
Productalcohol dehydrogenase 
Protein accessionXP_002181532 
Protein GI219122397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCTG TTCGCTACCA CGGCGCAAAC GTTGCGTTGA CGGTCGAATC GATTCCCAGG 
CCCACAAACT TGGCCGACAA CGACGTCCTA ATCCAGGTTC AAGCAGCCGC GTTGTGCCAC
ACCGAGCTGC ACTTTGCCGA CGGTACGCTA AATCTAGGGG TTGCTCCCAT GACACTAGGG
CACGAAGCTT GTGGAATAGT AATCCAAGTT GGTAACAGCG TTCCCGATAC AAGAATTGGC
GAACGTGTCA TTCTGTATTA CTACGTTGGT TGCGGATCCT GCCGATGGTG TCTGCAAGGT
GACGAGCAAA TTTGTGGATC ACTGCAGGCC GAATTTGGCT TTATAAGCGA CGGTGGTCTA
GCAGAATACA TCAAGGCGCC TTCTCGTAAC GCCGTGCCGC TACCTAGCAA TATTTCTTTC
GTCGACGCCG CCCCCATTGG TTGTGGTGTA ACGACGGCGG TCCACGCGAG CAAAATAGGA
AGGGTCCAGA AAGACGATTG GTGTTTGGTA TATGGCGTAA ATGGCGTTGG TTTCGGTCTC
ATACAGCTTC TGAAAAATCA TTACGGTGCC AAAGTGATCG CTGCGACCCG TTCTCCAGCC
AGACGGAAAC TGGCGCTCGA ACTGGGCGCC GACGTATCCA TTGATACTAC AGATTCCTCG
ACTGTGGCCA AAGCAGTCCA CCAAGCAACA TATGGGGCTG GTGCAGATGT CATCTTTGAG
TGCGTTGGAC GGCGTGAAAC AATGGATGCG TGCGTTGGCT GGGACGGTGC GTTAGGTAAA
CGTGGTCGTT TGGTTTTAGT CGGATACGAG GCTGGAAGTG AGCACGAATT TCGATGCCAT
CCGATTCCAA TGATTGTACA AGAGCAATCC GTTTGCGGTA GTGTTGGTGC TACTCTCAAT
GATCTCAAGG AGGCACTTGA ATATGTTTCC TCTGGAAAGG TCAAAACCAT TGTGGACAGC
CTCCTTTCCT TGCAGGATTT TCAGCGTGGC ATAGATAAAA TCAAATCATG CGACTGCATC
GGAAAAATTG TTTGCCGACC CGCAGAAACC TCATTTGGCT AG
 
Protein sequence
MQAVRYHGAN VALTVESIPR PTNLADNDVL IQVQAAALCH TELHFADGTL NLGVAPMTLG 
HEACGIVIQV GNSVPDTRIG ERVILYYYVG CGSCRWCLQG DEQICGSLQA EFGFISDGGL
AEYIKAPSRN AVPLPSNISF VDAAPIGCGV TTAVHASKIG RVQKDDWCLV YGVNGVGFGL
IQLLKNHYGA KVIAATRSPA RRKLALELGA DVSIDTTDSS TVAKAVHQAT YGAGADVIFE
CVGRRETMDA CVGWDGALGK RGRLVLVGYE AGSEHEFRCH PIPMIVQEQS VCGSVGATLN
DLKEALEYVS SGKVKTIVDS LLSLQDFQRG IDKIKSCDCI GKIVCRPAET SFG