Gene PHATRDRAFT_34716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34716 
Symbol 
ID7200181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp209352 
End bp210545 
Gene Length1194 bp 
Protein Length397 aa 
Translation table 
GC content51% 
IMG OID 
Productnad-dependent epimerase/dehydratase 
Protein accessionXP_002179157 
Protein GI219116725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT CCAAGCAGCT TCCACAGGAC ATCGACATCC CTTCTTCCAC TTCCGCAACC 
GACAGTGAGA CGGAGTCAAT CTCGACGGGC TCTTACTCCT GGGAAACTAC CGACATTGTG
GAACCCCACG TCGGATACAA GAAGGTCTTG GTCACGGGAG GAGCCGGTTT TATCGGCAGT
CACGTTGCCG ATGTGCTGTT GGAGCGCGGT GATGATGTGG TTATTATCGA CGAGATGAAC
GACTATTACA GCCTCGATAT TAAACAAAGC AACTTGAAAC TGTTACAGAG CAAGTACGGC
AATGATCGTC TGAAAATCTA CTTTGGCGAC GTTTGCGACG AAGAGTTGGT CACAAACATT
TTCGAAACCG AGCATCCTAC GTGGGTCTGC CACATGGCGG CACGCGCTGG TGTCCGTCCG
TCAATTCAAG ATCCCTACGT CTACATTCAC TCCAATATCA AAGGAACAAC GCGGCTGATG
GAGTTATCGG CCAAATATGG CGTTCAGAAC TTTGTCTTCG CCAGTTCATC TTCTGTGTAC
GGAGGTTCCA AGTCAACCTT CTTTTCCGAA GACGAAGTGG TGGACAATCC CGTTTCACCT
TACGCCGCCA GTAAAAAAGC CTGCGAGCTG TTGGCGTACA CGTACCATCA CTTGTACAAT
CTCAACACGA CTGGCTTGCG ATTCTTCACC GTCTATGGCC CCCGCGGTCG ACCGGACATG
GCGCCCTTCA AATTTATTGA TCGAGTCAGT CGTGGCGTTG AAATTCAACA ATTTGGTGAC
GGTTCGTCTT CTCGTGACTA CACTTACATA TCCGATATTG TCGACGGCGT GGTACGGGCA
ATCGACCGAC CTTACCGGTA TCAAATATTC AATCTTGGTA AAGGCAGCGG CACATCTCTC
CGGGAATTTA TTGATCTGGT GCAAAAGCAC GTGGGCCAGA AAGCCAAGAT TAAGATTCTC
CCCGATCAAC CAGGTGACGT ACCATACACG TGTGCGGATG TCAGCAAGGC AGCTCGGTTG
CTTGGCTACG AGAGCGAAGT ATCTTTTGAG GATGGTATAC GGCTTACGGC TGAGTGGTAC
AAGGATGCGT ACGCACACCG CCAGCTTCAG GTGTGTCCCG AAACGCAGGC CAACGGGTTG
GGTCGGGCAC CGAGTATGGT GCAATTGTTT GAAGGCGTCA GTGCTTCTAC ATGA
 
Protein sequence
MNISKQLPQD IDIPSSTSAT DSETESISTG SYSWETTDIV EPHVGYKKVL VTGGAGFIGS 
HVADVLLERG DDVVIIDEMN DYYSLDIKQS NLKLLQSKYG NDRLKIYFGD VCDEELVTNI
FETEHPTWVC HMAARAGVRP SIQDPYVYIH SNIKGTTRLM ELSAKYGVQN FVFASSSSVY
GGSKSTFFSE DEVVDNPVSP YAASKKACEL LAYTYHHLYN LNTTGLRFFT VYGPRGRPDM
APFKFIDRVS RGVEIQQFGD GSSSRDYTYI SDIVDGVVRA IDRPYRYQIF NLGKGSGTSL
REFIDLVQKH VGQKAKIKIL PDQPGDVPYT CADVSKAARL LGYESEVSFE DGIRLTAEWY
KDAYAHRQLQ VCPETQANGL GRAPSMVQLF EGVSAST