Gene PHATRDRAFT_41531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41531 
Symbol 
ID7199384 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp95315 
End bp97000 
Gene Length1686 bp 
Protein Length514 aa 
Translation table 
GC content55% 
IMG OID 
Productnad-dependent epimerase/dehydratase 
Protein accessionXP_002185484 
Protein GI219130674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCT TGCAGGCTGC GCATCATCAT CCTCCACCAG GGCAGCACAA TCCCCGCAAT 
GCCGAAATAA GTCGTCGGTT GGGAACCCCG AACGGTAGCA GTACCAACAA TACCAACACC
AACAACGGCA GCCACGCCGC CATGTTTTGC AGTCGTCAGA GTATCTTGAC GCTGACGGTG
GGAGTGTTGG TGGGTTACAT TCTCTTACCC GTGTTGTTGG TGGAGATGGA GTTGCAGGAC
CTGATGGGAG CACTGCCGGA CTGGGAAACG ACCGGCAGTG GGCCGTACGC CCGCCCGTCG
CCTCCGGAGA TGCTGCACCA GGGTCTCTAC AGTACCGCAC GCATTCCGGC ATCCTTGTCC
GAGACGCCGC GTTTGCGCAG TGTCGAGCAG GAACCGCCGC GAACGCTCGA AGCTGCTCCG
CGGGACTTTG CCGGTGTCGT GGCGGAAGGC GTCACCGATA TCGAAAAACG AATCGTGCAA
GATCACGATC TGCTCGGGAG ACAGTCCTTG CCCACCGCAA CGACGCCCTA CATTATGCCC
ACCAAGGTCT TGCCCGATCA TCAACGCAAG AAGATTCTCG TCACGGGAGG AGCCGGATTC
GTCGGCAGTC ATCTCGTGGA TAAACTCATG ATGGATGGGA TGGAAGTCAT TGTCGTGGAT
AACTTCTTTA CCGGACAAAA GAAAAACGTG GCGCACTGGT TGCATCATCC CAATTTCAGG
TAAGCACCAT TACAAGAAGG GAGGAAGTAT GGAGCTAGCA TCGAACCTCC AAGGACACGC
ATACAAAGCT TTCCATGGCA CACACAAACA CACACTCACA ACACACCGTT TGTGCATGCT
GTTATTCATA CACCGAACAG TCTCGTGGTG CACGATGTCA CCGAACCAAT CCAACTCGAA
GTAGACGAAA TCTACCACTT GGCCTGTCCG GCGTCACCTC CGCATTACCA GTACAATCCG
GTCAAAACCA TCAAAACGTC CACCATGGGG ACCCTCAACA TGCTCGGACT CGCCAAACGC
GTCCGCGCCA AGATTCTACT CACCAGCACC TCCGAAATAT ACGGCGATCC CAAGGTACAC
CCACAGCCCG AATCCTACTG GGGAAACGTT AACACCATCG GACCCCGCTC CTGCTACGAC
GAAGGCAAAC GCGTGGCCGA GACCATGATG TACAGTTACA AGAACCAAAA CGGCGTCGAC
GTCCGCGTCG CACGGATATT CAACACCTTT GGTCCGCGCA TGCACCCCAA TGACGGACGC
GTCGTTTCCA ACTTTATCAT ACAAGCCCTG CAGAACAAGA ACATGACTAT TTACGGCGAA
GGCAAACAAA CACGATCCTT CCAGTACGTT ACCGATCTCG TCGACGGTCT CTACGCGCTC
ATGAACGGCA ATTACGATCT TCCCGTCAAT CTCGGCAATC CGGAAGAATA TTCCGTCAAG
GACTTTGCCA CCTACATTCA AGAACTCACC AAGAGTACGT CGGACATTAT CTTCTTACCC
AAATCCGAGG ACGACCCCTC CCAACGTCGA CCGGATATCA CCACGGCCAA GCGAGAACTG
GGCTGGGAAC CCCAGGTCAA GGTACAAAAA GGCTTGGAAA AGACCATTGA ATACTTTGCC
CGTGTTTTGG AAAGTGCGGG GGAAATCATT CCGACCGGAC CCGGGGCCGC CAAGCCCGAA
GCCTAA
 
Protein sequence
MVRLQAAHHH PPPGQHNPRN AEISRRLGTP NGSSTNNTNT NNGSHAAMFC SRQSILTLTV 
GVLVGYILLP VLLVEMELQD LMGALPDWET TGSGPYARPS PPEMLHQGLY STARIPASLS
ETPRLRSVEQ EPPRTLEAAP RDFAGVVAEG VTDIEKRIVQ DHDLLGRQSL PTATTPYIMP
TKVLPDHQRK KILVTGGAGF VGSHLVDKLM MDGMEVIVVD NFFTGQKKNV AHWLHHPNFS
LVVHDVTEPI QLEVDEIYHL ACPASPPHYQ YNPVKTIKTS TMGTLNMLGL AKRVRAKILL
TSTSEIYGDP KVHPQPESYW GNVNTIGPRS CYDEGKRVAE TMMYSYKNQN GVDVRVARIF
NTFGPRMHPN DGRVVSNFII QALQNKNMTI YGEGKQTRSF QYVTDLVDGL YALMNGNYDL
PVNLGNPEEY SVKDFATYIQ ELTKSTSDII FLPKSEDDPS QRRPDITTAK RELGWEPQVK
VQKGLEKTIE YFARVLESAG EIIPTGPGAA KPEA