Gene PHATRDRAFT_16140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16140 
SymbolhemE 
ID7198277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp201081 
End bp202181 
Gene Length1101 bp 
Protein Length366 aa 
Translation table 
GC content54% 
IMG OID 
Producturoporphyrinogen decarboxylase 
Protein accessionXP_002184319 
Protein GI219128227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.943071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCAGAACG ATCTCCTGTT GCGTGCCGCA GTCGGAGAGA AAGTCGAACA AACACCGCTG 
TGGCTCTTCC GTCAAGCCGG TCGGCATCTT CCGGAATATC AGGACTACAA GGCGCAAACG
AACAAGAACT TTTTGGAACT CCTGGCGTCT CCCGCCTGCG TAGCAGAATG TACCATGCAA
CCCATCCGTC GGTACGATTT GGATGCGGCT ATTTTGTTTT CCGATATTCT GGTCGTCCCG
GAGGCACTCG GGATCCAAGT CACCATGCCC GGAGGCGTCG GGATTCTCGT TCCCGAGCCA
CTCACGTCGC CGGAAGAAGT ACACACGCGA CTCCCCTCCA TCGACCAGAT TACTCCCGAC
TTTGTGCAAA CTAAGCTCGC GCACGTCATT GAAGCAGTCC GGACGATTCG CACGCAAATG
GCGGAAGAAA ACAAATCCAT TCCCTTGATT GGGTTTTCCG CAGCCCCCTG GACACTCATG
TACTACATGG TGGGTGGGAG TTCCAAAAAG AATACCGAGC TCGGTGTGAC TTGGTTGGAG
GACTATCCGG AGGCGTCTGG AGACCTGTTG GCGCTCTTGA CCAAAATTGT GGTGGAATAC
ATGGACGCGC AAGTACTGGC CGGAGCACAC GTGTTGCAAG TCTTTGAAGC CATGGGTATG
ATGATTGACG ACGTGAACTT CGAAAAACAC GCGTTGCCGT GTTTGCGAAC CATAGCGCAA
GAGCTTAAAA CACGCCATCC GGATATTCCG CTCATGGTGT TTTGTCGGGG TGCCTGTCAC
CTGAACAACC AACTGGTTGG CCTAGGATAC GATGTCATCA CGATGGACGG CAGTGTGGAC
CGCACTACGG TAAGGCAGCA ACTAGGCAAC ACTGTCACGT TACAGGGCAA CTACGATCCG
GCGGAACTTA TTGAAGAAAA CGGCAAAACG GTCGAGACGG TCCGAGCGAC TGCGAAAAAA
TTGCTGCAGG AGCTGGGACC CCAGCGACTG ATCGCCAATC TAGGTGAAGG GCTGGGTGGG
AAAGAAAGCC CGGAACTTGT GGACGCCTTC GTCAAGGCGA TTCACGAGGA GAGCGCCGCC
ATGATTCTTC AAGATAGCTA G
 
Protein sequence
PQNDLLLRAA VGEKVEQTPL WLFRQAGRHL PEYQDYKAQT NKNFLELLAS PACVAECTMQ 
PIRRYDLDAA ILFSDILVVP EALGIQVTMP GGVGILVPEP LTSPEEVHTR LPSIDQITPD
FVQTKLAHVI EAVRTIRTQM AEENKSIPLI GFSAAPWTLM YYMVGGSSKK NTELGVTWLE
DYPEASGDLL ALLTKIVVEY MDAQVLAGAH VLQVFEAMGM MIDDVNFEKH ALPCLRTIAQ
ELKTRHPDIP LMVFCRGACH LNNQLVGLGY DVITMDGSVD RTTVRQQLGN TVTLQGNYDP
AELIEENGKT VETVRATAKK LLQELGPQRL IANLGEGLGG KESPELVDAF VKAIHEESAA
MILQDS