Gene PHATRDRAFT_19188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19188 
SymbolHemE_2 
ID7197643 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1162567 
End bp1165840 
Gene Length3274 bp 
Protein Length409 aa 
Translation table 
GC content47% 
IMG OID 
Producturoporphyrinogen decarboxylase 
Protein accessionXP_002178653 
Protein GI219115715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAACATTGCC TACTCTCGTG CTTGTACAAA TCTTACTTTG TACGTATCCA TCGATCACTC 
AAGCAATCGT TTTACAGTTC TTTGTACAAT TCTGTTGCTT GCAATTATGA AAGTGTCCCA
CGTCGCTTTC AGCCTGTTCC TGGCTGGTTC CCAAACGACT GCCTTTACTA ATCCGGTATC
CAACTCGGCG AAACAACAAT CGTTCCGTCT TCCAAGTTCG GCATCGTCGG CAGCGGATAC
GACCTCCTCG GCCAATGTCG CTCAAAAGAG CACGCAAGAT CCGTTGCTTA TTCGGGCGGC
TCGTGGTGAA AAAACAGAAC GCACTCCTGT TTGGATGATG CGACAAGCTG GTCGCCACAT
TGCGGAATAC CGGGACTTGT GCAAAAAGTA CCCCACCTTT CGCGAACGTT CCGAAATTCC
GGAAGTCGCC GTTGAAGTCT CGCTGCAGCC TTGGCGCAAT TACCAGACTG ATGGATGCAT
TCTCTTTTCC GATATTTTGA CTCCTTTGCC AGGTATTGGA TGTGAATTTA CCATTGATGA
AAAAGTTGGT CCTCTTATGG AACCGATGCG CTCCTACGAT GACATCAAAA AGGTACGGAA
TGACTAGCGA CACGAATGGC TTCTTGCTTA ATAGTGTTAA TATAGTCTTA CATTTTCTAT
TCGTGTTTTG TGTAGATGCA CCCCATGGAC CCATACAAGT CCACGCCATT TGTCGCCGAA
GCTCTCAAGG CTTTGCGCCA AGAAGTTGGT CCGGAAACTG CTGTATTAGG CTTTGTTGGA
TGCCCTTATA CACTGGCAAC ATACATTGTG GAAGGCAAAA CATCCAAGGA ATACCTGGAA
ATTAAGAAAA TGGCCTTGAA CGAACCTGAT CTTCTTCACG CTATTCTGCA ACAATTGGCG
GACAACATTG GAGACTACGC CCTTTTCCAG ATTGAGAACG GCGCCCAGCT GATTCAGATT
TTTGATTCTT GGGCCGGACA TCTATCACCC CGGGACTACA ATACTTTTGC AGCTCCTTAC
CAAAAGCAAA TTCTTGACAA AATTAAGGAA AAGTACCCTG ACGTCCCGAC GGTCATTTAC
ATCAAGCACT CAGGTGCATT GATTGAACGC ATGGCGGCGA CGGGCGTAGA TGTCGTTTCG
TTAGACTGGA CTGTCGATAT GGCGGACGGG CGTGATCGCA TCGAGGCCGG ACGCAAGTCC
GCTGGTCTTG AAGGACGTGG GGGTGTACAA GGAAACTTGG ATCCAGCCGT TTTATTCGCC
AATCACGATG TTATTGAAGA ACGAGCGATT GAAATTCTCA AAAAAGCAGG CAGCGTTGGG
CATGTCATGA ACTTGGGGCA TGGTATTGAA GCCGCTACAC CAGAAGAAAA CGCACACCAT
TTTATTGAAA CTGTCAGAGG CTATCGTCAT GAGGAGTAGA CATTCATTAA ATATAGATTG
GTTGATCATT ATTGTATAAC TTCAAAATAG TACAGCGATA AACACACTCT TTGGCTTCTG
AAGGTTGTCT TAAACCATAA GTCTTACTTC GTTGTCTGTT CCACGAACTC CGGCTCCGTC
TTCTCTGTGA AAGTCGGCGA TCCAAACCGA GCCTAGGGAT TCTAGTGCTG CAGCATAGCT
TGACACGTCT TTTTCACGTT TGAATGGAAG AGCGAGCCCT CCTTGCCAAG TGCTGAACCG
ATATGCCGAA ACTGTATGTT TCGAAGGATG AATTCGGATC GAATGACCTA CGTGACAAGC
ATGGCTGCTG CTCGAATGAA ATTTGGAATG ACCGCACTCT AATCGAAATA CCTTGAGATT
CGACAGTGAA TTCACGCTCT TTCGAAAGTC GTCCAATCCG ACTTTGGGAC TTGCTGAGAC
TAAGCTGCCT ATACGCAATT CCGGCCGTGC GGATGCATAC CGGTAAGAGG CCAATGTAGC
CAAACCAGCG GCGTACCCGT GTCCTGAAAA GACCACGTCA TAAAAAGGGT TCTCTTCAAC
AAGTTGATCT ATCAAGGTAA ACAGCTTTCC TTCAATTTCG CATAGCGCCT TGTACCGATC
AATGAAGACA CTAACGTGTA TTCCAGTATC CTTCAGAATG GTTACATTGG TGTCTTTGGG
AGCTTTTCCT AGTTGTTCGG TATTCGTACC GCGAAACACG CAAAGAAATT GCCGTTGCCT
CTCAACAATC AAAACTTCTC GGCGGCTGCT TGGATCGCTG TTAGACACGT CCGAGGTGAG
AAAGCCACAG ATTTGCTGAA TGTCACTGCC GCACTCCAAA AGTGACTCCA CGATTGGTCG
GTGACAATCA GCTATCTTTC CTGTAAGACT ACCATAGCTA CCACGAGCAA ACGAAGCAAG
CTCCATCCGA CTTGCTCCTG TTAAGTCATT TAGATTCAGC AAACATTCCT CTTTTTGATA
TGTCTCAAGA TATTGAGAGA GTCTGATGCA AACCTGGAAC TCAGTCTTGG CTGGTATGCT
AACTTGCCAT TCAGCTGCTT CGTATTCAGG GGTACGAATG GTTTCCATAC CGGTCAAGAA
CGATGTTGGA CGCGGCTTGG CAAGTGATGG TATGGCAAGG GAATCCTTCA CTTTCGCTGT
CTTGTCGAAT TCTTTCTGCA CGTCCCACAA TTCTTGAAGC AATGGAGACG TGGGGTTCGC
TTTGATGCTA GCACCGGTAG ATCTGCCTCT AGTCGCTGTG GCTGGCTGTT TGTTTTGTGT
TCGTAGACCC TTTGATGTAC TAGTCGCAGT TGCAAAATAA TCGGTCTTGC GATTTGTTTT
TCTTTGCCAT GGAAGAAATG TTCCTTTTTC CTTTTTCACG GGGCTTGTGT TTTTAGGAAT
CACTTTCTGA TTATCATGTC GGATGGTGGT CAGAGTCTGA GGCTCTAACT GAGAACTTGA
GTTGCTAGAC GATAGTTTGC TTAGTTCCAT TGCTAAGACG CGGAGTTCCG AGGCCATATC
CGGGGGAGTG AATGGACGAT TATCCGGTGA TCCTTCCCCA TCTCGAGACT CTTCTGGCTG
ACAAGGTGGT GTGACGCTTC CTGAGAGGTG CGTCGACCGA GCGGCTACGG GATGCTTTGT
TTTCGTGGAG AAAAAGGGGT CAAACCCAGC TAAAGCGTTA TTCTGATATT TGGTGCTTCC
TATCGGGACG GTGTTGTGTC CAGCTAGGTG ACTGCTGTTT GAAATAGGCA TGGTGGTCGG
CTGTCTTCAA CTGTCCTTTG TTCAAAAAAT TAGTGATGCG ATAAGAGCAA CACTGGTTGG
TACCTGGGAT CAAATCGGGC GTGATTAATA AAGG
 
Protein sequence
MKVSHVAFSL FLAGSQTTAF TNPVSNSAKQ QSFRLPSSAS SAADTTSSAN VAQKSTQDPL 
LIRAARGEKT ERTPVWMMRQ AGRHIAEYRD LCKKYPTFRE RSEIPEVAVE VSLQPWRNYQ
TDGCILFSDI LTPLPGIGCE FTIDEKVGPL MEPMRSYDDI KKMHPMDPYK STPFVAEALK
ALRQEVGPET AVLGFVGCPY TLATYIVEGK TSKEYLEIKK MALNEPDLLH AILQQLADNI
GDYALFQIEN GAQLIQIFDS WAGHLSPRDY NTFAAPYQKQ ILDKIKEKYP DVPTVIYIKH
SGALIERMAA TGVDVVSLDW TVDMADGRDR IEAGRKSAGL EGRGGVQGNL DPAVLFANHD
VIEERAIEIL KKAGSVGHVM NLGHGIEAAT PEENAHHFIE TVRGYRHEE