Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19188 |
Symbol | HemE_2 |
ID | 7197643 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1162567 |
End bp | 1165840 |
Gene Length | 3274 bp |
Protein Length | 409 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | uroporphyrinogen decarboxylase |
Protein accession | XP_002178653 |
Protein GI | 219115715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACATTGCC TACTCTCGTG CTTGTACAAA TCTTACTTTG TACGTATCCA TCGATCACTC AAGCAATCGT TTTACAGTTC TTTGTACAAT TCTGTTGCTT GCAATTATGA AAGTGTCCCA CGTCGCTTTC AGCCTGTTCC TGGCTGGTTC CCAAACGACT GCCTTTACTA ATCCGGTATC CAACTCGGCG AAACAACAAT CGTTCCGTCT TCCAAGTTCG GCATCGTCGG CAGCGGATAC GACCTCCTCG GCCAATGTCG CTCAAAAGAG CACGCAAGAT CCGTTGCTTA TTCGGGCGGC TCGTGGTGAA AAAACAGAAC GCACTCCTGT TTGGATGATG CGACAAGCTG GTCGCCACAT TGCGGAATAC CGGGACTTGT GCAAAAAGTA CCCCACCTTT CGCGAACGTT CCGAAATTCC GGAAGTCGCC GTTGAAGTCT CGCTGCAGCC TTGGCGCAAT TACCAGACTG ATGGATGCAT TCTCTTTTCC GATATTTTGA CTCCTTTGCC AGGTATTGGA TGTGAATTTA CCATTGATGA AAAAGTTGGT CCTCTTATGG AACCGATGCG CTCCTACGAT GACATCAAAA AGGTACGGAA TGACTAGCGA CACGAATGGC TTCTTGCTTA ATAGTGTTAA TATAGTCTTA CATTTTCTAT TCGTGTTTTG TGTAGATGCA CCCCATGGAC CCATACAAGT CCACGCCATT TGTCGCCGAA GCTCTCAAGG CTTTGCGCCA AGAAGTTGGT CCGGAAACTG CTGTATTAGG CTTTGTTGGA TGCCCTTATA CACTGGCAAC ATACATTGTG GAAGGCAAAA CATCCAAGGA ATACCTGGAA ATTAAGAAAA TGGCCTTGAA CGAACCTGAT CTTCTTCACG CTATTCTGCA ACAATTGGCG GACAACATTG GAGACTACGC CCTTTTCCAG ATTGAGAACG GCGCCCAGCT GATTCAGATT TTTGATTCTT GGGCCGGACA TCTATCACCC CGGGACTACA ATACTTTTGC AGCTCCTTAC CAAAAGCAAA TTCTTGACAA AATTAAGGAA AAGTACCCTG ACGTCCCGAC GGTCATTTAC ATCAAGCACT CAGGTGCATT GATTGAACGC ATGGCGGCGA CGGGCGTAGA TGTCGTTTCG TTAGACTGGA CTGTCGATAT GGCGGACGGG CGTGATCGCA TCGAGGCCGG ACGCAAGTCC GCTGGTCTTG AAGGACGTGG GGGTGTACAA GGAAACTTGG ATCCAGCCGT TTTATTCGCC AATCACGATG TTATTGAAGA ACGAGCGATT GAAATTCTCA AAAAAGCAGG CAGCGTTGGG CATGTCATGA ACTTGGGGCA TGGTATTGAA GCCGCTACAC CAGAAGAAAA CGCACACCAT TTTATTGAAA CTGTCAGAGG CTATCGTCAT GAGGAGTAGA CATTCATTAA ATATAGATTG GTTGATCATT ATTGTATAAC TTCAAAATAG TACAGCGATA AACACACTCT TTGGCTTCTG AAGGTTGTCT TAAACCATAA GTCTTACTTC GTTGTCTGTT CCACGAACTC CGGCTCCGTC TTCTCTGTGA AAGTCGGCGA TCCAAACCGA GCCTAGGGAT TCTAGTGCTG CAGCATAGCT TGACACGTCT TTTTCACGTT TGAATGGAAG AGCGAGCCCT CCTTGCCAAG TGCTGAACCG ATATGCCGAA ACTGTATGTT TCGAAGGATG AATTCGGATC GAATGACCTA CGTGACAAGC ATGGCTGCTG CTCGAATGAA ATTTGGAATG ACCGCACTCT AATCGAAATA CCTTGAGATT CGACAGTGAA TTCACGCTCT TTCGAAAGTC GTCCAATCCG ACTTTGGGAC TTGCTGAGAC TAAGCTGCCT ATACGCAATT CCGGCCGTGC GGATGCATAC CGGTAAGAGG CCAATGTAGC CAAACCAGCG GCGTACCCGT GTCCTGAAAA GACCACGTCA TAAAAAGGGT TCTCTTCAAC AAGTTGATCT ATCAAGGTAA ACAGCTTTCC TTCAATTTCG CATAGCGCCT TGTACCGATC AATGAAGACA CTAACGTGTA TTCCAGTATC CTTCAGAATG GTTACATTGG TGTCTTTGGG AGCTTTTCCT AGTTGTTCGG TATTCGTACC GCGAAACACG CAAAGAAATT GCCGTTGCCT CTCAACAATC AAAACTTCTC GGCGGCTGCT TGGATCGCTG TTAGACACGT CCGAGGTGAG AAAGCCACAG ATTTGCTGAA TGTCACTGCC GCACTCCAAA AGTGACTCCA CGATTGGTCG GTGACAATCA GCTATCTTTC CTGTAAGACT ACCATAGCTA CCACGAGCAA ACGAAGCAAG CTCCATCCGA CTTGCTCCTG TTAAGTCATT TAGATTCAGC AAACATTCCT CTTTTTGATA TGTCTCAAGA TATTGAGAGA GTCTGATGCA AACCTGGAAC TCAGTCTTGG CTGGTATGCT AACTTGCCAT TCAGCTGCTT CGTATTCAGG GGTACGAATG GTTTCCATAC CGGTCAAGAA CGATGTTGGA CGCGGCTTGG CAAGTGATGG TATGGCAAGG GAATCCTTCA CTTTCGCTGT CTTGTCGAAT TCTTTCTGCA CGTCCCACAA TTCTTGAAGC AATGGAGACG TGGGGTTCGC TTTGATGCTA GCACCGGTAG ATCTGCCTCT AGTCGCTGTG GCTGGCTGTT TGTTTTGTGT TCGTAGACCC TTTGATGTAC TAGTCGCAGT TGCAAAATAA TCGGTCTTGC GATTTGTTTT TCTTTGCCAT GGAAGAAATG TTCCTTTTTC CTTTTTCACG GGGCTTGTGT TTTTAGGAAT CACTTTCTGA TTATCATGTC GGATGGTGGT CAGAGTCTGA GGCTCTAACT GAGAACTTGA GTTGCTAGAC GATAGTTTGC TTAGTTCCAT TGCTAAGACG CGGAGTTCCG AGGCCATATC CGGGGGAGTG AATGGACGAT TATCCGGTGA TCCTTCCCCA TCTCGAGACT CTTCTGGCTG ACAAGGTGGT GTGACGCTTC CTGAGAGGTG CGTCGACCGA GCGGCTACGG GATGCTTTGT TTTCGTGGAG AAAAAGGGGT CAAACCCAGC TAAAGCGTTA TTCTGATATT TGGTGCTTCC TATCGGGACG GTGTTGTGTC CAGCTAGGTG ACTGCTGTTT GAAATAGGCA TGGTGGTCGG CTGTCTTCAA CTGTCCTTTG TTCAAAAAAT TAGTGATGCG ATAAGAGCAA CACTGGTTGG TACCTGGGAT CAAATCGGGC GTGATTAATA AAGG
|
Protein sequence | MKVSHVAFSL FLAGSQTTAF TNPVSNSAKQ QSFRLPSSAS SAADTTSSAN VAQKSTQDPL LIRAARGEKT ERTPVWMMRQ AGRHIAEYRD LCKKYPTFRE RSEIPEVAVE VSLQPWRNYQ TDGCILFSDI LTPLPGIGCE FTIDEKVGPL MEPMRSYDDI KKMHPMDPYK STPFVAEALK ALRQEVGPET AVLGFVGCPY TLATYIVEGK TSKEYLEIKK MALNEPDLLH AILQQLADNI GDYALFQIEN GAQLIQIFDS WAGHLSPRDY NTFAAPYQKQ ILDKIKEKYP DVPTVIYIKH SGALIERMAA TGVDVVSLDW TVDMADGRDR IEAGRKSAGL EGRGGVQGNL DPAVLFANHD VIEERAIEIL KKAGSVGHVM NLGHGIEAAT PEENAHHFIE TVRGYRHEE
|
| |