Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16140 |
Symbol | hemE |
ID | 7198277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 201081 |
End bp | 202181 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | uroporphyrinogen decarboxylase |
Protein accession | XP_002184319 |
Protein GI | 219128227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.943071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCAGAACG ATCTCCTGTT GCGTGCCGCA GTCGGAGAGA AAGTCGAACA AACACCGCTG TGGCTCTTCC GTCAAGCCGG TCGGCATCTT CCGGAATATC AGGACTACAA GGCGCAAACG AACAAGAACT TTTTGGAACT CCTGGCGTCT CCCGCCTGCG TAGCAGAATG TACCATGCAA CCCATCCGTC GGTACGATTT GGATGCGGCT ATTTTGTTTT CCGATATTCT GGTCGTCCCG GAGGCACTCG GGATCCAAGT CACCATGCCC GGAGGCGTCG GGATTCTCGT TCCCGAGCCA CTCACGTCGC CGGAAGAAGT ACACACGCGA CTCCCCTCCA TCGACCAGAT TACTCCCGAC TTTGTGCAAA CTAAGCTCGC GCACGTCATT GAAGCAGTCC GGACGATTCG CACGCAAATG GCGGAAGAAA ACAAATCCAT TCCCTTGATT GGGTTTTCCG CAGCCCCCTG GACACTCATG TACTACATGG TGGGTGGGAG TTCCAAAAAG AATACCGAGC TCGGTGTGAC TTGGTTGGAG GACTATCCGG AGGCGTCTGG AGACCTGTTG GCGCTCTTGA CCAAAATTGT GGTGGAATAC ATGGACGCGC AAGTACTGGC CGGAGCACAC GTGTTGCAAG TCTTTGAAGC CATGGGTATG ATGATTGACG ACGTGAACTT CGAAAAACAC GCGTTGCCGT GTTTGCGAAC CATAGCGCAA GAGCTTAAAA CACGCCATCC GGATATTCCG CTCATGGTGT TTTGTCGGGG TGCCTGTCAC CTGAACAACC AACTGGTTGG CCTAGGATAC GATGTCATCA CGATGGACGG CAGTGTGGAC CGCACTACGG TAAGGCAGCA ACTAGGCAAC ACTGTCACGT TACAGGGCAA CTACGATCCG GCGGAACTTA TTGAAGAAAA CGGCAAAACG GTCGAGACGG TCCGAGCGAC TGCGAAAAAA TTGCTGCAGG AGCTGGGACC CCAGCGACTG ATCGCCAATC TAGGTGAAGG GCTGGGTGGG AAAGAAAGCC CGGAACTTGT GGACGCCTTC GTCAAGGCGA TTCACGAGGA GAGCGCCGCC ATGATTCTTC AAGATAGCTA G
|
Protein sequence | PQNDLLLRAA VGEKVEQTPL WLFRQAGRHL PEYQDYKAQT NKNFLELLAS PACVAECTMQ PIRRYDLDAA ILFSDILVVP EALGIQVTMP GGVGILVPEP LTSPEEVHTR LPSIDQITPD FVQTKLAHVI EAVRTIRTQM AEENKSIPLI GFSAAPWTLM YYMVGGSSKK NTELGVTWLE DYPEASGDLL ALLTKIVVEY MDAQVLAGAH VLQVFEAMGM MIDDVNFEKH ALPCLRTIAQ ELKTRHPDIP LMVFCRGACH LNNQLVGLGY DVITMDGSVD RTTVRQQLGN TVTLQGNYDP AELIEENGKT VETVRATAKK LLQELGPQRL IANLGEGLGG KESPELVDAF VKAIHEESAA MILQDS
|
| |