Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20757 |
Symbol | HemE_1 |
ID | 7201630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 119247 |
End bp | 120685 |
Gene Length | 1439 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | uroporphyrinogen decarboxylase |
Protein accession | XP_002180946 |
Protein GI | 219120415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0771142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGTTGATG TTGTTTCCTC CAGGTCTAGT GAGATAGAGA GAGACAGAGC TAGACGTCCA TATCTTGCCG TCGCCAATCA CCATGAGATT CTCTAGTACC TCGGTTTGGA CGATTGCACT ATTGGCGGTG AGCAGTAGTC CAACGGCGAT GACCAATGCT TGGATGACGA CACCAGTAAC GTCCTCGTCC TCGTCGCATC TTCGTACTAC CACCAACCGG ACTCCCCTGC GCTTGTCCTC CTCCAGCGCG ACAACGACTG CCGCGGCGGA CGAACGCGTG GTGTGGGGCA AGGTCAAATC GCACGACCAT TGCCGAGAAC CACACGATCG CGACATTTTG GTGCGGGCCG CCCGCGGGGA AACCGTCGAA CGCACACCGG TATGGATGAT GCGTCAAGCG GGTCGGTACA TGGAAGCCTT CCGCGAGTAC TCCGACGTTT TGCCCTTTCG GGAACGCTCG GAAACGCCCG ATTATGCCGT GGAGCTCTCA CTACAATGTC ACCGGGCTTA CGGCATGGAC GGCATCATTA TGTTTAGCGA TATTCTCACG CCGCTACCGA CTCTGGGTAT CGACTTTGAT GTCGTCAAAG GGACCGGACC GGTTATTACC ACCAAGGTTC GGACCGAAGA AGACGTCAAC AATATGCCAC GCCACGAATT CGACGACAAG GTTCCCTTTA TTAAGGAAAT ATTGAATCGT CTCTCACAAG AAGCCGAGGA CGCCAACACA TCGTTGATTG GCTTTGTCGG CGCACCCTTT ACGCTCGCCG CCTACACTAT TGAAGGAAAG TCGTCCAAGG ACTGCTTGAC GACCAAAAAG CTTCTCATGG CGGACGAACG TGGCGATAAT GCCTGCATCA GTTTGTTTTT GGACAAACTT GCCGATATGA TTGGCGACTA CGCCTGCTAC CAAATCGAGC ACGGCGCACA AGTCATTCAG GTCTTCGAAT CCTGGGCACA CCAATTGTCC CCACACTGGT TCGAAACGTA CGCCAAGCCT GCTGCGCAAA AGGCCATTCG TAAAATCAAG AGCCAATACC CGGACACTCC CGTCATTTAC TTTGCCAACG GCGGATCATC TTACTTGGAA TTGCAACGAG ATATGGGTGC CGACATGATT GCCGTCGACT GGGCGGTCAA CTTATCCCAG GCCCGCACCA TTCTGGGACC CGATGTGCCC GTTTCGGGCA ATCTCGATCC GACCGTGTTG TTCGGTAGTA AAGAGCAGAT CGAGCAGGCT GTACGGGATT GTATTGATCA AGCCGGTGGG CCAGGAAGAC ATCTTCTCAA CCTTGGCCAC GGCGTCATGC AAGGGACACC AGAAGAGGCC GTGAAATGGC TGGTGGATGA AGTCAAACGG TACAAGGGTA AAGCGTAACT GCAAAACGGA CGGTACAAAT GTTAACTGTT TGCTGAATCA AAAGGCATGG CAAACAGTG
|
Protein sequence | MRFSSTSVWT IALLAVSSSP TAMTNAWMTT PVTSSSSSHL RTTTNRTPLR LSSSSATTTA AADERVVWGK VKSHDHCREP HDRDILVRAA RGETVERTPV WMMRQAGRYM EAFREYSDVL PFRERSETPD YAVELSLQCH RAYGMDGIIM FSDILTPLPT LGIDFDVVKG TGPVITTKVR TEEDVNNMPR HEFDDKVPFI KEILNRLSQE AEDANTSLIG FVGAPFTLAA YTIEGKSSKD CLTTKKLLMA DERGDNACIS LFLDKLADMI GDYACYQIEH GAQVIQVFES WAHQLSPHWF ETYAKPAAQK AIRKIKSQYP DTPVIYFANG GSSYLELQRD MGADMIAVDW AVNLSQARTI LGPDVPVSGN LDPTVLFGSK EQIEQAVRDC IDQAGGPGRH LLNLGHGVMQ GTPEEAVKWL VDEVKRYKGK A
|
| |