Gene PHATRDRAFT_20757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20757 
SymbolHemE_1 
ID7201630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp119247 
End bp120685 
Gene Length1439 bp 
Protein Length431 aa 
Translation table 
GC content54% 
IMG OID 
Producturoporphyrinogen decarboxylase 
Protein accessionXP_002180946 
Protein GI219120415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0771142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGTTGATG TTGTTTCCTC CAGGTCTAGT GAGATAGAGA GAGACAGAGC TAGACGTCCA 
TATCTTGCCG TCGCCAATCA CCATGAGATT CTCTAGTACC TCGGTTTGGA CGATTGCACT
ATTGGCGGTG AGCAGTAGTC CAACGGCGAT GACCAATGCT TGGATGACGA CACCAGTAAC
GTCCTCGTCC TCGTCGCATC TTCGTACTAC CACCAACCGG ACTCCCCTGC GCTTGTCCTC
CTCCAGCGCG ACAACGACTG CCGCGGCGGA CGAACGCGTG GTGTGGGGCA AGGTCAAATC
GCACGACCAT TGCCGAGAAC CACACGATCG CGACATTTTG GTGCGGGCCG CCCGCGGGGA
AACCGTCGAA CGCACACCGG TATGGATGAT GCGTCAAGCG GGTCGGTACA TGGAAGCCTT
CCGCGAGTAC TCCGACGTTT TGCCCTTTCG GGAACGCTCG GAAACGCCCG ATTATGCCGT
GGAGCTCTCA CTACAATGTC ACCGGGCTTA CGGCATGGAC GGCATCATTA TGTTTAGCGA
TATTCTCACG CCGCTACCGA CTCTGGGTAT CGACTTTGAT GTCGTCAAAG GGACCGGACC
GGTTATTACC ACCAAGGTTC GGACCGAAGA AGACGTCAAC AATATGCCAC GCCACGAATT
CGACGACAAG GTTCCCTTTA TTAAGGAAAT ATTGAATCGT CTCTCACAAG AAGCCGAGGA
CGCCAACACA TCGTTGATTG GCTTTGTCGG CGCACCCTTT ACGCTCGCCG CCTACACTAT
TGAAGGAAAG TCGTCCAAGG ACTGCTTGAC GACCAAAAAG CTTCTCATGG CGGACGAACG
TGGCGATAAT GCCTGCATCA GTTTGTTTTT GGACAAACTT GCCGATATGA TTGGCGACTA
CGCCTGCTAC CAAATCGAGC ACGGCGCACA AGTCATTCAG GTCTTCGAAT CCTGGGCACA
CCAATTGTCC CCACACTGGT TCGAAACGTA CGCCAAGCCT GCTGCGCAAA AGGCCATTCG
TAAAATCAAG AGCCAATACC CGGACACTCC CGTCATTTAC TTTGCCAACG GCGGATCATC
TTACTTGGAA TTGCAACGAG ATATGGGTGC CGACATGATT GCCGTCGACT GGGCGGTCAA
CTTATCCCAG GCCCGCACCA TTCTGGGACC CGATGTGCCC GTTTCGGGCA ATCTCGATCC
GACCGTGTTG TTCGGTAGTA AAGAGCAGAT CGAGCAGGCT GTACGGGATT GTATTGATCA
AGCCGGTGGG CCAGGAAGAC ATCTTCTCAA CCTTGGCCAC GGCGTCATGC AAGGGACACC
AGAAGAGGCC GTGAAATGGC TGGTGGATGA AGTCAAACGG TACAAGGGTA AAGCGTAACT
GCAAAACGGA CGGTACAAAT GTTAACTGTT TGCTGAATCA AAAGGCATGG CAAACAGTG
 
Protein sequence
MRFSSTSVWT IALLAVSSSP TAMTNAWMTT PVTSSSSSHL RTTTNRTPLR LSSSSATTTA 
AADERVVWGK VKSHDHCREP HDRDILVRAA RGETVERTPV WMMRQAGRYM EAFREYSDVL
PFRERSETPD YAVELSLQCH RAYGMDGIIM FSDILTPLPT LGIDFDVVKG TGPVITTKVR
TEEDVNNMPR HEFDDKVPFI KEILNRLSQE AEDANTSLIG FVGAPFTLAA YTIEGKSSKD
CLTTKKLLMA DERGDNACIS LFLDKLADMI GDYACYQIEH GAQVIQVFES WAHQLSPHWF
ETYAKPAAQK AIRKIKSQYP DTPVIYFANG GSSYLELQRD MGADMIAVDW AVNLSQARTI
LGPDVPVSGN LDPTVLFGSK EQIEQAVRDC IDQAGGPGRH LLNLGHGVMQ GTPEEAVKWL
VDEVKRYKGK A