Gene PHATRDRAFT_36139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36139 
SymbolFum 
ID7201289 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp653681 
End bp655161 
Gene Length1481 bp 
Protein Length462 aa 
Translation table 
GC content53% 
IMG OID 
Productfumarate hydratase 
Protein accessionXP_002180479 
Protein GI219119437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAGT TGCCCATTCC CGCTGGTACC TTATGGGGCG CTCAGACCCA ACGATCTCTT 
GAAAACTTTC CCATTGGGGG GATCGAATCG CGCATGCCTT TGGCAGTGGT GCACGGCATG
GCGATTGTCA AGAAATCCTG CGCTCTGTAC CACTCCGACC TTGGTGTCAT GGAAGATCAT
ATAGCCCGTG CGATTGCGCA GGCGGCGGAC GAAGTGCTGG CCGGGAAATT GGATCGACAC
TTTCCTCTGG TCACCTTTCA GACTGGAAGG TGCGTTTGTA ATAGTTGTCT TGGGGTGTAG
GAGTGGCAAT CTTTTGTAAC TCATTTTTTT ACACATTTTC TATACTAACT CGCTTCCATA
GTGGCACTCA AACCAACATG AACGTGAATG AAGTGTTGTC GAATCGCGCC ATTCAAATAC
TTGGTGGAAC TGTGGGTAGT AAAGACCCCG TCCACCCGAA CGACCACGTC AATCGCGGTC
AATCTTCCAA CGATTCCTTC CCGACCGCCA TGCACATTTG CGCTGCCAAG ACCTTGCACG
AGCGAACTCT ACCAGGATTG CGGATTCTAC AAACGGCCCT CGTTCAAAAG GTGGAAGAAT
TTGGTGCCGT GGTCAAGATT GGACGCACGC ACTGCCAGGA CGCGACACCT TTGACTCTCG
GACAAGAATT CGGTGGCTAC TTGCAGCAAG TCGAGTACGG AATTTCTCGG GTAGAGGCGT
CTTTGCCGAG TTTGTACCGT TTAGCCTTGG GCGGCACCGC GGTCGGTACC GGTCTCAATA
CGGTGGAAGG CTTCGCGGAA GAAATTGCGG CCAAGATTGC GGACGAGACC GGTCTCCCGT
TCACGTCGGC AGCGAACAAG TTCGAGGCCC TCGCGGCGCA CGATAGTATC GTGGAAGTTT
CCGGTATGCT GAATACAATG GCATGTAGTT TGAACAAAAT TGCAAACGAC GTGCGGCTGT
TAGGTAGCGG TCCACGATGT GGACTGGGCG AAATTTCGCT GCCGGCCAAC GAGCCTGGCT
CGAGTATTAT GCCGGGCAAG GTAAACCCCA CGCAGTGCGA ATCTTTGACT ATGGTATGCG
CTCAAGTCAT GGGAAACCAC GTGGCCATTT CGGTCGGCGG AGCACAGGGA CATTTTGAAC
TCAACGTCTT CAAGCCCGTT ATGATTGCGA ACTTGTTGCA TTCAGCCGTC TTGATTGGTG
ACGCCGCCGC TTCGTTCGCA ACACGGTGCG TGGAGGGCAT CGTGGTCAAT CAAGACCGGG
TAACTCAGCT GCTGCACGGT AGTCTTATGC TGGTGACAGC GTTGAACCTG CATATTGGAT
ACGACAAGGC GAGTGAGATT GCCAAGAATG CACACAAGAA CGGCACCACA TTGAAGGAAT
CGGCTATTTC GAGCGGCTAC TTGACGGCCG CTCAGTTTGA TGAATGGATT GTTCCCGAAA
GCATGATTGG TCCGAGCCCG GCGAATGTAA CTACAGAATA G
 
Protein sequence
MGELPIPAGT LWGAQTQRSL ENFPIGGIES RMPLAVVHGM AIVKKSCALY HSDLGVMEDH 
IARAIAQAAD EVLAGKLDRH FPLVTFQTGS GTQTNMNVNE VLSNRAIQIL GGTVGSKDPV
HPNDHVNRGQ SSNDSFPTAM HICAAKTLHE RTLPGLRILQ TALVQKVEEF GAVVKIGRTH
CQDATPLTLG QEFGGYLQQV EYGISRVEAS LPSLYRLALG GTAVGTGLNT VEGFAEEIAA
KIADETGLPF TSAANKFEAL AAHDSIVEVS GMLNTMACSL NKIANDVRLL GSGPRCGLGE
ISLPANEPGS SIMPGKVNPT QCESLTMVCA QVMGNHVAIS VGGAQGHFEL NVFKPVMIAN
LLHSAVLIGD AAASFATRCV EGIVVNQDRV TQLLHGSLML VTALNLHIGY DKASEIAKNA
HKNGTTLKES AISSGYLTAA QFDEWIVPES MIGPSPANVT TE