Gene PHATRDRAFT_45243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45243 
SymbolCRTISO4 
ID7200259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp585988 
End bp587784 
Gene Length1797 bp 
Protein Length598 aa 
Translation table 
GC content52% 
IMG OID 
Productcarotenoid isomerase 
Protein accessionXP_002179244 
Protein GI219116899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0755538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTT CGGAAAGATC ACTAATTGCG TGTGCAATCT GCTCCATTTC GAGTGCTTTC 
GTACCCATCA TCCACACGCC CCAGCATCAA TCGCCACGTA CAACCCGTCA TCAATTCACC
AGGATCTATG CGGCGGTGTC CTCCGTACCA TCCAATAGCA TTCCCGACGA AGCGGATGTG
GTTGTTATTG GATCGGGACT CGCAGGACTC TCCTGTGCCG CACTTTTAGC CCATTGTGGC
AAACGAGTGG TGGTGTTGGA ATCGCATGAT GCGCCCGGTG GTGCCGCGCA CGGCTGGGAA
CGCCGTGGGT TTCACTTTGA ATCCGGACCG TCTCTCTATT CCGGATTCGC CATGGAACGT
TCTCCAAATC CTCTCAAAAA TATCTTTCAA ATCACGGGAG AAGACTGCGA GTGGATTACC
TACGATCGGT GGGGTACCGT AATGCCGGAT GGGACCAAGT TTGCCGCCAA AATTGGACCC
GAAGAGTTTC AAGACGTATT GGAGAGCCAA GGAGGACCGG GAGCACGCGA AGAATTTGCG
GCACTAATGG AGCGCATGAA GCCTCTGTCC GATGCTGCTC AAGCCTTGAC ATCGCTCGCT
CTGCGGGAAG ATCCAGCCGT TGTCGTTACG CTTCTGAAGT ATCCCCGCGA CCTTATTGCA
ACTCTGGCAC AAGGACAAGC ACTGAACGAA CCATTCAAAA ACATCATGGA TGAAATGAAG
ATCGAAAATA AGTTTGTCAA AAACTGGCTA GATATGCTTT GCTTTCTCTT GCAAGGCTTG
CCCGCTTCGG ACACAATGAA TGCGGTCATG GCCTATATGC TCGCCGATTG GTATCGACCT
GGTGTCACTC TGGACTTTCC CAAAGGCGGA TCCAGTTCTA TTGTCAGTGC TTTGGTTCGT
GCCGTCCAAA AGAATGGGTC TTCCGTTTGC GTCAACAGTC ACGTGGATGA GATTCTGGTT
GAAAATGGTA AGACTGTTGG AGTCCGACTG ACCGATGGAC GCAAGGTGCA CGCCACACAA
GCCGTTGTAT CCAATGCAGA TCCGTACATC AGCAACAAAC TGCTCTTAAA CGCAAGAAAG
TCAGGTCAGC TCAATAAAGC TGCGACCGAT CATCTGGACG CTTTAATAAA CACCGACAAA
ACAGAGGGTG GTATTGCCGA TTTGAAATCT TTCATCCACA TTCATGCCGG CATTGATGCA
GCTGGCCTCC CCGATCAGCC CAGTGCCGAC TTTCCTGCAC AATGGGCCGT TGTCCGTGAC
TGGGATGCCC CTGAAGGAGT AGAGAGCCCG CGCAACATCG TTTTGTGCTC CATGCCTTCG
CTTATTGATC CTAGTCTTGC CCCTGAAGGC AAGCACGTCT TACATGCTTA CGTTCCTGCC
ACGGAGCCAT ACGCGGATTG GGCCGGCATG GACCGCAAGT CGGAAGAATA CACGAAAAAG
AAGGAGCAAG CTGCGGATTT TTTGTGGAGT GCCATTGAAG AGTACATTCC GAACGCTCGG
GATCGTGCTG TTCCTGGCAC GGTACAGATT GGAACACCCT TGACCCACGA ACGATTTTTA
CGACGGACAA GGGGTACCTA CGGTCCGCGT GTGGAAGTCG GTGCTGGACA GACTCTGCCC
GGTCACAAGA CTCCGTTGCC AGGTTTCTAC ATGGTAGGAG ACTTCACATT TCCAGGTATT
GGAGTACCCG CAACAGCAGC ATCCGGCGCC ATTGCGGCGA ACACGCTAGT GTCGGTGTTT
GATCATCTCG CAATGCTCGA TAAGGTCCGT CTCCCGGAAA AGGAACAAAA GTCTTGA
 
Protein sequence
MRFSERSLIA CAICSISSAF VPIIHTPQHQ SPRTTRHQFT RIYAAVSSVP SNSIPDEADV 
VVIGSGLAGL SCAALLAHCG KRVVVLESHD APGGAAHGWE RRGFHFESGP SLYSGFAMER
SPNPLKNIFQ ITGEDCEWIT YDRWGTVMPD GTKFAAKIGP EEFQDVLESQ GGPGAREEFA
ALMERMKPLS DAAQALTSLA LREDPAVVVT LLKYPRDLIA TLAQGQALNE PFKNIMDEMK
IENKFVKNWL DMLCFLLQGL PASDTMNAVM AYMLADWYRP GVTLDFPKGG SSSIVSALVR
AVQKNGSSVC VNSHVDEILV ENGKTVGVRL TDGRKVHATQ AVVSNADPYI SNKLLLNARK
SGQLNKAATD HLDALINTDK TEGGIADLKS FIHIHAGIDA AGLPDQPSAD FPAQWAVVRD
WDAPEGVESP RNIVLCSMPS LIDPSLAPEG KHVLHAYVPA TEPYADWAGM DRKSEEYTKK
KEQAADFLWS AIEEYIPNAR DRAVPGTVQI GTPLTHERFL RRTRGTYGPR VEVGAGQTLP
GHKTPLPGFY MVGDFTFPGI GVPATAASGA IAANTLVSVF DHLAMLDKVR LPEKEQKS