Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45243 |
Symbol | CRTISO4 |
ID | 7200259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 585988 |
End bp | 587784 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | carotenoid isomerase |
Protein accession | XP_002179244 |
Protein GI | 219116899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0755538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTT CGGAAAGATC ACTAATTGCG TGTGCAATCT GCTCCATTTC GAGTGCTTTC GTACCCATCA TCCACACGCC CCAGCATCAA TCGCCACGTA CAACCCGTCA TCAATTCACC AGGATCTATG CGGCGGTGTC CTCCGTACCA TCCAATAGCA TTCCCGACGA AGCGGATGTG GTTGTTATTG GATCGGGACT CGCAGGACTC TCCTGTGCCG CACTTTTAGC CCATTGTGGC AAACGAGTGG TGGTGTTGGA ATCGCATGAT GCGCCCGGTG GTGCCGCGCA CGGCTGGGAA CGCCGTGGGT TTCACTTTGA ATCCGGACCG TCTCTCTATT CCGGATTCGC CATGGAACGT TCTCCAAATC CTCTCAAAAA TATCTTTCAA ATCACGGGAG AAGACTGCGA GTGGATTACC TACGATCGGT GGGGTACCGT AATGCCGGAT GGGACCAAGT TTGCCGCCAA AATTGGACCC GAAGAGTTTC AAGACGTATT GGAGAGCCAA GGAGGACCGG GAGCACGCGA AGAATTTGCG GCACTAATGG AGCGCATGAA GCCTCTGTCC GATGCTGCTC AAGCCTTGAC ATCGCTCGCT CTGCGGGAAG ATCCAGCCGT TGTCGTTACG CTTCTGAAGT ATCCCCGCGA CCTTATTGCA ACTCTGGCAC AAGGACAAGC ACTGAACGAA CCATTCAAAA ACATCATGGA TGAAATGAAG ATCGAAAATA AGTTTGTCAA AAACTGGCTA GATATGCTTT GCTTTCTCTT GCAAGGCTTG CCCGCTTCGG ACACAATGAA TGCGGTCATG GCCTATATGC TCGCCGATTG GTATCGACCT GGTGTCACTC TGGACTTTCC CAAAGGCGGA TCCAGTTCTA TTGTCAGTGC TTTGGTTCGT GCCGTCCAAA AGAATGGGTC TTCCGTTTGC GTCAACAGTC ACGTGGATGA GATTCTGGTT GAAAATGGTA AGACTGTTGG AGTCCGACTG ACCGATGGAC GCAAGGTGCA CGCCACACAA GCCGTTGTAT CCAATGCAGA TCCGTACATC AGCAACAAAC TGCTCTTAAA CGCAAGAAAG TCAGGTCAGC TCAATAAAGC TGCGACCGAT CATCTGGACG CTTTAATAAA CACCGACAAA ACAGAGGGTG GTATTGCCGA TTTGAAATCT TTCATCCACA TTCATGCCGG CATTGATGCA GCTGGCCTCC CCGATCAGCC CAGTGCCGAC TTTCCTGCAC AATGGGCCGT TGTCCGTGAC TGGGATGCCC CTGAAGGAGT AGAGAGCCCG CGCAACATCG TTTTGTGCTC CATGCCTTCG CTTATTGATC CTAGTCTTGC CCCTGAAGGC AAGCACGTCT TACATGCTTA CGTTCCTGCC ACGGAGCCAT ACGCGGATTG GGCCGGCATG GACCGCAAGT CGGAAGAATA CACGAAAAAG AAGGAGCAAG CTGCGGATTT TTTGTGGAGT GCCATTGAAG AGTACATTCC GAACGCTCGG GATCGTGCTG TTCCTGGCAC GGTACAGATT GGAACACCCT TGACCCACGA ACGATTTTTA CGACGGACAA GGGGTACCTA CGGTCCGCGT GTGGAAGTCG GTGCTGGACA GACTCTGCCC GGTCACAAGA CTCCGTTGCC AGGTTTCTAC ATGGTAGGAG ACTTCACATT TCCAGGTATT GGAGTACCCG CAACAGCAGC ATCCGGCGCC ATTGCGGCGA ACACGCTAGT GTCGGTGTTT GATCATCTCG CAATGCTCGA TAAGGTCCGT CTCCCGGAAA AGGAACAAAA GTCTTGA
|
Protein sequence | MRFSERSLIA CAICSISSAF VPIIHTPQHQ SPRTTRHQFT RIYAAVSSVP SNSIPDEADV VVIGSGLAGL SCAALLAHCG KRVVVLESHD APGGAAHGWE RRGFHFESGP SLYSGFAMER SPNPLKNIFQ ITGEDCEWIT YDRWGTVMPD GTKFAAKIGP EEFQDVLESQ GGPGAREEFA ALMERMKPLS DAAQALTSLA LREDPAVVVT LLKYPRDLIA TLAQGQALNE PFKNIMDEMK IENKFVKNWL DMLCFLLQGL PASDTMNAVM AYMLADWYRP GVTLDFPKGG SSSIVSALVR AVQKNGSSVC VNSHVDEILV ENGKTVGVRL TDGRKVHATQ AVVSNADPYI SNKLLLNARK SGQLNKAATD HLDALINTDK TEGGIADLKS FIHIHAGIDA AGLPDQPSAD FPAQWAVVRD WDAPEGVESP RNIVLCSMPS LIDPSLAPEG KHVLHAYVPA TEPYADWAGM DRKSEEYTKK KEQAADFLWS AIEEYIPNAR DRAVPGTVQI GTPLTHERFL RRTRGTYGPR VEVGAGQTLP GHKTPLPGFY MVGDFTFPGI GVPATAASGA IAANTLVSVF DHLAMLDKVR LPEKEQKS
|
| |