Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54826 |
Symbol | CRTISO2 |
ID | 7203219 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 645646 |
End bp | 648008 |
Gene Length | 2363 bp |
Protein Length | 635 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | carotenoid isomerase |
Protein accession | XP_002182438 |
Protein GI | 219124285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0698106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTCCTACCT ATCTGTTAAT CGGTATCTCT GTTGGGGTGG CGGAGGCAAA TTAGACCAGC AAACGTATTG GAAGACTGAC ATCGTGTTCA CAGTAAAGCC CACCGCAGAT TATTTGGTTA CAGTAAGCTG TGATTAACTT GATTTTCCCA GACAATGCTT GTGGAGAGCA AAAAAAGCCG AGATGGTAGT CGCACTTCGA GCCGTTCCCT CGATACCAAA ACACATTGCG TGTGCTCTAC TAGCAAACAA AACTCTGTCA GACCAGCAAA TACTCTGCTG CGATCGCGGG CAAGGCACAC GTTGATTTGG CTTGTCTTTT ACGTGTGGGA GTGGAATACC ACCACGACGG CATTTGCACC CTCACCTTCT CGAATTGCAG CTTTTCGAGC GTCGCGTGGT CGCAAGCTAA CCACATCAGT ATCGAGTCTA GTTAGCGGTG ACAAACGTGA CTGCGCGTCC CCAGCAGACG ATCTGGTGGA CGTGGCAATT ATCGGCGCAG GTCTGGGTGG ACTGTGCGCG GGCGCTATTC TTAATACGCT CTACGGTAAG AAGGTGGGTA TCTACGAGGC ACATTACTTG GCCGGAGGCT GTGCACACGC CTTTGATCGG CGCGCCGCCG ACGGTGTAAA TTTCACCTTT GATTCCGGCC CGACCATTCT GTTGGGATGC TCAAGTCCAC CCTTTAATGC CTTACAGCAA GTCTTGGACG CTGTGGGTCA AAAGGTTGAA TGGATAACGT ACGATGGATG GTAGGTAAAC AATGCTTTTC AAAATTACCC ATTACCCGGT TGTGCCGCAC GGATAAGTAG CTTTATGATT CTGACTAAAT TGAACGTATG TCCTCAGGGG TATGATCGAG AATCCAGGAA AGGACAATGA ATTGCGGTGG AAAGTCATAC TGGGACGGGA TGAGTTTCAG CGCGGTCCAC TCACACGGTT TGGGGGACCC AAAGCCTTGG AGGAGTTCGA AGCGCTACGA GAAGCAACCA AGGATCTTTT GGCTGGCGCC AAAATACCTG CCATGGCTAT GCGTCCCGGT CCAAGTGCCC TGGTACCACT CATCCGGTAT TTTTCGACAC TGGTTACCTT GCTTTCCCAG GGATCAAAGG CAACGGGAAC GTTTGCGTCA TTTATTGACG GGCCCAATTT CACTGTAACG GATCCGTGGC TGCGATCGTG GCTGGATGCT CTCGCCTTTT CCTTATCCGG GCTACCCGCT AGTCGAACCG CAGCCGCCGC CATGGCGTTT ACACTCAGTG ACATGCACCG CCCCGGTGCA GCGTTGGACT ATCCCAAGGG CGGTATGGGA GCAATCGCTG AGGCCCTTGT CAGGGGAGTA CAGCAAGGTT CGAACGGTTC TCAGGTACAC TTGCGGCAAC CTGTGGAAAA AATTGACTTT AGTGAGGACG GTACCATCGC TACGGGGCTG ACGCTACGCA ACGGAAGACG GATACTTGCT CGGGAAGGGG TGATATGCAA TGCACCAGTT TGGTCCCTGA AGAGCTTAGT TAAAAACGAG AAAGCCTTGC AAAAGCTCAA CAACGATCTT CCTATTATCA ATCCGCGTCC AAGATCTTCT TGGACGACCA CGGACGAAGG ATCATCGATT CACCAACAGC GTCCAACTCG ATCCGGAGAA GCGGACGAGA CTCTCTTGGG TGCCTGCGAT ACGGCAGAAA AGACCGGCTC CTTTTTGCAT CTGCATTTGG CGCTGGAATC TAGTGGCCTG AACTTGGACA ACTTGGAAGC TCACTACACT GTGATGGACC GATCGTTAGG AGGGGATGGG TCCTCCGTAA ATGGAGTCTT GGACGGACCC TGCGGAATAC TTAACATGAT TGCCGTCTCG AATCCCTGCA AGATTGACAA TAGTCTAGCT CCCGACGGAA CCATTGTGGT ACACGCTTAC AGCGCCGGCA ACGAACCTTA CGAAATTTGG GAGGGACTAG ATCGAAGAAG TGACGGGTAT ATGTGCCTTA AGGAAGACCG AGCTGAAGTG CTATGGCGCG CTGTGGAGAG TATTATTCCT GATGCGCGCA ATCGTGTCGT GATTAGTGAG ATTGGATCTC CTATCACGCA CGAGCGTTTC TTGAACCGAC CTAGAGGAAC ATATGGTAGT GCCACTGAAG ACTACCTTGC GGACGGTAGT ACATCGATCG GGAATTTGCT TTTGGCTGGC GACGGGATTT TCCCCGGTAT CGGTTTGCCC GCAGTGGCGA TCAGCGGGGC CAGCGCAGCG AATGCAATGG TGAGCGTATT TAAACAGTGG GAATGTCTAG ACGAGTTGGG CAAAAGCCAA AAGCTATAGC TTGAGAAAAT AGGCAGGAAG ATTGTTAAAT TTAGAACATT GCTTTGGAAA GTG
|
Protein sequence | MLVESKKSRD GSRTSSRSLD TKTHCVCSTS KQNSVRPANT LLRSRARHTL IWLVFYVWEW NTTTTAFAPS PSRIAAFRAS RGRKLTTSVS SLVSGDKRDC ASPADDLVDV AIIGAGLGGL CAGAILNTLY GKKVGIYEAH YLAGGCAHAF DRRAADGVNF TFDSGPTILL GCSSPPFNAL QQVLDAVGQK NPGKDNELRW KVILGRDEFQ RGPLTRFGGP KALEEFEALR EATKDLLAGA KIPAMAMRPG PSALVPLIRY FSTLVTLLSQ GSKATGTFAS FIDGPNFTVT DPWLRSWLDA LAFSLSGLPA SRTAAAAMAF TLSDMHRPGA ALDYPKGGMG AIAEALVRGV QQGSNGSQVH LRQPVEKIDF SEDGTIATGL TLRNGRRILA REGVICNAPV WSLKSLRPTR SGEADETLLG ACDTAEKTGS FLHLHLALES SGLNLDNLEA HYTVMDRSLG GDGSSVNGVL DGPCGILNMI AVSNPCKIDN SLAPDGTIVV HAYSAGNEPY EIWEGLDRRS DGYMCLKEDR AEVLWRAVES IIPDARNRVV ISEIGSPITH ERFLNRPRGT YGSATEDYLA DGSTSIGNLL LAGDGIFPGI GLPAVAISGA SAANAMVSVF KQWECLDELG KSQKL
|
| |