Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54800 |
Symbol | |
ID | 7202950 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 52388 |
End bp | 56420 |
Gene Length | 4033 bp |
Protein Length | 923 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | carotenoid isomerase-like protein |
Protein accession | XP_002182312 |
Protein GI | 219124022 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0759229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCCG TCACGCAGAG TCCACCCGTC GAGTTGTCGA CCGACCCTCC CGTCGCACTC TCCTTGGCCT TGCCTCCATT GTCTCCTACG GCAGACGGCA CGCTTCAACA CCACCACCTC ACCACCACCG ACGACGAGTC GTCGAGTCCT TGGCCAGTAG CAAGTATTTG TCTGAGCAAT TAAGCGAGCT AGTGAAAATC ATTCACAGTC ACCGAAAATA CTCCCCGCTT TGTCCCTGTT AGGTCATGCT GTTAGGTCAT GCAGTCAGGA CGCTCGATCT CGACAGGAAC GCAGTCTGGT CCTCTAGGAG CTTCCATCGA TGCGTTCCCC GGTTCCGTCT GGTCGTTCCT TTGAATCCGG GCGTAGCGCG TTGCCAAAAC GCACAGCAGG GCGTCCACTA GCCAGCACGA CGACGAGACC AAGCCGTCCC CGTCCGCTAG CCGCCATGCC GACGGTTGAC TATAACAATC ACTCAACGAG ACTTCAATCA ATCAGGCTTT TCAACAGGAA CAAAAAAAAT CCACAAACCG ATAGGTGATC CGCAGCGTGT TCCGTGGTCA AAACAATTTC AGCGATCCGT TGAATCGCGG GTGGAATCCC TGGCGGCCCG GGATAAGTTC CCGTCAGGAC AAGTGCGGCG TCGAGTACGT AAAGATGCAC GGCCAGTACT TCCCTACCAG TGGTAGTGGT TTCTCAACAG GTCCCGTTGG CCAACATGAT GGCCACGAGC TCGAGCAGCG CTGTGACACT GAACGACAAG GGGGCGGTCA CGTTAACAAG GGTAACCACG TCGAAGCAAA CGTAGCGACG AGCGTGTTGT GTGCCACCGA GTGCTGGTAA GACGACGGAC TTACAGGCAC GAAGAGTCCC GTGTCGGGAG CAGTCGTCTT CGTGGCAGGC CGTGAGCGTC TGGCGTTGGA CGATACGGGG AGACGTGTCA AGCCGAGGCT AGCATCCCAT ACGATAGCTA AAGACCGCCC CCGCCGTCGA ACAGGGAGCA TTCGTATACT TTGCTTATCT ACTTACTACT GCTCTACTCG TTTTAGGTAG AGAGAGAGTG TCATGGGGAC GATGCAACAT CCGAGCGAAC TGTAACCCGT TGGCCGAGGA GGTATCCCCG GTAGTGCGAA CGACACTTTG GACCGGATGT CCAGAACATT CCTTTTCAAA TTCTTTTTAA ATTCGCCGTT ATTTTTATGT CAACTTGCGA ACACACGGAT TGCGCCATTT TTCACCGTGT CAAGTTTACC TCACAGAGAA CCCGAACCCA AAACAGTGTG CGTCGTCGTT TTGCGTAGTC TCCAACGTGA TCCGACCTTT CAACAAGATG AGGTTTCCGC TTTCCTCGGT GCATTTCGAG GCTGGATTGG TAACGAGGTC GACCGGCTTT TCGAATGGGC TGCTACGGCT GCTAAGGCAC TGTGTATGGC GAATTGGCAA GGCAAATCCC GGCTTTTGAA GAAATTTGGC GCTACCGTTC GTCGTCTGGA CAAAGCCTGC GGTCCATTTG GTTGGACTGC ATGGATCCTT TTCCCATGTC AAACATTGAT GCAGCGCCGG GATCACTATA CAGTCAGTCG GACAACGCAA AGGGACCACT CTGTTTCCGC ATCAGCGGTA CGCGATTGGC CGATCCAGAA CTCTCGGTGA CGCCGAATCG TGTCGAGAAC ATTTAAAAAT GCGAAGCGTG GAGTCAGTCA ACCTAAGCTT CCTCCCGTAG TTCCGTCGCT TTCCCATAAA ATATCCGAAC ACTAGAAATA AAAAAAGCAA AAACAAATTC CGACGAATGC AGCAGATCAC GAGCATGATA TCGCCCGAAG CCTTTCTACC TCTGGCCACG ACGTTGCCGA AGATTCCTTT TTGGATACTC GCACCAGTGC TAATTTGGGT AGGTTTTCTG GTTTGGTTGT TCCACTGGCC CGCTCGACGA GTCAAACTGC ATCCGCGGCG AGCGAGTCGC TTTCGGCCGG AGCTTGTGCT GGAAAACGGC AAGCAACGTC GGTTTGACAC CATTGTGATT GGATCCGGAT CGGGAGGTTG CGCCTGCGCC AACCTGTTGG CCCAATCTGG GCAGCGCGTC CTGATTCTGG AGCAACACAC CAAAACGGGG GGATGTACCC ATTCCTTTCG CGATCGCGGC TGTGAGTGGG ACACCGGCTT ACACTACACG TCGGCAGGAA TGGGGCGATC GACCTGCCGA CCGGGTGCCA TCATGCATTT TATGACGCAA GGTCTGCAGA AGTGGACTCC ACTACAGGAT CCCTACGATG AAGTCATCTT TCCTCCGGAC GATTTCGTTA AACTTGGCGT CCCGAACGAG TCGTCGTACC GCTTCGTGAG CGGTGCGGAC GAGACAATTC AAAGCGTTTT GGCCAGTATT GACCCTGAAC ACCGGGAGCT CGAAAAAAGA GCAAGATTAT ACATGGATTT GTGCACAGAT ATTAACAGTG GCTTTACAGC ACTCGGAATC TCTCGGGTGT TGCCTTCGTG GATGCACTTT CTCGTGCGGT CCAGGATTGA TCGTCTCATG AAGTTCGCAG CCATGACTGT CCGAGATGTA CAATACGGAA TGCTCAATCT GGGGCTGACG ATAGAAGAGC TTCTCAAGGA TGGCTGCCCC CCGGCGCCTG CTGGATCCGA ACCAGATCCT TCTATTCGTC GCCTAAAGGC TGTCTTGACT CATCCAATTG GCGATTACGC AGTGCAGCCA CGTGATGCGA CCATGGCGGC ACACGGAGTT ACTATGGCGC ACTATCAAGA TGGAGCGTGC TACTGCGTTG GTCCGACGCA GCAAATTTCG GTCCGAAGCT CTAGTATGGT GCGAGAATTT GGCGGTGAGG TATTGACGGA CGCGACCGTT CGAGAAATTA TCCTCGAGCA CGGACGCGCC GTTGGAGTTC GCGTGAGCAA CACGTCGGCC TTGGCGGAAT GTAAATCCGA TGCCGAGCGC GCCCAAGTGC CAGTGACCGA GCTCCGGGCG AAAGCGGTGG TATGTGCGAC TTCAGTATAC AATTTATACA ATAACTTGTT GCCTCAAGAC CTTGCACAAG TAAAGGAGTT TCAAGATCCG GAAAAACGAA CCATCCAACA GAGCAACGGT CATATTTTCC TCTTTTGCAA GATCAAAGGG GACCCGACCG AGCTGAAGTT ACCAGCCCAC AATCTGTGGT ATTTCAATTC TTATGACATT GACGATGCCT TTGAGGCCTA CTTTACCGAT CCTGTTGGTC AGCGGCCCCC AACAGTCTAT ATGTATGTAT GAAGGAGAGA ACTGTGTAGG GGTGTACAAA CTTGGTAATT ACTTCTAACT GTCTTTTTGA CTTTTTGCCT TTTAGTGGAT TCCCCTGCAC AAAGGATACA AGCTGGAAGC AGAGATTTCC GGGAGTAAGC AATTGCATCC TCATTTCAGA TGGTCTTTGG GAGTGGTTCG AGAAATGGCA AGATAAACCA GTGCACAATA GGGGTTCCGA CTACGAGGAG TTCAAGGAAA AGCTGAGCAA ACATCTGTTG GAAATCCTCT TCGAGTTTGT ACCAGAAGTG AAGGATAAGA TTGAATTCAG CTTTCTCGGA ACTCCTTTGT CGGAGCAAAC GTACCTGAAC TCATTTTGTG CTGGGAGCTA CGGAACGAAG TGTCTTCCTT CTATGTTTGC TAAAAGCAAC CGCAGGTGGA CAACATCTCC TCACACATCT ATTCCTGGTC TATATCTGGC GGGGTCGGAC GCGTTTTTGC CTGCAGTTTG CGGTGCCATG TACGGAGGCT GCTTTGGAGC AATTGCAGTT CTGGGTCATT TGCGAGCCCT GAAGCTGACC TTGGCGTTCA TTGCACACTT TGCTGGGTGT ATAACAGATG AGGATCCCAA AATTGGCTGG ATCCAAGCTT ATATTCTTGC CTGGAAAAAG TTTATGAACG ACTGACATGT GAGTGCTAAC ATATTGAAGC ATGTTGTTTC TCTTTGCAGC TGCTGGTGGA CATTGACTCG AGTAGATCAA TGGAATAGAG CTATCGTTAA TGTTAATTCT GCTTTTTCAC CTC
|
Protein sequence | MTPVTQSPPV ELSTDPPVAL SLALPPLSPT ADGTLQHHHL TTTDDESSSP WPVIRSVFRG QNNFSDPLNR GWNPWRPGIS SRQDKCGVEY VKMHGQYFPT SGSGFSTGPV GQHDGHELEQ RCDTERQGGG HVNKGNHVEA NVATSVLCAT ECCLPHREPE PKTVCVVVLR SLQRDPTFQQ DEVSAFLGAF RGWIGNEVDR LFEWAATAAK ALCMANWQGK SRLLKKFGAT VRRLDKACGP FGWTAWILFP CQTLMQRRDH YTRYAIGRSR TLGDAESCFL VWLFHWPARR VKLHPRRASR FRPELVLENG KQRRFDTIVI GSGSGGCACA NLLAQSGQRV LILEQHTKTG GCTHSFRDRG CEWDTGLHYT SAGMGRSTCR PGAIMHFMTQ GLQKWTPLQD PYDEVIFPPD DFVKLGVPNE SSYRFVSGAD ETIQSVLASI DPEHRELEKR ARLYMDLCTD INSGFTALGI SRVLPSWMHF LVRSRIDRLM KFAAMTVRDV QYGMLNLGLT IEELLKDGCP PAPAGSEPDP SIRRLKAVLT HPIGDYAVQP RDATMAAHGV TMAHYQDGAC YCVGPTQQIS VRSSSMVREF GGEVLTDATV REIILEHGRA VGVRVSNTSA LAECKSDAER AQVPVTELRA KAVVCATSVY NLYNNLLPQD LAQVKEFQDP EKRTIQQSNG HIFLFCKIKG DPTELKLPAH NLWYFNSYDI DDAFEAYFTD PVGQRPPTVY IGFPCTKDTS WKQRFPGVSN CILISDGLWE WFEKWQDKPV HNRGSDYEEF KEKLSKHLLE ILFEFVPEVK DKIEFSFLGT PLSEQTYLNS FCAGSYGTKC LPSMFAKSNR RWTTSPHTSI PGLYLAGSDA FLPAVCGAMY GGCFGAIAVL GHLRALKLTL AFIAHFAGCI TDEDPKIGWI QAYILAWKKF MND
|
| |