Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39528 |
Symbol | CPS-II |
ID | 7195219 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 57596 |
End bp | 62683 |
Gene Length | 5088 bp |
Protein Length | 1542 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183539 |
Protein GI | 219126596 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACACG CGTTCGGTGC CAACACGACC GTTTCGGGGG AGGTCGTCTT CAACACGGGT ATGGTCGGCT ATCCGGAAGC ATTGACGGAT CCTTCCTACC GTGGACAGAT CCTCGTCTTG ACGTATCCCT TGATTGGCAA CTACGGGGTG CCAGATGAGA ATATCAAGGA TGCGCACGGC TTGCCCAAGT ACTTTGAGTC TTCACAAATT CACGTTGCCG GTCTCATTGT CAGCAGTTAT TCTTGGCAGC ATTCCCATTG GGCGGCACAA AAATCTCTCA GCAAGTGGCT GACGGACAAC AACGTACCCG CCATGTATGG TGTCGATACT CGTGCGCTTA CCAAGCGATT GCGAGAACAC GGGTCGATCC TGGGACGCAT GATTGTGGTA CGTTCTCTTG TGACGGCGAC GACACCGCTG GTGAAACGTA AGTATGGAGC ACGACAGGAA TGATTCTTAC ACATTGTTGC ATCATGGCTT TGTAGAATCC GCCCCGTATC GAAGAATCGG TTTCTTCCGA AATGTTTGAC GATCCGAATA CACGCAATTT GGTCGCGGAG GTCTCGACCA AGAAAGTCGT TGTCTACGGC ACGGGAAACT CGCCCAAGGT CATTGCCTAC GATTGCGGAA TGAAGTATAA CATTATTCGC TACCTCGTCG AGATTCACAA GGTCGAGCTC GCTGTTGTTC CTTACGACTA CGATCTCGAA GCTAATCCCG ACAACCTCGA ATGGGAAGGC CTCTTTCTTT CCAACGGTCC CGGGAACCCG ACCATGTGCG AGCAAACCAT CAAGTCGATT CAGTACGCCT TGAATCTGGA AGTAGCCAAG CCAATCTTTG GTATTTGTCT CGGTAATCAG CTGCTGGCAC TCGCTTCTGG CGCCAAGACC TACAAGCTCA AGTATGGAAA CCGTGGAATG AATCAGCCTT GTATCGACTT GCGGACGGGA CGATGCTACA TTACTCCACA GAATCACGGC TTTGCCGTCG ACGCAGCGAG TTTGTGCGAG TTTTGGAAAC CACTCTTTCT CAACGCCAAC GATCTGACCA ACGAAGGTAT CGTACACACG AACAAGCCCT TTTTCAGTGT CCAGTTTCAC CCGGAAGCGT GCGGCGGTCC GACCGACACT GCGTTTTTGT TCGACAAATT CGTCGGACAC GTAAAGAATG TGCCACAGCC GTTGGTGTTG CAAGATAGTC TCAGCTACGA GCGCAAGACG TACAAGAAAG TTTTGCTGGT TGGTAGTGGC GGTTTGAGCA TCGGACAGGC TGGTGAATTT GATTACTCCG GAAGCCAGTG CATCAAGGCG CTCAAGGAGG AAGGCATTGA AGTCATCCTG ATCAACCCGA ACATCGCAAC CGTTCAGACC TCACAAGAAA AGGAAGAAAA ACAGGCTCGT GCTGCTGACC GAGTCTACTT TTTGCCCATT CAACCCAATG TTGTCATGGA CATCATTCGC AAAGAGCGCC CTGATGGTAT CATCGTATCC ATGGGTGGAC AGACTGCCTT GAACGTTGGA GTTCAGCTTT GGCGCTCGGG GGAACTTCAG GCTGAAGGTG TGGAAGTTCT CGGTACGCAA ATTCCTGTCA TTGTAGCTAC AGAAGACCGC GAGATCTTCA GTGAAAAGCT GAAGGAAATT GATGAGACTA TTGCTCTCTC GTATAGCGCT ACCAATATTG AGGAGGCGGT CGTAGCCGCG AACAAGATTG GATACCCCGT GCTGATCCGA GCTGCCTACG CTTTGGGAGG TCTAGGGTCT GGATTCGCGG AGAACGATGC CGAGCTCAAG ACAATGGCAG CCAAGGCCTT TTCGACGAGC GATCAAATTT TGATTGACCA GGACTTGCGT GGTTGGAAGG AACTAGAATA TGAAGTTGTC CGTGACAACA GTGACAACTG TATTACGGTG TGCAATATGG AGAATTTCGA TCCTCTGGGT ATTCACACGG GAGACTCCAT TGTTGTCGCC CCATCTCAAA CTTTGACCAA CCGGGAGTAC TTTATGCTCC GCCGTACTGC TCTGAAGGTG GTTCGCCACT TAGGTATTGT TGGAGAATGC AACATCCAGT ACGCCTTGCA CCCTGAAAGT GAACGCTATT GTATCATTGA GGTGAATGCT CGTCTTTCGC GATCCTCTGC CTTGGCCTCC AAGGCCACTG GTTATCCACT TGCTTACGTC GCAACCAAGC TGTCGCTGGG AAAAAATCTT GTTTCGATTC GAAATTCAGT GACGAAGACA ACAACGGCTT GTTTCGAGCC CAGTTTGGAC TACTGCGTCC TGAAAATGCC TCGCTGGGAT CTGAAGAAGT TTTCCCGTGT CTCCAACAAG CTTGGTTCTT CCATGCTTAG TGTTGGAGAA GTCATGTCGA TTGGTCGCTC CTTCGAAGAA GTGATGCAGA AGGCGTGCCG TATGGTAAAC CCGAATCTAG ACGGTCTCGA AGGAAACTAC TCCGGGCTCG TCGATGAAAG TTTTGACTTG GAAACGCAAC TGACTACCCC CACCGACACA CGCTTATTTG CGGTCCAGAC AGCTCTTGAG AACGGCTGGT CCGTAGACAA AGTACACGAT TTGACAAAGA TTGATCGTTG GTTCCTAAGC AAATTGAACA ATATTGCTAC AATGCGAAAG GCCGCAATGG CTGCCGGTTC TCTCGACAAC TTAACGAGCA ATAATGGACG GGAGCGTATG CGTCAACTGA AAATGGCGGG TTTCAGCGAC CGACAAATTG CTCGCTACGT TGGTCTTCCA GGTACGCTTG ACGGTGAGTC GCGCGTCCGT GACCGTCGAA AGTATCTCGG AGTGACCCCT GTTGTCAAAC AGATTGATAC GCTTGCTGCT GAATTCCCGG CACAGACGAA TTACCTCTAC GTAACCTACT CGGGAGATGA GGATGACATC GAGCCATCAT CACACTCATC ACAACTCACC CCGTCTTACC GCTATTCTCT TACGGAGAAG TCTCGCTTGG ATTCGGGCGA GTTTCGTGCC CGAGCTAGAG CCTTTTCGGT CGGCCAGGCA GTGGCCAAGA ATACTTTGAA GGAGGCAAAA GAACGGGGCG TCATAGTTCT GGGCTGTGGA GCCTATTGTA TCGGGTCCAG TGTTGAGTTC GATTGGTGTG CGGTCTCGTG CATTCGACAG TTGCGCCGCG ATGGATTCAA GAGCATCATA ATCAATTACA ATCCCGAGAC TGTAAGTACA GATTATGATG AGAGCGACCA GCTCTATTTC GAAGAGCTCA GCACAGAGCG TGTTTTGGAT ATCTACGAAC GCGAAGGAGC TGCTGGTGTT ATCGTTTCGG TTGGAGGACA AATTCCGAAT AACCTGGCAG ATCCGCTTTC ACAGAGTGGC GTTAACATTT TGGGAACAGA TTCTAGCGAC ATTGGCCGTG CCGAGGATCG TCGTCAATTC TCGGACATGC TTGACACCCT TGGAATCCAA CAGCCCGAGT GGTCTGTCCT CAAATCGAAA GACGAAGCGA TTTTGTTTGC CGAGAAGGTT GAGTACCCAG TTCTAGTTCG TCCCAGTTTC GTGCTCTCTG GCGCTTCGAT GAGAGTCGCC GCCGATGAGT TCCAGTTGCG CAACTTTTTG GACTTGGCTG CGGACGTTGG TAACGACAAG CCAGTCGTCG TGACCAAATT TATCCTCGGA GCGAAGGAGA TCGAGTTTGA TGCGGTAGCT AACAAGGGAA CAATTCTCAA CTACGCAATT GGTGAGCACT TGGAAAATGC TGGTGTTCAC AGCGGAGATG CTACTGTTAT CCTTCCGGCC CAAAAATTAT ACGTCAGTAC GATCCGTCAG GTCAAGCGTT ACGCCAGTGC GATTGCGAAG TCTCTTCGCA TCACGGGACC ATTTAATATC CAGTTCATGG CTAAAGGTAA TCATGTGCAG GTCATTGAAT GCAATCTTCG CGCGAGTCGA ACCTTTCCGT TCGTAAGCAA AACGTTGAAC AACAATTTTA TCTCTCTCGC CACGAAGGCT ATGGTTGGAC TTGAAGCCGT TCCCTACAAG ATTTCGTTGC TCGATATTGA CTTCATCTGC GTAAAAGCTC CCATCTTTTC ATTTACCCGC CTTCGTGGTG CCGACCCGAC CTTGGGTGTG GAGATGGCGT CGACTGGCGA GGTTGCTTGC TTTGGAGAAG ATTCTCATTC CGCGTTCTTG CAGTCAATGC TTTCGACAAC GTTCAAGTTA CCCAACAAGA ATCGAACCAT TTTGCTTTCA ATCGCCAGTG AGGAATACCG ACGTGAGTTT GCTGAGGCTG CTGTCATTTT GCACCGTCTG GGCTATAAAC TTTTCGGAAC TCCTGGTACG GCAGCGTACT ACCAAGACAA TCACGGCATT GAGATCAAGT CTGTTTCTAA ACCGGATGCT GAAACGGACG ATGCTCCGGG AACTGCCTTG CACGAAATCA AGAACGGAAA GATTGACTTG GTGATTAATG TGAGTGAAGG AGCGACGCGT CGCGAAGAAA TCACGTCGGG TTACATTATC CGCCGGGCGG TGGTGGACTT CGGGGTCAGT CTGGTAACGG ATGTTAAGTG TGCCATTAAA CTCGCGGAGT GCTTTGACCG CGGAATGGGC GACGGTAAAT TCGTTCCACG CCACATTGGT GAGTTCTACA AGATCCCGAC GATTGGCTGG ACGAGTACCT AAGCGGACGG GACACGGAGA ACCCTCTACA TATATAAGAA CTTTAGGAAC CACAGTGAAT ACAGACGCCT TTGATTGGCG TTGCGTGTGT ACCATAGACA CCATAAATAC AAACTCAAAT AATTCCTCTG TTCATTTTTG TTGTTTTCCG TTTGTGCTGC TTGCGAGATG CAGAAAAGAT CGGAGGTAGA ACGTCGCTTT ACCAGAGACT CTAGCTCTCG GTGGAGATTG TTTTCGGGTA TGTATCTGAG TTCTCTTCAA TTGTGTATAT CTAATCTTGA TTTGCTGTTA CACGTCATGG TGTTTATTAC AGAGTTTAGT TTCGGCCCAA TCTTTCACGC GGGGCCAAAT CCAAATATGT AACGTCGGTC AGTGCGCATC AAAGACAGAA AGCTTCGGTG ACTTGCAAAG AACGAGTGCG ACCGGAAAGA CGCCATGA
|
Protein sequence | MGHAFGANTT VSGEVVFNTG MVGYPEALTD PSYRGQILVL TYPLIGNYGV PDENIKDAHG LPKYFESSQI HVAGLIVSSY SWQHSHWAAQ KSLSKWLTDN NVPAMYGVDT RALTKRLREH GSILGRMIVN PPRIEESVSS EMFDDPNTRN LVAEVSTKKV VVYGTGNSPK VIAYDCGMKY NIIRYLVEIH KVELAVVPYD YDLEANPDNL EWEGLFLSNG PGNPTMCEQT IKSIQYALNL EVAKPIFGIC LGNQLLALAS GAKTYKLKYG NRGMNQPCID LRTGRCYITP QNHGFAVDAA SLCEFWKPLF LNANDLTNEG IVHTNKPFFS VQFHPEACGG PTDTAFLFDK FVGHVKNVPQ PLVLQDSLSY ERKTYKKVLL VGSGGLSIGQ AGEFDYSGSQ CIKALKEEGI EVILINPNIA TVQTSQEKEE KQARAADRVY FLPIQPNVVM DIIRKERPDG IIVSMGGQTA LNVGVQLWRS GELQAEGVEV LGTQIPVIVA TEDREIFSEK LKEIDETIAL SYSATNIEEA VVAANKIGYP VLIRAAYALG GLGSGFAEND AELKTMAAKA FSTSDQILID QDLRGWKELE YEVVRDNSDN CITVCNMENF DPLGIHTGDS IVVAPSQTLT NREYFMLRRT ALKVVRHLGI VGECNIQYAL HPESERYCII EVNARLSRSS ALASKATGYP LAYVATKLSL GKNLVSIRNS VTKTTTACFE PSLDYCVLKM PRWDLKKFSR VSNKLGSSML SVGEVMSIGR SFEEVMQKAC RMVNPNLDGL EGNYSGLVDE SFDLETQLTT PTDTRLFAVQ TALENGWSVD KVHDLTKIDR WFLSKLNNIA TMRKAAMAAG SLDNLTSNNG RERMRQLKMA GFSDRQIARY VGLPGTLDGE SRVRDRRKYL GVTPVVKQID TLAAEFPAQT NYLYVTYSGD EDDIEPSSHS SQLTPSYRYS LTEKSRLDSG EFRARARAFS VGQAVAKNTL KEAKERGVIV LGCGAYCIGS SVEFDWCAVS CIRQLRRDGF KSIIINYNPE TVSTDYDESD QLYFEELSTE RVLDIYEREG AAGVIVSVGG QIPNNLADPL SQSGVNILGT DSSDIGRAED RRQFSDMLDT LGIQQPEWSV LKSKDEAILF AEKVEYPVLV RPSFVLSGAS MRVAADEFQL RNFLDLAADV GNDKPVVVTK FILGAKEIEF DAVANKGTIL NYAIGEHLEN AGVHSGDATV ILPAQKLYVS TIRQVKRYAS AIAKSLRITG PFNIQFMAKG NHVQVIECNL RASRTFPFVS KTLNNNFISL ATKAMVGLEA VPYKISLLDI DFICVKAPIF SFTRLRGADP TLGVEMASTG EVACFGEDSH SAFLQSMLST TFKLPNKNRT ILLSIASEEY RREFAEAAVI LHRLGYKLFG TPGTAAYYQD NHGIEIKSVS KPDAETDDAP GTALHEIKNG KIDLVINVSE GATRREEITS GYIIRRAVVD FGVSLVTDVK CAIKLAECFD RGMGDGKFVP RHIVSAQSFT RGQIQICNVG QCASKTESFG DLQRTSATGK TP
|
| |