Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33610 |
Symbol | |
ID | 7204061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1364662 |
End bp | 1366851 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186516 |
Protein GI | 219113865 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTTG ATTCGCGTGA ACTACGATTC TTAAGCGGCT CGTTCGCCGA ATTTACTTCA CACTGTCGCT TTGCATCTTC TTCCTCAACG ATAGCCTCTG TGCTCGCCTC TGGTGCGGCC GCCGCAATCG AATACGAGAA TTCGTTTTAC CGTGTATCGG AAGACTCTCG CCGCGTGCCG CCAACGTCAC AACAAAGACA GGCGCAACCG CAAACACAAC GGCGCAATAC GCACACGAAG AGTAGTAGTA GTAGCGCACC AGGGATTTCT GCGTTGCCCA CACAACCCCG CCAGGGCTAC GGCCCATCAA TTCAGCTCAC GGAAGAGGAA CGGGCTTTGT TTACCCTCCT GCGCCAAGTC CGTTCCGAAA CCAAATTGGA TACCACATTA CGGGTTGCTG GGGGGTGGGT GCGGGACAAG CTGCTCGCAA CCCCAGAGTT TCAGACTTAT CACAAAGTAT TGGATGTTGG TAGTCAACGC CTTACTTCAA AGTTTCAAAA ACCGTCGGCA CCGTCCATGG GACGGCAAGG CACCAAAGTA TTGGTGAATA CCGACAAAGA TGCACAACCT GTTGATATCG ACATTGCGTT GGATGATATG TTGGGACGAG AGTTCGCCGA CCGTTTGAAT GAATACCTTT CGATGGAAGG CCAGGATACC ATCGCGGTCG GAGTCGTTTT AAAAAATCCC GAAAAGTCGA AGCATTTAGA AACTGCTACT ATGAAGGTTG GTTCCTTTTG GATAGATTTC GTTAATCTCC GCGCCGAGGA ATACACACAA GCTAGCCGTA TTCCTGATTT GATGCGAATT GGTACCGCCG CCGAAGACGC CTTTCGCCGT GATTTGACCA TAAATGCCTT GTTTTACAAC GTCAATAGTG GGCAGGTTGA AGACTGGACA GGTCGTGGTT TTGATGATTT ACGTAAAGGC GTAGTCGCGA CGCCCTTACC GCCACTTACC ACTCTGTTGG ACGATCCATT GCGAGCCTTG CGCTCGGTAC GCTTTGCGGC GCGTCTACGC TTTACCATGG ACGATGAACT AGTAGCAGCC GCCAAAGATA AATCTGTGCG AAACGCGTTG GCGCAAAAAG TGTCACGTGA ACGAGTTGGA GGAGAACTGG ATCTCATGCT GCGATCGCCC GATCCAGTTG GTGCCATGCG GCTTTTGATC AATCTGAGGC TGATTGATAC AGTTTTCCCG ATTGAGCACT ATTTGGCACC GGATAAAACG ACATCTACAG CTACTCTATT TGACAAAGGG CTTGAGCTGT TATCTACTTC ACATGATCAT TTGGCAGATT GCCGATGGTC ACCGCCTTTA TGGTGCAAGA ACTCCCGTGC AGCCTTTGGA GCCGTCGAAC ACCGTCTTTT ACAAGACGAA GAAGCTCGTC GCCTGTTATG GTACGCTGCT TTTCTAAAGC CAATTCACGA TCGTACGCCA GTGAGACAGG CATCCAACCC AAAACTGACA GGGAGGAAAG CGAATCGTTC CGCCGTTGCT AAGTTGCTGG TGGATAAACT CAAGCGACCT ACCCGAGACG CCGAGGCTGT CGAGCGTATC ATCAAGGCTT CCACAGATTT TACACAATTA GTAAACGCTG GCTGTGACAT TTCAGCTACC ATGATCCTCT TGAGCGACGT CCGGATTACG TACGTTGCTA GTTGTGACGA GTTTGACAAA TCAGAGGGCC TGACCGGAAG TTTAATTGGT ACAATGAATG GACGCCTCGT TGACAGTGCA GTTGAAGAAG ATCCAGTATG GCAACATGCG ATGGAGTTCC GAATGCTTTG TGCGAAACCT TTACAGCGGG TTGGCCCACT ATGGCGTGCC TCATTATTTT TGAGTATTTC TGAGGCCATG GCAACCTTGG AAGATGGATT TACAAGACTG GATTACACTA TTGAAGGAGA TGTGTTTGAT GAGATACTGG AGGAGCGACG ACAAGGAGTG ATCGAACGAT ATGACGCGTT TGCGACGGCT CTACAGCAAA CTGGATTGAT AGGCATTTGG GATCAGCCGC CTCTACTAGA CGGTGATACT TTGAAAACGA GCATTTTAAA AGGTATTCCG AAAGGACCAG CATTTCGAGA CGTTATGGAC GAGCAATCAA ACTGGATCAT TACACACCCA GGCGCGGACG TCGCAGCTCT GAAAACGCAT TTGCAAGAAA CCTTTCCGGA ATACATCTAA
|
Protein sequence | MALDSRELRF LSGSFAEFTS HCRFASSSST IASVLASGAA AAIEYENSFY RVSEDSRRVP PTSQQRQAQP QTQRRNTHTK SSSSSAPGIS ALPTQPRQGY GPSIQLTEEE RALFTLLRQV RSETKLDTTL RVAGGWVRDK LLATPEFQTY HKVLDVGSQR LTSKFQKPSA PSMGRQGTKV LVNTDKDAQP VDIDIALDDM LGREFADRLN EYLSMEGQDT IAVGVVLKNP EKSKHLETAT MKVGSFWIDF VNLRAEEYTQ ASRIPDLMRI GTAAEDAFRR DLTINALFYN VNSGQVEDWT GRGFDDLRKG VVATPLPPLT TLLDDPLRAL RSVRFAARLR FTMDDELVAA AKDKSVRNAL AQKVSRERVG GELDLMLRSP DPVGAMRLLI NLRLIDTVFP IEHYLAPDKT TSTATLFDKG LELLSTSHDH LADCRWSPPL WCKNSRAAFG AVEHRLLQDE EARRLLWYAA FLKPIHDRTP VRQASNPKLT GRKANRSAVA KLLVDKLKRP TRDAEAVERI IKASTDFTQL VNAGCDISAT MILLSDVRIT YVASCDEFDK SEGLTGSLIG TMNGRLVDSA VEEDPVWQHA MEFRMLCAKP LQRVGPLWRA SLFLSISEAM ATLEDGFTRL DYTIEGDVFD EILEERRQGV IERYDAFATA LQQTGLIGIW DQPPLLDGDT LKTSILKGIP KGPAFRDVMD EQSNWIITHP GADVAALKTH LQETFPEYI
|
| |