Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36115 |
Symbol | |
ID | 7201177 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 579145 |
End bp | 581528 |
Gene Length | 2384 bp |
Protein Length | 782 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180465 |
Protein GI | 219119408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000280602 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCCTC ATGAGGTAAA AAAGAGTATT GCCTACATCA ATGGCCACAA AGTCCAAATT GCGCTGGATG CAGCCAAGAG CAGTGCACCA CCTAGTGTTA TTGGAAAAAT GCTCCAGTTA CAAACTGGCA ACATCTTGTC GGACTCTTCT TTGCAACACA TAAGGATACG GTTGGGAAAA GAGCAAGAAA CAGTGTTGCT CAAGGATGAA GAATTCATGA CGCAGGCAGA CAAGCTTTTG TCCTATCTTG AGAACACTCC CGAGATCAGC TTTTGTGCTA TATACGATGA CCCAGATTCA CCTCTCTTCA CAGTTTACAA GCAGAGAGCA AAAAAAGACC GCAGGTGTCT ACACACAAGT ATTAGAGTTA ACTCCGGGGT ATCTGCTGAA ACGGGAATTC TTGACAACGC TACACTAGAT GCAATGGATC CCAATGGAGA ACTTGATGAC TATGTTGACC GCACACGTTG CGCATTCAAG CTCCTTGGCT CACAAAAAAT GCTGTTAGGA GTTGCCTGGA CAAATAATGA GAGCAGAAAT GTATTCTCTC GTTTTTCGGA GATAATGGTA GCAGATGTGA CGGAAGGCAC AAACAACGCC AAGCGACCGC TATTTCTATT CTCTGGGAAG ACGTCAAACC AAAACACTTT CACAACACTG TGGGCTTTTT TACCACAGCA GGCTTGTTGG GCCTTCCATT GCGTGTGGAC TCGATGCATA CCGCAACTAC TCCCAAAGCA AGGAATTCAA CAAATGCGCC TGACAATAAC AGATGGAGAT CCAAAAGAGT ACGGGACCTT TGTAGATGCA ATACCAACTT TCTACCCTCT TTGCGAACAC AAGCTTTGTC ACTGGCATCT ACTGTATTGC AGTAATCTCA TGAAGGTGCA GACTGGAAAA TGTGGAGTTA AAGCTGCTAT TCTATTCCGT GTAGTTGTTC TTTGGATTGA GAGCTGGATG ACCAAAATTG AGACACAAGA GGAATACAAA CTTTCTAAAA GGCTCTTGGC TGATTGGCTT GCAACCCCCG AAGCTATTGA TGTCACATTG GGTGGTATGG GGCAAACTAT TGTATCGCAA ATTAATGCGT ACATGACACT GTCGCTTTTT CCTCACGAAC AGCGCTGGGC TAGATATCGC TATTTATACA CACGAGCATT CAACACATCT GCAAGCTCGT ATGCCGAGGC AGAAAATAGT GCTTTAAAAC GACGGGGCGA CGGGGTCAGG CCAAGCTTTT CTGTACCAAA AGCAACTCAG GTTATAAACA AAGGGACACA AATTAGGTCA AAGAAGAGGC ATCAAAAAGC TGTTTACAAT TTAAATGCTG CCAAGACAAG AAAGCCTGCC TACTACGCAA ACATTGGGGA TTTAGTGGAT TACATTCAAG ATTCTCTTTC CAAAGATTTT GAAGCAGCTG CTTCATTTGT GCTCTTCCGT CCAAATGCAG ACCAGTTTTG GGTCAAGCAA GCCACTCGCA AAAGCAAAAA CACGGACATT CCGAAAATCA ACGACAGTAG CTATTACAAG TACATGATTC CGCAGTTTGA ACGCACACAA ATTGTGGAGC TTGTTAATAT TGATGGTACA TTCTATTTGG TGTGTAGCTG CGGAAATTTC AGCGACAAGC TTCCCCATGT GCCCATCTTT ACAAGGTTCT TGGTCAATCA CCCACGTCAA CCGATGTCTC TGTACGCTGG ACAAAGCACT GGGATGTGTA TTTGCACCGA AGTGGCCACA GTGACCTGTC AAAGCACTTG GAAGACCTGT ACAAACAGGA GCAACCAGGT CCAGTATTTG TTGATAGTGG TCAGTGGGTG ATCGGAAAAG GTGAAAAAGG GTCATATTTT TTCGAAACTT CGCTTCCGTA CAAGCCCCCT GTCATACGAG ATTTTAATCG ATGGGCAGTG TCTTCGCAAA CGACTGGAGC TGATTTGAGT GGGACCAAAA ATACCACAAA TATGTATTTT TCGAGTGGAA TGGTGCAAGA ATCAACAAGC CTGTCCAGAG AGCATGCATT CCAGGATTCA TTGCATGAAA ATTCCACAAA TTCGCAAAAT TCGGATTGTT TTGATGATAC AGAGGCAGTG ACAACGGAAA GTGTATCTTC TACGAAAAGA ATGTTTGGCT CAAGTGCTTT TGTGCACAAT TTCCATTTCT ACCAGGAAAT GTCCAAGCTG GCAGGATTTG ATATGGAAGC TGCTGAGTCG ATGAATAAAG CTATGCAAGA AGCATTGGAA AAGGTACAAG CCAATGTTGC AAAAAGGGCT GGGAAGATGG ATTACACTAT AGGACCAGCC ATTACAAAGG ACTGTGTAGG TCTAAGATTG AAGCCAAGCT ACAGTCCAAA GAAGAGGAGG AAACCAAACT TACAAAGGAA ATGA
|
Protein sequence | MEPHEVKKSI AYINGHKVQI ALDAAKSSAP PSVIGKMLQL QTGNILSDSS LQHIRIRLGK EQETVLLKDE EFMTQADKLL SYLENTPEIS FCAIYDDPDS PLFTVYKQRA KKDRRCLHTS IRVNSGVSAE TGILDNATLD AMDPNGELDD YVDRTRCAFK LLGSQKMLLG VAWTNNESRN VFSRFSEIMV ADVTEGTNNA KRPLFLFSGK TSNQNTFTTL WAFLPQQACW AFHCVWTRCI PQLLPKQGIQ QMRLTITDGD PKEYGTFVDA IPTFYPLCEH KLCHWHLLYC SNLMKVQTGK CGVKAAILFR VVVLWIESWM TKIETQEEYK LSKRLLADWL ATPEAIDVTL GGMGQTIVSQ INAYMTLSLF PHEQRWARYR YLYTRAFNTS ASSYAEAENS ALKRRGDGVR PSFSVPKATQ VINKGTQIRS KKRHQKAVYN LNAAKTRKPA YYANIGDLVD YIQDSLSKDF EAAASFVLFR PNADQFWVKQ ATRKSKNTDI PKINDSSYYK YMIPQFERTQ IVELLRKFQR QASPCAHLYK VLGQSPTSTD VSVRWTKHWD VYLHRSGHSD LSKHLEDLYK QEQPGPVFVD SGQWVIGKGE KGSYFFETSL PYKPPVIRDF NRWAVSSQTT GADLSGTKNT TNMYFSSGMV QESTSLSREH AFQDSLHENS TNSQNSDCFD DTEAVTTESV SSTKRMFGSS AFVHNFHFYQ EMSKLAGFDM EAAESMNKAM QEALEKVQAN VAKRAGKMDY TIGPAITKDC VGLRLKPSYS PKKRRKPNLQ RK
|
| |