Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25067 |
Symbol | |
ID | 7196961 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2423284 |
End bp | 2425574 |
Gene Length | 2291 bp |
Protein Length | 566 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176976 |
Protein GI | 219110449 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACAGTCGTGC GCATACTACC AGGGGATCCA CATACCTACA CAAAGTGACA CACACACACT CTCGTACCTT GCTGTTTGTG AATCTCCCCC ACTGGAACGA CTGTTTGGCA AAATCAACAA CAAATCCCCT CGCATCGACT GGAACAAGTG AAAACGATCT GCCCGAGTGC AATTCCGAGA ATATCTCAGT ATCAGCATAC CGCCAAATCG TTTGTAGTTC TACAGCGACT TTGTTTTTTT GGCTTTCTCG AGACACTTGA GGAAACTCGT CGATTTGTAG AACGAGTCTT TTTTCCTCTC GGCGTCGCAA TCACAAACTG GACTTCAATG GGAAACGAAC CATCGAAAAA GGCAGGTCGA AATTCATCAG CACCGTCGAC TTCGACAGCC ACCACAACAA AGACGAACAC CTCAAAGGGT AAGTCGAACG CGACATAAAG CGTCCACGAT TCGAAAGTGT ATGAGTGCGC ATTGTAGTTG CTGTTTCGAT AAATGTTGCA CGGATTGACC TCACCTCGCC ATTTCCACCG TCTATCTCTG ATTTACTTGG CATGGTCAGA TAAGCACAAG CCGGCTCACG GCTCTAAGCA TGTCAGTATA AACCCGGGGG ACATAAAGAA CGAAACCTCT ACTCTCCCGG CCGGTAACAA AACCGCCAAA CCAAAATTGC GCATGGACGA ATCCGCGCAT ACCTTTGCCC CCAGCACGGC GAGTGCCTCG GGTCACTACC GCCGGGGATC GTCGCCCGTC ATGATTACCG ACGCCCTTTC CGATGTTCGC GTGAACTATC ACATTGAACC CAAGGAACTA GGACACGGTC ATTACGGGGT GGTGCGAAAA TGTATGCACC GTGATTCCGG AGAATGGTAC GCAATCAAGA GTATTCGAAA ATCGAAAGTG TCCAAAATTG AGGTATTGAA ACGAGAAATT GCTATCCTGA AAGAAGTCCA ACACCCGCAC ATAATCGAGC TGCACGAAGT TTACGAAGAC GAACGTTATC TGCATTTGAT TACGGAAATT TGCACCGGTG GGGAACTCTT TGATCGGATT ATTGCCAAAA CGCAATCCGC CGAAGGACAC TTTTCGGAAC ACGATGCGGC CGTCCTGGTG CGAGACATTC TCGACGCTAT TCGCTACTGC CACGACGAAA AGGGCATTGT TCATCGCGAT TTAAAACCGG AAAATTTCCT CTTTCTCACG GAAGCAGAGG ATGCACCCGT CAAGATTATT GATTTTGGGT TGTCCCGGCA CGAGACAGAC ATGGGTATCA TGCAAACCAA GGTAGGGACA CCCTACTACG TCGCGCCAGA AGTTTTGAGA CGGGAGTACA CCAATTCCTG TGATATTTGG TCGATCGGTG TCATTACGTA CATCTTACTG TGCGGCTACC CACCCTTTTA TGGTGAATCC GACACGCAAA TATTTGAATC GGTCAAAGTG GGCAAGTTTG ACTTTCCGTC ACCCGAATGG GACGAAATCA GCCAGTCGGC GAAAGATTTC GTGCTGATTA TGCTCAAGAA GAGTCCCATG GATCGGTACG GAAAGGTGTC CAGTGACAGA CATCTGCCTT GGCATCCGCA TGTTATCGTC ACTCACCCTT GTATTACTCA TCTTGCCTCC TCCTCGTATC ATTAGACCTA CGGCTGCCGC TGCCCTTAAG CATCGATGGC TCAAGGAACA GCTCGGACGC AAGGAACTGG CCACCTCTAG CATTTCTCAT GCAAGCGTTC GGACGGGAGA GTTTACCAAG TATTTGGCGA TGAAAAAGTT GAGAAAGGCG GCTCTCGGTT ATATTGCGTC GAACCTGACA CAAACCGAGG TGGGACATTT GGCGGAATTG TTCAAAACCA TGGACAAAAA CGATGACGGT CACGTTTCAC TAGCCGAACT AGATGAAGCT ATTGCTAAGG GAAGCTTCAA TAAGGAAATT CGAGACGATC TCAGGGAGAT GCGGCACGAA TTGACCTTGT CGGACGAAGA GACTATTGAT TACCGAGACT TTTTGGCTGC AACCATGGAT CGCAGTCTAG CAATGCGCGA GGAGAATATG AAAATGGCTT TTGAGCATTT CAAGCGTTCT GACGCCGACT ATCTGACTCT GGAAGATTTT GCCGATTTCT TTGGTGGAGA AGCGCACGCT AAGGAGATCT TGAGTCTGTT GGATGCCAAC GGAGATGGGA AGGTATCGTT CGATGACTTT CGAAGAGTTA TTGCCGAAAG CATGGAGGAC GACGAAGATG AAACTGAGAA TGGGGAAGTC ATTGGGTAAC AGTAAATGTA ACGTAACGCA C
|
Protein sequence | MGNEPSKKAG RNSSAPSTST ATTTKTNTSK DKHKPAHGSK HVSINPGDIK NETSTLPAGN KTAKPKLRMD ESAHTFAPST ASASGHYRRG SSPVMITDAL SDVRVNYHIE PKELGHGHYG VVRKCMHRDS GEWYAIKSIR KSKVSKIEVL KREIAILKEV QHPHIIELHE VYEDERYLHL ITEICTGGEL FDRIIAKTQS AEGHFSEHDA AVLVRDILDA IRYCHDEKGI VHRDLKPENF LFLTEAEDAP VKIIDFGLSR HETDMGIMQT KVGTPYYVAP EVLRREYTNS CDIWSIGVIT YILLCGYPPF YGESDTQIFE SVKVGKFDFP SPEWDEISQS AKDFVLIMLK KSPMDRPTAA AALKHRWLKE QLGRKELATS SISHASVRTG EFTKYLAMKK LRKAALGYIA SNLTQTEVGH LAELFKTMDK NDDGHVSLAE LDEAIAKGSF NKEIRDDLRE MRHELTLSDE ETIDYRDFLA ATMDRSLAMR EENMKMAFEH FKRSDADYLT LEDFADFFGG EAHAKEILSL LDANGDGKVS FDDFRRVIAE SMEDDEDETE NGEVIG
|
| |