Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_231 |
Symbol | |
ID | 7201612 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 38474 |
End bp | 41667 |
Gene Length | 3194 bp |
Protein Length | 970 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180935 |
Protein GI | 219120392 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCCGGAAG TGCGCGCCGA GCGGGAGCGA GTGCGGGAAG AACAACGACA GGAAACGGAA GAACGACGCG CCAAGAAACG GCAAGCGGAA CAGGCTTTGG CCAACAAACA CCGGAAACTC GACAAGGAAG AAGCTCGCAA GGCACAAGCG CGCTTGGAAT ATCTGTTGCA GCAGAGTTCC ATTTTTGCGA AACTGCAGGG TGGATCCGGT GCCATTCCGC AGGGACCGGA CGAGGCCAAA GACCAGGAGA CCGCGGCCGC CGCGGCAAAG TCGAAAAAAA AGCGCACGTC CACCGCGTCG CCGAGCTCTA ACAAGGCACA CCATCGACAC GGGGCCGAGT CCAACGACGA CTCGAATGAA GAGGAAGAAG CCGACGAAAC AGAAGTCGGA CACGTCTTTT TGACCAAGCA ACCCACTTCG ATCAAGTTTG GTACCCTGAA GCCCTACCAG CTCGAAGCTT TGAACTGGAT GATTCATCTT TCCGAAAAAG GGCTGAACGG GATTCTCGCT GACGAAATGG GTTTGGGGAA GACCTTGCAA TCCATTTCCG TCCTTGCGTA TCACTGGGAA TTTCTACGCA TACAGGGACC ACATCTGATC TGTGTCCCCA AATCTACGCT TTCCAATTGG ATGAACGAAC TTAAACGTTG GTGCCCGTCG CTTCGGGCCA TCAAGTTCCA CGGCTCGCGG GAAGAAAGGG AATATATGAT TGACAACATG TTTCACAACG AAGCAGCCAC GCACGATGGT CGTCGACCGG ATCGACAAAT TATGGACGGA TCGGGCGAAT TGATCGACGA CAATACCGAC ACACCGCGTC CCTGGGATGT CTGCGTAACT ACCTACGAAG TCGCCAACGC GGAACGCAAA ACTTTGCAGA AGTTCACGTG GAAATACCTC GTAATCGACG AAGCCCACCG ACTCAAGAAC GATGCTTCCA TGTTCAGCAA GACGGTTCGG TCGTTCCGGA CGTCGAATCG CTTGCTCTTG ACGGGGTACG TGCTGGTCGT TTAATTTTTG GCGTTGCTGG AGTTGTAGAA GCGTGGCGTT ACGGTTTTCT CACACCTCTG TATTTTCACA CCAATTTTTA CCTACAGCAC TCCGTTACAG AACAACCTCC ACGAGCTTTG GGCTTTGCTC AATTTCTTGT TGCCCGATAT CTTTTCGTCG GCGGACCAAT TTGACGAATG GTTTGATCTG GAAATTGACG ATGAAGAAGC CAAAAAGAAC ATGATTTCGC AGTTGCACAA GATTTTACGT CCGTTCATGT TGCGTCGTCT GAAAGCTGAT GTGGCCAAAG GGTTGCCACC GAAAACGGAA ACCATTCTCA TGGTTGGAAT GTCCAAAATC CAGAAACAGC TCTACAAAAA GCTCCTCTTG CGTGATTTGG ATAGTATCAC GGGCAAAGTC TCGGGCAAGA ACAGAACCGC CGTTCTCAAC ATTGTCATGC AGTTGCGGAA GTGCTGCGGT CACCCGTATT TGTTCGAAGG AGTCGAAGAC CGGACTTTGG ATCCACTTGG AGAGCATTTG GTCGAGAATT GTGGAAAGTT GAGCATGGTC GACAAGCTAC TCAAGCGATT GAAGAGCCGT GGGAGCCGTG TTTTGATCTT TACTCAAATG ACGCGCGTGC TCGATATTCT TGAGGATTTT ATGGTTATGC GTGGATACCA ATATTGTCGC ATTGACGGCA ACACCAATTA CGATGACCGC GAGAGCTCCA TTGACGAATT TAATCGGGAA GGCACCGATA AGTTTTGTTT CTTGTTGTCG ACCCGTGCGG GAGGTCTCGG AATTAACCTG CAGACAGCCG ACACGTGCAT CTTGTATGAT TCGGACTGGA ACCCACAACA AGACTTGCAA GCCCAGGATC GCTGTCATCG CCTTGGACAA AAGAAACCAG TCAATGTGTT TCGTTTGGTC TCTGAGAATA CTGTTGAAGA AAAGATTGTG GAACGCGCTC AGCAAAAGCT CAAGCTGGAC GCAATGGTTG TGCAGCAAGG GCGACTGAAA GATCAAGACA AGGTGACCAA GGACGAAATC ATGGCCGCCG TTCGATTTGG TGCGGATACG GTCTTTCGAT CCGAAGAGTC TACAATCACC GATGACGATA TTGACGTGAT TTTGGAGCGT GGGTACGTGG TAGCAGGTCG GGTTGAGAGT ACCACAAGCG GACGCGAAGG ATGCCGGAGA ATCTTCTGAC ACAGTTTCAT CTTTTCTTTT CTTCGTCTTT CAGAGCGGCC AAGACCAAGG AGCTTGCCGA AAAGATCCAG ACACGGGACA AAGGCGATCT TCTGGACTTT CGCCTGGACG GGGGAATATC AGCGCAGACG TTCGAGGGCG TGGATTACAG TGACAAAGAT CTACGCGATC ATCTCCGCAT GCTAGCTGCA GATTCAATGG GAAAGCGAGA ACGCCGACCG CCTCCCACGA GCTATAATCC TATCATCATA TCGAAGAAGT CAATGGTGGT AAATAATCGC CGGATCAAAC TACCTAAATG TCTACGTATT CCACAAATGG AAGACCATCA TTTTTACAAT CGCGAACGCC TCTTGGACTT GGGAAGGCTT GAGTTCGAGA CTTACGCTGC GCTTAGAGAG GCTGGTGAGC TTCCACCGAA AGAGTTTATG GAACGGAAGA GGACGTTGTT ACCCGACGAG CTGGGACAAG AAAAGCTAGA GCTCTTGGCT GAAGGATTTG GGGACTGGAG TAGAAGTCAA TATTACGCCT TTGTCAAGGC AGCTGCGAAG TATGGGCGTG ATGACATCAG TGGCATCGCC AACGAGTTGG ACATGCCAGA AGTCGAAATC GCTGCTTACA GTAAATCATT TTGGGCCTAT GGACCGACTG AACTCGAAAG CGAGTGGGAA CGCCTCGTTG GTAATATCGA CCGAGGGGAA AAGAAGCTGG CGAAGCAAAA GAAACTCAAG TCTCTCCTCG CAAAGTTCGT TAACACTTTT GAAAACCCTA GAGATGATAT GGTCTTTGCT AATAAAGGAA CCACTCCCTT TGCTCTAGAG CAGGATCGAG CACTGCTATC TGCCGTCGAC AAACACGGAT ACGGTAATTG GGATTCCGTC CGCGAGGAGA TTCGCACTGA TGGACGTCTC AAATTTCAGC ATTCAACCCA AGGTATGACA GTACAGGCAA TTGGGAAGCG CTGCGATTAC CGAATGAGGC AAATGGAAAA GGAA
|
Protein sequence | DPEVRAERER VREEQRQETE ERRAKKRQAE QALANKHRKL DKEEARKAQA RLEYLLQQSS IFAKLQGQRP GDRGRRGKAH HRHGAESNDD SNEEEEADET EVGHVFLTKQ PTSIKFGTLK PYQLEALNWM IHLSEKGLNG ILADEMGLGK TLQSISVLAY HWEFLRIQGP HLICVPKSTL SNWMNELKRW CPSLRAIKFH GSREEREYMI DNMFHNEAAT HDGRRPDRQI MDGSGELIDD NTDTPRPWDV CVTTYEVANA ERKTLQKFTW KYLVIDEAHR LKNDASMFSK TVRSFRTSNR LLLTGTPLQN NLHELWALLN FLLPDIFSSA DQFDEWFDLE IDDEEAKKNM ISQLHKILRP FMLRRLKADV AKGLPPKTET ILMVGMSKIQ KQLYKKLLLR DLDSITGKVS GKNRTAVLNI VMQLRKCCGH PYLFEGVEDR TLDPLGEHLV ENCGKLSMVD KLLKRLKSRG SRVLIFTQMT RVLDILEDFM VMRGYQYCRI DGNTNYDDRE SSIDEFNREG TDKFCFLLST RAGGLGINLQ TADTCILYDS DWNPQQDLQA QDRCHRLGQK KPVNVFRLVS ENTVEEKIVE RAQQKLKLDA MVVQQGRLKD QDKVTKDEIM AAVRFGADTV FRSEESTITD DDIDVILERG AAKTKELAEK IQTRDKGDLL DFRLDGGISA QTFEGVDYSD KDLRDHLRML AADSMGKRER RPPPTSYNPI IISKKSMVVN NRRIKLPKCL RIPQMEDHHF YNRERLLDLG RLEFETYAAL REAGELPPKE FMERKRTLLP DELGQEKLEL LAEGFGDWSR SQYYAFVKAA AKYGRDDISG IANELDMPEV EIAAYSKSFW AYGPTELESE WERLVGNIDR GEKKLAKQKK LKSLLAKFVN TFENPRDDMV FANKGTTPFA LEQDRALLSA VDKHGYGNWD SVREEIRTDG RLKFQHSTQG MTVQAIGKRC DYRMRQMEKE
|
| |