Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43213 |
Symbol | |
ID | 7196577 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2333244 |
End bp | 2336538 |
Gene Length | 3295 bp |
Protein Length | 819 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176960 |
Protein GI | 219110417 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTCAACACC TATCTGCATT CCTCTTACAT CTGAAAGTTT CCTTTTCCCA TGCGTCTTCC AGCTGGCAGT AGATGCTGTT TATTGATCGC TTGCCGAGAA GGATGCGATA CGATAGCTCC ATCTATAGCT ATGTCATTCG TAGAGAATCG CACCGGATCG AAAACGAAAC GTTTAGTCTC TTTACTGCAA AAAGGAATGT TGCTTCTGAT TACTGTGGGC ATTCTTGTCA CGATCTTCGA TCTCCTAACC TTCAAAGGTC TCCGCAGCGA GCATCTTCAG ATAGCAAACG AGTCTCATCG TTGGAAAAGA TTATCGGACA CCTGTAGTGG TAAGGAACCT CTTTTGGAAC TAGTCACTCA AGCCGTTGGA AATGTTTCTA TGGCACAAGC TTATATTCAG GACAACTGCA GAGCTTTGCC GACGTGGAAA GGCGTCAGCA ACTTGTACGG ACCTACGCCC GTCATTCTAG GTCTAGAGCA CTGTGAAGAC TTTCGCAGCA ACCTTAATCA AGCGAAACCG CTCGGTGGCC TGAAAATTGC GGGCTTCTTC AACTCGGGTA CCAATGCTAT GGCACGAACG CTTATGGAAA ATTTGAACGA CGGAAAATAT GGTAGCCGTG CGACTTTGGA GTCTACTATT AGAGCGCAGG GAGTTGTGAC TCAAGTACCT TGGGGCAAGC ATCGGTGGAT CTTGGCAGAT GAACTGGCTC GCTTGTCTCA GCCTGATATT CTACCCGTGG TAGTGGTACG TGATCCGTAC CGATGGATGC AGTCAATGGT AAGCGAGATA TGCGGGAGCT ATTGGATTAT GTCACTTATT GATTTTCGTC TAACTATGTG TGTGTGCGTG CGTGTGTGTG TGTGTGTGTG TTCTCCATTT CTTGTTCTTT CCAGTGCAAG TCAACGTACA CTTTAGTCTG GGATCGGGGA CTTGCGAATC ATTGTCCAAA TCTTGTGCCG TCCAACAATA AAGAGATGGA GATGAATGGA AACCGCTCTT CTTTTTCGGT GCAATTGACG CAAAGTGTCA CCAGCACTCA CCATTCGTTG GCGTCATTGT GGTCCTCTTG GTACGGATCG TACTTCCGCT CAAGTTCTCC GCGGCTAATT GTTCGCATGG AAGATTTAGT GTTCCACGGG CCGGAACTTG TGCGTCGATT GAGCGACTGT GTTGGAATAG ACCGTGTTCA GCCCTTCGTG TTTCTTACCG AAGCTGCCAA AGATCACGGC CGGTCAACCG ACTTGTTGAC AGCCATAGTC AAATATGGTA GTTCGGAAGG GCGTTACAGT GGCATGATGG TGCAAGACTT AGTTTACGCG AGGGATGCGT TATCGGCTGA TCTTATGGAA ACATTGCACT ACCAATACGA CAGTGTCTAG CACACTAGGT AGTAGAGCTG GCCAGGGTCC GACCCTGCGA GGAGAAAAGT CATCGCTCTG TAGAGAGAGG GAACGGAACT AATAGGAAGT AATGTAACAG TAAGTAACCT ACTAGAATGC TGGTGGTAGG GGTAATAACG GACCGATTTT TGAAAAAGAA GACTCATTCA CTGCCAATCA TGTTACGCGA CGACGATGGA CAGACAGGAG TAGAATTTTT TCCGTGCACC TTCCGCGCCA CATGTGAGGT GATGTCGGCG CCCTCTGAAA TAGGATGCGC TTTCTCGAAT AACGGTTGTG TCAACCCCTC GCGATCTACC AGACGTGACG CGACTAGCGA ATCGAGAAGC ATCGAGAGGA TAGTCGCTCG CAGTCAGCTA ACATCACTTC CACAACTTGC TACGCTGGTT TTACTTCCAT CCCTAGGGCA CACATTACTT ACCCGTCTTT TCCTCGAAGT CATTGTTGAA CGAGAAAATC AGAATCAGAG GAATGGTGAA CGAAACGTCT TCCCTCCTTC GTCCCGCTGA CATTTCGGGT TCCGGCCCCG AGAGACAGAC GACTGGACAC GATCCCATCA AACCTGTAAG TTTCGTGCCA TACTATGAAC GTCGAAGACA AAAAAGAGGA ATCTCACCCT TGACGATTTT CAGCAGAGCT CTCAAAGCGG ATTCCCCCCA GCAACAGTAC TACTTGCGCT GTCGACAATT ATCTTGCTCG CATGCGGCTT GCGATACTAT CCCACATCCC AGCAGCAACG ACAGGTTTCT GTGGCCTTTC TCGGAAATTC CATGTTTTAT TTCAACGATT TTCCGCGCTT CCTTGTTGAG TTATCGGACA ATGGCATTCG TCAAGATTCT TGCTTGCATG GCGGGGCCAG TATCCCCAGT TTGGTACAAC AAGGCAACGG AATGTTTCCA CAGTTTCAGA CACCAGCCGC GATTTACGCC CAATCATCGG ACGGCTCTCC TATCTTTGAC TACGGTGCAT GCTCGGCGAA AATGCTGTTG ACGGGGACTA GCATTTACAC TCCTGAACAG TACGAACGTT TCAACGAGAC GGATGGGATG GTCAAGAACC CTTGTCGAGA AGATTTAGTC TTCACCAGTT ACATCGAGAA ATACTACGCC ACCAACAAGC CAAACTGGGA TTACATCCTC ATCAACGATA ATACTCGCAA TCCAGCTCGC GGCTCGACGC GCCAAGCTTC GTTGCAAACC TTGCAACATT TCTATATTCC GTGGCTCCAC AAAACTGGAG CGACGCCAGT CTTTCTGTGG ACACACGCAT ACTCGGTCGA GTCCACACCT AAACGCAACA TGACGGGTCT AGACGATGTG GCAAATTTCA CCTCTTTGAC AGGCGCTGGA TACCGCGCTT ACGCGGAATT CTTGCAAGCC CACTTGCCGC CAACCCAGAA GCCACGAATC GCACCAAGTG GACTCGCCTT TTTGACGGTC TACGAAGAAA ATTTGGAGAT GTGGAAGAAA CTCTTTCACA ACGCCGACCA TCTGCATGCC TCACCGCATG GTACCTTTTT GCAGGGATGT GTTGTTTATC ACACGTTGTT TGGAAAAATG CCGGATCGGG ACGTGGTAAT TCGACGTGAC ATGGGATCAT TATGGGCCAC GGCTCGAATG ATGCAACACT CCTGGGAACC GCCGAACCCT ATGCTTCACG AAAGGGACGC AACTTATCTG TATGATGTTG CCGAACGCGT CATGAACGGT CATGTTCCGG AGACTTACAT TGAGTACAAT CGGGGAGAAG TTGCTGATAC TGGCGATGAT TCCTGACTGT ATGTTGCTGC CGGAATATAG CTCCGCGCCA GCCTAGTACA GGAATCTGGC AATCATCCGT GTTATGCAAT TGCGTGTGAG TGTAAATTTA ACTCATTTAG ATTGATTGTT GAGTG
|
Protein sequence | MRLPAGSRCC LLIACREGCD TIAPSIAMSF VENRTGSKTK RLVSLLQKGM LLLITVGILV TIFDLLTFKG LRSEHLQIAN ESHRWKRLSD TCSGKEPLLE LVTQAVGNVS MAQAYIQDNC RALPTWKGVS NLYGPTPVIL GLEHCEDFRS NLNQAKPLGG LKIAGFFNSG TNAMARTLME NLNDGKYGSR ATLESTIRAQ GVVTQVPWGK HRWILADELA RLSQPDILPV VVVRDPYRWM QSMCKSTYTL VWDRGLANHC PNLVPSNNKE MEMNGNRSSF SVQLTQSVTS THHSLASLWS SWYGSYFRSS SPRLIVRMED LVFHGPELVR RLSDCVGIDR VQPFVFLTEA AKDHGRSTDL LTAIVKYGSS EGRYMSSTLG SRAGQGPTLR GEKIRGMVNE TSSLLRPADI SGSGPERQTT GHDPIKPTKK RNLTLDDFQQ SSQSGFPPAT VLLALSTIIL LACGLRYYPT SQQQRQVSVA FLGNSMFYFN DFPRFLVELS DNGIRQDSCL HGGASIPSLV QQGNGMFPQF QTPAAIYAQS SDGSPIFDYG ACSAKMLLTG TSIYTPEQYE RFNETDGMVK NPCREDLVFT SYIEKYYATN KPNWDYILIN DNTRNPARGS TRQASLQTLQ HFYIPWLHKT GATPVFLWTH AYSVESTPKR NMTGLDDVAN FTSLTGAGYR AYAEFLQAHL PPTQKPRIAP SGLAFLTVYE ENLEMWKKLF HNADHLHASP HGTFLQGCVV YHTLFGKMPD RDVVIRRDMG SLWATARMMQ HSWEPPNPML HERDATYLYD VAERVMNGHV PETYIEYNRG EVADTGDDS
|
| |