Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42429 |
Symbol | |
ID | 7196618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 30314 |
End bp | 34211 |
Gene Length | 3898 bp |
Protein Length | 625 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176507 |
Protein GI | 219109505 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACAAAGCG CGATTGCAAA CAGCAAGCGC CAACGGAAAA GCGCCAGTTG TTGGCTCGAC TCCGTTTGGT TTGCACCACT GGAGTAGCTC TCTACAGTCT TTTGGATTTG TGAGCTCTCC ATTGGTATAG AAGGTCTGGC AATTGCAAGG ATCTTCGCCA ATTCAGTTGG GAGACCTTTA GGGCGGCAAC AGTCAAACAA GGTCAAGCGC AGCGATCGAC CGCGCGAGGG TTGCTTATCA GCTGCCTTGA GTGACGCTCC CTACGTACAC CGCTTTTATC ATGAACACGA AAGAATACCT GTATTCGTGG CATCTCTCGC TGCGAGCGGA CCGTCCAGTG GAAGGTTGTG GGATGCGCCC GCACGCCTAC ATGTACGGCA AAAAATTAGA CGAACGCGAA GACAAAACGC TGCCTCCGCA TTCCAAAAAA ATGAAAGAGC CACCACCGCA GCACGAATTC TCCTATCGAT GGTTCCGCAG TCCTTTGCAT GAGCCTTGTG CCTACGAGAA TTGTCCTCGT CGAACTTCGT TCTCTCCACA CGATTGGTCC AGACATGCTT TAGGTGGAAC GGAATGCGGA TTGCAGTGTG TATCGACGCA AAGTTCCTTG TTTCGATGCA CGTTTTGTAA TTCTACTTGC TTTGTGAATG CTTGGAAGAC TCAGTACAGT GTTCCGAAGG AGGCCACTCG GACGGAAACT CATGGTCGGA CCCGTTCACA ATCTTTTGGT AGTAATGATG AAGACGTCTT TGACGATACG GGTAGTGTAC GGTCTTCGAA TGGATCTAGT CCAGCTCTCG ACACCCTCAG CTCGCCACCA CCATCGACTC CCCGTGGGTT CCTCAGTGGA TACTCGGCGG GCAAGCAGCT GAACCCCGCC TCTGGTAGCA GTATGTACCA CTCGGAATAC GATGCTGGTG ACGATTGGGT GGAATTCAGT CGAGATCAGC TTTACATGCC AGGCCCTGAA GATGTCGGAC ACAAGCTCAA AATTGAAGCT GCGGCGTATT CAACCGATAC CAGTGAACTA CTTATGTCAC GTGTTGTCAA AACAGACGTT GTTCTAGGAA GGGCTCCTGA CCCACTTAAA AGGCAACTAG TTACCACTAA GGGCGGTGGA GGAGGTGGTC CTCGTTTCCG CGTTATAACG TACAATGTTC TCGCCGAGAT TTACGCAACT CAACAGCAGT ATCCTTACTG CGACTTCTGG GCACTTTCAT GGGATTATCG ATTCCAAAAC ATTCTTCGCG AGATCATTGA TGCATCGCCA GAAGTTGTAT GCTTGCAAGA AATTCAAGCG GATCACTACG AGAATCACGT TTACGTGGCC ATGGCTGACG CGGGATTTGA AGGCGTCTAT AAGCAAAAGA CGAGACAGAG TATGGGACTT GCTGGAAAAG TCGACGGATG TGCTTTGTTT TGGAGACGTT CCAAATTTCA TTTGGTCGAA TCCTACAGCA TTGAGTTCAA CGAAGTTGCG CAGAGACAAG CGACTCAAGT GTTAGGCCTC AATCCACGAA GCGAAGAAGG TGTGGCCTTT TTAAACCGTC TATCGAAGGA TAACGTGGCA CAGCTTGTTG TTCTAGAATT CATCCAGCCT AGTCGATCGA ATCGCGAAAT ATCGCAAGTG TGTATTGCCA ATACGCATTT GTATAGCAAC AAGGACTTCC CAGACGTAAA GCTGTGGCAA ACATGGCAAC TTTTGCAAGA GCTGGAATCA TTCATTATGA GTCGCGGAAC GAATCTTCCT TTGATTATTT GTGGAGACTT CAACTCGACT CCAGATACAG CCGTCTACGA TCTACTCTCG AGACAGACAG TCCATCCCGG CCATCCTGAT GTAAATGTTA CGACTGGCGA CGACGTTCCT AACGTTCTCC CTGATGCGAT GAATATTACT CATTCGTTCC AGCTGGGCAG CGCCTATCAA ACAGTATTGG GAGAGGAGCC GTGGACGACG AACTTTACTG TCAATTTTAA GGGCGTTTTA GATTACATAT GGTATTCCGC CCAGAATTTG CGGCCGCTCT CAGCTGCCCC GATACCAGAG GAAAAGCAAT TGACAAAGAA TGGGGAAGCT TTACCTTCGA CAGAGTACAG TTCAGATCAC ATCATGCTGA TCTCAGATAT GCAAATTATT GGCAATGGAG CACGATAAAG AAAGATCTAG GAGAGAATGA AAGGATGTGT GCTATTGTGG ACCGCCTTCT TTTGTAGTAG CGCTCAATTT TGTTATGGTA GAAGTAGAAA CTTTGATGGT GGGACGAGGA AGAATAAAAT CTGGCAATAA TGGCATTACG ATCACTTGGC ACTCCCCCCC AAAAGCATGG CGTAGTCATG GCCTCCGTCG TTTTCCAAGT GAGTCACCAT CTTTTCAAAA TCTTGGACCC ACATCATTTG CTCTGGTCCG CGGCCTGATG GCATTTTGCG TTTGGATGCC GCGTCGAACT CAACGGAGCG AATAGGCTTG CCGTTCAGCA GTACACGAAC GACGTGCGAG TGTGAGTCCA CCGGTCCCTC TTTGTGCAGC CGCACAAGTT CAAAGACCAA AGAACTTCCG TATTCCGGCC AAAAGCGCCA GTCGGCACTG GTGTCGTCGG CCAAAAAGTC GGCTCCTATT CCGTACAAAA GACCGAGGAT TGTGATGTCG TGACAACTAT AAATGGTGAA TGGTCGCTTT TCTTCGGTGT TGAGGCTTGG TGCAACCTTC AGCGATTCCG TAATCTCTCT CAGTGGCGGT GCGGCGATTG CGGCCAAAAG GCGCTTATTC TGGTACCATT TTCGGAATCT CCATGACAAG TGCATCAATG TTTGATGCGA GAGTGAACAT AGCATTTGTT CAACTGAAAC ATCATGCTCG TAGTCAGAAA ACCGGGCCAA GTCCATTCCG TGTGATGATC TGCAGACAAA ATGATCAGCG GCTTCGACCC AGTTGATCCC GCTCGGTGCT CGCGAGCTGA AATCGCTCTT TCGGGGCCTC ACTAAACCAG GAAGAATATT GGCCAAACGA GCCGCTAGTG GCGCCGCAGC CCCATCGCGC AGCATGAAGT CTTCTGATGA GATGACTTCA TCAACTAGAT CAGCCATGAG ATCTGGGTTG CGATCGAACG CATTTAACGG GTCTCGGGAG AGCTCTCGGA CACGGACTTT GACAAGTTCA TCCTTGCCAA AACTCTTCCC TCGCCAAGCG TGGTTCGGTA CGCGAGCTTC TTCAAATATG TCGGGATTTA GCGTCCGTTG CGGGGAGGGC GTATAACAGT TTGCACCCAG CATGCCATCA AGAAAGCTTT GAACTGACAT AATGGTACGT AGATAGTTCG TAGAAAACAC TTTAACATCC CAAACGGACA GGAAGTCTTC GGGGTTTTCC CATCGCCATT CGCTAAGGTT CGGTGAGTGA TGCCCATAGT GATTGTATCG ATGGAAGAAT CGGTGTCCGT TTTCTTTTAA CTGCGACAAT CCCATCTGCG TTAAAAAACC GAAAGGATTA CGGCCCACAT CAAGAAACTG ACCATGATTC GTGTTTTGAT GAATGTCGGG CGGGAAGCAC CTTGAGTATG CTTCAAAGGC TGTGGCAGAA TCAGGTGATG GCAAACGTGT CATCCAATAG GCAGCTTCTT CCTTTCTACG ATGGGATGGA GAAAGCGGCC TACTTGGTGT TCTATCTCCA TGTCGGCAAA ACATCCAGAC GCCCTCGACG ACACCATCAT TCTTCAAGTA GCGATGTGGG TCTTCATCCG CTTTTGACCC AAATGCGCCG GATGAGTTGC GTCGAAACCT ACCTATAAGT CGAGAAGATC GTTTCGACCC TTCCAAGCCA CGAAGCGCTG TTGAACCCAA CTTCTTGGGT CTCATTGTGA CCCTTAAGGG CAACCTCAAG AATACGCGGG AGTGCGCA
|
Protein sequence | MNTKEYLYSW HLSLRADRPV EGCGMRPHAY MYGKKLDERE DKTLPPHSKK MKEPPPQHEF SYRWFRSPLH EPCAYENCPR RTSFSPHDWS RHALGGTECG LQCVSTQSSL FRCTFCNSTC FVNAWKTQYS VPKEATRTET HGRTRSQSFG SNDEDVFDDT GSVRSSNGSS PALDTLSSPP PSTPRGFLSG YSAGKQLNPA SGSSMYHSEY DAGDDWVEFS RDQLYMPGPE DVGHKLKIEA AAYSTDTSEL LMSRVVKTDV VLGRAPDPLK RQLVTTKGGG GGGPRFRVIT YNVLAEIYAT QQQYPYCDFW ALSWDYRFQN ILREIIDASP EVVCLQEIQA DHYENHVYVA MADAGFEGVY KQKTRQSMGL AGKVDGCALF WRRSKFHLVE SYSIEFNEVA QRQATQVLGL NPRSEEGVAF LNRLSKDNVA QLVVLEFIQP SRSNREISQV CIANTHLYSN KDFPDVKLWQ TWQLLQELES FIMSRGTNLP LIICGDFNST PDTAVYDLLS RQTVHPGHPD VNVTTGDDVP NVLPDAMNIT HSFQLGSAYQ TVLGEEPWTT NFTVNFKGVL DYIWYSAQNL RPLSAAPIPE EKQLTKNGEA LPSTEYSSDH IMLISDMQII GNGAR
|
| |