Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46499 |
Symbol | |
ID | 7201583 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 527036 |
End bp | 529276 |
Gene Length | 2241 bp |
Protein Length | 669 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180848 |
Protein GI | 219120209 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAGCC TTCATAGACT CAATCGATCT GATCTTGACA AGGCAATAAG ATTCTCAAAT ATACAAACCT TTCAGCTGCG GGGTATCGGA ACAATACTAT TGCAAATTCA AGAAATTGCG ATTTTTGCCG CCGAGGAGTT GCAACCTACA TTTGTCCCTT GGGAAGCTCC TGGAATACAG CTTTCCACAG TGACTGTGAG TAGCAGAATG ATTCTGACGT AGTACCGATC GAGCGCATCG GCACTCGAAT ATAGACCGTT CACCATAGAT ATAAAATACA GATTCATACC CGCTACCAGA ACGCGACCGG CAGCAGACGG ATGCCTCAAC CGCAACCGCA ACCCGTAGAG GAAAACACAG GGGATGCCTT TGGAGAAAGT TTGCTTCTTT GGCGCGCTGC TCATCGTTAG TGGTGCTGTC ATACATCTCG ATTCTCTAGT TCTGCTTCCA TTTCATTGGA ACTTTGTCGG TAGCTGTCCC TCCGTATAGT TTCGTACCAG GGGCGATCTG CGCTCGTTCT CCCATTTTCA GTTGTGAGCG CACTTTCGTC TGTTATCATG TCAGATTACG TCACAGAACT GAGCCCCCAC GATGTGTTGT TCGGCCGCGG TTCCGGGCCA AATGATCACG AAGGCAACGT GCGGTTTCGG CAGCTCGTTG CCTCGCGTAA GGACGAGTAC ATGGCAACGA ATCACCGCAT AACAAAAGCG AACATAGCAC GGGAAATTGT CGATCAGGTG CTACGACACA AAGGCCGATT TCTCAAAAAG ATCGAAGCCA CCGACCCGGT TAGTCTGCAT ATACCCGACG GTATTGATGC ATGGATCGAG GTTGACGAAG ACACCGTTAT GGAAAAGGCA AAACAAGCGC TTCGGCAGAA CCCGAACAAG CAAAAGGGAG ACACCGGACC TTCCGTGGGG TCGGAGAGTT CCAAAACCAC ACCGAACGCC AGCAACCCAC GGGCGACACT AGACATTGAA GCGGATATTC AAGCCTTTGC TAACATCGAA CCGATCCCAA TTCACTCCAT ATCCAGCGCA CCTTTCGCGA ATGAAGCTGT GACTTCCATG TCCACGCATT CGCCCGTGAC GTCGACCGAA GGCCATACAC TCAACTCACC CTGGCAATCG CAACAAGATC AACAGCCGCA ATTCCATCAC CGATATTTAA CCGATTCAAG TGAGTATCTC ATCTCAGATA GGGAAGAGTA CGCGATAGAC ACCAACACTG TCTCTGTGCA GCAAGATCAA TCACAGCCAA ACCAACGGCA GCAGTCCCCG CGAGGCGACG AAGATAGCAT GCTGAATATG CTACCCGGAA GCCACCGTGC TAGTTTGGCT ATTAGCGATG TCTTTAGTGG CGAGGAGCCC CGTCGTGGTA GCATGACCAT GAGCGATCTA ATTCGCATGC ACCGTGCGCG GGAAATTGGA CTCGAGCGCT CCAGCGACAA TCGCAATTCT ATGGATATGG ACGATATGCT GGATTCCTTT AGCAGGAGCA AAATCTCCAA TGAGAACAAC GATGTCCAGA ACAAAAGGTA CAATGCCAGT ACGGAAACAA TGGGTACAAT TGAACCGATT GGGACTGGTA GTGTGGCCGA CATGAGTTTT GCCACTATGA ACTCCTCGAC GTTTTCTTTC TACAAGGGCA ACGATTCCCT AGCGGTGCCT GATGGGCAGG GGCCGCCAGA CCGGCCAATT ATCGACAATC GCTTCATGGC AACCGCGTCC ACGCTTCCAG AACTCGGAGT ACCCACTTCG CGCACAGTTA ATCGGTTCGC ATCGGATAAT TCTCTGAGCA TTGCGGAGCT CCGGGGGCGA CGCAAGTCCG GTTCGACACC ATCCAAGTCC AGTTCGGAAG AAGGGTCGGG CTTTCATACA AGTGGAAATT CCACCCTTTC TTCGCATCCC ACAGTAGCAA GCGGCAACTC TGTGTCCCAA ACCATTGTTT TGGAAGATCA GTCTCTCGAA CTGGATTTGA ATTCCATGGG ACTTTCGTCT GTGGAGATGC TAAAGGGAAT GATCTTAAGT AGCAGTAACG ATATGAGCTT GAACGACATT TCCGAAGAAA GTTTGCAGCA ACTCCTATAT CAGCACCAGC AACAAACAGG CACAGGTCTA CGCGCTAGTC AACAAGATGC ACAGCAGCGC CAGAACGACG ACAAAAACAA CGATTCCTAA CACTTTTACC ACGGGCCTAC TGTTAAATAA TTTTTCTTTC ACATGGGATA CACAACCTAA A
|
Protein sequence | MVSLHRLNRS DLDKAIRFSN IQTFQLRGIG TILLQIQEIA IFAAEELQPT FVPWEAPGIQ LSTVTIHTRY QNATGSRRMP QPQPQPVEEN TGDAFGESLL LWRAAHLSYQ GRSALVLPFS VVSALSSVIM SDYVTELSPH DVLFGRGSGP NDHEGNVRFR QLVASRKDEY MATNHRITKA NIAREIVDQV LRHKGRFLKK IEATDPVSLH IPDGIDAWIE VDEDTVMEKA KQALRQNPNK QKGDTGPSVG SESSKTTPNA SNPRATLDIE ADIQAFANIE PIPIHSISSA PFANEAVTSM STHSPVTSTE GHTLNSPWQS QQDQQPQFHH RYLTDSSEYL ISDREEYAID TNTVSVQQDQ SQPNQRQQSP RGDEDSMLNM LPGSHRASLA ISDVFSGEEP RRGSMTMSDL IRMHRAREIG LERSSDNRNS MDMDDMLDSF SRSKISNENN DVQNKRYNAS TETMGTIEPI GTGSVADMSF ATMNSSTFSF YKGNDSLAVP DGQGPPDRPI IDNRFMATAS TLPELGVPTS RTVNRFASDN SLSIAELRGR RKSGSTPSKS SSEEGSGFHT SGNSTLSSHP TVASGNSVSQ TIVLEDQSLE LDLNSMGLSS VEMLKGMILS SSNDMSLNDI SEESLQQLLY QHQQQTGTGL RASQQDAQQR QNDDKNNDS
|
| |