Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46543 |
Symbol | |
ID | 7201687 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 658228 |
End bp | 659928 |
Gene Length | 1701 bp |
Protein Length | 532 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181046 |
Protein GI | 219120623 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAC AGAAGCAACC ACAGCGGAAG GACTCGGACG TTTCGGAAAA ATCGGACTCC GGCGGGGGCG ATCCCTACGC TTCCGCAGAC TATCAAGAAG CGCTCGAAGA TGTGCACACC CGCTTTATTC TGAATTTGCC GCCCTCGGAA CTCGAAACAG CCGACCGACT TTTCTTCCAA CTGGAACAAG CATGGTGGTT CTACGAAGAT TGGATCTGTG ATCCTCATCC AGAGAAAGTT TTGCCTCGGT TTTCCAGTTT CAAACCCTTT GCCCAGAAAA TGTTCGCCTA TTCGGAAATG CTACCGGAGT CCCACAAATT CGGATCCATG TGGGCGGAGT TTTCGCAGTA CAAGCGCGGA ATATCTAACT ACGGATGCAT TCTCTTGTCA GTGGATTACA CTAAAGTTAT TTTGTGTCAA GGTTAGTGTT GGGCGGGATC TCATGCGTCC CCGGTACGAG TTTTCTCTCT TTTGGTCGCT TTTCTGCCTG ATTTCTGACT GCCGTTTGCT TCTTTCAATT TAGTATGGAA TGGAAAGACG TTCACCTTTC CAGCTGGAAA GATCAACCAG GGCGAAGATG GATTGACCGC CGCAGCCCGT GAAACCTACG AAGAGACAGG GTTCGATCCC AACTGTGTGT TTGGACAAAC CGCTTCCTGG AAAGCGACGG ATCCTGCGAA GATTACCTGG AAATCTTTGC AGGAACAGGA CGCTCTGATT TTTCAAGAAG ACAATGGTAA ACGCCGAACC TGCTATGTCT GTCACGGTGT TCCGGAAGAC TTTCCGTTCC TGCCCGTGGC ACGTAAGGAG GTCGCCAAGG TAGCCTGGTA CCGCGTGGAT AAGATACCGA AATCTTCGTA TGCGGTATTT CCCTTTTTGT CCCAACTGAG ACGATGGATT GCTCAACGCA CAAAGTCTTC GCGCGACAAG TCGACGGGCC GATCGAATGC TCGTAAAAAG GGTACACCCA AGCGTTCCGG CAACAACTCC AGAGGTCGCG ATTCTCGCGG TAAAGTGCGG GATGGCGACG GCCTGGTCAC CAGTGGACTA GCTGCGCCCG GAGAAGTATC CAGATGGTCG GAAGACGATA TGTTCGCCGT TAACGAACGT CTCTTGGGGC GAAAAATCAC GTACGACGGG AACCCTCATT TGTTTGAACA AGGATTTCAA GGCCAGGACC CGCACGCCTT TCACGTCGTC GGTGGGTCTT TCCTCAATAC CAATGATTCG ACTCTGGCCC CGCCGCCGGC AACTTCAAAA TTGCAACCTT TGTTTCGTGG CAGTAATAAT GATACGGGGA AAGATGAGTT GCTGCCCTTT TTCTCGGATG ACGGTGCTAC ACCGTGGGGA GAAACAGTGG AAGACGCCAA AGGAGCACCG CCACCCAGAG CTCTCAAGGA CGACGCCGAC GCCTTGCTGG CACTTTTACA GCAAACGAAG GATCCACCCA AAAGCTTGTT GACAACCGGG AACGATGTCG ATGTGGCATT CCTAACGGAC GCGGAAGTAA CCGCTCGCAG TAATGAAACC AAAACAACTG ATCGGCGAGT CACTATGCGG GCGCAGTACG AAGCCGATAT GGAGTTTATT CGCGAATGGG TTGCGAATCT ACCCAAACCA GGACCTTCCA AACATTTTGG AACGTTCAAG CTTGACGCCG ACGCAATCAT GGCCAACGCG TTGGCCAGTG TATCAAAATA G
|
Protein sequence | MASQKQPQRK DSDVSEKSDS GGGDPYASAD YQEALEDVHT RFILNLPPSE LETADRLFFQ LEQAWWFYED WICDPHPEKV LPRFSSFKPF AQKMFAYSEM LPESHKFGSM WAEFSQYKRG ISNYGCILLS VDYTKVILCQ VWNGKTFTFP AGKINQGEDG LTAAARETYE ETGFDPNCVF GQTASWKATD PAKITWKSLQ EQDALIFQED NGKRRTCYVC HGVPEDFPFL PVARKEVAKV AWYRVDKIPK SSYAVFPFLS QLRRWIAQRT KSSRDKSTGR SNARKKGTPK RSGNNSRGRD SRGKVRDGDG LVTSGLAAPG EVSRWSEDDM FAVNERLLGR KITYDGNPHL FEQGFQGQDP HAFHVVGGSF LNTNDSTLAP PPATSKLQPL FRGSNNDTGK DELLPFFSDD GATPWGETVE DAKGAPPPRA LKDDADALLA LLQQTKDPPK SLLTTGNDVD VAFLTDAEVT ARSNETKTTD RRVTMRAQYE ADMEFIREWV ANLPKPGPSK HFGTFKLDAD AIMANALASV SK
|
| |