Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48220 |
Symbol | |
ID | 7203339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 551733 |
End bp | 554858 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182709 |
Protein GI | 219124853 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.313872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAACT TTCCCCCTCC ACCACCACCA CCGCCTTCTA GGCCACTGGG CGTCTCTCCA TCAGCTCCAG CTGTTCTCGA GACGTCGTCG GCAACGTCCC ATTACAGTAC GAGAAGAGGG CCGCTGAATG GTGGAGCCTC AGCGTCCGCG CTCCCACAGT CCAACCTTTC CACACACGGC AGCACAAGAT TTCCACCGCC GCCTTCGCAC AGCGTCACGA CCAATGGATC CGTTCCGTCG CACGATGCTC GGACTACACC TCCGTCCGCA CACACTCCGG TACCGCCTCC ATCGATCCAT CGAATCGCTC ATCCGGGCAT GGTTTCTCAG CCGCCGTCCC GGCAGAATCC ACCACTTCCG TTCGCACCTC CCGCATCTGT ACCTCCGACT GCGAGTCGGA CACTGCAACC AGCCTATCCC GTTCCACCGC AACCCTACTC GTCAAATCCT CTTTCACCCC CGCTTCCATC GTCACGTCCT CCCAACACTA CGCACCCGCA CCCATCGCCT CCACCACCGC AGCCCGGGTA CTACCAACCG TCGCACACGC AACAGCCTCC GCCTGGTTTA CGGAAAATCG ATCCCTCACA AATTCCTAGA GCTCCACTCT TTACCCGCCC GCAAGAATCC CAGTTGCCGG TATATTTTCC GCGAGCTGCC GTCCTCAATG GCGAAACGGC CCAAAACCCC CCACCCGCCG ACTCGCGATA CATTGTCAAA GACGACGGTA ACGCTTCGCC TAATCTCGTA CGCGCCAGCG TCTACGCCTT TCCCCTTACA CGTGCTCTCT GGCATCAAAC CGGCGATCTA CCACTCGGAA TCCTCGCTAC CCCCTTGGCC TGTCACGACG AAACCTTCGT GCCACGTCCC CGCGTCCTCC CCGACCGTTC CGTCCAAGAC TGGCGTGATC CACAACGTAT CCCGTGCGTG GATGCCCGGG AACCGTCCCC GCCACCGCGC TGTGGACACT GTCACGCCTA CGCCAATCCG TTTTTTGGAA CGGATGGATC TTGTAATCTC TGTGGTACCA GCAATCGCGG GATTGCCGCC AACCTGACGG GTCCGGCCAT GCAGTGCGGT ACGGTAGATT ATCACGTTTC CGGACCCTAC GTCACCCGCC CACAGCCCGT GCCGCCCGTG TTTGTGTACG CCGTGGATTT AACGTGTCCG CATGTCACGC AGTATCTACC CATTCTGGCA CAATTGGGGG AAGACCTGGC GACTCACGTC GGGAATCAGT ACGCACCGCT GACGCCACGC ATTGGTCTCT GTTGGGTATC TTCGGCCGGA ATTTTGGTGG CCGGGCACCA CGACCGGGAA CGCTACTCCG TCATGGCGGA TGTGACCAAC GCACCCTTTT GCCCTCTACC CCTCAACGAT TGGACGTTTG ATGTGTCGGT ACCGGAAGGA TTGGCATCCT GGAAGGCCTA CTTGGATGGC CTACTCCAGA ACGATCTCGA GGATCTGCGG AAATTGGCGC GTGCGAAAAA TGCGTACGGC TTGGACGGTA TGGAACTATC GTGTGGTGGA GCGGCGCTGG CCTTTCTTGC TGACGCCTTG GCGGCGACGG GGGGTCGCGG TACCTTGATC ACCCGACGAC GACCGAATTT TGGCGTCGGT AGCCTTTCGG TACGGGAACC TGTTCCGGGT AAAGCGCACG ATCCGGACAA TATCATATCC TATAGTCCGC TACAAACCGC CTCCAAGTTG AAGCATACAG AAGATGCGGC GGCTTCTTCG TTTTATCAGG AGCTAGCCGC GAAATGTTGC CAGGACCGAA CCTGTTTGGA TGTTTTATAC CACACTAGCC CGCTCACACC ACCCGCGTAC TTGGATCTCG CCACACTCGG CGAGTTGTGC CGGAATACTT GTGGAAAATT GTTCCACGTT TCGAACAAAG ACTGGAATCC GATCATTCTG GAAGAACTGA GAGCCCAAGT CTTTTCCTTT ACGGGATGGG ATGCGGTGTT TAAGGTTCGA TGTTCGGACG GAATTCAAAT CAAATCCTTT CCAACCCATG TCGGTAACCT AGTCGATAAC GGATTGGGTA GCTCGTCGGA AATCGAATTG TCCTGCGTGA CGCCGAACAC GTGCATCGCA GTGGAGCTTG AGCATCGCGT GGGTGGTGTA CCCCCCAAAA ATCGGTACGT GTACATACAA ACGGCCTTGC TCTACTCAAC AATATCGGGC TGCCGCCGTG TGCGGGTCTC CACCTTGGCC ATCCGTAGTT CTACTGTGGT AGATGAAGTA TTCCGATCGG TTGACATGGG AACTGCTTCT GCTCTACTAG CCAGAGAAGC GCTGGATCGT ATGAAGAAGC TAGTAAGGGA GAAGGAAGGA GACGCGGCCC GTGAAAAAGC CCGTGACTTG GTGTTCCATC GGTGTCTGGA AATTTTGCTC AACTACCGCA CAAATTCGTC GGCCGCAAAC TCATCCGCAC GGCAAATGGT CCTCCCAGAA AAGTTGCAGT TGTTTCCACT CTACTGTATG TGCTTGATGA AGAGTCCGAT CTTTCGCCCG GGCATGGCCC GTCGCGATGC ACAAACTCAA GCTGTCCGCA TGTCGCCCAC GGGTGATGAC AGGGCACTAT TCGTACATTA TCTGGCCAAC GTAAGTTCCA GTACCAGCAT GCTTATGGTG CATCCCAACA TTTTTTCCGT CTTGGGAAAC GAAAGTGGTA CTGCGGAGTT CGAGTCGCAT CATGGACCGG AGCAAGTTGG GTTTGTAAGA ATGCCACAGC CCATTTTGCC GAGTATGGCT AGTCTTGAAG ACGATGGTGT GTACCTCCTG GATAGCGGCC TACAGATTTT TTTCTATGTT GGAAAGACTG CGCCGGATGA AATAAAGGAG ATGGCACGTA GTCACCAAAT CGATCAAGCA GAACTGCTTC ACAATTTTGT CTGGCAAATG AGGACGTTCA ACGGCACAAA TCAAGGAAGC GAAGGTTCCG TCCGGCCAAC TCATGTGCCT GTTGTGTCAA TTATACAGCA GGACGGTCAC GATGCTCCAA TGGAAGCGGA TGTTCTCAAT CTTTTGGTGG ATGATGCGGT TTCTGGGGAG AAAGACTACA ATGATTTTCT GTGTGGATTG CATCAGCGCA TTCAAGACAG ACTCAAAGCC AAGTAG
|
Protein sequence | MTNFPPPPPP PPSRPLGVSP SAPAVLETSS ATSHYSTRRG PLNGGASASA LPQSNLSTHG STRFPPPPSH SVTTNGSVPS HDARTTPPSA HTPVPPPSIH RIAHPGMVSQ PPSRQNPPLP FAPPASVPPT ASRTLQPAYP VPPQPYSSNP LSPPLPSSRP PNTTHPHPSP PPPQPGYYQP SHTQQPPPGL RKIDPSQIPR APLFTRPQES QLPVYFPRAA VLNGETAQNP PPADSRYIVK DDGNASPNLV RASVYAFPLT RALWHQTGDL PLGILATPLA CHDETFVPRP RVLPDRSVQD WRDPQRIPCV DAREPSPPPR CGHCHAYANP FFGTDGSCNL CGTSNRGIAA NLTGPAMQCG TVDYHVSGPY VTRPQPVPPV FVYAVDLTCP HVTQYLPILA QLGEDLATHV GNQYAPLTPR IGLCWVSSAG ILVAGHHDRE RYSVMADVTN APFCPLPLND WTFDVSVPEG LASWKAYLDG LLQNDLEDLR KLARAKNAYG LDGMELSCGG AALAFLADAL AATGGRGTLI TRRRPNFGVG SLSVREPVPG KAHDPDNIIS YSPLQTASKL KHTEDAAASS FYQELAAKCC QDRTCLDVLY HTSPLTPPAY LDLATLGELC RNTCGKLFHV SNKDWNPIIL EELRAQVFSF TGWDAVFKVR CSDGIQIKSF PTHVGNLVDN GLGSSSEIEL SCVTPNTCIA VELEHRVGGV PPKNRYVYIQ TALLYSTISG CRRVRVSTLA IRSSTVVDEV FRSVDMGTAS ALLAREALDR MKKLVREKEG DAAREKARDL VFHRCLEILL NYRTNSSAAN SSARQMVLPE KLQLFPLYCM CLMKSPIFRP GMARRDAQTQ AVRMSPTGDD RALFVHYLAN VSSSTSMLMV HPNIFSVLGN ESGTAEFESH HGPEQVGFVR MPQPILPSMA SLEDDGVYLL DSGLQIFFYV GKTAPDEIKE MARSHQIDQA ELLHNFVWQM RTFNGTNQGS EGSVRPTHVP VVSIIQQDGH DAPMEADVLN LLVDDAVSGE KDYNDFLCGL HQRIQDRLKA K
|
| |