Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44222 |
Symbol | |
ID | 7204058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1347631 |
End bp | 1349472 |
Gene Length | 1842 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | enhanced disease susceptibility 5-like protein |
Protein accession | XP_002186236 |
Protein GI | 219113305 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTCGTGTGG CTTTTTCACT GACAGTTACA GTTAGTGCAA TTTACGGATA CCTTCCGACG ACAAAACACA TTTTTACCGC CGTCAAGTAT CCGTTGCATT TACAGTTACT GTTAATCCAA CGCAGTCCAT TGTCACCGTT GTCCAACATG CTTTGGAAAC GAGCGTCTAC CCGGACGTTT ATCGTCTTAC TGCACCTGCT CACGTACCGA GCGGGCGCCT TTGCTTCCTT GTCATCCCGG CAGGCTTCGT ATCGAAGTCG CGGTCCCTTG GTTCCTTCAC AATCCACACA GCGAGGTCGA GTGCTGACTA CCACCACCGA ATCTTCCGGA GACGTCCCTC GCGGTGGAGA CTTGCCCCGC CGGATCCCCG ACCTCCCAAC GCTTGCGGAT TATCGCAAGT TCGCGCTGCC CTGTCTCGCC CTGTGGATCG CAGGGCCCTT GCTGAGCCTC GTCGATACTT CGTTCATTGG ACTTTCGGGA TCCCCGGACC TCTCGGCGAA CAATTTGGCT GCGCTGGGAC CCGCGACAAC CTTCTTTGAC GGCGCGACCT ACTTATTCGC CTTTTTGAAC GTCGCCACCA CGAACCTGTA CGCGTCCGCT CGATCGCAGT CGGGCCCCAA TAGTCCCGAA GCCGAATCCG TCGTCCGTAC CGCCTCTAGA GTAGCGGTCA ACTGTGGAAT TGGAATCATG TTCTTCCTAC TCGCCTTTGC CCGACCCCTC CTCAAGCTTT ACATGGGCGA CAAGGCAGCC AGTACTCCGG GACTGCTCGA TGCCGCGACG GATTACGTCT TGATTCGGGC CCTCAGTATG CCCACTTCTC TATTATTGGG CGTCTTACAA GCCGCCTTAT TGGGTGCCAA AGACTCCGTT ACTCCGTTAA TCGCCATTCT GTACGCCACC GTTGTCAACA TATTTGGTGA CTTTATTCTC GTCAATCGTC TCCAGATGAG TCTGAAAGGG GCCGCGATTG CCACGACACT GGCGCAGTGG GCATCCACGG CCGCTTTGAT TGCACCCGCG CGTCGCAATC TCGTCAAGGA TCATTCCCTG GGATTGGTGC GCAAACCAAA ACCTTTTCCG GGCGGAGTCA CGGGACGAAC TTTTCTGGCA TTCGCCGCGC CCGTCTTGAC TTTGATTCTA GGAAAGCTCG CGGCCTTTGG CTTCATGACG AATGCGGCGG CGGGCGTTCC GGGACAGCCC ACACCGTTGG CCGCCCATCA AATCATTCTC AGTTTACTCT TTTTCTGCAG TCCGTTCCTC GAAGTCATTA GCCAGACAGC ACAAACCTTC TTGCCCTCCT ACTTGGCTCC TATTTTTGAA CACATGGACA AACTCCGCAA GCGCAATCCC GACTACAAGC CCGAGGAAGA TCCGGCCGTA GAGCCGTGGT TGAACACGTC CAAACTGGTG GCTACACGTT TGTTGGGCAT TGGTATGGTG ACGGCGGCTG TCGTGGCAAG CATCGTTTCC CTCATTCCGG CTTTTTTTGG TAACTTGATC ACGTCCGACT TGACGGTACA ACAGGCCGTC AAGCCTTTGG CGAAGTACTT GTGGATGGGT GCCTTCTTCT GGGCCCCCGT GGCGGTTTGC GAAGGTGTCT TGTTGGCGCG TCGGGAGTTG TCTTTTCTGG CCAGTATTTA CTTGGTCAGT ACCGCTTTGT TGCCGCCCGT CTTGCTGCGC ATCAAATTTC GTGGGGGAAC GGTTGGTCAA GTATGGGCTT GCTTTGCCAT CTTTCAGCTG TTCCGAGCGG CCTGTTTTAT TGGTCGTATT TGGGGACCTG GTCTGGTACA CCGCGTTCTG GGACGCCGGC AACCGCTTCC GGCGACCGAC AGGGGCTCCT AA
|
Protein sequence | MLWKRASTRT FIVLLHLLTY RAGAFASLSS RQASYRSRGP LVPSQSTQRG RVLTTTTESS GDVPRGGDLP RRIPDLPTLA DYRKFALPCL ALWIAGPLLS LVDTSFIGLS GSPDLSANNL AALGPATTFF DGATYLFAFL NVATTNLYAS ARSQSGPNSP EAESVVRTAS RVAVNCGIGI MFFLLAFARP LLKLYMGDKA ASTPGLLDAA TDYVLIRALS MPTSLLLGVL QAALLGAKDS VTPLIAILYA TVVNIFGDFI LVNRLQMSLK GAAIATTLAQ WASTAALIAP ARRNLVKDHS LGLVRKPKPF PGGVTGRTFL AFAAPVLTLI LGKLAAFGFM TNAAAGVPGQ PTPLAAHQII LSLLFFCSPF LEVISQTAQT FLPSYLAPIF EHMDKLRKRN PDYKPEEDPA VEPWLNTSKL VATRLLGIGM VTAAVVASIV SLIPAFFGNL ITSDLTVQQA VKPLAKYLWM GAFFWAPVAV CEGVLLARRE LSFLASIYLV STALLPPVLL RIKFRGGTVG QVWACFAIFQ LFRAACFIGR IWGPGLVHRV LGRRQPLPAT DRGS
|
| |