Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47172 |
Symbol | |
ID | 7201953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 708876 |
End bp | 710771 |
Gene Length | 1896 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181252 |
Protein GI | 219121810 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCACCATGG CGACGACCAA CGATACCGAT GACGCCTGGG GCGAAGTGTT TGCCTTGGCG GAAGGAAAGG ATACGAAAGT CTCCTACGGT AAAATGGAGG CGGAAGCGGA CCCGGAGCGG TCGGCGTTGG CGTCGCGGGC CCGACACAAA CGTAAGCGAG ACGTCGTGTC GAAGTCGGCG TCGTCGCCAC TGAACGAGGA GGAAGCGTAC ACGGCATTCT TGGAATCGCG CACTGTCATA GGATCATCGA TTCCGCAATG GATTCGCTTG GGCGAGACGT TCACTTCGCA CGCGGTGTGC AAGGGATGGG CCGTGTCCCG AAAAGAATAT AAGCAGGGCA CCTGCCGTCG TTGCCGACGT TCGGCCGTGC ATCACCTCGC CGAAATAGAT CCAAAATTTC CATGGTGGTG TATTCTTTTC GTTGGAGTCC GAAATCTGCG TTGTATAGCT GTCGGTAGTA TTCTACGGCA AGCGAAAGAG AAAGAAGGGT CTCTGGAGGG TCCAGATACC ATTCTTCGAT ACGTCGGAAG GCGAATCTGG AAAGAATTGG ATACGGGCAT GATACACGGC AGTATCATTC GCTTTCCCAC ACTACAAGCA AAGTGGAAGA TTTTCCGATC TTCTCTTCAT TCTGTGCTAC ATTCCAAGGC CGACAAAGAG TTCTTGGATA AGGAAACGGC CTTTCGATGG ATTATCCACT CGGACGCCAT TTACTACCAG CTGTACTACT TGCAACTTAC ACGACAAATA CCTCTACTTC CACATGATGC TGATACCGTT AGAAGGATTC CTCATCCCAG CGAATACTTT GGGCAATCGC AATTTGCTAC GGATCATCAA CAGGCCAAAA TAGCTTTGCG TCTCTTTCTG CAACGGACCG AGACCAATAC TCCTCGAGCA TCCGATTGGA CGGATCGCTT TGGTTGGACC CACCGCAGCA GTGCCAAGCA TGGGCACATT TTGGAGACTT TGCACGAATA TCGTATGCTG GAAACGGTGC TGCTGTTCGA CTTCTCGGGT CTAGTCTCCA CAACCGAAAC CATCACCGCC TGGTTCGCCC AAGTCTCGCC GTCGAACCAG CTAGACCAGC ACGACACTCC GGCACCACCA CTTTGGATGG CCTGGCGAGA CTCGTGCCGC GACTTTCTTT GCCACTTGTA CGCGTATGCT ACAATTTCGC AATCAGTCAT TTCTCAGCTT CCATCTCTGT TATCCAAGTA CGGTATGCAT CAAGGAATCA TTGAGGCGGG AGCAGGGACC GGATATATTG CAAATCTTTT CATCCGAGCG GGCATTTCGA CCGAAGCGTT CGACGTGCAC CCAACCAACA GTGGTTCCAA TTCATCATCC GTGCACAATG GGTATCACGG TGCAACCCCT TCATATGTCT CGGTACGCCA AGGCAAGTCC AGTGCGCTCC ACAAATATTT CTCGCACACA TTTAGCAAGG CTTTGCTACT GTGCTATCCA CCCCCGGACT CAACCATGGC CTACGACGCG TTGCGCTCCT TTGTGCAACA CGGAGGATCG CTTTTCGTGC ACGTTGGCGA ATTCCGGGGC CTTACCGGCA ATTCAACTTT TGAGCAATTA TTAATGGATG ACTTTGCTTT GCTGCAGCGT TTCCCTTGTC TACCGTGGGG TACTGACGCT GCGGAGCTCA CTATCTGGCG TCGTCGAAAG GCCAGCGACA ACACGTCCAA AAGTCGCCTA CTTCCTTGTT CGTCTTGTGG AACTAGGGAA TCGGTACAGC GCCTGCGACT AGTCCGCTAC CTTACCTACT GCAGCGCTGA ATGCGCACAG CAGCATCAAC CTTCAATAAG CGAGCATCTT CGATGGGCCT TCCTCCCACC GATGCGGATT GATTGGAAAG ACGACCGTTT TTTTGCGATT TTGTAA
|
Protein sequence | MATTNDTDDA WGEVFALAEG KDTKVSYGKM EAEADPERSA LASRARHKRK RDVVSKSASS PLNEEEAYTA FLESRTVIGS SIPQWIRLGE TFTSHAVCKG WAVSRKEYKQ GTCRRCRRSA VHHLAEIDPK FPWWCILFVG VRNLRCIAVG SILRQAKEKE GSLEGPDTIL RYVGRRIWKE LDTGMIHGSI IRFPTLQAKW KIFRSSLHSV LHSKADKEFL DKETAFRWII HSDAIYYQLY YLQLTRQIPL LPHDADTVRR IPHPSEYFGQ SQFATDHQQA KIALRLFLQR TETNTPRASD WTDRFGWTHR SSAKHGHILE TLHEYRMLET VLLFDFSGLV STTETITAWF AQVSPSNQLD QHDTPAPPLW MAWRDSCRDF LCHLYAYATI SQSVISQLPS LLSKYGMHQG IIEAGAGTGY IANLFIRAGI STEAFDVHPT NSGSNSSSVH NGYHGATPSY VSVRQGKSSA LHKYFSHTFS KALLLCYPPP DSTMAYDALR SFVQHGGSLF VHVGEFRGLT GNSTFEQLLM DDFALLQRFP CLPWGTDAAE LTIWRRRKAS DNTSKSRLLP CSSCGTRESV QRLRLVRYLT YCSAECAQQH QPSISEHLRW AFLPPMRIDW KDDRFFAIL
|
| |