Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47205 |
Symbol | |
ID | 7202191 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 810072 |
End bp | 811862 |
Gene Length | 1791 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181446 |
Protein GI | 219122215 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.551537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCCT CCAACAGTCC CGGAGCCGCT GTTGTCATTC CTTTTGAGCC CACGAGCAAA GACGTTGTGC TTTCATCGTC TGAACCCGAA AACCACCATC TCGGGAATGT CTTCTTTCAC CATCTGCTCC AGAGCTTGCG CAATATGCCG AACTTACAGA GCACGGCTTC TCGCGTAGTC GACGCAGTCT GCGACGAACG TCAAGGGAGG TTCCTGAATG TTCTTCCCAA TACGACAGAA CAAGAAGTTC GTCTTTGTAC CGTGCTGACC CGCGATGAGG CCACTGCTCG CGTATACCAG GCCCTACAAC AATCACAGAT TGTGTCGGGT CGCACACCGC CTACGAAAAG GGCTCGTGTA CAACGTAAGG GTGAAGCGAC CAATCTTCCT GTCTCTCCTA CTAGTACTAC TACAAAAAGA GGTCCCGACG AAATTTCTCT AAAAGTTCGT TGCAGGTTTA GACGGAACGT CTACCGCATT GTCAATGGGA ACAATCCGGC GAGTAGGATT CACCCATATG CGATTGAACT TCTGGAAGCC GTCTGCAACC AGGCTTTTAG TAATGTCATA CAACATGCAT TTCAACAAGT GGGTTTGCGC TACACGAATG GCGGGGAAGA GGAGCCTGAA GAACACAGAA TAATGCGTCT TCAAATGCGA CAGCGGTTTG TAGCAGCATT GGAGGCCAAT ATTCCAACGT TAGACTTTGC CAAAACTTTG GTGAGATCGT GGGATGGGGA GTTGATTTGC CGACCTGAAG CCGAGACTAT CCCAAGCAAG GCTTTGGATG TGGAAACGGA TGTCCAAACC GTGACTTCCG AAAATCCACC AACATTCAGC AAGTCGGAAT CTTGCAATGA TCTATTGCAA GCGAGGGAAG AGGAAGTGAC AAAAACCTCG ACCTCGTCAA GAGCATCTAA AATAAAGGCT CCCAAGAATG TTTCGTCCAT CTCGATGGAA AATCCTACTA TTGCTGCGAA TCAAGTTGGG ACCTTACAGA ACTCAGTCTC GCAACATCCA CAGCAACATG CGACCAAGTC GGCCAAGAGT TTGCCCATTC CCAGGAACGC CTCCAGTGCG CAATGCCTTA TATCTCCCCT TACGGATGGC CCCTTAAAGC TGTCGCCGCT CACGTCGCGC ACAAAACCAG CTGCCTCCTC TTTCCTTTCC TCTCTGGTCA CTTCGCCTAT TCGCAAGAGT TCCAGCACCC CTTCCAAAAT GATGTTCAAG AGGGAATCAT TCGACGATTT AGATGAAAAT GATGTTATGC AAGGTCTCGA CTTTTTGGAC GACACGCATC ACGACTTGGG CGATGGATTC CTGTTGCACG ATGCACTTGC CGCAGACTTG GGATCGCCGG AGCCTTTACA TTTTGAAGGA AACCACGTAG GCTCGACCAC ACCACCATTA CCGATCTCGC CCCTGGCAAC ACTAGACCCG AATACGACGG AGGAACGAAC CGGAAAGTGG ACCATTGTTG AGAAAAGGGC ATTTTTGCGA GGCCTGGAAC GGTATGGAGC CGGGCGTTGG AAGCAAATCT GCGATATGAT CCCTACTCGG TAAGTCGTTA GGAAATGACC AACAACCGCG TACGCGTGTG CCGTTTACAC AGTATCATCC GCATCCTTTG CTGACCGTTC TCACGATTCA TTTGCTTCTT TGCGCTTTGC GGTCGGCGAC AGCTCGTACG GACAAGTAAA AAGTATGGGT CGCTTCGTTG TGAAACGCTA CAACCTTTCC AAGAATGAGC GGCCGACGGG ACCCGTCGTT CATTTCCTGC AAAAACCTTA G
|
Protein sequence | MASSNSPGAA VVIPFEPTSK DVVLSSSEPE NHHLGNVFFH HLLQSLRNMP NLQSTASRVV DAVCDERQGR FLNVLPNTTE QEVRLCTVLT RDEATARVYQ ALQQSQIVSG RTPPTKRARV QRKGEATNLP VSPTSTTTKR GPDEISLKVR CRFRRNVYRI VNGNNPASRI HPYAIELLEA VCNQAFSNVI QHAFQQVGLR YTNGGEEEPE EHRIMRLQMR QRFVAALEAN IPTLDFAKTL VRSWDGELIC RPEAETIPSK ALDVETDVQT VTSENPPTFS KSESCNDLLQ AREEEVTKTS TSSRASKIKA PKNVSSISME NPTIAANQVG TLQNSVSQHP QQHATKSAKS LPIPRNASSA QCLISPLTDG PLKLSPLTSR TKPAASSFLS SLVTSPIRKS SSTPSKMMFK RESFDDLDEN DVMQGLDFLD DTHHDLGDGF LLHDALAADL GSPEPLHFEG NHVGSTTPPL PISPLATLDP NTTEERTGKW TIVEKRAFLR GLERYGAGRW KQICDMIPTR SYGQVKSMGR FVVKRYNLSK NERPTGPVVH FLQKP
|
| |