Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45757 |
Symbol | |
ID | 7200782 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 157771 |
End bp | 159563 |
Gene Length | 1793 bp |
Protein Length | 530 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179986 |
Protein GI | 219118427 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.846436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATG ACCGGATAAG TGACCGTATA GCCTTCGGAT CGTGCAACGA ACAGGATCTG CAAAACAACC TCTGGCCAAT TATAGAATCG CGTCAACCCA CTGCGTTTGT TTGGGGTGGC GATGCCATTT ATGCCGGTAC GTTTGCCGTT CTGTGCCGAC TCGGCGAGAA TCTCAGCAGA TTCTACGTCT GGTATAGATA CCTGCTGGCT TCTTACATTT TTCTTTCTGT TTCATCACCG TATTCCAGAC ACCTTGCTGG CACCAGACTG GTCTTCCTTT CCACCGAAAG CGCAACATCG ATGCGCCACT CCTTCCCTGT TGCGGAAGCT TTACCAAACA CAACGAAATG TGCCTGCCTA TCGGAGCCTA TTGCGCCAAA ATCTGACAGT GTTTGGAACT TTTGATGGTA CGTACGTAGG CCTGACCGGC TTAAGTCCTT TTGTATTCGA AAGTGGTCGA TTTCTCATTT GGACACTGTA TTCACCTTAC ACTAGACCAC GACTACGGCT GCAACAATGC CGATCGTACG TATCCATACC GATACGAAAG TGGTATGGCC TACGTGGATT TTTTGGGCTT GGTTTCCTCC TCGCCCATGT ATCAGCGTGC CCAAGCTGGA CATGGAGTAT ATGGAGTCAA GCTTTTTGAT TTTGCCCGGC CGAAGGGAGA ACGCGAAGTA TCCGATGTCG AGGCCGGTCT CGACCCGGAT TTAAACAGTT TTGGCCAAAA GGGCATGCGG ATACCAGCAA TGCACTCCTA TTCCAATCGT ACAGTCGCAG TGTTTGTTCT CGATGTGCGC ACCAACAAGA CACCATGGAA AAAGGGTTCG GCTGCGTACC AACCCGACCT AGAGGGAGAC TTTCTCGGAG AGCGCCAATG GCAATGGTTT GAAGCCGCCA TTGCGCGCTC TACAGCCTCC GTCAACGTCG TTGTAAATGG GTTGCAGGTG CACGCAAATC GCTATCCTGA TGGTAATATT GCGGAAGCGT GGGGCAAATA TCCCAAAGCA CAACAACGAT TATTCGATGC AATTTTGCAA GACGGCGTGG AATCGCCTGT CCTCGTTTCG GGTGATGTGC ACATGACACA ACTCATGCGA AAGGATTGCT ACAATCCTGA AACCGGCAAT CAGCGGTCTT TGATCGAAAT GACGACCTCC GGAATGACGC ATTCTTGGGG AACTGTTAGT AGCCAGCCAC TGGTGGATCC TACTCGCAAG CCTTCGTGGA GGGATTGGTT CGAGTCATAC GCAGCTACTC GTATGGTCCA TATCCTCCAC ACTGTTTGCC CTTGGACCGA CTTGATGGTT TCCGAAAGTC AAACTTCAAG CGGGGAGGAC AGCTCTCCCG AATCGGGGGG TGGTGAAGGA TCCAAGCAAG GCTTGCAGTA TTCGTTGGCC AAGAATTTTG GCGAGCTCGA GTTTGACTGG GATGCCCGAA TGGTCTCAAT TCGCAGTATA GGGGAAGACC CCGCTTCTCC TCCGCTTCTG ATGGCCCGAA CGTCTATGGA TCGACTCTCC GGACGGGATG GCCTCATGCC CGGTGGTACC CTAACGGACT CAGATTTTGC CGTAGCGCAC GAGGCTATAG TACCAGGTAC CGCCGAGTGG GTCTGTGTGA ATCATAGAGG TGCCGAAAAC TACGTAACGC ACATGTTTGG CCATGTGGTT TCAGCTATGA TTTTGTTGGC TGTGGTGCCT TTCCCCTTGC TCCTTCCGGC GTTGCTGGCC TGCTATCTTG TACGCTGCCG TCGGAAAGCC AAAACGGTTC CGTCCACCAA ATTATTGATG TAA
|
Protein sequence | MNDDRISDRI AFGSCNEQDL QNNLWPIIES RQPTAFVWGG DAIYADTLLA PDWSSFPPKA QHRCATPSLL RKLYQTQRNV PAYRSLLRQN LTVFGTFDDH DYGCNNADRT YPYRYESGMA YVDFLGLVSS SPMYQRAQAG HGVYGVKLFD FARPKGEREV SDVEAGLDPD LNSFGQKGMR IPAMHSYSNR TVAVFVLDVR TNKTPWKKGS AAYQPDLEGD FLGERQWQWF EAAIARSTAS VNVVVNGLQV HANRYPDGNI AEAWGKYPKA QQRLFDAILQ DGVESPVLVS GDVHMTQLMR KDCYNPETGN QRSLIEMTTS GMTHSWGTVS SQPLVDPTRK PSWRDWFESY AATRMVHILH TVCPWTDLMV SESQTSSGED SSPESGGGEG SKQGLQYSLA KNFGELEFDW DARMVSIRSI GEDPASPPLL MARTSMDRLS GRDGLMPGGT LTDSDFAVAH EAIVPGTAEW VCVNHRGAEN YVTHMFGHVV SAMILLAVVP FPLLLPALLA CYLVRCRRKA KTVPSTKLLM
|
| |