Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45104 |
Symbol | |
ID | 7200180 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 205304 |
End bp | 207205 |
Gene Length | 1902 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179156 |
Protein GI | 219116723 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.902407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCCGAAA GTGGGAGAGC CACAGCACCA TGATGATGTC GAAGCAGCAC CATCAGCTGA AAGACATTGC CGTACCACAC GACCATGACG TACTGTCAGG GAGAGGCAAT TTCGTTAATT ATCACGCCGG CAACGAGTTT TTCCGTGCGC TTGTCCGAAA ACATAAGGTT GCATACGTTG CGTGTCCCAA GCCTCAAAAG GGCAAGTTTT CTCGCATGAT TCTGGACGAA ATTAGGTGCC TTAATCCCCC GGGACGATTC CTGAAGCAGG ACTCTGTTTC AAAAATGTGG TACGATATTG GGGAAAAGAA AGCTCTAGAT AAAACGAGAC AAGCCTTGAG AGAAGGAGCA CCGGAAATCA TGAAGGAGAT TGGGGACGAC GAGAGCAGCG AAAACCCTGC CTCGTCCCCC CTTTCTCGTG AAACCACGCC AAAGGTACGT GTTGTGCGAG ACCATGCGAA CAGGCCGATG ACTATGGGGT TTCGCCGGAT GCACACGCTT GAATGCCGAT CCCTTATATA GAGGTCATTC GAGGAAGGTT CCGCTGGTTT TTGACATTAT ATGCCCTTCA TTCACAAATA TTATGCCCCT TTTTCCTTTT CTCCTTCTGA CAGGTGTCTA TGTCACACGA TCAGGCACCA CCTCCTCCGC CCCCGATGTA CAACAGCACG CCTCCTATGA ACAACCGATC ATCCCGCTTC GGAAATGGAT ACGAACACTA CACTCCAGCT GCCGCGCCTG CGTTGTCTGC ACCTACAGGG CTTTCCCACT GCCCTACAGA AGGCACCCGT TCAACTGCGA TGAGGCATCC GGGCCAAATG ATGCTTCATC AACAAATGCA ACACTACCAG CAAAGCAATA TGAGAAACAA TCTATTTTCG ACGCAAACCA TGGCAGGACA GCAGGGCAAT CCTCAACATA CCGTATCTAG ACCCTTGCCA CATAACCATA TGCCAACCGA AGGTTTTCAT CAGCAAGAAC AAGAACAAAG CCACAGAAAT CATCATCTTC ATATCAACAC TGTACGACAA GTTACTGAAT GCGGGCCGAA TCTACAGGGA CGCATGGGAG ATAGTAGTAT TCATTGCCAT GAGTATTTAG CTAACGGGAG AGAAACTTTT GACAGCAAAA TTTCGTTTGA TGATTTCCTG GAGCCTCGGC CTATTTTTTC TGGTCATGTC CAACAGCGGC AGCAACATCA ACAGGACCAA ATACAGCTGA ACGAGTTTCA GCAGCGACAA GAAGAGCAGC ATATTCAGCC ACCACAACTT CCATTACGAC CGCATCCGTT AAAAGAGATT TATTCGAAGC TTTACGAGGA ATTACCAGAA AGCCTAGGTC CTTTGCCCAA CTGCAAAAGG CAAGTGCGAA AAGGTCTCGA ACGAGACAAT AGTGAAAAGA GTATACAAGT TGACAGTATT TTTCAGGAGA TGAAGCACTC CGAATCTGCC GGACAAGTAA ATGGCGGAGC TTCGGCCCAA AATTTATCTA TTATGAGCTT GTCTATTGGT GATATGAATG TAACTTCAGA GCCAACAAAT GCAGATAGTT TGGCGGCTAT GCTGAATAGC TCACTGCGCG TGGGATCTCG GAAAACAGGT CGTCAAAGTG GTAGCGTTGG TGAAGCGAAC AACTCTGACC TGGCACATGT GATGGACATG AGCGTGGCCA CACTCGGTGA CCGTCTTTCG GACTTTGGAG ATACGAGTTT ACCGCGGATG TCGGAGTCAC AATCAAACAT GTCTTTCGTA AACGTTTTTG AAGAAACCGA AAAGGATCTG TTTGCTGGAA GGTAGTTGCT ATCGGGATCC AGGTTTACGA TTGTGGCACT GTCCGGTCAA TCGGAATTTT CACAAACATT ATCGCATTAG TGGGGTGTAA TTAATCGTGA CTTTTTGCCT AC
|
Protein sequence | MMMSKQHHQL KDIAVPHDHD VLSGRGNFVN YHAGNEFFRA LVRKHKVAYV ACPKPQKGKF SRMILDEIRC LNPPGRFLKQ DSVSKMWYDI GEKKALDKTR QALREGAPEI MKEIGDDESS ENPASSPLSR ETTPKVSMSH DQAPPPPPPM YNSTPPMNNR SSRFGNGYEH YTPAAAPALS APTGLSHCPT EGTRSTAMRH PGQMMLHQQM QHYQQSNMRN NLFSTQTMAG QQGNPQHTVS RPLPHNHMPT EGFHQQEQEQ SHRNHHLHIN TVRQVTECGP NLQGRMGDSS IHCHEYLANG RETFDSKISF DDFLEPRPIF SGHVQQRQQH QQDQIQLNEF QQRQEEQHIQ PPQLPLRPHP LKEIYSKLYE ELPESLGPLP NCKRQVRKGL ERDNSEKSIQ VDSIFQEMKH SESAGQVNGG ASAQNLSIMS LSIGDMNVTS EPTNADSLAA MLNSSLRVGS RKTGRQSGSV GEANNSDLAH VMDMSVATLG DRLSDFGDTS LPRMSESQSN MSFVNVFEET EKDLFAGR
|
| |