Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37243 |
Symbol | |
ID | 7202029 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 501002 |
End bp | 502088 |
Gene Length | 1087 bp |
Protein Length | 332 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181388 |
Protein GI | 219122093 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.533431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCCA GTAATTATAA TGGTGGCGGT CTGCGCTGCG GCTTTTGCAG ATGTCTCATC GGATTTCTCG CGCTGACTGG AATTGGCATC TTCCTGGCAG TCCACTTGAC GGACGCTAAC ACCCCGGCAG ATCTGATACC CAACAACTTT AAACCTTCCG ACTTCATTCC GAACTGGGAA GAATTCTTTC ATGAAGATCC GTTCAACGCG ACAGGCCCAG ACGACGCCCC GCGATGGCGA AACAGTGGCA ACGGACTCAA CCTCGAAATT GTCAACGCTC TCGACGAGCG CTGGAATGTC TATTTCGACC GCGCAGTTCA AGAGTGGGAT GCCGGGAATC CGGATGTTCT AGACCTGACG ACGTCCGCCA GCTCCGTTGA TTCCAGTTGT CAAGCAGTTG AAGGTGTCAT GAAGGTTTGT AACGGAAACT ACGGCGACAC GCGCTGGAAA GGTATCAATG AGGTCATTAT TGAAAATGGT GTTCGCATAA TTTCGAGTAC GGCTAAAATG AATGAGTTTT ACTTGTCCGG TGGTAACGAC GCACAGAAAC AATACACGAT GTGTAAGTTA AGGGCGGGTT GGCTTTCTGT TGTTCCTTCT TTATTCCAGT CGTCCTCACT TTTGCATTGA GTTTGGTCCA TATTGGTTAG GTCACGAAAT CGGGCACGGC TTTGGATTGC CTCATACTGA CGAAAATTTC AACAACGCCA ATACAGGAGA ATGCATGGAC TATACATCTC AGCCCGAAGC AAATATGCAA CCCGGAGAGG CCAATTTCTT GTTTTTGCAA GAAATGTATG GAACACAAAC CGGCGGGAAT ACGGTAGCTA GTGCTTCAGA AGTGCCAACC GATCGACGCA TCCCTGCACT CCGCGGTAGC GCCACTATTA GTTGGTTTGA GGATGTTGTT CTGTCGGTCT GGGACGACCT TATACCCAAA ATAGAGGGTA GAGTTGACGA GCGAGAGCAT GAGGATGGAT GGCAGCTAGT CCACCGATCA CTGCATGGCA GTGCTCACGA AATGTATTTG GGAAAGGGGC TGGTAGTTCG TGTTCATAAA CTATTCTCGA GCGAGCGTTT GGAGTAG
|
Protein sequence | MGASNYNGGG LRCGFCRCLI GFLALTGIGI FLAVHLTDAN TPADLIPNNF KPSDFIPNWE EFFHEDPFNA TGPDDAPRWR NSGNGLNLEI VNALDERWNV YFDRAVQEWD AGNPDVLDLT TSASSVDSSC QAVEGVMKVC NGNYGDTRWK GINEVIIENG VRIISSTAKM NEFYLSGGND AQKQYTMCHE IGHGFGLPHT DENFNNANTG ECMDYTSQPE ANMQPGEANF LFLQEMYGTQ TGGNTVASAS EVPTDRRIPA LRGSATISWF EDVVLSVWDD LIPKIEGRVD EREHEDGWQL VHRSLHGSAH EMYLGKGLVV RVHKLFSSER LE
|
| |