Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42809 |
Symbol | |
ID | 7196420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1196965 |
End bp | 1198146 |
Gene Length | 1182 bp |
Protein Length | 323 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176740 |
Protein GI | 219109975 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.767843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTGAACGGA GACGGCACTG CAAGCCAAGG CGGGCAAAAA GATCCTTACG GATTGTGCTA ATGCAATAAG CTGAGCAATG GTTGCATCGC ACGAAAGCGA GGGTAGCATG GCGTCCGTCG CCCCTACTGC AAAGATTGAT AAAGATATCA GTCGACCGGC TTCGCCCCGT GATGTAGCTT GCAGCTCGAT GGAGATGTTG GCGCGGTCCA TGGAGCCCTC ATCAAAATCT TTCTTGCCAA CCGCGAAACC GTTTTTTGTC ATGGATTGGT TGGACCGAAT TGATGAAAAG GACTTGGAAT TAGCGCGCCA GATTATAACG ACTCCAGGTA GGGCTCTTTT GACCGATTTC GGTAAGGAAA CCAGCACGCG AACAGTGTCG CTGGCAGGAG CTTCTCGAAA AGATTTTGCT GGTGAGCCTT CAAAACCTTC GGAGCATCCT CCCAAACAGA AGTTTACGCC CCGATCGCAT TTCCGCAAGC GAGGAATCGC CGTTGGAAAT GGATGGAACG CTAAAGGCTT GCAAAAGGCC AAGGAGGGGA ACTGGGAAGA TGCGCTGTCA TGCTGGGAAA ATGCTCTCGA AATTCGTTCG CAAGTTTGCC TGTCTCTGGT AGATGTGGCC AATACTTGCA ATAATATCGG CATTGCCTTA GGAAAGCTGA ACCGATTTGA TACCGCAGTT GAACACTTGG AGCGTGCTCT CGAAGTGAGG GAAGCGCTTG CGGAAGGAGC AGATAACCAG GCAGAAATTG CCACAACGTT ACACAACATC GGAAATGTGT ACCAGCAGGC AGGCGAGTTT GGTAAAGCGG AAGAGTACTT TGTAGAATCT AGGGACATGC AAATCAAAGT CCTTGGACGA GATCATATAC ATGTAGCCCG AACCTTGGCA GCATTGGGCA ATGTTCGCTA CCAGGCCAAC CGAATCCCGG AGGCTCGGAA AGCGTACTGG GAAGCCTTGA CCATCTTCCA ACACGTTGGA CTCCCTGAGG CTGATATTGA GGTGCAGTGT GTTTTAGGAA ACGTGCAAGA GATTGACAAA TCGAAATAGA ATTCACACGC GGAAGAATGA TTATTGAGAG TTGAGTTTCC TAACTTTTAC GTGAGGAAGA ACATCAATGT GACAGGACGC TAAAATAACG ATTAGAACTA GTAGAGACTC AAAACATTAA AGTTGCATTC TA
|
Protein sequence | MVASHESEGS MASVAPTAKI DKDISRPASP RDVACSSMEM LARSMEPSSK SFLPTAKPFF VMDWLDRIDE KDLELARQII TTPGRALLTD FGKETSTRTV SLAGASRKDF AGEPSKPSEH PPKQKFTPRS HFRKRGIAVG NGWNAKGLQK AKEGNWEDAL SCWENALEIR SQVCLSLVDV ANTCNNIGIA LGKLNRFDTA VEHLERALEV REALAEGADN QAEIATTLHN IGNVYQQAGE FGKAEEYFVE SRDMQIKVLG RDHIHVARTL AALGNVRYQA NRIPEARKAY WEALTIFQHV GLPEADIEVQ CVLGNVQEID KSK
|
| |