Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43169 |
Symbol | |
ID | 7196766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2220989 |
End bp | 2222289 |
Gene Length | 1301 bp |
Protein Length | 404 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177472 |
Protein GI | 219111441 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0856686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCACTGTCA AGGCTGTGTG GACACTCGTT ACTACTAGAG CGAGCAATTA TACAATAGAT TGGTTCGCAC CGTTCGAAAG CGCATTATGA GTCTAACTCG CTGGGCTTTC GCTTCTACTC TAGTAACGCT ATTTTCCGCG TCGGCGTTTG TGCCGCAGCA TAGTCGTAGC AAGTCGCATC ATGGATCGGC CGCGACCACG CACCTGTACA TGAGCGCTGC CTTGATTGTC CAAAATAAGG GCGGCGGTCA CGGAGAGCTC GGCTTTCAGC TCGCCAAGGT TTTGTCGGAC AACGACAAAA TCACTTCCGT GACCATTCTA CAAGACGATG CCTGCAAAGA TTCCGCGGAA CCTTTCCAGT CCTACGCCAC GGACCTACCG GACGTGAAAG TCGTCAAGGC TAGTCTCGGC GACGAATCTA TGACCGCCAC TGCGCTGCAA GACATTCTCG GGAAAGACGC CGCCTTTGAT TATGTTTGGG ACAATGCCTC CAAGTCCCCC AAGGGTGCTG GACAGGCCAT CTGCGATCTC GCCAAAGCTT GGAACGTCAA ACTGTTCACG TACGTATCTT CCGCTGGAAT GTACCAACCT ACGGCGGATG CTCCCTTTCC CATGCCAGAA ACGACACCGA TCAAGGAAAG TGCCGGACAG AATCAGTTCG ATCAGTACGC GATTCAACAG GGATTGCCCT TGGTCACCTT CCGGCCACAG TACATTTACG GTCCCAAGGC CAACAAACAC GACTACATCG ACTGGTACTT TGATCGACTC GTACGAGAAC TGCCGCTGCC TATTCCCGGT GACGGGACGC AAAAGCTTTC TCTCACTAAC GCCGAAGACG TGGCCTCACT CTTGGCGGCA CCCTTGAATG ACGAAGCCGC CGCGATTGCC CAACGTGTTT TCAATTGCGG TACCGATCAA CTCGTCAGCT ACGACGAAGT TGCCTACCTG TGTGCCGAAG CAGCGGGTAT CGATAAAGAC AAGGTCATGA TCGAACACTA TGATGCCGAC ATGTTCGGCA AAGCGACCTT TCCCTTTCGC ATGACGGACT TTTACGTGGC TCCCGACACG GCCAAGGAAA AGCTCGGCTG GTCCGGGCCG CTACACTCCC TGAAGGACGA TCTGCAATCA TTTTACTACG AATCGTACGT AGCACGGGGT GGGCCAACCA AGAAGATGTC CCTTATCAAG GATTGGGAAA TCACGGTTGG TAGTAAAACG TCTCTGCCGG AATACGGATC CAGTATTTAC GACAAGTTCG ACCCAATTAT CCTCGAAGTT CCTGCGGCAC CTGCCGAATA G
|
Protein sequence | MSLTRWAFAS TLVTLFSASA FVPQHSRSKS HHGSAATTHL YMSAALIVQN KGGGHGELGF QLAKVLSDND KITSVTILQD DACKDSAEPF QSYATDLPDV KVVKASLGDE SMTATALQDI LGKDAAFDYV WDNASKSPKG AGQAICDLAK AWNVKLFTYV SSAGMYQPTA DAPFPMPETT PIKESAGQNQ FDQYAIQQGL PLVTFRPQYI YGPKANKHDY IDWYFDRLVR ELPLPIPGDG TQKLSLTNAE DVASLLAAPL NDEAAAIAQR VFNCGTDQLV SYDEVAYLCA EAAGIDKDKV MIEHYDADMF GKATFPFRMT DFYVAPDTAK EKLGWSGPLH SLKDDLQSFY YESYVARGGP TKKMSLIKDW EITVGSKTSL PEYGSSIYDK FDPIILEVPA APAE
|
| |