Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50229 |
Symbol | |
ID | 7199010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 52872 |
End bp | 54593 |
Gene Length | 1722 bp |
Protein Length | 520 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185195 |
Protein GI | 219130066 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGTCGATTT TTTGCTTGTC TGATTAATCG AAGACTAGCA AAGTTTGCAA AAGTTGAAGC AAGATCCCTG ACTTACAAAA GTAACTATGG ACGACTTCAA CTCGCAATTC GGGAAGCTCT CCACTACGGC GGCAGAATGG AAGCCAAGCG GAAATGCAGA CAACGATTCT GATGTACAGT TAAACGCCAA GGTTGTCAAG GAATTTGTAC CAGGCCAAGG ATGGATCGCC TCGTCGACCA AGCCTGCTGC GCGTGAAGCG AATATCCCTT CGTCGGTGAG GGAATATACA TTGGATGCGG CAAATTCCTC CGCGAGGTTT GAAACGACAG TTCCAACAGG GCCTTTCCCT TCTCCGATGC CAAGCTTCCG GGCCTTGCAA ACTCTGGGAT TAGGAGACGA TTTGTGGCGA CATTACAGAG ATATTTCTTT AGAGAGCTGT AGGCAAATGG ACCCCGATGA CCAGCGGCAG AAGGCAGTCC CTTTACCTTA TTGTAACGCC TACTGTCTAG ACGATATCTC CCAACGGGGA CGCTCCTCGT TCGGTTATCC ATCGGCAACT TTCCATGTCA CTTCCAGAGA AGACGGCAAT GCTTACTGTC TGCGCCGTTT CGACAATGTG AGGTGTGTGT CACCCAAAAT AGCCTCCACC GTATCAGACC GATGGACCAG TGTCGCAGTT GTACAAGAAC ATCCCGGCAT TGCACCCTTT TACCAATGCT TTATGGCCCA ACGTGCTGTG TTCTTTGTGC ACCAGTACAT ACCTGGAGCG CGTAGTCTCA AGGAACGCCT CGGGGGTCCG TTGTCGGAGT GCGTCCTGTG GAGTTGTATC TCCCAACTTG TGTCGGCTTT ACGAACAATT CACGGAAGAA ATTTGGCCGC TCGTACTTTG CAACTTCATC ACATCTTATC GAATACGGAT TCGGCTGCAA GTCGGCTGCG TGTGCGTTTG AACTGTTTGG GCATTGCAGA CGTTCTGGAG TTTGAAGCGC GCAAAAAGGT GGCCGACCTG CAGCGACATG ATGTTCGGGA TCTTGGAAGG TTGATTCTCT CACTAGCGTC AGGTACCGAA ATCACCCATT CCACCGACAT GGAAACGGTA GGATCATGTG AACAATTCTT GGCACAGAAC TACTCGCCAG ACTTACACAA TTTAGCCATG ACATTGATCA GGAGTACACC TCAGCCGCCG TCGATTCTTG ACGTAAGTAG AGTTGTCGCT CAACGGGCTT TCGATGAGCA AGATGCAGCT TATCAGTCCT TTGATCGCAT GGAACGAGCG CTATCCGCAG AGTACGATTC AAGCCGGATG TTACGAATCT TGCTGAAACT AAGTTACGTG AATGAACGAC CCGAATTTGG CCCAAATCGA AGATGGGCGC AGTCGGGAGA TTGCTATGCT CTGACGCTAT TTCGTGATTA CGTCTTTCAC CAAGCTGATG GTGGTGGCTA TCCAGTTATG GATTTAGGGC ACGTCATATC AGCGTTGAAC AAGCTTGATG GCGCAGATGA AGAAAAGATT GTTTTGTCGT CTCGGGATGG GAAGAGCCTA ATGGTAGCAA GCTACGCAGA GATAGCTCGA TGCCTGGAAA ATGCGTTCCA GGAACTATGC GTGGGCGCAG TATCGCATGA TGCGTTGCAT TACTGTTGAC TGAGAGCGAA TCAGATACAT CACTTATTGT CACAAATTAA ATTGATACTG TAAATGGGAT GCTCAAGACG TG
|
Protein sequence | MDDFNSQFGK LSTTAAEWKP SGNADNDSDV QLNAKVVKEF VPGQGWIASS TKPAAREANI PSSVREYTLD AANSSARFET TVPTGPFPSP MPSFRALQTL GLGDDLWRHY RDISLESCRQ MDPDDQRQKA VPLPYCNAYC LDDISQRGRS SFGYPSATFH VTSREDGNAY CLRRFDNVRC VSPKIASTVS DRWTSVAVVQ EHPGIAPFYQ CFMAQRAVFF VHQYIPGARS LKERLGGPLS ECVLWSCISQ LVSALRTIHG RNLAARTLQL HHILSNTDSA ASRLRVRLNC LGIADVLEFE ARKKVADLQR HDVRDLGRLI LSLASGTEIT HSTDMETVGS CEQFLAQNYS PDLHNLAMTL IRSTPQPPSI LDVSRVVAQR AFDEQDAAYQ SFDRMERALS AEYDSSRMLR ILLKLSYVNE RPEFGPNRRW AQSGDCYALT LFRDYVFHQA DGGGYPVMDL GHVISALNKL DGADEEKIVL SSRDGKSLMV ASYAEIARCL ENAFQELCVG AVSHDALHYC
|
| |