Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33120 |
Symbol | |
ID | 7204253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 98419 |
End bp | 99600 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185997 |
Protein GI | 219112827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.765371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTGGT CGACGCTTCT CTTTGCGTTG GTCATGGCGA CTTTTCCATG GGACACGCGC CTCGCCTACG CCGCTCCCGG CTTGTTGATC GCCAACACGC GGGGGGACAA CGCCATCGTG CAGTACACCC TGCCCTTTGC CGACGGGGAC GTGCCGACGA CACTGTTGGG GAGCGATGCC GTCGAAGGAC CGGATCACCT TTTGGTCGAG GGCGATTATT TGTACATTTC ACACGGCGAC TCGCCCGAGA CTTCCGCTAT TGCCCGGATG AACGTGGCGG ACGGGACGCT CGAGGCTGAC TTTGCCACCG GGGGTGCGCT CCACCGACCC TACGGATTCG ATTTCTACAA CGGCACACTC TACGTGGCCA GTTTCAAGTC CGATCAAATT CTCATGTACG CGCAAGAGAC CGGGGACTAC GTGGGCGTGT TTGCCGAGAC GAACGGTACC GAAGAAGGAC TCTGCAACGG ACCGAACCAA ATCTACATCA GTGACGACGA TGGTCTTTTG TACATGACCA CGCAGGGAAG TAACTTCAAC GGCACGGAGT TAAATTTTCT CTTCAATAGT CAAATTGTCG TCTACGACTT GGAGACGGGC GATGGCTCTG TCTTTTTGCC ACCTCCGATG CCTTTACCGG GTGGTGCAGG ATTTGTCAGT TTGCTGGGCG TCACCGTTGG GTGTGGCACC GCGAGCGCAC CCGCTGCGGG GGAAGCCTGT ACCATGTATA CAACCGATTT TGCCGGCGGT TTGCGAGTCT ACGCCATGGA TCCTGATGGA CCCACGCTCC TTTACGCCGT GGAAACTTCT TACCAAGCCG GCGTTTCCAC CGGATCTCTT TCCTTCGATA CAGCGGAGAG TCAGTTGTAC GTGCCCGTTT GGCTCAACGA AACATCGGGA GCGGTGGTAT TACGCTTCGC GGCGGACGAC GGAGCGAGTG CCGGGCTACT GGAGACCGAC GATTCGGCCA TTTACATTGA AACTGACGAT GGCTCCACGG ATCTGAGCCG TCCAATTGGT GTCTTATTTC TGCCTGAAGC TGTTGATTTT TCCACAGAGG TACCGGCCGC ATCCCCGGGA GGGTCCTCGG CCGCAACGGC GTGCGGCAGC GTGGCACTCG GGTGCACAGC CATGATGACA TGGTTCGCCT TCGTTACGGT TGCCGTATCG TCGGGTTTCT GA
|
Protein sequence | MKWSTLLFAL VMATFPWDTR LAYAAPGLLI ANTRGDNAIV QYTLPFADGD VPTTLLGSDA VEGPDHLLVE GDYLYISHGD SPETSAIARM NVADGTLEAD FATGGALHRP YGFDFYNGTL YVASFKSDQI LMYAQETGDY VGVFAETNGT EEGLCNGPNQ IYISDDDGLL YMTTQGSNFN GTELNFLFNS QIVVYDLETG DGSVFLPPPM PLPGGAGFVS LLGVTVGCGT ASAPAAGEAC TMYTTDFAGG LRVYAMDPDG PTLLYAVETS YQAGVSTGSL SFDTAESQLY VPVWLNETSG AVVLRFAADD GASAGLLETD DSAIYIETDD GSTDLSRPIG VLFLPEAVDF STEVPAASPG GSSAATACGS VALGCTAMMT WFAFVTVAVS SGF
|
| |