Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35733 |
Symbol | |
ID | 7200995 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 628440 |
End bp | 629567 |
Gene Length | 1128 bp |
Protein Length | 346 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180280 |
Protein GI | 219119027 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAGT GCTGCGGCTA TTTCGCCGCG TTCGTCTCGT GTGTCGCCTT TGGAACCTTC GCAGTTCCGA TCAAATGTGC AGCCGTCCGC AAGGTCGATG TGGATCCTCT CGGTACGTGG GCAGTGAAAC TGAAAGTAAA CCAGCAATCA TATCTGCTGA TCTATTGTTT CGCTCACTCG CTCCCTTTTG TTTGCACAGT CTTGCAGACT TACAAGATCG GCATGACGTT GCTTACGAGC TGGTTGGTCT TGCTCTTTGG TGTACCCTTC ACTTTTACTC CTTGGGGTTT TGTTTCCGGC TTGTTTATGG TCCCGGGGGG CACTGCGGGG TACTTTGCCG TCCAGAACGC AGGTATGGCT GTAAGTCAAG GCATATGGTC GAGTCTTAAA GTATTGGTCG CCTTTTGTTG GGGCATTTTG ATTTTTCATG AGCCTGTCCA TTCCAAGCTG GGGACCACCC TAGCGATCGC GCTGCTCATG GTGGGATTGG CCGGCGTGAG CATCTTTGCT GCTCCACGGA CTTCAACGTC GTCACCACAA GAAGAGCCGC TACTCCCGGA TGTGGAAGAA CAAAACCAGC CGGAAATTGT TGACAATAAG GACTATTTGG GCTTTCTGAA ACGGAGACAC GTTGGCTTAC TTGGTGCCGT AATCGATGGG GCTTACGGTG GCAGTGTTCT GGTACCGATG CACTATGCGG GCCCCAAAAC AACGAACGGA CTTTCGTACG TTATGTCCTT TGCCATTGGT TGCTCATCCG TCGTGACCAT GGTTTGGGTT TTGCGTCTCC TTTTCAACAG CGTTCAGGGG CAATCTCTCC GCGTTGGGTA CGATCGCTTG CCGTCGTTGC ACGTCACAAC AATAGGGCCG TATGCAGCCT TGGCGGGGCT AATATGGAGT TTGGGAAACG TGAGCTCAAT CTTGACGGTG GCGTTGCTGG GCGAAGGTGT GGGCTACAGT ATTGTGCAAA GCCAGCTTTT GGTGGCCGGT CTCTGGGGCG TGTTTTGGTA CAAGGAGATT CGTGGCATGC GAGCCATTGC GAGTTGGTTC ACCTTTGCGG TGATCACGGT TGCGGGTATT GTGATGTTGT CTCGGGAGCA TGTACCCGTA CCAGCGGAAG CCCCGTGA
|
Protein sequence | MEECCGYFAA FVSCVAFGTF AVPIKCAAVR KVDVDPLVLQ TYKIGMTLLT SWLVLLFGVP FTFTPWGFVS GLFMVPGGTA GYFAVQNAGM AVSQGIWSSL KVLVAFCWGI LIFHEPVHSK LGTTLAIALL MVGLAGVSIF AAPRTSTSSP QEEPLLPDVE EQNQPEIVDN KDYLGFLKRR HVGLLGAVID GAYGGSVLVP MHYAGPKTTN GLSYVMSFAI GCSSVVTMVW VLRLLFNSVQ GQSLRVGYDR LPSLHVTTIG PYAALAGLIW SLGNVSSILT VALLGEGVGY SIVQSQLLVA GLWGVFWYKE IRGMRAIASW FTFAVITVAG IVMLSREHVP VPAEAP
|
| |