Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48103 |
Symbol | SNAP |
ID | 7203269 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 214519 |
End bp | 215599 |
Gene Length | 1081 bp |
Protein Length | 313 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182638 |
Protein GI | 219124705 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.274876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCA TTGCGAAAGC GCAAAAAGCC AAGGGCCAAG AGTACCAGGC CGAGGCAGGA CGGGCGCTGA CGAAAAAGTC GTGGTTTGCC AGCGCGCGTG ACCGCAACGT CGAGGACGCC GCGGAATTGT ACCTACAAGC TGCCAACGCC TACAAAGTCG GTGGACTCAA TCAGGAAGCT GGTGACGTCT ACAATGTCGC GGGTGAACTT TATCGTGACA AGTTAAAGCA AGCCAATGAA GCGGCCAAGT GCTTTACACA AGCAGGTAGG TTGCGTTGCG TTGGTGGAAC TATTCCTTTT GTAGTGCGTT CAAGAAGAAA GAAGCCTGTT GTGGACCACT ACACTAACTA ATTGGTTTTG TGCGTTCCTT TTAAGGCTCT TGCTATAAAA AGAGCAATCC GGTGGATGCT GTTTCTAGTT ATCAGTCCGC AGTCAGTCTC TTGACGGACG CCGGTCGTTT GACGCAGGCC GCCAAACTGT GCAAGGAATG TGCCGAGCTA TACGAGTCGG AGGAAGTCGC TGAAGCTGGT GGAAAATCCA ACGTGGTTGC CGCCATTGAA GCGTACGAAC AAGCCGGCGA ATTGTTTGGT ATGGAAGACA GTAAATCGCA GGCTTCCCAA TGCCGGGCCA AAGTGGCGGA ATTATGCAGT GCCGCGCTCG ATCCGCCAGA TCTACTGCGG GCGGCAGGAT TGTATGATGA ACTCGGACGA GCATGCTTGG ACTCGAATTT GTTAAAGTAC AACGCAAAAT CGTACTTTTT GCAAGCCATC TTGTGTCATT TGGCCAACGG GGACGCGATT GGGGCGGAAC AAGCCTTGGG CAGATACGAA GGTGTGGATT ATACGTTTGC TGAATCGCGA GAAGGCAAAT TTTGTCGACA ACTAGTGGAA TGCGTGGAAG GTTACGATGC GGAAGCGTTC GCGACGGCGT GCTACGAATT TGATCGCATT TCCAAACTCG ATCCTTGGAA GACTTCCATG TTGGTAAAGG TCAAGCGCAG TATTCAGGAT GACGGTGCCG GCGAAGAAGA AGATGATGTC GATCTTACCT AAAGTATTTA TCTATGTAGT TTTGGTATAC C
|
Protein sequence | MSAIAKAQKA KGQEYQAEAG RALTKKSWFA SARDRNVEDA AELYLQAANA YKVGGLNQEA GDVYNVAGEL YRDKLKQANE AAKCFTQAGS CYKKSNPVDA VSSYQSAVSL LTDAGRLTQA AKLCKECAEL YESEEVAEAG GKSNVVAAIE AYEQAGELFG MEDSKSQASQ CRAKVAELCS AALDPPDLLR AAGLYDELGR ACLDSNLLKY NAKSYFLQAI LCHLANGDAI GAEQALGRYE GVDYTFAESR EGKFCRQLVE CVEGYDAEAF ATACYEFDRI SKLDPWKTSM LVKVKRSIQD DGAGEEEDDV DLT
|
| |