Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_36970 |
Symbol | |
ID | 7204450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 777259 |
End bp | 778503 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185950 |
Protein GI | 219121453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCTC TTATCAGTAC ATACGGTTCG CTCCGTCGCT TTGAAGATGC CCGACGGGTT TTCCTCACGA TTGAAGGGCC CGTGGATGCT GCTTGTCTTC GAGCTATTCT TTTTGCCTGC AGCACGGCGA GTCCACCACA ATGGAACGAG GCCCTCGTGC TACTACACAC GTGTGACGTG ATCGGTGGCA CCCGTTCACC AACGCACATG GATTCAATCG CCGTCAGCAA CGCCATGCTG GCTTGCTCCA AGGCGGATCA GTGGGAAGAG TCGTTGCAGC TTTTGCGCTT GTACGGCGAT AATGCCACGT CGCCGTTGGC ACTGAATTCA CTGATTGCTT CCTGCGGCCG CGGGGGTCGG CCCGATATGG CGATAGAAGT ATTGAACGAA ATGCGTGTAT ATGGCTTGCA GCCCGACACA GTGAGTTACC GGAATGCTGT GATCGCCTGC AATCAAGCCG AACACGAAAT GCATCGTGCG GCACGTAACC ATCGCGAGGT CCGGTTGGCT CAGGACGAAG TAGGTTTTGC GTGGTGGGAA TGCGCCTTAT CTTTACTGCG CCGCATGCAA GAAGAAGGTC AGCAACCCGA TACACAAACC TTTTCATCGG CAATCAGTGC GTGCGAATCG GCTGGTCAAT GGCAACGCGC CTTGCGTGTT CTGCAGTCAA CTCTGGATGA CGGCGACGAT TCCAAGCTGA ATTTGTATTG TTTCAACGCT GCCATTGCAG CTTGTGAAAA GGGAGGGGCC TGGGTGGAGG CCTTGGAAAT ATACGAACGC ATGAAAGCTC ACGGAGGAAG CCTTCGTCCT AACTTTGTGA CGATCGCGAG CTTGGTATTG GCGCTGGATC GTGGCGGTCA AAAAGAGCTG GCTCAAGAAG TCTACCGGGA AGGGGCACAG CAATTGCGAG TGGTGCAGCC CTGGCGCTAC ACTCAGAATG CACAAGAAGA ACGTGTGCGT GCCATGGACC TCCACACATT TTCGGCGGCC ATGTCCAGAG CTGCTCTGCG CAGTTACTTG GAGCGTCTGT TGGCGAAGGA AACCGTGGTG CCGGCGGACG ATTGGATTAT AATTGTTGGT CAGGGTCGGC ACAGTGTCGA GGAACCAGTC CTACTGCCAA CGGTGTGGAG ACTATTGCAA TACGAGTACA AACTTCCCGT CACCACGAAT CCAGTCAATC CGGGCCGCCT TGTGGTGCAA TCCCGCGACT TAAGCGTGTT GGTGAAAACG AAAAGTTGGC GATAA
|
Protein sequence | MASLISTYGS LRRFEDARRV FLTIEGPVDA ACLRAILFAC STASPPQWNE ALVLLHTCDV IGGTRSPTHM DSIAVSNAML ACSKADQWEE SLQLLRLYGD NATSPLALNS LIASCGRGGR PDMAIEVLNE MRVYGLQPDT VSYRNAVIAC NQAEHEMHRA ARNHREVRLA QDEVGFAWWE CALSLLRRMQ EEGQQPDTQT FSSAISACES AGQWQRALRV LQSTLDDGDD SKLNLYCFNA AIAACEKGGA WVEALEIYER MKAHGGSLRP NFVTIASLVL ALDRGGQKEL AQEVYREGAQ QLRVVQPWRY TQNAQEERVR AMDLHTFSAA MSRAALRSYL ERLLAKETVV PADDWIIIVG QGRHSVEEPV LLPTVWRLLQ YEYKLPVTTN PVNPGRLVVQ SRDLSVLVKT KSWR
|
| |