Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37547 |
Symbol | |
ID | 7202412 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 384796 |
End bp | 386035 |
Gene Length | 1240 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181715 |
Protein GI | 219122776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGAG ACGAAAAGGC GTTAGACCGA GTAAAGTTTC TTACCTACAA TGCTCTACTT CTCTTCTGCC ATAAGCCAAA CGGCAATCTG GCCTCATCTC GTCACCTTCT GGTGGAGTGG CACCAGCAGC TGTATCAGGT CGCACAGAAA GCCGTCTCAT GGACGACAAG GCAGTCAGAT GCTAATTTCA TGAATGGAAA GCCTTGGAAT ATTGAGGTGC GCGTTGCTCC GGCTCTGTAT CATCGGTACA AAGACAGGTG CGGCGAATGG AAAATAACTT CGAGTTTTGG CTTGTCACGA TCGATTCGTT CGCCTAAACA ATTTGCACAA TTAATTTGCT CCGAAATTGA GGACGGAATG GATTGCCGGG TTTCAGAATC GGGTTTTCTC TGTATCACAA CTCGAGATCG AATCGAGCAG CTGCGTGCGT CAGGAAAGCT ACCATGTCCA AAGTGTATAC AGTGGTGCAA GGGCGAAAAG GGTCTGTGGT GGCATAGTCA GCAGCAGCAC GGCGTAGGTC ATCATGATGC CATGGATGCT GCTGTTATGG TTCAAAACGT CAACGCAATA GTCGTCTATG GGGACAGTAC GAGTGACCTA GATTTGTTGA TTCGGCCGAA GAAAATCGGA ATACAGACTC CGAATACAGC AAGTTTGGAT GAGGATCCTT TTCGACTGGC TCAACATGGA GATCTTGAGT CCCTTCAAAA ATTAGTAGAG CATGGCTTCG ACCCGATTCA CACGGCAGAT TCTCGCGGTG CCAATGTCTT GCTTTGGGCA GCCGGTGGTG GTCACTTGGA CATGCTGCGC TATCTGATTG AAACCTGTGG TTGTGATCCT GCATGGAAAC AGAAGGCAAA GCGCTCCTTT GGTGGACGTA CGGCGCTTCA TTGGGCTGCT CGGAAAGGGC ATGTCGAAGT TTGTCGCTAT CTCGTGAACT CCTGTAACGT AGACTTGGAA GCTATCACCA CAGACGGAAC AACGGCTTTC AGTTGGGCAT CGTGGCAAGC TCACCGATGC GTGATGCAAT TTCTGCATTC ATCTGGCTGT AACGTGACGA GCTCGAATAC ATTTGGCTGT AACGCTGCGC TCTGGGCGGC TCAAGGTGCG GCTGACTCCG CTGTAATGGA GTGGCTGGAT CGAATTGGTT GCTCAGTGTT TCAAGTAAAC TCGAATGGAC ACGGAGTTTT GCACAAAGCA GCGCAGCGTG GTAGAGAAGA TGTGTCAAAA TGGTTTGTAG
|
Protein sequence | MTRDEKALDR VKFLTYNALL LFCHKPNGNL ASSRHLLVEW HQQLYQVAQK AVSWTTRQSD ANFMNGKPWN IEVRVAPALY HRYKDRCGEW KITSSFGLSR SIRSPKQFAQ LICSEIEDGM DCRVSESGFL CITTRDRIEQ LRASGKLPCP KCIQWCKGEK GLWWHSQQQH GVGHHDAMDA AVMVQNVNAI VVYGDSTSDL DLLIRPKKIG IQTPNTASLD EDPFRLAQHG DLESLQKLVE HGFDPIHTAD SRGANVLLWA AGGGHLDMLR YLIETCGCDP AWKQKAKRSF GGRTALHWAA RKGHVEVCRY LVNSCNVDLE AITTDGTTAF SWASWQAHRC VMQFLHSSGC NVTSSNTFGC NAALWAAQGA ADSAVMEWLD RIGCSVFQFC TKQRSVVEKM CQNGL
|
| |