Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35832 |
Symbol | |
ID | 7201045 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 840514 |
End bp | 842090 |
Gene Length | 1577 bp |
Protein Length | 473 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180330 |
Protein GI | 219119127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.495105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTGG CCGCTCCCAA AATTGCAATC CTTGGGGGAG GAGCAGCAGG GCTAGCGGCG GCACGTGTCG TACACCGCCA AAGTGGCGGA AAGGTTCGTC CCGTTGTATT GGAACAAGGA GATGCAATGG GTGGCGTATG GAATTACCGA GACGAGACGG CAGCTACCAA ACCCATGTAT CGTCACTTAC GGACCAACTT GCCCAAAGAA ATCATGGCGT TTCGGGAACT ACCCTGGCCT AATTTGGAGC CCAATGCAAG TTTTGTAACA CACCGTCAAG TACTGGACTA CCTACATCAT TACCGCGATC GTTTCAAGTT GGAACCATTC ATTCGCTACC GCACACGTGT CACGCATCTA CGGATCCTCT CCGATCGGGT TGGGGATACC GTGGAATATA TTTCGCAAAG CCGGCTTTCG ACTCGGGAAG AACCGTTGCC ACGTGTCGAA CTCACTACAG ATACCGACGG AAAAGAGTGT AGCGAAGCGT TTGATGGAGT CTTTGTATGC AACGGACACT ACGGAGTACC GGCGATTCCT GCTTTGGATG GATTGGAACA ATACTTTCGG GGACAAACTC TACACGCCAT GGCCTACGAT AACCCCGACG CCTTTCGGGG ACAAACGGTG CTTTGCGTTG GTGGACGAGC GAGTGGATCG GATATTGCTA GGGAACTGTC CGGAGTTTGT CGCCACGTTT TCTTGTCCGA TTCAACGGCG CCCGACGACG CTCCCATCAC CGAGTTTAAC GTTACCTGGG TGCCGCCAAC GGTTAGGGTA CGGGAAGACG GTGCCGTTAC GTTCGCTCGC ACTGATTTCG TTGCGAAAAA GGTCGATACC ATTATCTTTT GTACGGGGTA CGACTACAAT TTTCCTTTTA TTAGCGAGTC CACGTCCAAT CTGGATTTCG ACGCTACAAT TGGAACACGG CGAGTCAAAC CCTTGTTTGA GCAACTGTGG CATGCCACGT ACCCAAACCT GTGCTTCGTA GGATTGCCCC ATTCGGTCAT CCCCTTTCCG CTTTTTGAAT TGCAAGCGGA AGCAGTCTGG TCCTCTTGGA CCAATTCACC CAGCGTTTTA CCGGATCAGA GTGCACGTCA ACAACATGCG GAAGAAGCAG CCGTGTCTGG CGGAGAGGGT AAAGTTGACG ATGGTCGGGT ACCGCAGGAT TCGCACTATT TAGGATCCGC ACAATGGGAT TACTGCCGAC GTTTGGCAAG TTACGCCGAC ATATACGATA ATCGTATGGA AGACTTCCTT GTCACCAACA AGGTACGTTG TCAACACGTG TGAAATGCAG ATTAATCGGC TGTGAAACGG TGTCGGTCTT TCTTGTTGTG GTTTGTTGAC GGACGTCGTA TTTTTGGATC CACCCACGTC TGACCATCAC TTTGGTACAC TTTTGCTACT GTTACGACTA TGTGCAGACA ATTTACGACC ACACTTGGGT GCAGCGTAAG AATGTCTTTC CCGCCGGCCC CGATACATAC CGGGATTATT GCTATCAGCG TCTCGAAACG CAGCGATCGT TTCGACAGTA CAGGAAATGT GAGGCACGTG AATTGATATC ACACTAA
|
Protein sequence | MSLAAPKIAI LGGGAAGLAA ARVVHRQSGG KVRPVVLEQG DAMGGVWNYR DETAATKPMY RHLRTNLPKE IMAFRELPWP NLEPNASFVT HRQVLDYLHH YRDRFKLEPF IRYRTRVTHL RILSDRVGDT VEYISQSRLS TREEPLPRVE LTTDTDGKEC SEAFDGVFVC NGHYGVPAIP ALDGLEQYFR GQTLHAMAYD NPDAFRGQTV LCVGGRASGS DIARELSGVC RHVFLSDSTA PDDAPITEFN VTWVPPTVRV REDGAVTFAR TDFVAKKVDT IIFCTGYDYN FPFISESTSN LDFDATIGTR RVKPLFEQLW HATYPNLCFV GLPHSVIPFP LFELQAEAVW SSWTNSPSVL PDQSARQQHA EEAAVSGGEG KVDDGRVPQD SHYLGSAQWD YCRRLASYAD IYDNRMEDFL VTNKTIYDHT WVQRKNVFPA GPDTYRDYCY QRLETQRSFR QYRKCEAREL ISH
|
| |