Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39360 |
Symbol | |
ID | 7195068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 282010 |
End bp | 283467 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183458 |
Protein GI | 219126425 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00278594 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCATC CCCGGCTGGA AGAAATTGGA AGTGGAAAGT CTTTGCTTGA TGTAAGCGGT ATTCCAATTT CCACAGCCTG TTCTTCGACG CCGTCCGCTT CGCTTTCGGA TGAAGAAAGG AAAGCGTCAT TCCGCGATCG CTTCGGGCCA GACCCTGAAC CAACAACAGA GAAACGAGCT TGTAGGCCTT TTTCTGTTAT GCGAGGGCGA TCCGGGCGTT TTGAAAAGCA GGCCCGAGTT GCTAGTACTG CTGAAGAGTT GGCTATGTGC CTTCCCGATT CGCCGCCAAC TCTTGGAGGC TCTCCAAAGC TGAAAAAATC CTCAATGTGG AGTACACTCT TTTCTTCTAG TGCCAGACTG GAAGAATCTT CAGCTCATCG CCCGACGTCC TCTGCGTGGA TTACTGCAAT GCTTCCACAT CGTAAGGGTG CTAAGTCTTC TCCGTCTGGT AGTAACATTT TGGCTCGATC TTCTACCGCC TTTTCTAAAA ATGAGGACGA TGTACGTTCC ACTGACCGTG AGTCACTTGT GTTCAAGAAT CAGCCGCTAC ATCGCAAGGT CATGAAACGT GAAACCTACA CAGTGGAAGA AGACACGCAA ACGCGACATC CGAAAGCATC ATTTTCCCGT CAGAATGGTT CCTCGGTCGC AAAATCCCCG CACAGTATTG CCCCGTCGAT CGTAGAACAA GACGCATTCG ATCAATCCAG GTCTTATCTT GGTTCTCCGC TTTGGACCCC TGATTCTCCG AACTCCGAGA AACACGACCG ATGGACACCG CCACACAAGA ATACGGGATT GTTACGGAAA GAATTTGGAA TTGAATCTTC CCAAGAGCTC ACTTGGCTCA AGCACCCTCT TCGAGACTTG CTGGGGAAAT CAAAACAGAG TAACGAGACA GAGCCGAGGC ATGCCATGGG AAAGAAGCCT TCGCATGACA ACTTTCAAGT ACAAGATGGC GATGGAGGCG CGTTCGGCCT CCAAGGATCG TCCGTTCTTC TCGGTAACAG GCTTTCACAG ATGCGTGCAG CGGGCGATCC AATTTCACTG AAAGTTACAT TATCATTGGA TAAGGATCTG AAAGATGAAG ATCTCAACAA TGTAGGCAAA TTGCTCTCAC AATTAGAGCA ATATCTGTCA ATTGCACCAA GACAAGGACA CTTTCTCAAT CAGCACTTTT CTACGAAGAA AATATTCAAT GAAGCCGGAA GTATTGACCG TGAGTCAACG GGAAAACATG ACGCAATGCG CCAGAAATTG TTCGAGCTGG AAGCTGAAAT TGATCGTACC GCGTTGGAAA GCGCAGCCGC AGCTATGTTG CTCGAGGGGT GCCAGGATGA GAAGTTATCG GATCACAACT TGATGGTTGC TTCTTCTCCG GTTGCTGCGT CTGAGAAGGA TTCATCCTCG TTAAGTAGCG ATGCCGATGA ACTTGTCGCC CAGCCCGCAC TCTTTTAA
|
Protein sequence | MNHPRLEEIG SGKSLLDVSG IPISTACSST PSASLSDEER KASFRDRFGP DPEPTTEKRA CRPFSVMRGR SGRFEKQARV ASTAEELAMC LPDSPPTLGG SPKLKKSSMW STLFSSSARL EESSAHRPTS SAWITAMLPH RKGAKSSPSG SNILARSSTA FSKNEDDVRS TDRESLVFKN QPLHRKVMKR ETYTVEEDTQ TRHPKASFSR QNGSSVAKSP HSIAPSIVEQ DAFDQSRSYL GSPLWTPDSP NSEKHDRWTP PHKNTGLLRK EFGIESSQEL TWLKHPLRDL LGKSKQSNET EPRHAMGKKP SHDNFQVQDG DGGAFGLQGS SVLLGNRLSQ MRAAGDPISL KVTLSLDKDL KDEDLNNVGK LLSQLEQYLS IAPRQGHFLN QHFSTKKIFN EAGSIDREST GKHDAMRQKL FELEAEIDRT ALESAAAAML LEGCQDEKLS DHNLMVASSP VAASEKDSSS LSSDADELVA QPALF
|
| |