Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30365 |
Symbol | |
ID | 7195815 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 304372 |
End bp | 306859 |
Gene Length | 2488 bp |
Protein Length | 677 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184225 |
Protein GI | 219128028 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.660335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCTGGC TTTTTTGCGA AAGAACAACC CGCGTACCGA CGAGTATCCA TCCGACCCTC CTCACTGTCA ATACTCGAGT ATTGGTGGGA CAGTCAATAG GTCAGATCCT TCTTCTCCTG TTTGACCAAG TTGCGAGATA TTCAAACATG CCGGTCTACA ACTTCAAGGG CATGAAGCCC GTGCCGACGG CTCCGGAACT CGTCGACATT GTCTTGATGC GGACCCAGCG ACGGACACCG ACCGTCGTCC ACCCAGGATA CAAGATCACG CGGATTCGGT CGTTTTAGTA TGTGCGCTGC GGAAACGCGA GAGCCATGTC TAGATTGCAA TCACTAGTCA TTCTAACGAA TCTTTGCAAC CTTTCCACCT TTCCAGTATG CGAAAGATTA AGTTTACCCA ACAAACCATT TCAGAGCGAT TGAGTGGGAT GCTCACCGAC TTTCCCCGCT TAAACGACAT TCATCCGTTT TATGCAGATC TTTGCAACAC TCTCTACGAT CGCGATCACT ACAAACTGGC CTTGGGACAG ATCAACACGG CTCGTTCGTT GGTGGATTCC ATCGCCCGCG ATATGATTCG CATGGTCAAG TACGGGGACT CGCTTTACCG GTGCAAATGT CTGAAACGTG CCGCACTCGG TCGCATGTGT ACGGTCCTCA AACGCCAAAA AGCCTCCTTG GCGTACTTGG AAGAAGTTCG TAAGCATCTC TCCCGTTTGC CCGCTCTCGA TCCCAACACG CGTACCCTGC TCATGTGTGG ACTTCCCAAC GTGGGCAAGT CATCCTTTAT GAACAAGATC ACCCGCGGCA ACGTCGATGT ACAGCCCTAC GCATTTACGA CCAAGAGCTT GTTCGTTGGA CATTGCGACT ACAAGTATCT GCGTTGGCAG GTGATTGATA CTCCCGGCAT TCTGGATCAT CCGTTGGAGG AACGGAATAC CATTGAAATG CAGGCCATCA TTGCCCTCGC GCATTTGACC TGCAGTGTGC TCTACTTTGT GGATATCTCG GAGCAGTGTG GGTACACGAT TGATCAGCAG TGCTCGCTCT TCCGTTCGAT CAAACCCTTG TTTGCGAACA AGCAGCTCAT TGTGGTCGTC AACAAGGTCG ATCAGCAACC TTGGGAATCG TTGGAAGAGG CCAAGCGTGC CATGGTACAG GGTTTAGCGG ACGACGCCAA CTGCTCGCTC ATGACCATGT CCAACTTATC GGAACACGGT GTTTCCGATG TCAAAGCCGC GGCTTGTGAC AAATTACTGG CGACTCGCGT GGACGCTCGC GTCTCGGGCA AGAAGATTGA AGGCGTCATG AATCGTTTAC AAGTCTTTAA CCCGGCACCC CGTGACGGCG TAACACGCGG AGCGTTTATC CCCGACTCTG TCCGCATTGC CCAGGAAACA GGACAGGACA AACCAGGGCG TTCGCGTACG GGTTACGCGC CGTCGGTCAA GGATGGTGTC ACCGACGGCG ATGTCGACAT GGACGGTGGC AACATCCGCA AGACCGCACG AGAACTCATG TGGGAAGGTG GCGGACCCGG AGTATGGGCT CCTGATTATC GCGACCAGTA CGACTTGGCG GACGATTCGT GGAAGTTTGA CAAGATCCCG GAGATCATTG ACGGCAAGAA CATTGCCGAT TTTGTGGATG CGGATATTTT GGACCGCCTT GAAGCTTTGG AACGGGAAGA AGAGCAGCTG GTGGCTGAAT CCGACGCCGC TCGCATGGGT GAAGAGCCGG AATCAGATCT CGAAAGCGAG GAAGAAGCTG CGGTTGAAGC GATTCGGGCC CGTAAGAAGA TTATCAAAGA AAAGAAGCTG GCCACGAATA CCCAAAACAA ACCGATGCTT CCCCTATCGA TCCGCGGAAA GAGCAAAGAC AAGCACGATG CCGGTACGCT ACTAGCAACC GAGATCAGAA AGACTATGGA CAGTATCGGT GTGGATTCGA GCAAGATGCT AGAGCGTGGA CGTTTGCTGG AACGTGGCAG AAAACGCGAA CGCTCACTGA GTCGCAGTCG TCGTGGTGCA GATGACGAAG ATGCGGCCAT GGATGTGGAT GTGTCCGGCT TGAGCAAGGC CGCATTGAAA AAGATTAAAA AGGAAAAGGG AGAAAAGGCT CGTCGGGATC TCAGCCAAGC TCGCAGTCAT TCTCGCCCTC GCGAACCTTC ACAGATGGGT TTGAAAGATT CGGAGTCCGT CAAGTCGGCC CAAAAGCTGG ACCGACTTGG ACGCAAGGGA TGGATGGGCG CTTCTGGAGA AGGCGATACA CGAAAGAGCG TACATTTGGT GAAATGGATG AACACTGGCA AGAAACGCAA CGGCACGCAC TATCAGCGAT AGACATACGT GCCCATGTTA TGTAAGGATT TGTTGTTCCT AGTGCTGTAT TGGGTACAGT ATTTCGGTCG TCTAGGATTG GAGCAATCGC TGCTGCACGG TATCTCTAGA TGTAACACAT TACACCCCTA GTGCAACTAT CTATCGGA
|
Protein sequence | MPVYNFKGMK PVPTAPELVD IVLMRTQRRT PTVVHPGYKI TRIRSFYMRK IKFTQQTISE RLSGMLTDFP RLNDIHPFYA DLCNTLYDRD HYKLALGQIN TARSLVDSIA RDMIRMVKYG DSLYRCKCLK RAALGRMCTV LKRQKASLAY LEEVRKHLSR LPALDPNTRT LLMCGLPNVG KSSFMNKITR GNVDVQPYAF TTKSLFVGHC DYKYLRWQVI DTPGILDHPL EERNTIEMQA IIALAHLTCS VLYFVDISEQ CGYTIDQQCS LFRSIKPLFA NKQLIVVVNK VDQQPWESLE EAKRAMVQGL ADDANCSLMT MSNLSEHGVS DVKAAACDKL LATRVDARVS GKKIEGVMNR LQVFNPAPRD GVTRGAFIPD SDGVTDGDVD MDGGNIRKTA RELMWEGGGP GVWAPDYRDQ YDLADDSWKF DKIPEIIDGK NIADFVDADI LDRLEALERE EEQLVAESDA ARMGEEPESD LESEEEAAVE AIRARKKIIK EKKLATNTQN KPMLPLSIRG KSKDKHDAGT LLATEIRKTM DSIGVDSSKM LERGRLLERG RKRERSLSRS RRGADDEDAA MDVDVSGLSK AALKKIKKEK GEKARRDLSQ ARSHSRPREP SQMGLKDSES VKSAQKLDRL GRKGWMGASG EGDTRKSVHL VKWMNTGKKR NGTHYQR
|
| |