Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23306 |
Symbol | |
ID | 7195748 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 547167 |
End bp | 550091 |
Gene Length | 2925 bp |
Protein Length | 908 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184157 |
Protein GI | 219127886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.103886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGATA TTGCCAGCGC TCCTTCCCTA TACAGTCGCC CCGCATTTTT CAGACAAACC CGGACCGGAA AGATTCTCAA AACCGTCAAC GAGCGATATT ATCGAGACGA TTTGGGCTTT GGCTGTCACT TTGTCGATGA TGCAAGTGTC AAGCGACATA AACAGGTTGA CGAGGTTGTA GGAAAACCCG AAACGATAGA ATCAGCATCA GAATTGCTCG CATTGCTTCA GCCCTGTCAA CCCCATGCGT TAATAGTCTG TGATGCCAAT GTTTTGTTGC ACAATATAGA TGTTTTGGAA CAAGCGGACT CTCTCATGCC GAATATTGTC ATTCCTCAGA CGGCTTTAAT GGAATGTCGG GCCAATCGAA TGGTAGCTTA TGACCGCACG GTCGAGCTGT TGCGAGCAGT AGGGGGCGGG AAGACCAAAA CTACCAAACG TTGCGGCATC TTCTTTGCTG ATCCGCATCA TGCTGCAACT CAGCTCGAAC ATGATGAGAC CAAAATTGAG CGCAAGGGAA ATTCCATCAA CGATGAAAAC GATGCCCGCA TTCGAAAAGT AGCCGACTAT TTTGGCACAG CCTTAAAGAA CACGGGTGTG CGAGTGATCT TGCTGACCGA CGACGCCGGC TCTCGCACAC TCGCGGCAGA AGAATCCTCA ACGTACCAAG CCAAGTCGGT GCGAGAATGG GTAAAAGAAT TAGAAAGGTT CAATCCAGGT CTATCTCTAC TCGATCAAGT CGCACAATTT AATAATACTA GCCCCACGGG TGGTATCAAC GAAAAGGATT ACTTTGAAGC TCATCTAGAA GCCAAGCTTT TGTTACGAGG AGTACAAGCT GGAATGTACC ACCGGGGAGT GCTACGATCC GCAGGAAGCC ATTCCGCTAT GATTACGATT AGACAAGGCG ATGAACGAGT AGCTGTGACG ATACCAAGCT TTACGGATCG GAATCGTGCC GTCGACGGCG ATGTCGTTGC TGTGGCTTTG CATCCTTTGG ACAAGTGGAT TACTGCGAGC GTCGATCTCA AAGCCAGTAA GGCTGAGGCA AACAGAGCTA TTGCGCCAGG TATCGCTAAT GAAACAGCTG AACCAACTAT AAGTGAAATG AACAATGTCG CCGACACCTT TGCTTTGGAG GATGACGCTG AATCACTGCG TCCCACTGGT AAGGTTGTTG GAATCATTCG GCGCAACTTT TCCACTTACA GCGGCTCCAT TTACGCCATC AAAAGTGACT CTACAGAGCT GACGGATCGA GAGCGGACTG CATCAGATTA TGAACGCGAG CATCCGGATG GGTCAATCAC CTGCGTATTC TTTCCCGTCG ACAAGAAGAT TCCTCCCATT TTAATTCGGA CAACGCAACG GGATCGCCTT TTCGGTCAAC GCATAGTTGT GGCTATGGAT TCCTGGCCCT CCACATCTAT TTACCCCCTA GGACATTACG TACGGGTGAT TGGGCCAGCC GGATCTAAAG ACGTTGAAAC CGAAGTGCTG CTTCAAGAGC ATGACATACC TCACGAACCT TTCCCTGCTG CTGTTCTTGC TTGCTTGCCA CCTGAAGATT ACCGCATCGA TGTAGACAAT AGCCCCGGAC GCCAGGATAT CCGGCACATT CCTGTTTTGT CAATCGATCC GCCCAATTGC AAAGATATCG ACGACGCACT ACACTGCACT GTGCTACCCA ACGGAAACTT TCAGGTTGGT GTGCACATTG CGGACGGTAC GTTAGGGAGA AGGATTGTCT ACAGTTAGCC CCTATGTTCG ATCTGACAAC ACACTATGGA TTCTTTTGCA GTGACACACT ACGTCCAGGC AGGTACCGCG ATTGATCTAG AAGCAGCGAA TCGTTCGACG TCGACGTATT TAGTAAATAA GCGACTCGAC ATGCTTCCCA GCCTCTTAAC AACAGACCTT TGCAGTCTGA AAGGAAATGT AGATCGGTAC GCCTTTTCGG TACTGTGGGA GGTCACACCA GAGGCCGAGA TTCTCAACGT TGAATTTCAA AAGACCATCA TCCACTCGAT TGCCGCCCTT ACTTATCAGC AAGCGCAGAC AATGATTGAC CAACCCGACG ACCCGAACGA TATTCAGTCG AATGCCGTGA AGCGCCTCGC ATCTCTTGCA CGTAAGTTTC GAAAACGTCG AATTGATGCT GGGGCGTTGA CTTTGGCATC ACCAGAAGTT AAGTTCGTGT TGGACAGCGA GTCCTTGAAT CCAACAGACG TCCAAGCGTA CGCACTGTTG GAAGCAAATG CTGTCGTAGA GGAATTCATG CTACTGGCCA ACGTTACCGT ATCGAAAAAA ATTCTTCGGC ACTTTCCGAC TTTGTCAGTA CTTCGACGGC ATCCTGCTCC TAACCGCGCT ATGTTTGATA GCCTTATCAG TAAAGCAAAG AGCAAGGATT TGGATATCAA TATCGACGAC TCGAAGCGTC TAGCGGATTC GCTGGATGCC GCTGTTGTAG AGTCTGACCC TTACGTGAAC AAACTTCTTC GTATTTTGTC GACCCGATGC ATGAGCCCCG CGCAGTACTT TTGCTCGGGA GAGTTTCGCC CAATGGAGTG GCATCACTAC GGTTTGGCGG CGCCTGTCTA TACACACTTT ACGTCCCCAA TTCGACGTTA CGCGGATGTT TGCGTCCATC GATTGTTAGC TGCTGCTGTA GGGGTGGCCC CTTTACCACC TCACCTCTCA TCGAAATCTT ACCTGCATGA TCTATGTGCC AACATGAATA GACGCCATCG TGCGGCGCAG CTTGCAGGTC GAGCCAGTGT GCAGCTTCAT ACACTCATTT TCTTTGCCGG TGATGGGGCC AAAGAAGAAC AAGCTTACAT ATTGGACGTA GAAACTGCAG AAGGAGTCGA GCCTTCCTTT ACTGTGATTG TTCCTAGATA CGGAATCGAA GGGAGAGTGA AGCTA
|
Protein sequence | MGDIASAPSL YSRPAFFRQT RTGKILKTVN ERYYRDDLGF GCHFVDDASP CQPHALIVCD ANVLLHNIDV LEQADSLMPN IVIPQTALME CRANRMVAYD RTVELLRAVG GGKTKTTKRC GIFFADPHHA ATQLEHDETK IERKGNSIND ENDARIRKVA DYFGTALKNT GVRVILLTDD AGSRTLAAEE SSTYQAKSVR EWVKELERFN PGLSLLDQVA QFNNTSPTGG INEKDYFEAH LEAKLLLRGV QAGMYHRGVL RSAGSHSAMI TIRQGDERVA VTIPSFTDRN RAVDGDVVAV ALHPLDKWIT ASVDLKASKA EANRAIAPGI ANETAEPTIS EMNNVADTFA LEDDAESLRP TGKVVGIIRR NFSTYSGSIY AIKNYEREHP DGSITCVFFP VDKKIPPILI RTTQRDRLFG QRIVVAMDSW PSTSIYPLGH YVRVIGPAGS KDVETEVLLQ EHDIPHEPFP AAVLACLPPE DYRIDVDNSP GRQDIRHIPV LSIDPPNCKD IDDALHCTVL PNGNFQVGVH IADVTHYVQA GTAIDLEAAN RSTSTYLVNK RLDMLPSLLT TDLCSLKGNV DRYAFSVLWE VTPEAEILNV EFQKTIIHSI AALTYQQAQT MIDQPDDPND IQSNAVKRLA SLARKFRKRR IDAGALTLAS PEVKFVLDSE SLNPTDVQAY ALLEANAVVE EFMLLANVTV SKKILRHFPT LSVLRRHPAP NRAMFDSLIS KAKSKDLDIN IDDSKRLADS LDAAVVESDP YVNKLLRILS TRCMSPAQYF CSGEFRPMEW HHYGLAAPVY THFTSPIRRY ADVCVHRLLA AAVGVAPLPP HLSSKSYLHD LCANMNRRHR AAQLAGRASV QLHTLIFFAG DGAKEEQAYI LDVETAEGVE PSFTVIVPRY GIEGRVKL
|
| |