Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31322 |
Symbol | |
ID | 7199351 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 106216 |
End bp | 108388 |
Gene Length | 2173 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185522 |
Protein GI | 219130753 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCGGATCC CAATCCAACC CATCTTGCAC ACCGGGACCC TCGTTCCAAG GGCCAAGCTT GATTCACTAC TTCCACCACC AACACAACCA CCACCGTATC ATCGTCCATC CATTCTTTGA TGCGTAACAA TACCCAAGAT CTAGATCACA GAAACTAACG AACGAACGAA CGAACTAACG AATTCCTAGC GAGCGATAAA CGGATCGATA CCAGTGGTGA CATGATTGGA CCCGCCATTG TAAGTACAAC CCGTCCTGTC GCACCTTGCT GTACGAACGA ACGAAGGTTT ACTGTACTTG GATTCTCACC CATTGCCTCA ATTGTGTGTC ATCGATCGGT TCCGCACTTT GTTAGGATCC AGCGGATGTC GGCTTGACGG GCCTCTTCTG GCTTTTTTTG AGCTACGGCT ACGTCTTGTA CTCGAGCAGT AATCTCATTT CGGAAGGATC CGAGCTGCTC CTTCTCATCC CCAGCATGGC GGGGTTGGTC GGCGGAGTGG TCCTCCCCTT GTTAGGAGCC GTCCCCGACG GCGCCATCAT TCTCTTTAGC GGACTCGGAA GTCTTGAAGA CGCCCAAGAA ACATTGTCTG TCGGAGTCGG GGCCCTAGCC GGCTCCACCA TCATGCTACT GACCGTTCCC TTTGCTCTTT CCGTCTACGG AGGTCGCGTA GATCTCGACG CCAACGGCGT ACCCGACTAC CTTGTCAAAC CCAAACTTTC CACCAAAACG TCCTGGAAGG CCGAATTCAC AAAGACGGGC GTTACCTTGT CCGATGCCGT GCATCACGGT GGTGTCTTGA TGGCCCTCAC TACCGTTCCC TACTTCCTCA TACAGGTGCC CGCATCGATC TACGCCACGC CCGAGAATTC GGAAGACGTC GTTGCGGCAC AGGAGCACTG GTGGGCGGCG GCTGGATTCA TTCTCTGCTT GCTGGGCCTG ACGGTTTACA TGAGACTGCA GTTGCATATT TCCCAACAGG GACAGGACAA GGGCAAGCGC ATGGCCGTCA TGAAGAAACT GCTCAAACAG GGACAGGTTT CACTCAGTGG AGCCATTGCC GCACAAGTCA ACGCCAAAGA GTCCGCGCTG CAGGCGCAGG CTGCGTCCGA ATACCAGTCC ATCCACGATG TCAAAGATGG TTACCCCAGT CCGGCCATTG CGGCCTTTCT GAAAGAAATT CTGGCCGACG CCTTTTATTC CTACGATTCC GACACCAATG GACAACTCGA CAAAACCGAA GTCTTTGTCT TTTTCCGAGA CTTTCACGAA AGCATATCCG AAGAAGAAAT GGATAAGCTC TTTGCCAAGT TCGATACGGA CGGCTCCGGT ACCATTTCTT TGGACGAATT TATCGGCCTC GCCTACACGC TCATCAAGGC GCAGGACCAG CAAACGGCGC CGCGTCACCT GGACGCGTCC AGTCGCGGTA CCCGTGCCGC CCTGGTACAG GCTGCCTTTG GCGAAGACGA AGACGAGGAG GAAGAAACCG TACCGGAAGA ATTCACCTCG CTCACGCCCG ATCAACAACA ACGCGCCATC AAATGGAAGG CCTTTCGGAT GTTGGCGTTA GGAACCGGCC TGGTCGTGCT CTTTTCAGAT CCCATGGTGG ATGTCATGCA AGAGATTGCG GTGCGGTCGG GCATATCGCC CTTTTACGTT TCCTTCGTGC TGGCACCATT GGCGTCCAAC GCCAGCGAAG TGATCGCCTC GCAATACTAC GCCAGCAAGA AGACGCGCAA AACGATTACC GTGAGTTTGA CGGCGTTGGA GGGTGCCGCT TGCATGAACA ATACGTTCTG CTTGTGTATT TTTATGGGGC TGGTCTTTGT GCGCGGCTTG GCTTGGCACT ACACGGCCGA GACGGTAGCC ATTGTGATTG TGGAATTCAT AATTGCATTT ATCGTGATTC GAGAAACTAC CATGACGACG GGAATGGCCA TGTTCATCTT GGCGTTGTTT CCGCTGAGCA TTGTGCTCGT CGCCGCTCTA GAAGCGTTTG GTTTGGATTG ATGGATCGGT TCACTATTTG AGTGGGTCGG TGAGTCACAC GCGTGTTGAA AGTATTTGTT ACGTGTCGGC GCCGGGACCA CAACCTTGGC TCCGTGCCGA ATGCACTTTT TGGAATGCAA TGTTCCGTTA ATGAAAACGC TAGTAGAGAG AACAAATGTA TGG
|
Protein sequence | MIGPAIDPAD VGLTGLFWLF LSYGYVLYSS SNLISEGSEL LLLIPSMAGL VGGVVLPLLG AVPDGAIILF SGLGSLEDAQ ETLSVGVGAL AGSTIMLLTV PFALSVYGGR VDLDANGVPD YLVKPKLSTK TSWKAEFTKT GVTLSDAVHH GGVLMALTTV PYFLIQVPAS IYATPENSED VVAAQEHWWA AAGFILCLLG LTVYMRLQLH ISQQGQDKGK RMAVMKKLLK QGQVSLSGAI AAQVNAKESA LQAQAASEYQ SIHDVKDGYP SPAIAAFLKE ILADAFYSYD SDTNGQLDKT EVFVFFRDFH ESISEEEMDK LFAKFDTDGS GTISLDEFIG LAYTLIKAQD QQTAPRHLDA SSRGTRAALV QAAFGEDEDE EEETVPEEFT SLTPDQQQRA IKWKAFRMLA LGTGLVVLFS DPMVDVMQEI AVRSGISPFY VSFVLAPLAS NASEVIASQY YASKKTRKTI TVSLTALEGA ACMNNTFCLC IFMGLVFVRG LAWHYTAETV AIVIVEFIIA FIVIRETTMT TGMAMFILAL FPLSIVLVAA LEAFGLD
|
| |