Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38979 |
Symbol | |
ID | 7194694 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 21444 |
End bp | 25049 |
Gene Length | 3606 bp |
Protein Length | 1165 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183014 |
Protein GI | 219125495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCTC TTGAACATGT TCTTGTGAAC CTTTTGGGAG CGACGACACC GGATTCGTCG TACCGTCGGT TCTTTGAAGA GTACGGTATT ACTCAGGCCA GCGAGTTGGC CTCAATCACC GAGAATCGTC TTGCAACGGT GTCATATGGT GTTTTGACCC CTTCTGTGGG AGATACCCCT GCCACCATTG TTCGTATGTT TCTTCCGCCT GCCCAGCAGG ATCGGATCTT GAAGATTGTC AAATGGTTCC TCTCGAAAGG TACCGACGTG ACAAACGAAA CCTGGTTTGA ACTTACCCCT GAAGTCCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCTGCTAC CCCTGTTGGA TCGGATGCTC GGAGTTCCTT TGTCGAAAGT GCTGCCGCAA AGTTTCGGAA GACAATCAAG AATCACTCCG TTCCGTACCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG AATATTCGTA TCAAGCTCCG TATCCATGGC GTCCAGTTGG TTCTTGACCC GGATTATTTG CCTGAGACCG TCGACGAGAC GGATACATTT GTCGAAATGC AGAACTTTGT CTTTGGCGTG TTCAACGATA TATTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAA TTGGATGCTC AGGCTGTTTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATCAATGCG CAGATCACTG CCACATCCAT TGAAACAAAG CTCACTTTGT ACTCATTTGC GACTTCAAAG AGCAAGACCT GTGTTGCTTT TTTGACGACT TGGCGCAATT TTATTTACGA TCTTGAACGG ATCAACGAGT TCCCCTTGCC GGATCACCAG AAGAGCGTAC GACTCAAGTC AGCTGTCCGT TCCCATCCGC AATTGAAACT TTTCCTCGGA AATGTTCAGC TTTACTCTCG GACCCATGTG GGTAAGAGTG CCGACGATTC CGATTTTGAG TATGTTTATG ATTTGATGCT CAAACATGCG ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGCGGTGG CCGCTCAGCA AACAACGCGA AGTCTCAGTC TTCTTCCAAG TCTTCTTCCA AGAAGAAAAC TAACAAACTG ATTGGTAAGA AGCACAAGAA TTATGTGCCT CCTGAAAAGT GGAATGCTCT CTCTCCCGAA GAGAAGCGGA CCATTATGGA CCAACGAGGA CCTCGCCCTG CTCCAGCTCC TGCCCCTGCC TTATCGGTGA ACGCCGCTGC CACTCAGCCT CCTCCTACGG TGTATGTCAG CGACTCGACG GTTGTTGACA ACCAAAGCTC GCTTCGACTC ACGTCCCGCC TGCTGCCGGA CCTGGTCAAC TGCTTCGTTC GCTCATTTCG AATTCAGCTG CCCGCCAGCA CCCTGCTCCA TCGAATGGAG CCACGTCTGA CTCTTTTTCG GTCAATGGGA CCACCTATCG CCGCGAAGTG AACCGTGCTT CTGTGCAGTA CCGCCTTTCC ACTCACGATG TTTCGTTGAA CAAGGACTCT TTGATCGATG GTGGTGCCAA CGGTGGCCTT AGCGGCTCCG ACGTAACCGT TATTTCGCAA TCCCTGTTGG AGGCCACAGT CTCTGGAATT GGAAATTCGG AATTGACCAA CCTCCGTTTG TCAATGGTGG CCGGACTCAT TCACACGACG GATGGTCCCA TTATTGGTGT GTTTCACCAG TATGCTCACC TTGGTACTGG CAATACCATT CATTCGTGCA ACCAAATGCG CTCCTGGGGA GTCACGGTTG ACGACGTCCC TCGTACTTTT GGTGGCAAAC AGCGTATTGT CACGTCCGAT GGTCGTTTTG TCATCTCGCT TTCGGTTTCT GGCGGACTCA CTTACTTGTC TATGCAGGCT CCTACCGAGG AGGACCTGGA CACTTTCGAA TGGGTGCCTT TTACCGCTGA CAACGAGTGG GACCCAAATG GGGTCTCTTC TCCTGCCGCT GCCGACGATG ACCTCAGTTT GCAGCTTCCT GCCGGCCATG TCCCGTTCCG TGATGAACGC ATCAATAACT TTGGTCTCCT TGCACATTCC GCGGCTGTTA GTCGATCCCC TTTGAATGCC GATGCTTTGC AACCCAATTT TGGATGGGTT CCCAGTGCTT GTATCTCTCG CACGTTTGAG AATACCACTC AATTCGCTCG TGCCGATGCC CGTTTGCCCC TGCGCAAACA CTTCAAGTCG CGTTTCCCTG CTGCCAATGT TTCTCGTTTG AACGAAATTG TGGCAACCGA TACCTTTTTC TCGGATACCC CTGCGGCCGA TGACGGCATT TTTAACCATG GTGGGGCTAC GATGGCCCAA CTTTTCGTTG GCAAAAGTTC GCAAATCACA TCTGTCTTCC CGATGAAGCG TGAATCCCAA TTTGCCCATA CTTTCGAGGA CTTTATTTGT ACCCATGGCG CTCCCGATGC CCTCCTCAGC GACAACGCTC GTGCTCAGAT CGGTCAGCAA GCACTTGAGA TTTTGCGTAT GTATGCAATC GACGATATGC AGTGCGAGCC GCATCATCAA CACCAAAATT ACGCGGAACG CCGCATTCAA GAGGTGAAAA AGATGGTGAA CACGATCATG GATTGTACAA ACACTCCTCC GGAATATTGG TTGCTCTGCT TATTTTATGT GACCTATTTG CTCAATCGCC TTGCTGTTGA AAGCTTGAAT TGGCGTACCC CGCTTCAGGT TGCCCATGGA CAGCGTCCTG ATATTTCTGC TTTGCTCCTT TTCCGTTGGT TTGAAACCGT TTATTATTAC AATCCTGACC ATGCGTCTTT CCCATCGGCT TCTCGCGAGA AAACTGGTCG TTGGATTGGT GTTGCTGAAC ACAAAGGTGA TGCGCTGACT TATTGGATTT TAACCGACAA TACTCACCAA GCCATTGCTG GTTCTGTTGT TCGTTCAGCC AATGTCGATA ATGGTTTGAA AAACCATCGT GCTGCGAATT CCTCTCCCGA TGGTGGGGAG CCTTCGAATC CTAAGCCCAT TGTCTTGGCT ACGAGTGACC TACGCCATGA TGCTACGGTC GATCCATCTT TTGAGAAATC CCCTGCATTC TCTCCTGACG AATTGATCGG CAGGTATTTG ATCCGTGAAG CCCCTGACGG CCAGAGCCAT CGAGCCCTTG TTGCTCGTAA AATTATTGAT GCCGACTCCG ATAACCATCA GACGATTTGC TTCTTGTTGC AAATTGATGA AAAGGATGCT GACGAGATCA TTTCGTACAA TGAACTTTCC GATTTGATGG AAGCCCAACA ATCAGAGCCC GCTACGAACG GAAATATCGA AGATCATTTC AAGTTTACTA GTATTATTGG ACACCAAGGC CCTTTGCAAC CGACCGATGC TGGTTACAAG GGATCCTCTT GGAATGTTTT GGTTCAATGG GAAGATGGTT CCCAGTCGTA CGAACCTCTA ATTGAAATGG CTAAGGACGA TCCAGTCACA CTCGCGATGT ACGCGTCTGA CAACGATCTC CTTAACGTGC CCGGGTGGCG CCGCTTCAAT CGTTTGCTTC GCAACCGTGA TGACTTCAAT CGATCTGTTT CGTTAG
|
Protein sequence | MDPLEHVLVN LLGATTPDSS YRRFFEEYGI TQASELASIT ENRLATVSYG VLTPSVGDTP ATIVRMFLPP AQQDRILKIV KWFLSKGTDV TNETWFELTP EVLEYWQPAS AIVAPATPVG SDARSSFVES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQAVYRDL VASYGKGINA QITATSIETK LTLYSFATSK SKTCVAFLTT WRNFIYDLER INEFPLPDHQ KSVRLKSAVR SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLKHA TDIDQTDLED RGNNRGGRSA NNAKSQSSSK SSSKKKTNKL IGKKHKNYVP PEKWNALSPE EKRTIMDQRG PRPAPAPAPA LSVNAAATQP PPTLASTHVP PAAGPGQLLR SLISNSAARQ HPAPSNGATS DSFSVNGTTY RREVNRASVQ YRLSTHDVSL NKDSLIDGGA NGGLSGSDVT VISQSLLEAT VSGIGNSELT NLRLSMVAGL IHTTDGPIIG VFHQYAHLGT GNTIHSCNQM RSWGVTVDDV PRTFGGKQRI VTSDGRFVIS LSVSGGLTYL SMQAPTEEDL DTFEWVPFTA DNEWDPNGVS SPAAADDDLS LQLPAGHVPF RDERINNFGL LAHSAAVSRS PLNADALQPN FGWVPSACIS RTFENTTQFA RADARLPLRK HFKSRFPAAN VSRLNEIVAT DTFFSDTPAA DDGIFNHGGA TMAQLFVGKS SQITSVFPMK RESQFAHTFE DFICTHGAPD ALLSDNARAQ IGQQALEILR MYAIDDMQCE PHHQHQNYAE RRIQEVKKMV NTIMDCTNTP PEYWLLCLFY VTYLLNRLAV ESLNWRTPLQ VAHGQRPDIS ALLLFRWFET VYYYNPDHAS FPSASREKTG RWIGVAEHKG DALTYWILTD NTHQAIAGSV VRSANVDNGL KNHRAANSSP DGGEPSNPKP IVLATSDLRH DATVDPSFEK SPAFSPDELI GRYLIREAPD GQSHRALVAR KIIDADSDNH QTICFLLQID EKDADEIISY NELSDLMEAQ QSEPATNGNI EDHFKFTSII GHQGPLQPTD AGYKGSSWNS HSRCTRLTTI SLTCPGGAAS IVCFATVMTS IDLFR
|
| |