Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18197 |
Symbol | |
ID | 7197570 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 744347 |
End bp | 748677 |
Gene Length | 4331 bp |
Protein Length | 1168 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177677 |
Protein GI | 219111851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.44305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGATACTCG GAAGGCTCGT ATTTGGCTGG TTTAGTCCCC TCCTGGTCCG CGGAAACGAG AAGCAAAGAC TGGATCAGGA AGATCTGAGC CTGATCCCGT TTCCGAAAGA CTGCCAAACT TCGACAGTGG CGGCCGCTTT TGAAAAAGCC TGGGGGGAGG AATTGAAAAG GGAAGACCCT AGTCTGTTTC GTGCTCTCCT CGCCTCGTAT GGTACTGAAT TTGCGAGGGC TGGCTTATTT AAGTTGGTGC ATGACTGCTG CGTCTTTGTT GGACCCCAAG TTCTTCATGC TATGATAATG TTTTTACGTA ACCCTGACTC TCCACTTTCA TATGGCTTAG GCTTGACGAC GCTCGTCACA TTGTCACAGC TCACAATGAG CCTGTGCCTC CGACATTATT TTTTCAAGTG CTACACTACG GGTTTGCGTG TGCGCACAGC CGTTGTCGTG GCCATCTATC ATAAAGCTTT GAAGCTGTCA GCAAGTGAAC GGCAAACGCG CTCGTCGGGT GAAATTACGA ACCTCATGTC GATTGATGCC CAGAGACTAC AAGGTGAGTT CTACTAGTTG TTAATTATTT GGCGAAGCCA TTTGGGCTCA TCACTCAAAA CTTTATCATC CAGACTTGAC AACGTACTTG CACGCAATTT GGTATAGTCC CTTGCAAATC AGTCTAGCGC TTTTGTTTCT TTGGAAACAG CTTGGCGCTA GTTCTTTAGG AGGTGTGCTA GTCATTGTGA CAATGATTCC AGTGACGAAA ATTGTGGCTC AGTGGATGGG ATCTATGCAA AAGTTGTTGA TGCGAGCTAA GGATCAGCGA GTAGACTTGA ACGGAGAGGT TTTGGCCAGC ATGAAAGTCG TCAAGTTTCA GGCCTGGGAA GAGCCCTTCC AATCTCGGAT CCTTGCCCTT CGTGAGGTGG AACTTCATCA ACTTCTTCGC TACTACATTG TGTTGTCGCT TTCTCGAATG CTGTGGACGT TCACGCCGTT GATGGTGGCG CTGGCGACAT TCTCCGCCTA TGTATGGTCT GGCCATGTTC TTGACGTGGC GTCGGCGTTG ACATCACTAG CTCTTTTTGA AATTTTGAGG TTTCCTCTAT TTATGCTTCC TCAAATCATT AGCAATATTG TGGAGGCTAC GGTGGCATTG AAACGCATTC AATCATTCTT GCTTTGTAAA GATCATAAAC CAGTTGAAGC TGGTAATCTC GACAACATCG GCATCAGAAT GGAAGGAGTT TCAGCGGCGT ATGATTCGAA GCGTCCGAAG GTGAATCCAC AAGCTGATCC AGTTGCTTTA GACCTGTTGG ATAAACAGTG GGAGGTCAAG CTACTGAAAT CTCAGCTAAA TGAGGCCGAG CATCTCATAA ACCGCCATAC GGGCAAGGCG GTTTACGGTT CGAAAAACGA AGGCGATGAG GAAGCTGGGC ATAGCCTTCT TTGTCTTAAG CGAATTGAAT TTGAATGCAA ACCAGGCGAA CTTGTAGCTG TCATCGGTAG TGTTGGCTGT GGAAAATCGT CGTTTATAAA CGCACTTTTG GGCGAGGTCA GGGCATTGAC TGGATCTACT TCCGTTTGTG GAAAGATGGC GTATTTCTCG CAAGTGCCTT TCATTATGAA TGCCAGCGTG CGGGACAATA TTCTCTTCAG TCATACAGAC GAAGAAGTAG ATGAAGCGAT GTACCAGAGG TGTTTGAGAT GCTGCGCCCT GAAGCACGAT TTGGATCTTC TCCCTAATGG CGACCGCACC GAAATTGGAG AGAAGGGAAT TACATTGTCT GGAGGTCAAA AGGCGCGAGT AGCACTAGCT CGTGTAGTAT ATCATCGGGC TGACCTTTCT TTGATTGATG ATGCTCTGGC AGCTGTTGAC GCTCATGTTG CAAAGCAGCT TTTCGAAGAG GCCATCGTGA ACGAACTCCT GTCTTGTGGC GCTGCTGGTA TGGAGAGCCG CAGCGTTATC ATGGTCACAA ATGCTCTGCA ATATTTATCG CATCCCCGAG TAGACAGAAT TATTGTTTTA CAGGACGGAC ACATTGTAGA AAGTGGGACT TACAATGAAT TGAAGAACGG AGACTCCGTT TTCGCTGGAT TCCTTGCAGT ACTACGTGAC ACTGGTACAG ATCTGAGTGG ACATCTTGTA GAGGGTGTTG CTAGCAGCGA TAGCAACGGC GTTTCTGATG AATCGGGTAA TTTAGTATGC ACCGGACGAG AAGCGGATAT TGAAGCGGAA CTCCCGGTGA AACTAATGAC CGACGAGTCT CGGCAATCGG GACACGTGAA GCCGAGTGTC TATCTCTCTT GGATCAAAGC AGCTGGTGGG TTGTTTGCCC CTGTCGCAAT TCTCTTGGCG TTCGGCTTTG CAGAGGGGAT TTCTGTCCTT TCGAATTGGT GGATTACCTA CTGGTCCGGA CACGGTAGCT TATCAAGCCA ATCCCGATTT CTTGCCATCT ATGCCCTCAT CAACGGGACG GCTGCTCTGT TCGGCCTTTT TAGGACTCTG CTTGTCGTCA TTTTTGGTCT AAAAGTCTCC CGAAAGGTGA GACGCCTGTG GAGACAAGAA TCATGCACCT ACGCTTGCTT CTCACTGTCT CTTTCGTTCA CAACAGCTTT TTGCCAACCT ACTGTCGGTA ATTCTGCACG CTCCCATGTC ATTTTTTGAT ACGACGCCTG TTGGGCGTCT TGTGAATCGA TTTAGCAAAG GTAAGAGAAG CTTGTATTTC GGGCCCTGTC GCTAAGCATC TAGGTGATCT GTTTCCCTCA TCGTCTTTAC AGATATGTAT ACAATAGATG AGCAGCTAAT GGGCACCCTC CGAACATATC TCCAGACACT GTTTGGCGTG TTCAGCACAC TGTTGGTGAT ATCATCTGTG ACTCCATTGT TTCTGTTGTG CTTGGTGCCC ATGCTCATAT TCTATTTGAA AGAGCAATCT TTTTTCACTG TAAGTCAGTC TGGCCGAAGT TCTAAGACGC CCAGTACAGC CTTTCTGACA TTCTGCTGCG TAAACGAAGA TTTCTTACAG AGAGCTGAAA AGACTGGACT CGGTGAGCCG GAGTCCAATT TACGCGTTGC TTGGCGAAAG CGTAGATGGC GTTGCTGTGA TACGGGCCTT TGCTGCCCAA AAAAGTCTTC TCTGTCGCCT AACCGATATG TTGGATATTC AGCAACATGC GTACTTTCTT ACTTGCGCAG CGCAGTCCTG GCTCGCTGTT CGTCTTGAGC TGATTGGAAC ACTAATTGTA ACATTTGCGG CTTTGAGTGC AGTATTGGAG CACACCAGAT CCGGAGCCGA CGGAACCTTT GCGGGTTTGG CAGGGCTGTC GATCTCGTAT GCGCTATCGG TTACGCAATC GCTGAACTGG AGCGTTCGAA TGGCAAGTGA TATGGAGGCG AATATGGTGG CTGTTGAACG AGTTGAAGAG TACTCCAACA TTCAGTCCGA AGGCCTGAGA TCAACTCCTG TTGACGCAAA ACTCCCTCAA GTGTGGCCTC CGAAAGGGGC GATTGAGTTC ACAGAAGTCA GGCTACGATA TAGACCGGGT TTGCCTTTTG TTCTAAAGGG CTTAAATTTG ACTATTCCTC CCGGGAGCAA AATAGGAGTT GTTGGTCGTA CAGGTAAGAG GAATCATAGG AAGTCAACCG TTCTTACATT GACGGTAATC TTACGATGTT TCTAATCGTG TAAGGGGCGG GGAAATCGAC GCTAATGATA GCTCTGATGC GCATTGTTGA TGTAACGGAA GGAACCATAA AGATTGATGG TACAGATATA TCGGAGATTG GATTGGCGAG GCTCAGACGA ACTCTGGCTG TTATTCCTCA GGACCCTGTC CTTTTTAGCG GATCCGTCAG ATCAAATCTT GATCCGTTCC ATGAATACGA AGACGATGCT TTGTTAGATA TTCTAGACCG CGTTGGGCTG TACGCACGGT CTAGGACCTC TTCAACTCAG AGTCTCCCAT CCCTTGGCCA AATCTGTATT CGAACCCTTA CGGATGTCAT AGCTGAAGGG GGCATTAACT TTTCTGTCGG CCAGCGTCAG TTGCTGGTAA TAGCTCGTGC ACTTTTGCGA GGGGCGAAGA TTGTTATAAT GGACGAGGCA ACTGCAGCAG TTGATGCTGG AACGGACGCT GCAATCCAGA AAGTAATTCG GACTGAATTC ACAGAGGCGA CATGCATCAC GGTGGCCCAT AGAATCAACA CTATTTTGGA TAGCGATTAC ATCCTGGTTA TGTCGGACGG AAAGGCGGAA GAATTCGACA AACCTGATAT GCTACTAAAA AAAGGAGGAT TGTTTCGTGA TTTAGTAAGA GCTTCTGCTG ATAATACCTA A
|
Protein sequence | MIMFLRNPDS PLSYGLGLTT LVTLSQLTMS LCLRHYFFKC YTTGLRVRTA VVVAIYHKAL KLSASERQTR SSGEITNLMS IDAQRLQDLT TYLHAIWYSP LQISLALLFL WKQLGASSLG GVLVIVTMIP VTKIVAQWMG SMQKLLMRAK DQRVDLNGEV LASMKVVKFQ AWEEPFQSRI LALREVELHQ LLRYYIVLSL SRMLWTFTPL MVALATFSAY VWSGHVLDVA SALTSLALFE ILRFPLFMLP QIISNIVEAT VALKRIQSFL LCKDHKPVEA GNLDNIGIRM EGVSAAYDSK RPKRIEFECK PGELVAVIGS VGCGKSSFIN ALLGEVRALT GSTSVCGKMA YFSQVPFIMN ASVRDNILFS HTDEEVDEAM YQRCLRCCAL KHDLDLLPNG DRTEIGEKGI TLSGGQKARV ALARVVYHRA DLSLIDDALA AVDAHVAKQL FEEAIVNELL SCGAAGMESR SVIMVTNALQ YLSHPRVDRI IVLQDGHIVE SGTYNELKNG DSVFAGFLAV LRDTGTDLSG HLVEGVASSD SNGVSDESGN LVCTGREADI EAELPVKLMT DESRQSGHVK PSVYLSWIKA AGGLFAPVAI LLAFGFAEGI SVLSNWWITY WSGHGSLSSQ SRFLAIYALI NGTAALFGLF RTLLVVIFGL KVSRKLFANL LSVILHAPMS FFDTTPVGRL VNRFSKDMYT IDEQLMGTLR TYLQTLFGVF STLLVISSVT PLFLLCLVPM LIFYLKEQSF FTISYRELKR LDSVSRSPIY ALLGESVDGV AVIRAFAAQK SLLCRLTDML DIQQHAYFLT CAAQSWLAVR LELIGTLIVT FAALSAVLEH TRSGADGTFA GLAGLSISYA LSVTQSLNWS VRMASDMEAN MVAVERVEEY SNIQSEGLRS TPVDAKLPQV WPPKGAIEFT EVRLRYRPGL PFVLKGLNLT IPPGSKIGVV GRTGAGKSTL MIALMRIVDV TEGTIKIDGT DISEIGLARL RRTLAVIPQD PVLFSGSVRS NLDPFHEYED DALLDILDRV GLYARSRTSS TQSLPSLGQI CIRTLTDVIA EGGINFSVGQ RQLLVIARAL LRGAKIVIMD EATAAVDAGT DAAIQKVIRT EFTEATCITV AHRINTILDS DYILVMSDGK AEEFDKPDML LKKGGLFRDL VRASADNT
|
| |