Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39223 |
Symbol | |
ID | 7194925 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 652780 |
End bp | 656377 |
Gene Length | 3598 bp |
Protein Length | 1161 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183133 |
Protein GI | 219125743 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCTC TTGAACATGT TCTTGTGAAC CTTTTGGGAG CGACGACACC GGATTCGTCG TACCGTCGGT TCTTTGAAGA GTACGGTATT ACTCAGACCA GCGAGTTGGC CTCAATCACC GAAAATTGTC TTGCAACGGT GTCATATGGT GTTTTGACCC CTTCTGTGGG AGATACCCCT GCCACCATTG TTCGTATGTT TCTTCCGCCT GCCCAGCAGG ATCGGATCTT GAAGATTGTC AAATGGTTCC TCTCGAAAGG TACCAACGTG ACAAACGAAA CCTGGTTTGA ACTTACCCCT GAAGTCCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCTGCTAC TCCTGTTGGA TCGGATGCTC GGAGTTCCTT TGTCAAAAGT GCTGCCGCAA AGTTTCGGAA GACAATCAAG AATCACTCCG TTCCGTACCC AAAGTTCAGT GAAGACCGTT TTTGGGTCAC TTGGAATACG AATATTCGTA TCAAGCTCCG TATCCATGGT GTCCAGTTGG TTCTTGACCC GGATTATTTG CCCGAGACCG TCGACGAGAC GGATACATTT GTCGAAATGC AGAACTTTGT CTTTGGCGTG TTCAACGATA TATTGTTGAC CCCTCGTGCG CGTGGAATCC TCCACAAGCA TGTGGATGAA TTGGATGCTC AGGCTGTTTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATCAATGCG CAGATCACTG CCACATCCAT TGAAACGAAG CTCACTTTGT ACTCATTTGC GACTTCAAAG AGCAAGACCT GTGTTGCTTT TTTGACGACT TGGCGCAATT TGATTTACGA TCTTGAACGG ATCAACGAGT TCCCCTTGCC GGATCACCAG AAGAGCGTAC GACTCAAGTC AGCTGTCCGT TCCCATCCGC AATTGAAGCT TTTCCTCGGA AATGTTCAGC TTTACTCTCG TACCCATGTG GGTAAGAGTG CCGACGATTC CGATTTTGAG TATGTTTATG ATTTGATGCT CGAACATGCG ACTGATATTG ATCAGACCGA TTTGGAAGAC CGCGGTAACA ACCGCGGTGG ATGCTCAGCA AACAATGCAA AGTCTCAGTC TTCTTCCAAG AAGAAAACTA ACAAACCAAT TGGTAAGAAG CACAAGAATT ATGTGCCTCC TGAAAAGTGA AAAGCTCTCT CTCCCAAAGA GAAGCGGACC ATTATGGACC AACGAGGACC TCGCCCTGCT CCAGCTCCTG CCCCTGCCTT ATCGGTGAAC GCCGCTGCCA CTCAGCCTCC TCCTACGGTG TATGTCAGCG ACTCGACGGT TGTTGACAAC CAAAGCCTCG CTTCGACTCA CGTCCCGCCT GCTGCCGGAC CTGGTCAACT GCTTCGTTCG CTCATTTCGA ATTCAGCTGC CCGCCAGCAC CCTGCTCCAT CGAATGGAGC CACGTCTGAC TCTTTTTCGG TCAATGGGAC CACCTATCGC AGCGAAGTGA ACCGTTCTTC TGTGCAGTAC CGTCTTTCCA CTCACGATGT TTCGTTGAAC AAGGACTCTT TGATCGATGG TGGTGCGAAC GGTGGCCTTA GCGGCTCCGA CGTAACCGTT ATTTCGCAAT CCCTGTTGGA GGCCACTGTC TCTGGAATTG GAAATTCGGA ATTGACCAAC CTCCGTTTGT CAACGGTGGC CGGACTCATT CACACGACGG ATGGTCCCAT TATTGGTGTG TTTCACCAGT ATGCTCACCT TGGTACTGGC AATACCATTC ATTCGTGCAA CCAAATGCGC TCCTGGGGAG TCACGGTTGA CGACGTCCCT CGTACTTTTG GTGGCAAACA GCGTATTGTC ACGTCCGATG GTCGTTTTGT CATCCCGCTT TCCGTTTCTG GCGGACTCAC TTACTTGTCT ATGCAGGCCC CTACCGAGGA GGACCTGGAC ACTTTCGAAT GGGTGCCTTT TACCGCTGAC AACGAGTGGG ACCCAAATGG TGTCTCTTCT CCTGCCGCTG CCGACAATGA CCTCAGTTTG CAGCTTCCTG CCGGCCATGT CCCGTTCCGT GATGAACGCA TCAATAACTT TGGTCTCCTT GCGCATTCCG CGGCTGTCAG TCGATCCCCT TTGAATGCCG ATGCTTTGCA ACCCAATTTT GGATGGGTTC CCAGTGCTCG TATCTCTCGC ACGTTCGAGA ATACCACACA ATTCGCTCGT GCCGATGTCC GTTTGCCCCT GCGCAAACAT TTCAAGTCGC GTTTCCCTGC TGCCAATGTT TCTCGTTTGA ACGAAATTGT GGCAACTGAT ACCTTTTTCT CGGATACCCC TGCGGCCGAT GACGGCATTC TTAACCATGG TGGGGCTACG ATGGCCCAAC TTTTCGTTGG AAAAAGTTCG CAAATCACCT CTGTCTTCCC GATGAAGCGT GAATCCCAGT TTGCCCATAC TTTCGAGGAC TTTATCCGTA CCCATGGTGC TCCCGATGCC CTCCTCAGCG ACAATGCTCG TGCTCAGATC GGTCAGCAGG CACTTCAGAT TTTGCGTATG TATGCAATCG ACGATATGCA GTGCGAGCCG CATCATCAAC ACCAAAATTA CGCGGAACGC CGCATTCAAG AGGTGAAAAA GATGGTGAAC ACAATCATGG ATCGTACAAA CACCCCTCCG GAATATTGGT TGCTCTGCTT ATTTTATGTG ACCTACTTGC TCAATCGCCT TGCTGTTGAA AGCTTGAATT GGCGTACCCC GCTTCAAGTT GCCCATGGAC AGCGTCCTGA TATTTCTGCT TTGCTCCTTT TCCGTTGGTT TGAACCCGTT TATTATTACG ACCCTGACCA TGCGTCTTTC CCATCGGCTT CTCGCGAGAA AACTGGTCGT TGGATTGGTG TTGCTGAACA CAAAGGTGAT GCGCTGACTT ACTGGATTTT AACCGACAAT ACTCACCAAG CCATTGCTCG TTCTGTTGTT CGTTCAGCCA ACGTCGATAA TGGTTTGAAA AACCATCGTG CTGCGAATTC CTCTCCCGAT GGTGGGGAGC CTTCGAATCC TAAGCCCATT GTCTTGGCTA CGAGTGACCT ACGCCATGAT GCTACGGTCG ATCCATCTTT TGAGAAATCC CCTGCATTCT CTCCTGACGA ATTGATCGGC AGGTATTTGA TCCGTGAAGC CCCTGACGGC CAGAGCCATC GAGCCCTTGT TGCTCGTAAA ATTATTGATG CCGACTCCGA TAACCATCAG GCGATTTGCT TCTTGTTGCA AATTGATGAA AAGGATGCTG ACGAGATCAT TTCGTACAAT GAACTTTCCG ATTTGATGGA AGCCCAACAA TCAGAGCCCG CTACGAACGG AAATATCGAA GATCATTTCA AGTTTACTAG TATTATTGGA CACCAAGGCC CTTTGCAACC GACCGATGCT GGTTACAAGG GATCCTCTTG GAATGTTTTG GTTCAATGGG AAGATGGTTC CCAGTCGTAC GAACCTCTAA TTGAAATGGC TAAGGACGAT CCAGTCACAC TCGCGATGTA CGCGTCTGAC AACGATCTCC TTAACGTGCC CGGGTGGCGC CGCTTCAATC GTTTGCTTCG CAACCGTGAT GACTTCAATC GATCTGTTTC GTTAGTGA
|
Protein sequence | MDPLEHVLVN LLGATTPDSS YRRFFEEYGI TQTSELASIT ENCLATVSYG VLTPSVGDTP ATIVRMFLPP AQQDRILKIV KWFLSKGTNV TNETWFELTP EVLEYWQPAS AIVAPATPVG SDARSSFVKS AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQLVLDPDYL PETVDETDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQAVYRDL VASYGKGINA QITATSIETK LTLYSFATSK SKTCVAFLTT WRNLIYDLER INEFPLPDHQ KSVRLKSAVR SHPQLKLFLG NVQLYSRTHV GKSADDSDFE YVYDLMLEHA TDIDQTDLED RGNNRGGCSA NNAKSQSSSK KKTNKPIALS PKEKRTIMDQ RGPRPAPAPA PALSVNAAAT QPPPTVYVSD STVVDNQSLA STHVPPAAGP GQLLRSLISN SAARQHPAPS NGATSDSFSV NGTTYRSEVN RSSVQYRLST HDVSLNKDSL IDGGANGGLS GSDVTVISQS LLEATVSGIG NSELTNLRLS TVAGLIHTTD GPIIGVFHQY AHLGTGNTIH SCNQMRSWGV TVDDVPRTFG GKQRIVTSDG RFVIPLSVSG GLTYLSMQAP TEEDLDTFEW VPFTADNEWD PNGVSSPAAA DNDLSLQLPA GHVPFRDERI NNFGLLAHSA AVSRSPLNAD ALQPNFGWVP SARISRTFEN TTQFARADVR LPLRKHFKSR FPAANVSRLN EIVATDTFFS DTPAADDGIL NHGGATMAQL FVGKSSQITS VFPMKRESQF AHTFEDFIRT HGAPDALLSD NARAQIGQQA LQILRMYAID DMQCEPHHQH QNYAERRIQE VKKMVNTIMD RTNTPPEYWL LCLFYVTYLL NRLAVESLNW RTPLQVAHGQ RPDISALLLF RWFEPVYYYD PDHASFPSAS REKTGRWIGV AEHKGDALTY WILTDNTHQA IARSVVRSAN VDNGLKNHRA ANSSPDGGEP SNPKPIVLAT SDLRHDATVD PSFEKSPAFS PDELIGRYLI REAPDGQSHR ALVARKIIDA DSDNHQAICF LLQIDEKDAD EIISYNELSD LMEAQQSEPA TNGNIEDHFK FTSIIGHQGP LQPTDAGYKG SSWNVLVQWE DGSQSYEPLI EMAKDDPVTL AMYASDNDLL N
|
| |