Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40968 |
Symbol | |
ID | 7198902 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 13080 |
End bp | 16387 |
Gene Length | 3308 bp |
Protein Length | 1059 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185029 |
Protein GI | 219129718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCTG CCACCCGGCA AATGACAAGT GTAGCCGTCT ATGCCCATTT TCTGGACAAT GTACTCCTTC TTCCCCAAGG ACATCCTATT CGTCTTGCCT TTGATCAACA AGGATATGAA TCGGCTGACA ATCTCTTGTG CATCTTTGAG AACGAACTCG ACTCTCTTGA GTACACTCCT CCTGCCATTC CTGACGGTCC CGAAAATCCG TCGTGCATCC CTCTAATCAT GGCACATCGA CAGATCATAC GTCACTTCCT ACGTTGGCAA GCATCCTTAA AAGACCAAAA GGGGGCTCCT TTGAAGAACT CCGAGCTCGT TGCACTCAAC AACGAGGACT TTGTCCTGTA CCGTCGGTCA GCACTTGGTC AGGTTTCGAC GGCCACTGCA CCTGCCACTG TTCCCCCGAC TGTTCAGAGT CCCACAGGAA AGACACGTTC GGCTGTCGAG GATTTCAAGC GTGGGATAAA ACGTGATAAA ACTCACTATC CTGTGCTCAA AGATGACCGA TACTGGGACA ACTTCTATTG TTCGTTTGTT GTTACTGCCG TAACACATAA CGTTGACAAG GTTCTAGACC CGAACTACAT TCCTACCGAT CCTTTGGAAA AGTCCCTCTT TGAAGAACAG AACAAGTTTG TATATTCTGC TCTAGAGCAT ACTCTTCAGA CGGACATGGG AAAGAACATT GTCCGTGAGC ACAGTTTTGA TTTCAATGCC CAAGAAGTTT TCCGTAAAAT TGTGAAACAT TATACGGAGT CAGCCAGCGC AAAGATCAGT TCGTCTACTA CTCTGGGGTA TCTCACAACT GCAAAATACG GATCGTCATG GACAGGTACA GCAGAAGGAT TTATTCTCCA CTGGAAGAAT CACTTGCGTA TTTACAATGA CACTGTACCT ACTGGTGAGC AACTTCCTCA GCAATTGTGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT GTACCTGAAC TCCGACAGGT TAAAATCACA GCAACACTCG ACTTAGCCAA AGGTGGTAGT CCCATTAGCT ACGATAGTTA CCTCAGTCTC CTCCTTGCAT CGGCATCGCT CTACGACAAC GGTAATAATC TATCTAATTC TCGCAGTGGC AAGAACAAGC GCAATATCTA TACTACTGAA CTAGCCTATC ATCCGACGGA TTTTGAAAGC GAACCAGATG TAGACTATGA TATAGATGTG TCACCGACTG CCATATACGA AGCCAATGCC CACGTCCGTA ACAACAGTAC CCGTAACCGT CCCCTGGCAA CTAATCGCGA ATGACCTTAC ATTCCTCGTG AAATGTGGAA TTTGCTCTCT GATGATTCCA AGGCCATCCT CCAAGGTTTG GCTGCACCCG GCAAGCAGGC ACCATTAAAT GGTAGCCCGC CTCATCAAAC GCTGCAGGCC AATACACACG AGACCATTGG CACGGAACAT ACCGCAACGG ACACCTTCCA TGATTGCGCA CCTGAAACTG AATTACTCGC ACATCTTACT GAGCGTGTCA GTCGCATGAG CAGCGGTGAT ATTCGTAAGG TACTCGCCGC ATCACGTGAC GTATCAGAAA AGCCCAAATC ACTGCAATCT AACGTACTGC AATACCAAGT CTCTTGTCAT ACTACCAACG AGACTTCTGC ATCCCTTGTT GACCGTGGCG CTAACGGAGG GCTTGCCGGT GGTGATGTCA TTGTCCTGCT CAAAACAGGA CGTTCGGCAA ACATCACAGG TATCAACGAT CATACCTTGC CAAACTTGGA CATCGTCACT GCCGCTGGAT GTGTTGAATC CCAAAATGGG CCCATCATTC TCATTATGAA CCAGTATGCT CATCTGGGGA AGGGTAAAAC CATTCATTCA AGTGCGCAGT TGGAACACTA TCGTAATCAT GTCGAAGACC GTTCACGCAC GGTAGGGGGT AATCAGCGCA TTGTCACTTT AGATGACTAT ATCATTCCCC TCCATATTCG ACAGGGACTT CCATACATGG ACATGCGACG CCCCACTGAT GCTGAACTAG CGTCCCTCCC GCATGTTGTC CTAACCTCAG ACGTCGATTG GGACCCCTCT GTACTCGACA ATGAAATTGA CCTTGCGACT TCATGGTACG ATGGCATCCA TGACTTGCCC CAGCCCCCAT ACGTTGAACC ACGTTTTGAT CATACAGGCC AATACCTTCA CCGTCACATT TCTCTATGCG ACTACCGTGA TGACGCCATT GCACGTATCA TGCAGTGTCA ACAGCATCAC GTCACACGTA ATGTGCACGA TTATGAAGCC CTTCGTCCTT GCTTTGGCTG GGTCTCTGCC GACACCGTTC GGAAAACCAT CATGGCCACC ACGCAGCATG CCCGTGAAGT CTATCACGCA CCGTTACGTA AACATTTTAA GTCTCGTTTC CCAGCCTTAA ATGTACACCG TCGTAACGAA CCGGTCGCTA CTGATACCAT ATGGTCCGAC ACTCCCGCCG TAGACAATGG TGCAAAATTT GCACAACTCT TCGTTGGCAG ACGGTCTCTT GTCACTGATG CCTACCCCAT GAAAACTGAT AAAGAGTTTG TCAACACCCT TGAAGATCAC ATCCGTTTCC GTGGTGCAAT GGACAAACTA ATCAGTGATC GCGCTCAAGT TGAGATCAGT AAAAAGGTCA CTGATATTAC ACGCGCATAT AATATCGATC AGTGGCAGAG TGAGCCTAAC CATCAGCACC AAAACTTCGC CGAACGTCGT ATCGCCACCA TCGAAGCCAA TACGAACAAC ATTCTCAATC TTACTGGTGC TCCTGATAAC ACCTGGCTTC TTTGCGTGAC ATATGTTTGC TATGTCTTCA ACCATTTGGC GCATGAATCT CTCGATCATC GCACCCCCCT CGAAGTGCTT ACTGGTTCTA CACCTGATAT CAGTGTTCTC CTTCAGTTTC ATTTTTGGGA ACCGGTCTAT TATAGAATTG AAGATGCGAC ATTCCCCTCT GGTGGTACCG AGCAACAAGG ACATTTTGTC GGCATCGCAG ACTCCGTCGG TGACGCTCTC ACTTATAAGA TCCTCAACGA CCGCACCAAC CGCATTCTAT ATCGATCTAG TGTTCGTTCT GTGGCCATTT CCGGGCAAAC CAACCTACGC CTTGCGTCAC AGGATGGGGA GAATGGTCCT AAGCCCATCA ACTTTATCAA GTCGCGTAGA ACCGAAAATC AAAATTCCTA TGCCATTAAG GAGTTGCCTG GTTTTACACC TGATGATCTT ATCGGTCGCA CGTTTCTCAC CGACACTCGT GATGATGGAG AGCGTTTTCG GGCACGAATC ACCCGTAA
|
Protein sequence | MVPATRQMTS VAVYAHFLDN VLLLPQGHPI RLAFDQQGYE SADNLLCIFE NELDSLEYTP PAIPDGPENP SCIPLIMAHR QIIRHFLRWQ ASLKDQKGAP LKNSELVALN NEDFVLYRRS ALGQVSTATA PATVPPTVQS PTGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYCSFV VTAVTHNVDK VLDPNYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA QEVFRKIVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP TGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGS PISYDSYLSL LLASASLYDN GNNLSNSRSG KNKRNIYTTE LAYHPTDFES EPDVDYDIDV SPTAIYEANA HAILQGLAAP GKQAPLNGSP PHQTLQANTH ETIGTEHTAT DTFHDCAPET ELLAHLTERV SRMSSGDIRK VLAASRDVSE KPKSLQSNVL QYQVSCHTTN ETSASLVDRG ANGGLAGGDV IVLLKTGRSA NITGINDHTL PNLDIVTAAG CVESQNGPII LIMNQYAHLG KGKTIHSSAQ LEHYRNHVED RSRTVGGNQR IVTLDDYIIP LHIRQGLPYM DMRRPTDAEL ASLPHVVLTS DVDWDPSVLD NEIDLATSWY DGIHDLPQPP YVEPRFDHTG QYLHRHISLC DYRDDAIARI MQCQQHHVTR NVHDYEALRP CFGWVSADTV RKTIMATTQH AREVYHAPLR KHFKSRFPAL NVHRRNEPVA TDTIWSDTPA VDNGAKFAQL FVGRRSLVTD AYPMKTDKEF VNTLEDHIRF RGAMDKLISD RAQVEISKKV TDITRAYNID QWQSEPNHQH QNFAERRIAT IEANTNNILN LTGAPDNTWL LCVTYVCYVF NHLAHESLDH RTPLEVLTGS TPDISVLLQF HFWEPVYYRI EDATFPSGGT EQQGHFVGIA DSVGDALTYK ILNDRTNRIL YRSSVRSVAI SGQTNLRLAS QDGENGPKPI NFIKSRRTEN QNSYAIKELP GFTPDDLIGR TAFSGTNHP
|
| |