Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49122 |
Symbol | |
ID | 7195203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 659220 |
End bp | 662396 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183649 |
Protein GI | 219126825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0406753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACGG TCAAGCCGAG CGAAATGAAA AGGCGCGAAG CCGAAAAGAA GCACGTGTAC TCGGCATCCG GAGCACCGCT GGGTACGATT TCTTCCCCCC CACGCACGAC CGTCTCCACG CCGCCCTTGG GATTGAAAAC GACCATTGTC TTCTCCGGTA CCGGTGTCCC GATCGATACG CGTCTCGAAT CACCCCCCGA GCTACCACCG GCCAAAACTG GGACGAGTCC CAACACGCGA CCCCCGTTGC CACGCAAAAG ATCCAAACGG AGTCAGCGGA GTAAAGGCAA AAGCCGAGAC AAGAATACTT CCTCCACACC CACGATTTTG TGTTGGCTGA TTCAGCGAGG CAAGTATGAG GATGCTACGG AACGCTTGCA CGAAACTCCG CAGGAGGCTA GTATTTGGTG GGTGGAACGA CCTCCGGGAG ACGATGCCGC TGTTTCCCGG GCCTTGCCCA TTCATTTGGC GTGTCGGAAA CTCGCGGAAG AAACGGACGA AGCGGCTCGG GCTCGCCTCG GCGACTTTCT TTCCCACCTT CTTTTGATTT ATCCACAAGG GGCAAGAATG CGTGACGACG GACACGTCGA CCGGACTCGG ATCGAGGACA GCCTACTGTT TTGCAACAGT AGTAACAACA GCAACAATAA CAGCATTCGA TCCGCCCGAC GTACAGCGAT TCCCGTCACG GGTCGATTGC CCGTGCACGA TGCCGTGGCG GGGGGTGTCG ACGAAGAAAC ACTCGCCTTA TTCCTCACTG TATATCCGGA ATCGATCTAT TCCGTAGATG AGCGACGATT GTCTCTGTCG GAACTCAATC GACTCGCCAC AAACGATCCG AATATTCAAG GTGTTCTGGA TTTAGGCTAC GAAGACTGGA AGTCGGCCTA CGAATCGTCT CCGCTATCTG GAAGGTCCGC CTCAAAAATG GATGATTTGG TAAGTTGCGC TTCCGAGGCT CGGAACGCGG ATATCAGCGC GTCAACGCCG ATTTGCCTAT TAGTCGACGA AGGCGACGAA CTCTTTCCGG ATAATGTCAG TGCCTTAACC ACACCTGACG AGCTTTTACC TTCCGGTAAC AATGTGTGGG GGATAGACCG ACATGGCGAC AAGATTGAGG ACAAAGACAA AGAGGAAGTC TCTTCCGAAC ACGACGTTGT CGAAGAGGCA AAGCACGAAA CTAAACCAGA ACAGAACTCG CCGAGCTCCG ACCCGGCTCC TATTGTGACC TGGGAACAGA TTGAGGAACG CGCCCTCGCT TTGGAACGAG TACTTGGCGA GATGAAGACC AAAAATTACG ACTTGCACGA GAAGATTCAA GTTTTATCCA AAGACCAAGG GAGGGAGATC ATTTTGCGCG TAGATCGATC TCAGAAAACC GATTTGTACG GAATGGTGGA TGTGTTACAA CACCAGAATT TCGCTCTTGA TCAAAATATT TATAAAACGG AAACATTGCT TCACTACTCA GTTTTTCCGA GCGACGAAGA GTCGGTGGGA CGGCAACGGC GACGAGGAGA AATCGCACGT ATGCTAGGCT GGCTGGATGT TGAAGAAAAT GATAAAAGCA GCACTATCGA GGACAGCAGC GACGAGGAAA AGGTTAATGA CACCACGTTG CAACAAATAT ACAATGAGCT CCACGAATCT TATGATCAGC AAAAAGCTGC CATCAAACAG TTTGGTTTCG TCTTTGAAAA ACTCGGGATC AATCGTTTTG TGGACGAGGA CAATGCATCT GCTGATACGG TGCCCCGGAG TGTCGTTTCC AATCTGACCG TGAACTCCGA CGACTGGAGC TTTGGCTCCG ATCATTGTGA CGATGTCTCG CGTCCAGAGC GGTCTCGGAA TGTCTTGCGC GAAGTAGAGA TTGAATGGCC TGAAGACAGC GTAGATGCAT GCCAAGTGAG TGACAATTTA AGTACCATTT TCCGCCACGC TGCCGCCATC ACCGAAGAGA ATGAAGATTG CCTGATGCCA ACATCCTCTG GGGGGCACAA TTTTGGAGTA GATAATCTAA GCCAAATCCT ACGGTCGGCA GCTGCCAAGG AAACGAGGCG GAAAAAGGTC AAGCGTATGA AACCTGTCAG TCAAGAATTG ACCATTCCGG CGCTCATGCC TGAAGGTGGT GCAAAGATGT TACCAGCAAT AGCGGGCAGC CTGCAACACA CCAAGTCTAT TGGCTCTTTT CGATCGGCAT CCAAATCCTC CTTGAAACAG AGCTTTTTTA CATCAACTAC CAAGATTTAC GAGGTGCCTT CTGTCTTGCC GGAGGCTCCT GTTAAAAGTC CATCGTTAAA GTCTACGATT TCGGCCGAAC GCTCCTGTCA TGTAGCGTCG GCGTGTATAT CAGACCCAAT CCGGCCTCGT AGCATCAAGT CCAGCGAGGA TCCGAAAAGA AGTGACAGTG AAAGTAAAGG TAGCGGTGGT GGATTCCGGC GGCAACAGCG GCGCCCGAGC TTGATGGATG GTGTGACGGC CTTGGCCGAA AAGGGGCGAC TCCATACACG CGAGCACAAT GATCAACAAT CTCAATCTCC TTCGCATTTG CAGTCCGACG TTACAGATCT CCTGACTCCC GATGGTGACA AAAACGGCAC GGACAATGAC CATTGTGAGC ACGACGAATT GAAGCTGTCC GACAGAAAAC CTCCTATTGT GTCGTTTAAC ACTGTCTCTA TACGAGTGTA CGATCGTATT CTCAGCGATA ACCCGGCAGC TGCTAGCGGC CCGAGTCTTG GTATTGGGTG GGTCTTTGTA CCTCAAGATG TCAAATCAGT TGACGACTTT GAGATTTTGC GTGAGCCTAT GCGCGCTCCG GAGAGGTTGC TGCTGACTCG CCAAGAGCGT GAACAGGTCT TTTTTGACCT GGGGTATACC CAGAAGGACG TGGCTGTTAA CGTACGCGAG CTCAACAAAT TGCGATCACA GCGTCGAAGA ACGATTGTGA ACCTTGGTTC CACAAGAGTT GAAGAAACGG TGGAAGTCGC CAAGCGGAAA CTCAAATCAA TCTTGCGACT GAAACGTAGT AGTCCTCTGG AAACATCCGC TAAGAATTTG CTTTCATCTA CTTCCACAAA TTCGACTACA TCGTCCTCGA AAGACAAGGA AAAGCTCACA AGTGCCATCG AACAGAGTAC GAGCAAGCCT TTGCAGCTAG CAATCAACCC AATTTAA
|
Protein sequence | METVKPSEMK RREAEKKHVY SASGAPLGTI SSPPRTTVST PPLGLKTTIV FSGTGVPIDT RLESPPELPP AKTGTSPNTR PPLPRKRSKR SQRSKGKSRD KNTSSTPTIL CWLIQRGKYE DATERLHETP QEASIWWVER PPGDDAAVSR ALPIHLACRK LAEETDEAAR ARLGDFLSHL LLIYPQGARM RDDGHVDRTR IEDSLLFCNS SNNSNNNSIR SARRTAIPVT GRLPVHDAVA GGVDEETLAL FLTVYPESIY SVDERRLSLS ELNRLATNDP NIQGVLDLGY EDWKSAYESS PLSGRSASKM DDLVSCASEA RNADISASTP ICLLVDEGDE LFPDNVSALT TPDELLPSGN NVWGIDRHGD KIEDKDKEEV SSEHDVVEEA KHETKPEQNS PSSDPAPIVT WEQIEERALA LERVLGEMKT KNYDLHEKIQ VLSKDQGREI ILRVDRSQKT DLYGMVDVLQ HQNFALDQNI YKTETLLHYS VFPSDEESVG RQRRRGEIAR MLGWLDVEEN DKSSTIEDSS DEEKVNDTTL QQIYNELHES YDQQKAAIKQ FGFVFEKLGI NRFVDEDNAS ADTVPRSVVS NLTVNSDDWS FGSDHCDDVS RPERSRNVLR EVEIEWPEDS VDACQVSDNL STIFRHAAAI TEENEDCLMP TSSGGHNFGV DNLSQILRSA AAKETRRKKV KRMKPVSQEL TIPALMPEGG AKMLPAIAGS LQHTKSIGSF RSASKSSLKQ SFFTSTTKIY EVPSVLPEAP VKSPSLKSTI SAERSCHVAS ACISDPIRPR SIKSSEDPKR SDSESKGSGG GFRRQQRRPS LMDGVTALAE KGRLHTREHN DQQSQSPSHL QSDVTDLLTP DGDKNGTDND HCEHDELKLS DRKPPIVSFN TVSIRVYDRI LSDNPAAASG PSLGIGWVFV PQDVKSVDDF EILREPMRAP ERLLLTRQER EQVFFDLGYT QKDVAVNVRE LNKLRSQRRR TIVNLGSTRV EETVEVAKRK LKSILRLKRS SPLETSAKNL LSSTSTNSTT SSSKDKEKLT SAIEQSTSKP LQLAINPI
|
| |