Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49642 |
Symbol | |
ID | 7198295 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 284651 |
End bp | 287921 |
Gene Length | 3271 bp |
Protein Length | 705 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184337 |
Protein GI | 219128265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGG ATCGGCGAAG GTGAAACCTC TATGGATGGA ATCCAGAGTG GTATTAGTAC CAGACTCCCT GGACTATATT CTAGCTAGTA GGTAGCATAG CGGACCTGAC TGTGCGCTTT CGACTCCACA GAGTGCCCAT GTTAGTAGTA GGCAACGAAC ACGGACCCCC CCCCCCTCCC CTACCGACGG GTGGGGTTGG GCACAGTGGA TGGGTGGACC GACGAAAAAA AAACGTATGC GCGGACCCCG GGAAGCACCC TCTTCATTAC ATGAGTAGGT GACTGTACTT CTATGGGCTA ATGTTGGCCG TACCAACAAT TTACCTACCT ACCTACCTAC CTACTTGTTG GATTTTACTA TACGACTCTT GTGTCATGGA CTGCGTGATG ACTTGCGAGT CACCACCGTC CAGGCGCCAA CCGACAAGAC GGACAAACGC CACGAGAGAT ACCTGGATAG AATAGAATGG CAGCCGCCGT TCATTCTTCA CAGACACGAA CCAACCACTT GAGGGACGCG TTTTGGGTGG AAACGAGTCC AATAGTGAGA GAATCTACAC ACACCACGTG ATACGACACG CGGAACCATC CAGTGTTGGC CCCCCCCTCA CGGAACATTT TCCAGTCGGC TTGGGAGAAT AAATATCGAA CGCGAGACAC CAAACACCTC CTACCCAAAT ACACCCACCA CCACACAACA AGCCACACGC AACAAACACT TTTCATTCAT ATCATTATCG ATATACTACT TACAGTTAAA ACACAACACA ACAAAAACAC TATTGCCTAT ATTCCTCTGT GTGGTGGTAA CATTGGTATT GACATCATGC GGACGTTCGC CTTACTGTCC ATTCGATCCG CACACGGCCG GATGTTGTGG AGAGGGACAG TCCTACTGAC CGTGATGATA CTCCTGCACC GTCCCGAGCC GACGCACGCC TGGACAACGA CAACAACAAC GACGACGACG AGGTGGCTGC TTTCGTCCCC CACGACGTCT TGGACGAGTA CCACTACACG TCCGACCGCC GCCGAAACAA GAACGGCAAC GACCGGCACA ATCACGACCC GCCGGCACCT GTCCAGCAGT ACCTTGCCGG ATGCCAAACT CCCGAACGAT ACCGTCTCTC CCGACCCTAC TCGCCCGGAG ACCGACGAAA GCAGTCGTCG GCAACAATTA GTCTTCAACC AAGGACTCAA CGATCTCGCG GAATCCTGCA GTAATCCTAA AGTACAGGTC GTGACCAAAG CACAAGCGTG TCAAGACCGC TACACCGAAG CGGTACAGTC CCACACACAT CCGACCGACC TCATTTCCTT CAATACCGTC CTCAAGGCCT GGACCAAGGC CGGAGCTTGT CTGGCGGAAC ACGCACAACA CGGACACATG CTGGACGCGA ACGTGCCCGT CTACACGCCC CGCGATTGTG CGCAACGGGC GCAGGATCTC CTCCAAGCCA GGGTCGCCCA ACAACAGGAC GTCGATACCA TGTCCTACAA CAGCGTCATG GACGGTTGGG CCAAATCACG TGCCGTCGAA GCACCCTTGC GCGTGGAAGA ATTGCTCGCG CAACTACAGC AAGGCAGTCG ACACGGTTTG TATCCCGATA CACTCTCCTA CAACGCCTTG GTCGACGCCT ATGCCTACAG CAACAAACCG GAGCGCATGG ATCGACTCGA ACAGATTTGG CAAGATATGC AGCGCATGGA TCAGCAACAA ACGGATTCGG ACGACGCCGC GTCCGTTCCT CGCGTGCGAC CAACCGTACG GTCCATCAAT TCTATTCTGC ACGCCTACGC ACGCCAAGTA CCCGAAGACG CCACCTACGC ACCCAAAGCC CTACAAATCC TGGTCGACAT GAAACGGCAG CACGAAAAGG TCCCCGATCC CGCAGTCCAA CCCGACGTGA TGACCTACAC CACCGTCATG GACGCCTTTG CCCGCGTCGG CAACGTCCAA GCTGCCGAAC AGGCCGACCA ACTCTTTAGC GAACTCCAAT CGCTCTACGA ATCGACCAAG AACGATCGAT TCCGGCCGAG TGTCTACACT TACGTCACTT TACTCATTGC CTGGTCAAGG TCGCACGCTC CCCAAGCCAC TACCCGCGCC TCGGAAATTT TGGAAGCCCT CCTGGCCGAC CCACACGTGA CCCCCAATGC CCGCGCCTTT ACGGCCGTCA TTGCCACCTG GGGACGCAGT AGGGATGTCC GCAAGGCCCC CAAAGCAGTT CAAATTCTGC AACGCATGAA AGCCTTGGCG ACCACCAATC CTGAAGTGGC GCCGAGCCTG TACAGTTACA ATAGTGCGAT GGATTGTTGT GCCCGGGTCC GCGGCGATTC GGTTCAAAAC ACTGCGGCCC TGAAAATGGC CTTTGCTATT TTTCAGTCCC TCAACGCGGA TACGGCCGTC CAAGCGAATC ACGTCACGTT TGGTATATTG CTGAAAAATG CCGGGGCTTT GCTACCGGCC GGTGACGAAC GGAACAAAAT TGCGATTGCC GTGTTGAAAA AAGCCATGGC CGCTGGTCAA GTCGATCCGT CGGTTTTAAT CAACTTCCAA AAGGCGGCCG ATGCGTCGGT TGTATCGGTA ACGTTGGAGC CACTGGCGGC CGGGCAAGGG CATTTGGATT TCAACAAGAT TCCGGCAGCC TGGAATAAAC ATGTGCAAAA GTAAAACAAA GATGGGCTTT GGCGAGGCGG AAACGTTTGA CGAAAGTTGG GAATGAGTTC GTTCGTACGC ATTTCCAAGA GGGGTTGCGC GTTCCTTTCA ATGCTAACGA TTGGCATTCT GGCACCTGCG CATTGCCCGC GGGCGCTGTA GTACGGTATC ACCGATGCAT TCTGTCTGAT CTCGCCGCAA TTCGTTTCAC CCGAGAGGAA TTCGTCGAAC CGTAGGCCAA CGCAGAATAT ATCAAACCAA TATCGGAACG CCGTCTTTTT GATCAATACT ATTAAATTAA TACGCTGGGA TGTATCATGA ATAGGCAATG TAGTTTACTT TCCTCTCCCC ACGAGAAGAG CGCCTTAGTT GCTATTCTGA TCCAATGTCG CTAGCGAACG CATAGAACAC CGCTGACTGT AAACACAATT TGACATGTAG TAGCGCAATC GTGCATCCTT ACTTGAAGTT CGGGAGATTG CGCATCCCGG ACGAGCCTGC TGGGGCACCT GATAAGTGTC CGCCACCTTG TTGTTGTTGT TGTTGTTGCT GCTGCTGCTG CTGTCGCAAA ATCAACTGCA TCAATTGGGC GTGTTCCTGT TGCTGTCGCC G
|
Protein sequence | MGMDRRSIAD LTVRFRLHRV PMLVVGNEHG PPPPPLPTGG VGHSGWVDRR KKNVCADPGK HPLHYMIKTQ HNKNTIAYIP LCGGNIGIDI MRTFALLSIR SAHGRMLWRG TVLLTVMILL HRPEPTHAWT TTTTTTTTRW LLSSPTTSWT STTTRPTAAE TRTATTGTIT TRRHLSSSTL PDAKLPNDTV SPDPTRPETD ESSRRQQLVF NQGLNDLAES CSNPKVQVVT KAQACQDRYT EAVQSHTHPT DLISFNTVLK AWTKAGACLA EHAQHGHMLD ANVPVYTPRD CAQRAQDLLQ ARVAQQQDVD TMSYNSVMDG WAKSRAVEAP LRVEELLAQL QQGSRHGLYP DTLSYNALVD AYAYSNKPER MDRLEQIWQD MQRMDQQQTD SDDAASVPRV RPTVRSINSI LHAYARQVPE DATYAPKALQ ILVDMKRQHE KVPDPAVQPD VMTYTTVMDA FARVGNVQAA EQADQLFSEL QSLYESTKND RFRPSVYTYV TLLIAWSRSH APQATTRASE ILEALLADPH VTPNARAFTA VIATWGRSRD VRKAPKAVQI LQRMKALATT NPEVAPSLYS YNSAMDCCAR VRGDSVQNTA ALKMAFAIFQ SLNADTAVQA NHVTFGILLK NAGALLPAGD ERNKIAIAVL KKAMAAGQVD PSVLINFQKA ADASVVSVTL EPLAAGQGHL DFNKIPAAWN KHVQK
|
| |