Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46935 |
Symbol | |
ID | 7204759 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 861823 |
End bp | 866768 |
Gene Length | 4946 bp |
Protein Length | 689 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185798 |
Protein GI | 219121136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0448198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGAA CTGCGCCCAA TTGTTAGTAA ATCAATCCTC ATGATGGGGT TTCCCGAGCT CACAACCCGA TGAGGGCTCT TCTCCATAGC GTGTTGACTG CGACTTCGAC TGTGACTAAC AACAGTAACT GTAAATTGGC ACCTTCCCCG GGTGCTCACT AACATGTAAC TGCTAAAGTC TGTTCATACG ACCAAAATAG TATCTACAGG CGTTAGCACT TCTGACAATG AAAGGTCTAT GATCTGCTTT GCTTTGAAGT GTCGGGGGAG AGTAACTATT GAACATGATT CGTAGGCGGA GAATTTCGAG CAATGATCCA GGAAAAGACG AGGTGGGCGT CCAAGACACA ATTACCTTTT TCACCACTGA CATCACAGCT CTCAATACTT TGGGCGCTTC GATTTGGACG TATTTGGCTA GAGCCGCTGG GAAATTGCAG GCTACGATTC GTATAGCTAG CTTTCTATTT ATGGGGTACG GCTTTTTTCT CTCGCAAACT CTTTTGTTCA CTTCGGAAGA ATGCGGCATG ACTTATTCCT GGCGCCGTTT TCTTGAGCTG GATATATCCT CCATTCATCC TGTAGGGCGT TCTCCATATC GACTGTACAA ATTCTATGAT CAGCGCGACC CCCGACATGA ACGCTTTTTA CAGCAAGAGA GCGTGACGAC TTCAAGAAAG GCTTCCACGG ACTGGTGCCT AAACGCCGCC TTCCCGACTG CTGTTGTGTA TATTCCAGGT CACGGCGGAA GTTATCAGCA AAGTCGAAGT TTGGGTGCGC ATGGAATACA GCTCACGCGA CAGCGGGATG TGACGCAAAA CTACGTTGTG CAAGCGTTAC AAAAGGGAAT GTGGCATGGA AACGCGACGC AGCTGGAAAA CTTTGTTTAT GACGTGTATG CTTTGGATTT TGCTGAAGAA GGTGGTGGTA TGCATGGAGA TTTTTTGGTG GATCAGAGTC GGTTCGTGTC GAAAGCGATT CATTTTTTGA GCGAAGCATG TGGCTTTTCC AGTATCACAG TTGTCGCCCA CTCCATTGGT GGCATTTCGA TCCGCTTAGC TTTAGTTCGT GATGAAAAGC TGCGCCTTTT GGTTACAAAT GTTATTCTAC TAGGATCACC TCAAGCACGC ACCGTTCTAG CCTGGGATCC CTCTTTGGAA AAAATTCAGA CAGAAATTGT TGAAAATCAC GTAAATGGTA CTGCTTTTGT TGCCATATCA GGCGGCCTAC GCGACGAAAT GATTCCTCCC GCAGCTTGTG AACTCGTTCC TAAAGATAAT AACACCTTGA CACTTTTGGC TGTTGATATC ATGCCTAAGG AGGCGTCAAG CCCTTCGTTT GGAATGGACC ATCGCGCAAT CGTGTGGTGC CACAATGTTT TGGTACCACT GCGGAAAATA ATTTTTGCTC TAGTCAGGTC GGAACGCGAT GGAGAGGCTG CACCAGCAAG AATAGGAGCA GTACAATCGC TGTTTGATCG AAGTAAGACG CAAAACTATA ACACTGCACT TCAACGTATG ATGACGACGT TTCGGGTAAG AATTGCTTTG AGGTCCGTTG TTCTTTTAAG GCGTGGTCCT CTCATTCAAA AGCTGCTTTT ATTCTTAAAA GAAAGTGCAC GGACCAGTCG CCAGTTTAGC CATGGTAACT GGTCTCCTTC ACAATGCCGA ATTGCTACTG GGTTTATTTG CTTACATCTC CCTGTGGAGG TACGTATTCC GCTTTTCGGC AATGCTGCCA ATAACTTTTC CTTTCGGGTG CGGCTTGTTT TGCTGGGTAA CAGCAAAGCT GGATTTGCCT CTTGCTTCAA TTCTCATTTT GGCGTTTTCA GCGGACGCAA TCCGAGCTAC TTTGTTGTGG ACAGCACATC AGTCATCAGC ACTGAAGCCG ACGACGTTCC AAAGCAACGG GATAAGTTGG CGCTGGGCGG TATGCTCCAT CGTTACCTCA GTCAGCATTG TTCATGTCAT CTTTGGCGTC GTCCGTCTCT TACGTCCCAA TGATTTTGCC ATTGAAATGT CAAATTCAAT CAATATTGCA TTGATCGCCT CGATCTATCC ACTGGCTCTC CGACGCATCC ATAAGTTTGC ACAGAAGGTT GGTAGCTCCC GCTTTTCTTT CATTGACCTT GATCTATTGA CGATTGTAGT GGTCCCGTTT TTGGGCGCTG GAGAATTTGC TTATGTGCTG TCTAAAGGCT CTGTGCAAAG GTCAACACTA CCGATGCTAG CAGCGCCTTT CCTCATTCGA TTGGTCTTAA CCTCGAGCGA CCCAAGCATT CCACCGCATT CGTCTCGAAA ACGGTATATC TCAGATGTCA TCCGCACACT TCAGGTATGC ATTCTCTTGG TGGTTGGTCC TAGAGTTCTA CAAACGGGAT CAGGCTTGGC GTATAGTTTT AATTTACCAC TCGGCGGACT GGTGGGTATG ATGATGTGGA CGGATACGTT ATGGTCATTA ACGATTAGCG GACTAGGTTA GTTATTGTCA ATGCAATTGT TTTTGCGTAG CACATCCCTG GATTGTATCC GAATATGTTC CATCCGTATA GTAGTAACCG CAGAATCAAT CACTTCATGG GAACATGGAA GCACTTGGGT CCACTTATCG TACACCTATG GCATCAAATC GTTTCCAAAA GCACCGAGGT CTTGGTTAAA ATCTTTGTCG TACTTTATTA CTGGTGTGCC GTCCTTCATT GGCTGCTATG CTCAGCTGTC GTTTAACTGC CCTCTCATCC AAATGGTCTG GAATTTGTTC TGTATATTCC TGGGAATGAA CTGATTCAGC AAGGTTGAAT GGGTAAAGGG AGCGCCGCTT CGAAAAAATA GATCCCTCGA CACAGTACGG AATAACCCCA TACTGGTTAC TATCGATGAA TCCTACCCAT CCCAGACTGG CAAAAGAAAC GTCCATTACA TATCTTCCGG ATGCAGAAAC AAATTCCTTA TAACCCGCCA CGAATCTCCC ATCCGGGGCA GTCTCAGTCA AAAATGGTAG TAAGGGAAGC GAATATTCAT CGGCCAGCCC CCTCACTTCG TTTCTAGTTG CTTCAAAAAT TCGTTCTTTT ACTCTTGCAA TGTGAAATGA TGGGATGGTG GCTCGATCCG GAGCTCTTGA TGTTGGAACA ACCCGAAGTC TCAGAGATGG ATGTAAAAAA GCCTGTGCAT TTATATGATG CTTCGCCTGC ACTACATCTA TACGGCCCAA AACGCAGGTC TCCTCGTCTT CGTCCCATAT GCCTTTCGTG TTTTCTTCAT CCTTGCCCAT CCAGCTGGCT TCGATTAAGA GGCTCTGTCC CTCCCGTAAC GATACTTTCA ACCCATTCCT TGAAGCCGGA ATAGGAATTG CTTCTGGGCG AGTGAGTGGT TCCATCAAGT GTGCAGGGAA AATTTTGTAT TGAAGCGCAC GTGGACTAAT AACACCGGGT GTGTCCCACA AAGCGTGAGA GTCCGAAGGA AAACATGGCA CGCGAACTGC TTGCAGCGTG GTACCGGGTA GATTCGATCC GGTGACTTTC AAATTCTTGA TCGTGGCTCT CCGTTTTACA GCGAATCGAT TTTGTCCCTT TAAATACACC GATTCAGCAA TTAAAGGTGA CAATGTTTTC ACCAAACTTG ATTTTCCGAC GTTGGCAGTG CCGATGACGA ACACATCTCT ACCTCCTAGC TGCAGGAGTA TGCTTTCAGC CAACCGCACC AATCCGACGC CATTTGTAGC ACTAACATCG AAGACGGATG TAAATCGGAC GCCGGACATT GCCTCAATTC TCCGAGTTAT ATTCATCACA TCACTTTCGC TGCAACGAGG CAACAGATCA ATTTTGTTTA TCACCAATAT CACCGGAATG CTTCCAATAG TTCTACGCAG ATGCTTAACG ACAGTGTGTT CCGGATCAGT GGCATCCACC ACCATTATAC ACATTCCAAA CTTGCGTCGG GCTACAATGA AGCGTAGCTG CTCGCTAAAG ACTTTGGGTT CAATATCGCG CAAGGCATCG TAGGCTCCCC AAATATCATT TCTTTGTAAC GATTGACAGC GACTACAGAG AAAACTATCC ATTGGGCGTG TCGCATAATC TCCAACATCC ATGTAACGAG TTTTCTTCTG TATTCTTTTG CTTAAAGATG ACGTATGCTC CATGGTATCT TCGCCACCGA CAAGGCGAGT TCCTGTTATG TTCGCTGAGT CTGTATTGTT CGACCTTCTA CCGGATACCT TGGCTGAAAC AACTTGTGTC CCGCAGCCAG AGCAAAGTTT CGGTACAGCA TTTGATATTC GTTTGCCAGC ACTGTGCTGC TGGAGAACTA CCCGGCCTTT CACTCCTCTT GGAGGCGAGC CGCCAGCTTT GTTCGGTTTG CGCGTGGCGA TTTTGCTCGA GGACGAAATG GTTCCGGGAG CACCCGGGTG TTTCTTCCCT TTGGAGCTGC TCGGTTGTTT CTGTACCGGC GACTGCTGTT TCTTTTTACT GCTGCCCTTC TTTTTTGACT TTGGGCTTTT TGCAGCAACG GCAAAACGGC GAGATACTGT TGCTGTGGGG GCTAGTTTGG ACGATAGGAA CGTATACGAC CTTCCTAATT CCGCTAGCAA AAAGGGACCA GCGGAAAGAT GTTGCTGGGC GGACCAAGAA AACGCTGCCG TTGCTGTTTG GTAAAGGTAT TGCTGATTGT TGTAGCGATC TATCGTTGTT ACGACACTGC TCGCTGGGAA CACAGGAGTC GGCTGTTGTA ATTTCATCGC GGTCAAGCCT TTTCGCGGAG AGGCGCCTAG TCGTATCCCT GTTCTAATCG ATCGCAACAT GCTTGCATAA CAGCTACCCT GGAAAAAGGT GATGGGAGTG CAAGGAAAGG ACGCTAGACT GTGTCACTTG AGCTTCTCTA GTTGTGCATG AGGAGAGAAG CCAGATCAAC AAAAGAAACT TGTTTGGTAG TAAGGGTGAT TGAAGG
|
Protein sequence | MSGTAPNLST GVSTSDNERR RISSNDPGKD EVGVQDTITF FTTDITALNT LGASIWTYLA RAAGKLQATI RIASFLFMGY GFFLSQTLLF TSEECGMTYS WRRFLELDIS SIHPVGRSPY RLYKFYDQRD PRHERFLQQE SVTTSRKAST DWCLNAAFPT AVVYIPGHGG SYQQSRSLGA HGIQLTRQRD VTQNYVVQAL QKGMWHGNAT QLENFVYDVY ALDFAEEGGG MHGDFLVDQS RFVSKAIHFL SEACGFSSIT VVAHSIGGIS IRLALVRDEK LRLLVTNVIL LGSPQARTVL AWDPSLEKIQ TEIVENHVNG TAFVAISGGL RDEMIPPAAC ELVPKDNNTL TLLAVDIMPK EASSPSFGMD HRAIVWCHNV LVPLRKIIFA LVRSERDGEA APARIGAVQS LFDRSKTQNY NTALQRMMTT FRKVHGPVAS LAMVTGLLHN AELLLGLFAY ISLWSKAGFA SCFNSHFGVF SGRNPSYFVV DSTSVISTEA DDVPKQRDKL ALGVSIVHVI FGVVRLLRPN DFAIEMSNSI NIALIASIYP LALRRIHKFA QKVGSSRFSF IDLDLLTIVV VPFLGAGEFA YVLSKGSVQR STLPMLAAPF LIRLVLTSSD PSIPPHSSRK RYISDVIRTL QVCILLVVGP RVLQTGSGLA YSFNLPLGGL VGMMMWTDTL WSLTISGLG
|
| |