Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48799 |
Symbol | |
ID | 7195107 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 266415 |
End bp | 269613 |
Gene Length | 3199 bp |
Protein Length | 892 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183332 |
Protein GI | 219126161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTTCACACC AGGTTCGTTT CGATCGATCG GCGACACTAG TAGTGTCCAC GAAATGGACA AGACAGGTGA TGCAAATTTT CCTGCATGCA CAAACAACAA TTACCTAAAT ATCATTTAGC TCGGAAGAAA GAGTCGGTGA AACTTAGATT TTGAGTGAGA CTCTGCTAGT TCAGCATTAA CAACATTGCA TTGCCGAGAG AATTGCCTTC CGATTGCCAG AAACGTCCCA CTAGAGTTCT CAAAATCTCA GGTTGACGAA ACCGGCATGG AGATTTTGAT TGATGTTCCA CCGTATTTGC ATCCATACGT CGAATGGCTG CTGGATTGGA CCAGTTCTAT TTGGTCGTCT CTATCTATGT ATTGGGCGAA TGTGGGCCAC TGTTGGAGTA TGCTCATTAC CAATGCGGAA CGAGTCTTTC AGATCCCCCG GAACGCACTC GTACCACTTT TGGCTTTCAT TTTCTTCTTT TGGCCTATTC TGCTCAGCCT CATCATGACC CTGGTCACTG CCTGGGCTTG GATCTTCTGG ATTGTCACTT CTGTCTCGTT CGGCTTGATC CAACTCGTCT ACGTGTCCTA TCAGTTCATC ATGATCACCT GCGATATTTT TGGACTAAGC TGTTTGAAAA CCTACAGTAT GCTACGCCAA CAGCTTTTGC ATATTGTTGA CAAAACCACC AGTGCGGTGG GAGTAGAAAA CGCCACGCAC CGTCGGGGTG GCAAATCCCG GCGTCGACAG TGGCGTCAAG ACGTGGAACA GGCGCAAACC TACGAACGCT TTTTGCAGAT ACGCATCCAG TCCAAAGAAC ACGCACAGGT GATTTCGAAA CAGACTCTTG TCAAGAAAGC GTCTCTGGAC ACGGCGTTGC CGCCGCCGAT TCCACGGAAT CGATCCTTTT CCGTAGAACA CTCGCCAGCC AAACATGCTC TGAGGCGAAA TCAAAGTTTT GCTTCCGCCG ATGAACGTCG AATCAAGTCG AAAGGATCGT CTTCATTCCA AAATCGCGGT ACTAGTATTG ATGATTTGGA TTCAGTGGTC GTGGATGAAT TGGGCGAGAA ATTATCCGAT TTACTCGTGA GCACAACGCG ACGCTTGCGC GAAGCCCGCC GTTCGGCACA GAACACACCC AATGACGCGA ACGCAGCATC CTTGTGTTAC TTGCTTTCAG GCGTTCTTAA ACGTAATCAT TTACAATTGG ATGATTTATT GATTGAAAAC GCCCGAGCTG TTGCCGAACG GGGTCAATAC GGCCTGACGA ATGAATCGCG GAGTGTGGTC CGGGCCTATT TTCAACAAGT AGAGGAGGGC TTGGACTGGA TTGCGGAAGC GCCTGTTCTA CAAAATTTAT CATCGCACCA GTGCTCAGAA GGTGAGAATG GAAAGGAGGC AAAACATATG CACGAGTCAT CTCGAAGCAG CAGTGCGGAG CTCACGGGCT GGGCCGAAAG TAGCAGTAAA CACAATGACC TTTTGGAACG TGTAACTTTG ATACGGAAGA TGAAACAAAA TATGGGTCGG ACAGCGCTGA TGTTGAGCGG CGGAGGAGCA CAAGCCATGT ACCACCTAGG TATAATCCGA ACTCTGCTCG AATCAAAACT ATACCAAGAT ATAAAGGTGA TTTCGGGAAC GTCGGGAGGT AGCATTATTG CCGCAATGTG TGCTACTAAA ACGCCTGAGG AACTTTATAA CAATATATGC ATTCCAACAG TGGTTGACGA TTTCACCAAA ACAGTAAGCC ATTAATGTGA ATTTTTAAGG CCTGGTGGCT CTGGATGGTA TCCTAACTTT CCTATCTCTC ACTCAGGGCG AGCAACGACG AGAGAATATT CGATGGTTTC CTCCGGTTAC AGAAATGGCA GCATATTGGT TGAAGCACAA ACTTCTGGTG GACAGTGCAT ATTTTCGACG TACATGCGAC TTTTACTATA GCGACATGAC TTTCGATGAA GCTTTCGAGC GGACAGGCAA GCACGTTTGT ATCACTGTGT CGGCCAGCAG AGCAAGCGGT GGAACCGCGC AACGCTTACT CTTAAACCAC ATATCCACTC CACATGTAAC TGTAGCAAGT GCGGTTGCTG CTAGCTGCGC GCTTCCCGGA GTCATGGCCC CGGCTAAGCT GCTTGCCAAA AACAGCTCTG GAGTGTTGGA ACCGTTCGAG GTTGATGGTG TTGAGTGGAT TGACGGTTCC GTTCAGGCTG ATCTTCCGTT CCAGCGAATT GCAACTCTAT TTGCAGTATC GTCTTTCATT GTTTCACAGA CAAATTTTCA CGTTTTGCCA TTTCTCAATA AAGAGTATCA TCCGAACCAA AAAAGCTTGT ACTGGCAGCT ATTTCAAACC CTAGAATGGG ACATTCGAAG CCGTGCCCTC AAACTGAGCC GACTTGGACT CTTTCCTCGA CTTTTCGGAC AGGACATCAG CAAGATCTTC AAGCAAAAAT ACTATGGAAA CCTGACAATC GTTCCCCGCT TTACGACAAT GCAAACATTT GGTCTAAAAT CTCTTTCCAA TCCGACAATA AAAGATATGG AGGGGTATCT CAAGTACGGC CAAATTGCTG CATGGCCCTA TCTAAACGCC ATACGCGATA TGATCCGACT AGAAAAAGCT CTGGACGATT GTCTTATGCG CTTGGAAGCA CGAGTTCGAG CGCTGAATCC CGACGTTGAC TGGCTCAACC CTGACGATGT TGAGTCTATA GCAAGTTCGT CAGCTGTGTT TTCCAATTCT CGAGTACGAA TAATAGGACG ACCCCCAATG GTTGATTCCG CAAGGCAGCG GGAAAGTGAT TTAGTTCGAA AACTCGAAGA CGAGAACCAG GTGCTAAAGG AACAAGTACA GCGACTTCGA GCTGAACTGC TGGCACAAGT AGGCACTGAT GAAAATGCCA ACAGCAAATT GGATGAATCA AGCCACTATC CCGTAGCTCA ACGTTATCTA ATACAAAGCT CAGAAGGACG CCAACCATCA CTGAAGAATG AGCAAGAAGT ATTGACTCCA AGAGGAGCTT TGATCTGACT TCCCTGGCAA TATCTCCATT GTCTAGTTTG TTACTCAACC GTTTGCAAGA TCGACTTACT GTTAGTACCT CATCTGCCAC TGTGTCTGAT GTCTACATCA GCATCGGTCT TTATGGCTTT CGATTTGTAA TCGAAAAATA GCTTTCAAGT TCGCCAGAAG CTTACTGTTA GATCTCATT
|
Protein sequence | MEILIDVPPY LHPYVEWLLD WTSSIWSSLS MYWANVGHCW SMLITNAERV FQIPRNALVP LLAFIFFFWP ILLSLIMTLV TAWAWIFWIV TSVSFGLIQL VYVSYQFIMI TCDIFGLSCL KTYSMLRQQL LHIVDKTTSA VGVENATHRR GGKSRRRQWR QDVEQAQTYE RFLQIRIQSK EHAQVISKQT LVKKASLDTA LPPPIPRNRS FSVEHSPAKH ALRRNQSFAS ADERRIKSKG SSSFQNRGTS IDDLDSVVVD ELGEKLSDLL VSTTRRLREA RRSAQNTPND ANAASLCYLL SGVLKRNHLQ LDDLLIENAR AVAERGQYGL TNESRSVVRA YFQQVEEGLD WIAEAPVLQN LSSHQCSEGE NGKEAKHMHE SSRSSSAELT GWAESSSKHN DLLERVTLIR KMKQNMGRTA LMLSGGGAQA MYHLGIIRTL LESKLYQDIK VISGTSGGSI IAAMCATKTP EELYNNICIP TVVDDFTKTG EQRRENIRWF PPVTEMAAYW LKHKLLVDSA YFRRTCDFYY SDMTFDEAFE RTGKHVCITV SASRASGGTA QRLLLNHIST PHVTVASAVA ASCALPGVMA PAKLLAKNSS GVLEPFEVDG VEWIDGSVQA DLPFQRIATL FAVSSFIVSQ TNFHVLPFLN KEYHPNQKSL YWQLFQTLEW DIRSRALKLS RLGLFPRLFG QDISKIFKQK YYGNLTIVPR FTTMQTFGLK SLSNPTIKDM EGYLKYGQIA AWPYLNAIRD MIRLEKALDD CLMRLEARVR ALNPDVDWLN PDDVESIASS SAVFSNSRVR IIGRPPMVDS ARQRESDLVR KLEDENQVLK EQVQRLRAEL LAQVGTDENA NSKLDESSHY PVAQRYLIQS SEGRQPSLKN EQEVLTPRGA LI
|
| |