Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49028 |
Symbol | |
ID | 7195284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 350407 |
End bp | 353656 |
Gene Length | 3250 bp |
Protein Length | 1064 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183596 |
Protein GI | 219126715 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCAAG ATGAGAATGA AATTATCTTA ACGCGCGAAG AGAATATACC GTTGTCTGAT GTCGAAGAAT CGCCGCACTT TATGGAATCC TCGTTAACGA CACAGCGAAA AAAGGCCGAC TTGGCTGCGA CCGCTGCTCT AGACCACAAT GATGGCGACG CTACATCGCT CCAAAGTGCG CAAAACAGAT TATCCATCAA GCTATCACAG AAACCAACTC CCTCGTGGAG ACAGGCTTTA GCAGATACGT TGTTTGAGCG GAATTATGAA AAGAAGAAAC TTTTTAATGC GTATTCCTAC TCCAGCAATA CTTCGACGAG CACCTTCACG CTGATTTCTG GCCAGACCGG CAGTGGCAAA ACGCGCCTTG CACAAACGTT GCGGAAGCCT GTCGAGGATG CGGGCGGTTA TTTCCTGACA GGGAAGTTCG ACCAACTTCG CCGTCCTGTT CCTTATATAG CGTACGGATC CGCGTTTACA GAATTTACAC ATCAGGTTAT TGCCCGTGGA AAGGATACTA CACAATCGAT GCGTGACCGG ATCAATATCG CAGTAGGCAA CGAAGCACAC GTCTTGACAA GCGTCATACC TGCTTTGGAA ATTCTGATCG GTAAAAAAGA CGACGAAGAA TCCAGCATGC AACGGGATGA AGCTATACAA CGTTGGTTGT TTACTTTTCG ACGTTTTACG AGAGCGGTAT GTTCGCCTGA AGAACCTGTG GTTCTTTTGC TAGATGATCT TCACTTTGCC GACAAGTGCT CGCTCGACAT TCTTGCCTTT ATGGTGGCAG ACTCTGAAAA TCGAGGCTTG GTGGTCATTG GAACGTGCGA CGACAGCGAG ATTACGGCCG AGAGCTACTT GTCTACAAAG CTTCGAGAAA TGGAAGATAA ATCCCAAGCC GAAATAACGA ACATTGCGCT CGGCAACATG GGCAAAGAAG CCGTAAATCA TGTGATTTCG GAAATAATGG GATCGAAGGA CAGCAAAGCA GTGACCGAGT TTGGAAATCT TGTTTGGCGT CAAACAGAAG GAAATGTTTT TTATCTCATC GAATTGCTAC GATGGCTTCA CAACTCAGAC CTTCTCTATT TTGACGACAA GTCCGCTGAA TGGGTGTGGG ATGCAGAGGA AATCGATATA ACACTATCGG ATCGGAAACT CAATGGCTTC TTGTTCGATA GGCTACAGCA GCTTCCACGA CATTTAAAAG ACGTCCTTAA AGTCGCGTCT TGCCTGGGTC CGTATCCTGA TGCTGTTCTT ATAGAGCACG TCCTTGACCT TTCGGTTGGC CATTTGTTGG AAGAAGCTAA GGAGCTTGGT ATCTTAAGTT ATGATGAACA TACTAGAAGT TACATGTTCG AAAACGATTG TGCACAGCAA GCTGCGTACG AACTGATTCC TGTACACGAA AAAGAACTTT TCCATTTGGA AGTGGGACGT CGCTTGTGGC GAAAGCTTTC AAGTAAAGAC GAGCTCGACA GAAATATCTT TCAAATCCTG TCTCAAATGC AATTCGGTCG TCGATTGATA GCAAGAGACA ACGAGAAAAT AAGAGTGGCA TCGCTCTGTC ATGTCGCTGG ACTAAAAGCT GCAAGATGCT CTACTTTTCG AGTCGCAAAT GCCTTCCTTT CGTTTGGTGT CAGCTTGTTA AGCTCACGAA GCTGGAGAGA CAATTACGAC CTATCACTAG CACTTTACAA TGCAACTGCT GAAACAGAGA TGTGCCTCGC AAACTTCGAG GCCATGGAAA ATCTGCTGAA GGACATATTT ATCCATGCTC GTTCGTTTCG AGACAAGCTT CCGGCATACA GCACACAAAT GTATGCTTTG AATGTACGGG ACCGGCAGCG CGAATCATTG GATCTTGGGG TCGAAGTTCT CAGAGGACTT GGTGAAAAGT TTCCTCGTCG TTACTGCAAA GCGAGGCTGT TATCGGAATT GCGAAGTGTC AAGATTTTGC TTCGTGGCAA GAGCGACGAA CAATTGTTGC GCTTGCCTGC GATACAAAGG AATGACAAGC TCCAAGCCCT GCAAGTTCTA CAGCTTATGG TCCTTACTGC TCTTTCCACT CATCCAGACC TTGCTCCGTT TGTTATTTCT CGTATGGTCA AAATCACGCT GGAATACGGT ATGAGTCCTT TCGCTTCCGG CGCCTTTGCA ACATACGCAA TGATACGTAT CCCATTGGGA CCGTATGGTA ACGTTGACGA AGCCATTCGA TTCGGAAACC TGGGGATGGC TGTACTCGAG AGGTACAACA TATTAGAATA TGCGCCGCGC GTGTACGCTG CATACTACGG ATGCGTCTGG TGCTGGAAGT TTCCTTTGAA GGATTCAATG GAACCCTTGC TTCGAGCTCA CCGAATTGGT ATTCAAACCG GCGACTCTGA GTTTGCGGTT CTTTGCGCCG ATCTCTACCT CATGAATGCA CTCGAAGGGG GCGTACCACT TGATGCTATC GATCGTGAGT GGACTGGTTT CTTTGATCTC ATGGTGTCGC GGCGACACGA GACAGCCATA GCATTTACTC TTCCTTGGGC TCAAGCGATT CACCACTTTA TGGGGTACAC GGACAATCCA TTGCTTTCTA AGGGTGATTT GGTTGATTAC GACGAGGCTA TGGAGCGTAG CGTTCAACGG CAAGCGTTCA TTCAAGTTGT TTCGATCAGC TGCACTCGTA TGATGGTTTC CTACGTTTTT AATGACTATG ATCAAGCCGC AAGATCGGCT GAGACACTTC CTGATCTGCT AAAAATTCCG CCTAGTTTCG AACGAGTATC AACACTTTTC TATTCTACGT TGACATTTCT CGCCGTTGCA CGGACTGGTA AAAATGTGCG ACGACACGTT GGCAAGGCAA AGGAAGCCAT CAAGACGTTT CGACGTTGGG CGACGGATTC GCCCAAGAAT TGCCTTGACA AGCTCTTTTT ACTGCAAGCT GAGCTTTTTT CCGTCCTCGG AAAACATTCC CGAGCATACG AAAAATACAT TGCCTCAATT GCGTGTGCCA AGGACCAAGG ATTTTTGCTG ACGCACGCCT TAGCCAACGA GCGTGCGGCT CGCCATTTGT ATGGTCTTGG GCGTACCGAC GAAGCTTTTC TGTTCTTTGA AAATGCGTGC AAGTGCTACG GCGAGTGGCA TGGGCATGCT AAAGTCACAC GACTCAAAGC CGAAGTTGAA GAACTTTTTT CTTGACTGGG GAAAGAAGTA GATACTGTAA CATACATTAT CAACATTATT CTACAATTCG
|
Protein sequence | MMQDENEIIL TREENIPLSD VEESPHFMES SLTTQRKKAD LAATAALDHN DGDATSLQSA QNRLSIKLSQ KPTPSWRQAL ADTLFERNYE KKKLFNAYSY SSNTSTSTFT LISGQTGSGK TRLAQTLRKP VEDAGGYFLT GKFDQLRRPV PYIAYGSAFT EFTHQVIARG KDTTQSMRDR INIAVGNEAH VLTSVIPALE ILIGKKDDEE SSMQRDEAIQ RWLFTFRRFT RAVCSPEEPV VLLLDDLHFA DKCSLDILAF MVADSENRGL VVIGTCDDSE ITAESYLSTK LREMEDKSQA EITNIALGNM GKEAVNHVIS EIMGSKDSKA VTEFGNLVWR QTEGNVFYLI ELLRWLHNSD LLYFDDKSAE WVWDAEEIDI TLSDRKLNGF LFDRLQQLPR HLKDVLKVAS CLGPYPDAVL IEHVLDLSVG HLLEEAKELG ILSYDEHTRS YMFENDCAQQ AAYELIPVHE KELFHLEVGR RLWRKLSSKD ELDRNIFQIL SQMQFGRRLI ARDNEKIRVA SLCHVAGLKA ARCSTFRVAN AFLSFGVSLL SSRSWRDNYD LSLALYNATA ETEMCLANFE AMENLLKDIF IHARSFRDKL PAYSTQMYAL NVRDRQRESL DLGVEVLRGL GEKFPRRYCK ARLLSELRSV KILLRGKSDE QLLRLPAIQR NDKLQALQVL QLMVLTALST HPDLAPFVIS RMVKITLEYG MSPFASGAFA TYAMIRIPLG PYGNVDEAIR FGNLGMAVLE RYNILEYAPR VYAAYYGCVW CWKFPLKDSM EPLLRAHRIG IQTGDSEFAV LCADLYLMNA LEGGVPLDAI DREWTGFFDL MVSRRHETAI AFTLPWAQAI HHFMGYTDNP LLSKGDLVDY DEAMERSVQR QAFIQVVSIS CTRMMVSYVF NDYDQAARSA ETLPDLLKIP PSFERVSTLF YSTLTFLAVA RTGKNVRRHV GKAKEAIKTF RRWATDSPKN CLDKLFLLQA ELFSVLGKHS RAYEKYIASI ACAKDQGFLL THALANERAA RHLYGLGRTD EAFLFFENAC KCYGEWHGHA KVTRLKAEVE ELFS
|
| |