Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50246 |
Symbol | |
ID | 7199020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 91559 |
End bp | 94030 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185123 |
Protein GI | 219129916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAC GCAAGCAGCA TCAACAGTTA CCGAAAATAG CAGATCACGT ATCTGAATCG GAGGATGAGG AAATCGAGGA GGACGAAGCG TTCAATTCAG AAGATGAACG CAAATACGGC GGATTCTTCG AACGAGGTTT AGCACCAGAA TCTTCCAAGA CAGCAACAGT CGATAGTGAC GCCGAATCGG AAGAAGATGA GGATAGCGAC AATATAGCAG ATAGAAACGG GTCGGAGGAA GGCGATGGAG GTCAATATAT GCTTGATATG CTCGATATAC TTGGTGAAGA CAGCTCCAAA AAGAACTCGA GAGAAATCAA AACACCTCAA ATGGCAATCA GTGTCAAAGA ATCCGAGTTT TCTGCATCGG TCTTGCCATC GGCAAACCTT ACGCTAGACT CACTGATGAA GGGATTGGAG GACACCAAAG GATTCGGAGT GGTTCAAAAA ACCATGAAGA AAATTGCGCA GGGCCATGCC ACGGCGGCAC CGGTTGCCCG TGTTGTTTCC GAGCGTGCGC AACGCAAGGT TCACTATGAG CAGCAAACGA AAGAAGTTGA TAAATGGATT GACGCGGTGC AGGAAAATCG ACAGGCAGAG ACTCTAGATT TTCGCCCGAA AGAACGATTA GAGATTTCCC GTGACGTCTT GGTGGACAAG TTTGTGCCGA CGACCGATTT CGAAAAGCAA CTTCACGAAG CGTTACAAGA AGCCGGGCAA CTAGACGAAG AAGATATGCT CAGGGCGGAA GAACGGGCTC TACAGGATGA CCTTGGTGCG AATGAGATTA CCATGGAAGA ATACAAGCAG AGAAGAGGGC AACTCGCCAA GATGCGTGCT CTCATGTTCT ATCACGAACA AAAGCGCCAC CATATGAACA AGATAAAATC GAAGAAATAT CGTCGAATTC GGAAAAAGCA ACGCCTTCGC GGGAAAGAAG GCGAACTAGA AGCCGAAATG GAGGAGAACC CTGATCTTGT CCGAGAGCTT CAGGAGAAAG AAGAAGTTGA CCGAATGAAG GAACGAATGA CGCTCGCTCA CAAAAATACA AGCAAATGGG CGAGGCGGAT CTTGAAGCGA GGCAAAAACG TTGATGTTGA TACTAGACGA GCCTTGTCCG CACAGAATAA ACGCGGAGAC GAACTTTTAA AGAAAATGTA TTCAGGATCA GGCGAGGAAG ACGGAGATGA CTCAGACAGC GAAGATCTCA TCGAAGCGGC TAGGAAAGTT CTGCAAGATA CAGAAGAAGA AGAAGTTGCA GGGTCTTCTA AAGGCAAAGG GCTCCTGAAC TTATCCTTTA TGCAACGGGG AATTGAAAAG CAGCGGGAAA AAGCCAAAGA AGAGGCTCGT CAGCTTTTGC TCGAATTGGA GGCAAACGAG CGTATCGAAA CAAGCGACAA TGATGGTGAC ACTAATATGA ACTCAAAAAA GAAGAAGAGA GTCGCCGGCG CTGCTGAGAT GAAGGCTGTA CTCAAGGAGG GAGCGCTTGT TGTTTCTTCC CTTCAAACTG GCGGTTCAGC TAGTGTAGCC ATGAGTGGTG GCATAGACAT CAATTCTGAC TTCGCAGATC AGAATGAAGC AAAGATGTCA AGCTACGCCA GTGAACATAC TGCGGCCCTC TCATTGGGAA ATTCGCCTAA ATACATTCAG CCAAGGCAAC TTGTGAAGCC GATGGAGAAA AAGGGCTCAA ACACACAGGA TCTCTGCCCA CAACCCGATA ACGAAGTAAA TCCCTGGCTA CTTTTGAAAT CACAGGGAAA CGAAGTCTCA GATACTGCTA GCATGACATC CAGACCGGGA ATAGGTGCCA AACTATCGTT ATCAAATCAA GCGTTGGTGA TTGACCCTGA GAAAGCGGTT TATATGATGG AACAGAAAGG AGACACGGAG CTTTCTGTAA ATAAGATATT CACGAACGAT GTTGTGACCT CGACGGAGAA GAAAATAACT ATGCTCACAC AAGAAGAATT GGTGAGAAAG GCGTTTGCGG CTCCGTCGGA CAAGGAAATT GAAGAAGAAT TTGCAAACGA AAAAGATGCC ATTCAGGACT CTGAAGACCC TACTCGCACA AGAAAGAAAG ATAAGCTTTC GAATACAGTG TCGGGATGGG GTTCTTGGAC TGGGAAGGGA GCCCCTCCAC CTAAGCCTCC GAAAAAGATT CCAAGGCACT TGTTGCCTCC TGAACAGAAG CTTTCGAAAA GAAAACGTGA AGATGCTACG AAGCCAAATG TGATCATCAG CGAAAAGCGG ATAAGGAGAA CCGCCGACAA GTTTATGATA TCACAGATTC CGTATCCGTA CACTTCGCGT GAGGAGTACG AACGAGCCAT GGTTGGGGGG TTAGGAAGGG AGTGGAATGT TACAAGCAGC ATGAAAGACA TGACACGTCC AGAAATCATG ACTCGATCGG GCAAAGTGAT TCAGCCAATT TCGAAGAAAG TGAAGCAAAA ACGCCCAGCT GCAAGATTTT AG
|
Protein sequence | MGKRKQHQQL PKIADHVSES EDEEIEEDEA FNSEDERKYG GFFERGLAPE SSKTATVDSD AESEEDEDSD NIADRNGSEE GDGGQYMLDM LDILGEDSSK KNSREIKTPQ MAISVKESEF SASVLPSANL TLDSLMKGLE DTKGFGVVQK TMKKIAQGHA TAAPVARVVS ERAQRKVHYE QQTKEVDKWI DAVQENRQAE TLDFRPKERL EISRDVLVDK FVPTTDFEKQ LHEALQEAGQ LDEEDMLRAE ERALQDDLGA NEITMEEYKQ RRGQLAKMRA LMFYHEQKRH HMNKIKSKKY RRIRKKQRLR GKEGELEAEM EENPDLVREL QEKEEVDRMK ERMTLAHKNT SKWARRILKR GKNVDVDTRR ALSAQNKRGD ELLKKMYSGS GEEDGDDSDS EDLIEAARKV LQDTEEEEVA GSSKGKGLLN LSFMQRGIEK QREKAKEEAR QLLLELEANE RIETSDNDGD TNMNSKKKKR VAGAAEMKAV LKEGALVVSS LQTGGSASVA MSGGIDINSD FADQNEAKMS SYASEHTAAL SLGNSPKYIQ PRQLVKPMEK KGSNTQDLCP QPDNEVNPWL LLKSQGNEVS DTASMTSRPG IGAKLSLSNQ ALVIDPEKAV YMMEQKGDTE LSVNKIFTND VVTSTEKKIT MLTQEELVRK AFAAPSDKEI EEEFANEKDA IQDSEDPTRT RKKDKLSNTV SGWGSWTGKG APPPKPPKKI PRHLLPPEQK LSKRKREDAT KPNVIISEKR IRRTADKFMI SQIPYPYTSR EEYERAMVGG LGREWNVTSS MKDMTRPEIM TRSGKVIQPI SKKVKQKRPA ARF
|
| |