Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23497 |
Symbol | |
ID | 7198365 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 312385 |
End bp | 319259 |
Gene Length | 6875 bp |
Protein Length | 1891 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184600 |
Protein GI | 219128816 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGAAAAGAT TCAGTGTGAC TTGCACCGAC TTTCCCTCTG CGCGGTTGCT TCATTCTTGT TTCGGACCAA GCTGCTTCAG CATTCGTGCG AAGCGACATT AAAAAAATCT CTCTCCAGCC TGGGAGTGCC TTTGCCTTTT GGATCACAAA AAACCAAAGA GGCGACCCAA CAACACTTGT TTTTGGAGTT TGACAGTGTT CGTCGGCTCG TCTACCTCAA ACCTTACGCC GTACTCGCGA CATCCTCCGC CGGCAGATTG ATCGCCAACG CTCGCTGTAA CCGTCACTGG AACCGTCACT GCACAAACAA TGCTGGGTTC CATTGCCTGG CGCGCATTGA TGCGCCGGAA TGTGATTTAT CGTCGTCGCA ACTGGATCGG ATCGGTATGT ATGCTATAGC ACCTTCCTCG CGTCGGCATT TACCACTACT TACCATCATC CCCCTTCTCA CCTCGTGCGT TCTTATCCTA TATCAGCTTC TCGAGATTGC CTTGCCCGTC TTCTTTGTCG GTATATTGGT CTTGACCAAA TGGGCCGTCG AAAACTCGGG TGAGAACACC GGAACGGAGG TGATTCCGGA ATTCATTCCG GACGGCAGTC GCGCTCTGAC CCCCTTGACC TTTCAAGATT ACGTCACCGC GATGCAAGCC AAGCGCGTGT GTCAGACAAG CGAGAGTTTC TTTCCCGGCA GCGGGGACGG ACTCTCGATC ACCGGGATGC CCGCGGAAGG CTTCAACTGG CAGGTGCCCT TCGTCAAATG TGACGTACGC TTTTGCGACC AAGACGGACA GGATGCCAGC AACTTTTGTG AGTACGCCAT GGTGGCCGTC GCCGGATCTT CCTCCACGGA TCAAGGCGGC GCCGCCCGGG CCTCCGCCTT TGCCTCCTGG ATGTACGAAC GCTATCCCGC ACTCCGCACG TCGATGCCCT TTTTGTTCGA TTTCGTCCAG GTCTTTGATT CCCCTGAAGA AATGGACACG TACGTGAGGG ATACTCGCTA CGGGGATTCA GAATTTCCCA AAATTGGCAT GGGCATCGTG TTTGAAGGAA ACGCGGCCGA CTCGTATTCG TACTGGTTGC GCCAGGTACG TCAATCGTTA TCGTGACGAT GCAACATGTA TATGTGCTCG TATCCATCTA CCTATATATA TACGTATTTC CGAGCCTTCC TTTACACTAA GCTAACATCT TGTATCTTGT TTCTCACCTT TCTTCAGAAC TCGACTAATT TCAACAATCC CAAAGAAGAA GCCCGTCCTG CGGTCCGTAC TACTCCAGCA ACGGATCAGT TCTTGGCGAA ATTTGCCAAG GAGGACGACG TTTGCGTTCC GGAAGATGGT TCACCGAACC AGGGTCCTTT CCAAGATTCC TGTACCGGAC AGTATGTCTA CAACGGTGTG CTAACCATGC AACGTTTGGT CAATGACTTT ATTTTAGCCG ATTCCGGTGC CGAAGCTCAG GGAATTTTTG TAGCCGAAGC TGGGGTTCAG TACGTGCAGT TTCCCGCCCG ATCTTTCGAA ACAACGGGTT TCTTTGGAGA CATTGCCGGT ATGTAGCGGC TGGTACGGAT AAGGGGTCGT GGACTGAAAG ACCTTTTGCT CACTTTTTGG CTGTTCTTTT TTTCTGCGTG CTCATAGAGG TGATGCCGTT GCAGGTCATT CTAGGCTTTC TCTACCCAGT TGCCTCCATG ATTGGATTCA TTTGCCGTGA AAAGGAACTT CGGCAAAAGG AGCTCATGAA AATGATGAGC GTGACCGAAT CAGACATTGG CTGGTCTTGG TTCATTTCGT TCGCCGTCTT CCATATCGTC ACGGCTACCA TTGTCTCCGC TGTTTCGGGA GCCATGTTTA AGAACTCAAC AGGCTTTTAC CTCTGGATCT TTTGGGTCTT GACGCTCATG TCGACTGTCG TATTCTCAAT GGCAGTTGCG ACGCTGGCGT CTAAAGCGAC ACGCGGAATT TTGATCGGGC TGCTTTTGTT CTTTATTGGT GTTTTCTTTT CAATTTCCAT TGACTACCAA GACGCCAGTA GCGGTTTACT CTCTCTTTTG AGCCTGCACC CAGCCGCCGC CTTTGGGTTT GGTTTGCAAG AAATTGGTAA TTTAGAGGAC CGAGGTGTCG GCGCACAAAG CAGCAGTGTG GGAGAGTCGG ACTACCCTTC AGGTTACACA TTCAACAGCG CGATAAATTC TTTGATTGGA GATATTATTC TCTGGGGATT ACTCACTTTC TACCTCAATC GTGTTATTAA ACCGGATTAT GGCCAGGCGC AACCTTGGCA CTTCCCGTGT ACAGCTCTTT TCAAGTGCTG CGGATTTGGT CAAGGTGACG GTATGGACGA CATGGACCAT GCGCACCATG CGGAAATTGA AGATTCTGTG CCGAACGAGC CTGTCGGTGA CGCACTGCAA CGCCAATCCG AAGGGAAGAA TATTGAAATT CTTGGATTGC GTAAGGATTT TGGAAACAAG ACGGCGGTAG ACAATTTTAG TCTCAGCATG TACAGCGGAC AAATCACGGC GTTGCTGGGA CACAATGGTG CGGGAAAGGT ACGCTGGTGG AAGGGTACTG CAGCTCGGCT CGTTGCAATT ACTTTTGATT CTCACAGAAA TACTTCCTTT CCCAGACGAC GACCATTGGA ATGCTCACCG GTGCTCTCGC GCCGACAGCA GGATCCGCAA CCGTGGCGGG CAGAGATATC CGTCGAGACA TGACCAATAT CCGTAAGGAT ATTGGTATTT GTTTGCAACA CGACTGCCTC TTCCCTATGT TGACCGTACG CGAACACGTA CAATTCTTTG CTCGTTTGAA GGGACAGTAT AAGATAATGT CGAAGGAAGA CGCGGAAGCG CAGATTGACC AAGTCATTCA GGACGTTGCC CTTTCAGAAA AGCGCAACAC TTTCTCGAAA AACCTAAGTG GCGGCATGAA ACGCAAACTA AGCGTAGCAA TAGCCTTCTG TGGAGGAAGC AGCGTTGTGC TCTTGGATGA ACCTACCAGT GGAATGGATC CTTTCTCTCG TCGGTTCACC TGGAATGTCA TCCGTCAATA CCGCCAAGAT CGCTGCATCA TTCTCACAAC TCACTTTATG GTACGTACGC TTTTGGAACG ATAGTAATTA CAAGTTCCCT CTTTCTCGCG TGTCGTTAAC ATTCGTGGTC TTTGCTGCAG GATGAAGCCG ACATTCTTGG AGATCGGATT GCAATCATGT CGGAAGGTCG TCTGCGATGT TGCGGAAGCT CTTTGTTTTT GAAAAAGACG TACGGAGTCG GTTATCAGCT TGTAATTGAG AAGCTCGCCG CCAAAGCTGC AATCAAGAAT GGAGATACTG GTGCGAGCGC AAGCACGATG GATGCTCTTC ACGGTAATGA CGATAAGTTG AAACGAATTG TCACGGATAA TGTTCACGAG GCCTCTTTAT TGAGTAACGT TGGATCTGAG ATGAGTTACC AATTGCCAAT GGGTGCGGCC TCCAAGTTTA CGCCCATGTT TGAAGGACTA GATGAGGAAA TTGACAAGGG CATCATAAGT TCTTATGGAG TCAGCATCAC AACACTGGAT GAAGTGTTTC TCTTGGTGGC TAGAGGAGAG TCTACTGAGA AGGCCGAGCT TGCTTCCTCC AGGCAAATTG GCTCGAATGG TGCCACTCCT TTAGCCGCGG ATGCCGACAA GAGCCAACGC TCTCGAATGG ATTTAGAGAA CGACCGTCTT TTTACGACTC ATGTCAAAGC TTTGTTTCGG AAGCGTGCCG CGAATTTCCG TCGTGACAAA AAGGCGTGGG TGTGCACTAC GATTGTCCCA TGCTTGTTTG TGCTGATTGG ACTCATTATC CTTACTTTTG CTCCCGTCGA TCGAGATTTA CCACCTATTG AGCTGACTCT AGACGATTAC AATGTTGATT TCACGGGAAT GCCGCGGAAT CCTATTGTCT TTAACAACCC TCAGAGTAGC TTTACCTGCC AACCAGGAAG TTGTGCGTAC AGTTTCCCAG TCAGTACGAT TGCTGATACT GATGAGACCT ACTTCTTCTG TGGATATCAA GCTCGTCTAG AGGATGAGAC CACCAATTGT TCAATTACCG AGTCAGATCA AGTAACGAGC ACATTGAATG GCATCAATGG CGCATCTGCA GAAGGAATTG AAGCCAGCAC TGTGTTTGAG GTAAGCATCG ATTGCTGTCA CGGTAAATTT CGATTCTTTC GGTCGCTAAC ATGTTATTTT TTTTTACAAA GGCGTCTTTG AGTTTGTTCA ACTCCAGCAC GGTGTTTCCT GCTTCTCAGT ATGGCGCAAT TTTCTATAAA CATGAAGTGG GTAGCGTCAC AGATTCCAAC ATTGCTTACA ACGAATCAGT TTTTAGTCAA TGTGTTGCGA ACACGGTCAA TTATACAAAT GTGGAGGATT GTGGCAGATT TGGAGGAGTG GGATACATTA TTCAGTACAA CTTCACTGCT CTGCATGTGT CGCCTTTATT CCAGAGTCTA GCCGACCAGG CTTTAGCCAG AGAAGCTTTG AATTCTGATA CTTTTACAAT CCAGACGAAG CTGGCGCCGC TGCCTATTAC TAAACTCGAA GGGAATTTCG GAAAAGCTGA GGATGCTTTT TCCGCATGGT TCCTTGTTGT CTTGAGTTTT CCATTTATAT CTGGAGCTTT CGCAACCTTT GTTGTATCTG AGAGGGAATC GAAAGCGAAG CATTTACAAA CGGTTGCCGG AGTTGAACCG TCGGCCTATT GGATTTCGAC CTTTTTTTGG GACGTGATGA ACTACCAATT TCCACTTTGG ATTACCGTAA TTTTGTTTTT CGCGTTCGGG GTTGACATCC TTACGACAAC CGAGCGTGGC GTGGTTGGCG GAGTCATTGC GATTCTGTTC TTGTATGGAC CTGCATCTGC TGGGTTCACG TACTGCCTCT CCTTTGCTTT TTCATCGCCG AGCTTGTGCA ATGTCTTTAT GATCATCAGT GGGTTTTTGA TTGGCATGGG CGGACCTCTC ACAGCTTTCA TTTTGACCTT GCTTGGCAAC GAAAACCCTG CTGAACCGAA GCAGAATTTG ATTGACGCTG CCAATATCGT TATCTGGGTC CTTCGTTTCA TCCCCGCGTT TAACCTTGGC AAAGGTCTTT TCTATGCTAT CAACATCGAA ACACTCGACT TTTTGGAGAA CGAGCGTGTT GTCGCATGGT CTGAACCTGT CCTTCTCATC GAAGTCATTT TCTTGGCGTT GGAAAGCGTC CTTTACATGC TACTTGCGAT TCAAATTGAC AAGTGGTCCA GCAACCCCCG TGCTGTTTCT ATCTGGCGTA AATTTGTCCG ATTCATCACA TTCCAGTGTT TCTGTGGTCC TAAATCAAAG GATGCAATGG ATATAACTAC CGCAATCCCC GATGACGACG ATGTTCTTGC TGAACAGGAA CGTGTCCTGT CTGGCGGGGC CAATGAGGAT TTGATTGTAA TCAGTAAGCT GACCAAATGC TACGACAATG GAAAGCTGGC TGTCAATAAC ATGTCGCTAG GTATTCCTCC TGGACAATGC TTTGGTCTTT TGGGTATCAA TGGCGCGGGA AAGACGACAA CGATGCAAAT GCTTACTGCT GAATTCCCAC CAACTACTGG GGACGCGACA TTAGCAGGCT TTAGCGTGGC GAACGAACCC GAAAAGACTC GTCGTCGCAT TGGATATTGT CCTCAGTTTG ATGCTCACTT CGATAATATG ACGGGTCGAG AGCATGTTGA ACTGTATGCC GCCATCAAAG GCATCCCATT GGAGTTTGTA AAGGAAGCTG CAGCTACGAA GCTCACAGAA GTTGGTCTGA GTGATAAAGA CAGCGACCGT CTCGCGGCCG GTTACAGTGG TGGCATGAAG CGTCGTCTTT CTTTGGCTTG CGCCATGATT GGTCAACCCC AAGTTGTCTT CCTGGACGAA TGCTCTACAG GAGTTGACCC GGTTGCCCGT CGCGAGATCT GGCAGCTCAT CAGCGACATG GTCACAGGAG CCAACGTTGC CGCTGACGAG AAGACGTCGG TCATTTTGAC GACTCACTCT ATGGAAGAAT GCGAAGCGCT CTGCCCACAG ATCGGAATCA TGGCGAACGG TCGTCTGCGC TGTCTCGGAT CTGCGCAGCA TCTGAAGAAC AAGTTCGGCC AGGGCTTCCA GGTCGAATTA AAGGTCAAGA TTTTGCACAA TGAAGATATT GACTATCGCA AGAACCTCAC CAAAATAGCA GAAAGTAAGG GCGCCCATAT CGACGAGGAA ACGGGAGATG TGAAGGACGA CGTCTTTTTT AACGTGGACG AGTGCCAGAA GACCCTGCAG CTTTTAACGG GCGACACGTA CCTGTCCGAC ATGATCGGGA CTCACAATCC TTCTGGCTAC TTGGTTTACA AGAGTGCCTC GTCCGGACTA GCAACTTTGG AAGAGATTGC TGCCTTTGCC ACGAATGAAC TTCGGATGCG TGACCTCGAT GCGTTTATCA AGGAGCAGTA CCCGCATTCA GTTCTTCGTG AACGTCAGGA TTCCAAAGCT CGTTACGAAG TGCCGTCCCA AGGTATTCGT ATTTCCCAGA TATTTTCGAG TATTGAAGAA AACAAGGAGG TCCTCATGTT GGCGGATTAC GGTGTCAGCC AAACCAGTCT CGAACAGGTC TTCAACATGC ACGCGGCGGA AGCCGAAAAG CTCAAGCAAG GTCGGGACGA CTCTTAAAGA TTTTCTCGAT ACGTTTTAAC GTTTACGCAA TTGCCCCATA CACACGCGCA CGAACGCACA CACGACTTAT CTACAACAGT ACACACGATT TCACGCGGAT AGGTATTTTT ACATCAAGGC AAAAGTAATC AATTCCGACT TTTTAAATGG TTAAAAATTT ACACATTGGT GAACC
|
Protein sequence | MLGSIAWRAL MRRNVIYRRR NWIGSLLEIA LPVFFVGILV LTKWAVENSG ENTGTEVIPE FIPDGSRALT PLTFQDYVTA MQAKRVCQTS ESFFPGSGDG LSITGMPAEG FNWQVPFVKC DVRFCDQDGQ DASNFCEYAM VAVAGSSSTD QGGAARASAF ASWMYERYPA LRTSMPFLFD FVQVFDSPEE MDTYVRDTRY GDSEFPKIGM GIVFEGNAAD SYSYWLRQNS TNFNNPKEEA RPAVRTTPAT DQFLAKFAKE DDVCVPEDGS PNQGPFQDSC TGQYVYNGVL TMQRLVNDFI LADSGAEAQG IFVAEAGVQY VQFPARSFET TGFFGDIAEV MPLQVILGFL YPVASMIGFI CREKELRQKE LMKMMSVTES DIGWSWFISF AVFHIVTATI VSAVSGAMFK NSTGFYLWIF WVLTLMSTVV FSMAVATLAS KATRGILIGL LLFFIGVFFS ISIDYQDASS GLLSLLSLHP AAAFGFGLQE IGNLEDRGVG AQSSSVGESD YPSGYTFNSA INSLIGDIIL WGLLTFYLNR VIKPDYGQAQ PWHFPCTALF KCCGFGQGDG MDDMDHAHHA EIEDSVPNEP VGDALQRQSE GKNIEILGLR KDFGNKTAVD NFSLSMYSGQ ITALLGHNGA GKTTTIGMLT GALAPTAGSA TVAGRDIRRD MTNIRKDIGI CLQHDCLFPM LTVREHVQFF ARLKGQYKIM SKEDAEAQID QVIQDVALSE KRNTFSKNLS GGMKRKLSVA IAFCGGSSVV LLDEPTSGMD PFSRRFTWNV IRQYRQDRCI ILTTHFMDEA DILGDRIAIM SEGRLRCCGS SLFLKKTYGV GYQLVIEKLA AKAAIKNGDT GASASTMDAL HGNDDKLKRI VTDNVHEASL LSNVGSEMSY QLPMGAASKF TPMFEGLDEE IDKGIISSYG VSITTLDEVF LLVARGESTE KAELASSRQI GSNGATPLAA DADKSQRSRM DLENDRLFTT HVKALFRKRA ANFRRDKKAW VCTTIVPCLF VLIGLIILTF APVDRDLPPI ELTLDDYNVD FTGMPRNPIV FNNPQSSFTC QPGSCAYSFP ASLSLFNSST VFPASQYGAI FYKHEVGSVT DSNIAYNESV FSQCVANTVN YTNVEDCGRF GGVGYIIQYN FTALHVSPLF QSLADQALAR EALNSDTFTI QTKLAPLPIT KLEGNFGKAE DAFSAWFLVV LSFPFISGAF ATFVVSERES KAKHLQTVAG VEPSAYWIST FFWDVMNYQF PLWITVILFF AFGVDILTTT ERGVVGGVIA ILFLYGPASA GFTYCLSFAF SSPSLCNVFM IISGFLIGMG GPLTAFILTL LGNENPAEPK QNLIDAANIV IWVLRFIPAF NLGKGLFYAI NIETLDFLEN ERVVAWSEPV LLIEVIFLAL ESVLYMLLAI QIDKWSSNPR AVSIWRKFVR FITFQCFCGP KSKDAMDITT AIPDDDDVLA EQERVLSGGA NEDLIVISKL TKCYDNGKLA VNNMSLGIPP GQCFGLLGIN GAGKTTTMQM LTAEFPPTTG DATLAGFSVA NEPEKTRRRI GYCPQFDAHF DNMTGREHVE LYAAIKGIPL EFVKEAAATK LTEVGLSDKD SDRLAAGYSG GMKRRLSLAC AMIGQPQVVF LDECSTGVDP VARREIWQLI SDMVTGANVA ADEKTSVILT THSMEECEAL CPQIGIMANG RLRCLGSAQH LKNKFGQGFQ VELKVKILHN EDIDYRKNLT KIAESKGAHI DEETGDVKDD VFFNVDECQK TLQLLTGDTY LSDMIGTHNP SGYLVYKSAS SGLATLEEIA AFATNELRMR DLDAFIKEQY PHSVLRERQD SKARYEVPSQ GIRISQIFSS IEENKEVLML ADYGVSQTSL EQVFNMHAAE AEKLKQGRDD S
|
| |