Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50157 |
Symbol | |
ID | 7198941 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 216186 |
End bp | 221538 |
Gene Length | 5353 bp |
Protein Length | 1494 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184992 |
Protein GI | 219129641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATCGAATC AGCTTTCTAG CGGGTTGGCT CATTGAAAGT CCGTTCGAGA GTGAAACTAT CTCGCATTAT ATTTGCCAGT GTGTAGCTTT TGTCCTTCCT TTGTCGAGCC CAGCCCACGA ACTGTTCGTC CGGTTTCGAA AATTCGAAAC CAAAACAACC TAAAGACATT TGTATCCGAA CCTGAACACC ATGTCGTCCT ACTTTCGTCA ACAAGCTCCG GCTCCACTCC GCACCGTGCA TCGCGACTAC GAGTATACGT CAGCAATCAA CGGAACAACT CCCACACCGA CGTCGGCTTC GGCGCATGGT GTAGTCCAGA ACCAGTTTTA CCGCACCAGG GCGCCGCCTC CATCGCGGTC TCCCCGGAAC GCTCGTGTCG AACCCGACGT AGACGCCGAG GAAGCCTTGC TAGATCACGC TCATTTACGA GAGCTGCACG AAGAGGCGGA GAAAATGAAA GCACTTGGGA ACAAACACAT GGCAGCACAG GTACGTCCCT CGCAAGCTCC AGATGCCAGT CTTGAGGCGC GCGCCATTCC CAGCACTGGT TGCACTCGCT CACGTCCCTA CTCTTTATTT TTCTTCCGTT GTTGACAGGA ATATGCCAGA GCCTACAATG CCTATTCGGC GGCGCTTCAG CTCGCTCCGG TCGGACCCTC TTCCCACGTG TTTTTGTCGA ATCGAGCCGC CGCCTTATTG AGTATGAAAC GGTACGAGGC CGCGGCCACG GACGCAAAAC GAGCCATAGT CCTGGCTCCA ACTTTCGGCA AGGCCCACGC TCGCCTGGGA CAGGCTCTCT ACTTTTTGAA AGACTACTAG GCCGCGGTCG AAGCCTACGA AGAAGCGGTA GCCTACGAAC CCGACAACGC CACCACGCTC ACGTATTTAG AAAAGGCCAG GGCCAAAGGT GAGCGCTACA ACACTCGAGC CCGCGGCGAC GACGGCTCCG TGGGGGGCGA TGCGTCGACT GCCTACTCCA TACAAAATAG TGTTGCTACG GATCACTACC AAAAGGGCGT CGTCGAGTCT GGTTATAGAG GAATTACCAA TCAATCCGTT TTGAACGCAG CCGTCAAGTC GCCCAGGGAG AGAGCCACCG GGTCGTATCG TGCCAGTCTT TCGCCATCGT ACCAGCAGTA CGACATGAAC GAAGATGACC CTGACTTTGA TGAAGCCCTG CGCATTCAAC AGCGCGCCGC TAAATTCCTC ACCAACAAGG CGTACAGGGC AGCTATTGAA GAATACACGG CGGCGTTGTT CTTGGTTCCT GACGACCCCA ACCTTTCACC AGAATTGCAT TTGGGTCGAG CGCACGCCTT GAACGGATCA CGACGGCACG AATCCGCCAA AAACGACGCC CGTATGGCTA TTCGGCTTAA CCCTCAGCCC GCTGCCTTTT CGACAATGGC CAAGTCACTA TTTTACATGA AAGACTATCG AGGTGCCGTT GAGGCCTTTG AGGAATGCGT CAGGCATCTA CCTGCAGGCG AGACCCTTGG CATGTTTGAC AAAGCGTATT TACAAAAAGC CCAGGCTGCT CTCGATGAAG AAGAATTCAG TTTGCGGATG GCGGGAACTC CAACGCGCCA GCCAAAAACG CCTATTCCCA AACTCCCCCC ACCCCGTTTT GTTCCACGGG AACAAGCCAT GCAGTCATCG CCACAAGTGC CTCCCATGCC CAAACAGTGG CCTCAGCAAT CGTCGCTCGC CCCTTCCACC CTGCGTTGTG GACCGGAACG GCAGGTTTTC TTCTTGTCGG AAGGTCTAGG CATCAAACTG AACCGCGGAC CCGACGGTAT TGTACGGGTC TTGTCGGTGA CTTCGAATAC TCCAGCGGCT CCGGTTGCCC GTAGAGGCAT TATTGAAGCA GGTGACGTGG TTCGTGAAGC CGCTGGCGTC GACATTCGTC GGCCTATTAC AAACATTATG TGGGGCGACA CGGTCGCACT CATCAAAATG GCAGCCCGGC CAATTGTGCT CGTCGTTGCG AAAGAAGTCT CCAAAGTGCC TTTGTCGGTA TTGGAAGAAC AAATGAAGGC CTTGTCGCCT TTTGGATCGA CATCAACAAA ATTTGGTGGG AACCACGTTT ACCGTCCGTC GAAATCGAGT GGCGACGAGA CAGTCCGGTA TGTCTTGGAA GAATCCATGG GTACGCCAGT AAGTATGGAG TGTTCCTGTT TCTTGTGCTT TGTGGATACA GTAGGACGAC TCCAGTTTCG CTGACGTCTT TGTTGTTTCA CTCAGAGTAG CTAGCTGCTG GTCTGCCAGG TGAGGATATA GTAGACGAGG AGGGCACGGG TGTAGAGATA ACTGAGGCTG GACCGTTGGA AGAAGAAGAT ATCGAAAGCA ATGATGAGGA AGTTGAATCC GACTCCGCTT CGGACGTAGC TGTTCTAGCC GGCGAACCTG AAAAAGAAGA CGTTAGTGAT ACTGTGGATG CTTTATTGGA CGAATTAGAG AAGATGGAAG TGAGAAAAAG TAACTCGGAT GACGTGGAAG GCTCTGGTAC ACGCCGCATG TCAGCCGCAG ATTATGAGCT GGAAATGCTG TGCACCGAGA TTGAAGCGAC CAACTCGGAA AGGAAGTCAG TCGAATCACC TCCGACAGTG GATGCAATGC CTCTTGATGG CGATGACGTG CCTCCGGAGC GTGTGATTAC TGTCAAGCCA GGCAGATGAG GGGGAAGGAA ATGGGGGAGC TGCCAAGAGC CTATCTTTAC GTGATCGAGA AGAGCAAATG GTTGGAGGGG AGATTCTTTT TGGCTCGGAA GCAAATTTGT CTACCGGGAG CTGGGACAAT TTGCGTTGGA TGTCCTACTC GGGGTCCCGC AAAATACGAT TTTGTCAGAT GATTTATCGC CTTCTCACTC CAGAAAAGAA GAACATGTTT TGGGTGACAT CGGGTAGAGC ATATGAGAAG CGGGGGCTAG CCATTTATGA AGAGCCGCGA TTAATTCTGG TTCTGCGGAG GGTGGTAGAT ATGCAGGAGC TCCGACTACT TCTAGGTCTA CCTGACATCG CCGAAATAGA CAACCCAGAC GTTGCTTTAA CGCGTTATTG GGTTGTCGAA AGTGCTGTGG ACCCTGCGGT CAGCAGGCTA CGTCTGTCTC CTCTCACAAC TCCAACATCA TGGGGAAGCG AACAAGCGGA CACCAGGGAG AAATCCTGTT TTGAACTTTT GTCGCCGGCG GAATCGATCA TGCTCTCGGC CGTACGAGTA CGTGAAGGAA TCAAGAAGAA AGAACGATCT TTCGTTGACA GTGGTGCTTT CCTGGAAACG ACTGCAGTCG AAACCGCTCT CACAAAAGCT CTTTGTGATG CTAACGACCA CGCCGGTAAA ATTGGATCTC TGGATGTAGA CATGACGTGG AAGCACCAGG TTATTTTGGG AACGCTTCAC TCGATTGTCC TCTCCGGAAA TCTTAAAGGA TTGGAGGAGG CAATACAACG ATTGCGAGTT TCCGTGAAAG ATGGTAATGG GTCATCAAAA TTTCTTCCAA CTCGTGTAGT CGACCCGCTT GATGAGAACG GCCGCACTCC TTTGTACTAT GCTTGTACTT GTCGCATGAG CACTGCTGTA GCATGTCTTA TTAACTCTGG GGCAAGGATA AACGTCAAGA CAACGTCGGG CGGTATGGCT TTGAGTCACA TTTGTGCATC AAACCTCGAC GATAAGAGTC TTTCGATTGT GCTTTCGGCG ACACGTCCCT CTAGGCTTGA TCCGAACGAG CTTGACACCA TGGGAAGAAC GCCGATGTAT GTAGCCCTCG TCAACGGTCG TTCAGTGGCC GGAACACGAG ATGCCCGGGC TCTGAGTCGA TGTCTTGTCG CTTTGGCAGC ATGGGGTGGT CGGATAATTG TGACCGAAAC GACTTCATTA GCAAACCCGG TGAAAGTGTT AGCATCTGAG TGGCGATCAG AGGACCTTTC TGTACTTCTG GATCATATTG GTTTCCGGTA TCCTCTTCGG AAGCCACAAT CCTCGGATCT ATCACCGATC GCGCTGTCTC TGGGTGCATT CTATAACTTT CCAATACACA GTGCGTTGAT TTCTCTGCAT GGTCAGTTGG AAGCAGTAAC TTGTCGAGAC GAAGCTTCTT CGTATACTGG CGTTCAGCGA ACAATCCGAA CTCTCTTACT GAAAAGTTTC GAGCCCAATG AGCGCTTGGA TTTTTGTCAA TCAACGATGA CCGCTGCTCC TGAGCTGGCA AATTTCGCTG GTTTTACGCC CCTCCAAATT CTCGCGGCTT CCGCTCTGCA GCTGGACGCG GTTGAGGCGC AGATCGACGA CGACATCTAC CTTAGTCTTG TTGCTTTGCT CGCTGAAGTT GGTGAGCTGC TGGTGAAGAA CGGGGCTCGA ATATCTCTTG ATGCGCCATC GTTTAAGAGA ATACGTCGAA ATGCGTCTAC CGAGGGTGTT ACTACTAGCA AGAGTCAAAA GGGCGATTCA GTTGTGGACG TTTATCGCTC ATCTTTGAAA ATTGATTCGA ATAAGAAAAT AACTAAGATG CTGGGAGGCG CAGAAAGACT CTCACGGGCC CGCAAAGAGT TTATGCAGCT AACAGCGGTG AATGCTTCGC CGGATATGAC CGTCAATTTA AACCTTGGTG ATGCTTTGCC TCTGGAAGAT ACCAGTGAAG CTGGTGGTAA TAACGAAAAG TCTTGCGCCA TTTGCTGGGT TGTTTTCGGC GCTCTCATGA ATCGCAAGCA CAAGTGTCGA GTCTCTCGAC GTTATATCTG CGACGAATGT TCCACCAAAC GAATTCTTTG CGATGGTAAG GAATACCGAT TAAGTGACGG TCAATTTGCT TTGGCCAGAG CAGACGCCGA CGAAGTTGCC AACGAGCGTG AAGCTGACTT AAATGCGAGA GCGCGCGATA CGTCCATGGA GAGTCGGGTA CCGTTTGCTC AAGGATCTGA GAGATTGCCG GAGAAGAAGC CTGCCGCCCG AAAGTCTTTA AAACAACTTC GTCTCGAAAG GCTTGAAGCG GAGGGGGAAG CAGATCGTAA TTCGTTGTTT GGGGGAATCA TGGGATCCGC AGCCAAATTA TTTGGTACTG AAGGAGAACC GCAAACGCCG ACTCAATCGG ACGAAGTGAA GGGGTTAAGC GATTCGTTAG GACAAACACG TAACGCGTTG TTGGAACGCG GCGACAAATT AGCGACACTG GACGACAAAT CAGCAAAAAT GGTGGACGCA AGCGCGGACT TTGCTCGAAT GGCGAAAGAG CTTCGCAAAA AGTCGGAAAA ATCATGGTTC GGCTAATGTG TCAGTGGCAA ACGTGTGAAG TATGTGAACT TTCTGTAAGC ATTCTATAGT AGACCATTGC ACATTTATAT AGCTTCGAAA CGT
|
Protein sequence | MSSYFRQQAP APLRTVHRDY EYTSAINGTT PTPTSASAHG VVQNQFYRTR APPPSRSPRN ARVEPDVDAE EALLDHAHLR ELHEEAEKMK ALGNKHMAAQ EYARAYNAYS AALQLAPVGP SSHVFLSNRA AALLSMKRYE AAATDAKRAI AAVEAYEEAV AYEPDNATTL TYLEKARAKG ERYNTRARGD DGSVGGDAST AYSIQNSVAT DHYQKGVVES GYRGITNQSV LNAAVKSPRE RATGSYRASL SPSYQQYDMN EDDPDFDEAL RIQQRAAKFL TNKAYRAAIE EYTAALFLVP DDPNLSPELH LGRAHALNGS RRHESAKNDA RMAIRLNPQP AAFSTMAKSL FYMKDYRGAV EAFEECVRHL PAGETLGMFD KAYLQKAQAA LDEEEFSLRM AGTPTRQPKT PIPKLPPPRF VPREQAMQSS PQVPPMPKQW PQQSSLAPST LRCGPERQVF FLSEGLGIKL NRGPDGIVRV LSVTSNTPAA PVARRGIIEA GDVVREAAGV DIRRPITNIM WGDTVALIKM AARPIVLVVA KEVSKVPLSV LEEQMKALSP FGSTSTKFGG NHVYRPSKSS GDETVRYVLE ESMGTPLAAG LPGEDIVDEE GTGVEITEAG PLEEEDIESN DEEVESDSAS DVAVLAGEPE KEDVSDTVDA LLDELEKMEV RKSNSDDVEG SGTRRMSAAD YELEMLCTEI EATNSERKSV ESPPTVDAMP LDGDDVPPER ADEGEGNGGA AKSLSLRDRE EQMVGGEILF GSEANLSTGS WDNLRWMSYS GSRKIRFCQM IYRLLTPEKK NMFWVTSGRA YEKRGLAIYE EPRLILVLRR VVDMQELRLL LGLPDIAEID NPDVALTRYW VVESAVDPAV SRLRLSPLTT PTSWGSEQAD TREKSCFELL SPAESIMLSA VRVREGIKKK ERSFVDSGAF LETTAVETAL TKALCDANDH AGKIGSLDVD MTWKHQVILG TLHSIVLSGN LKGLEEAIQR LRVSVKDGNG SSKFLPTRVV DPLDENGRTP LYYACTCRMS TAVACLINSG ARINVKTTSG GMALSHICAS NLDDKSLSIV LSATRPSRLD PNELDTMGRT PMYVALVNGR SVAGTRDARA LSRCLVALAA WGGRIIVTET TSLANPVKVL ASEWRSEDLS VLLDHIGFRY PLRKPQSSDL SPIALSLGAF YNFPIHSALI SLHGQLEAVT CRDEASSYTG VQRTIRTLLL KSFEPNERLD FCQSTMTAAP ELANFAGFTP LQILAASALQ LDAVEAQIDD DIYLSLVALL AEVGELLVKN GARISLDAPS FKRIRRNAST EGVTTSKSQK GDSVVDVYRS SLKIDSNKKI TKMLGGAERL SRARKEFMQL TAVNASPDMT VNLNLGDALP LEDTSEAGGN NEKSCAICWV VFGALMNRKH KCRVSRRYIC DECSTKRILC DGKEYRLSDG QFALARADAD EVANERLKRR GKQIVIRCLG ESWDPQPNYL VLKENRKRRL NRTK
|
| |