Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44935 |
Symbol | |
ID | 7199836 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 681095 |
End bp | 688037 |
Gene Length | 6943 bp |
Protein Length | 2187 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178826 |
Protein GI | 219116062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACAC TAGAAGATCA AAAAGTTATG GCGGAAGAAG CATTGGCTGG AGAAGCCCCG GTGTCTTCGG TGGGCAAAGA CCAGGAACCG CTGTCGATTT CTTCACCTTT GCCTACCGTG CCGGAAGGAG CTCTCGACGA TCGCCCGTCG GAAACTCCTT CGGCACTATT TGCTTCGCCA GGTCAGAAAA ACTTTGATCC AATAATCAAG AAGCTACGAC GAGGCGATCT GGTGCGCATT TCGGAAGCGT AAGTGAATGA GTTAGCGGGT CCGGTTGGGA CTGATGTACA TTTCTTGTCT CATTTTATCT CCTCTTTCTG GACTATATTT CTAGTGGCAT TCATTCCGAG AGGCGCTTCC GCGATCGCCA AGGCCAACTT GGTCGGGTGA GTGTATGTCA CGATACGGTA AATGAGGGAA CCGTGGATGT GGAATGGATC TTGGGAGGCA AGTCGACCAA GTTGCCTAGA AACGTGCTGC TCTACACCAC AGCGACTCAA GAGGGAATGA CGGACGAGTC GATGGTGGGG CCCTTGGGTT CTCGACAACG CCGGAAACGC AAGCAACCGG ATACTATTGA TCCAAACCCG GCGCTGATGC AAAAAGTCGT CAAAAAGGCA GCCCCTACAA AGAGCAAAGC CACGAAAGTG TTGCCGACAA AGGCGAAAAA AGCAAGCTCG GTGAAGGAAA AGCCAGCGAT ATTACCAAAG ACGGTTGCGT CGAAAAAGAA ATCTAAGTCG GCAGGATCCA AACGCAAGAC GGCTCCCGCT CCGAAAAAAA CTGTCGAAAA GAAGCTGAAA CAGAAGACTT CTGCAACGCA AACTCCGCAA CCGACAACGA ATATTCTCGA GCTGTATGAG AAACACCGGC GAGAGTTTGA ACGCATCGTG GCGCGGCTAG AAAAGGTGGA TCAGTTCGGT TGGTTCTGGG ATCCGGCCCC GGCGGAGTAT GACGAGCAAT ACGACACTGT ACCCGATCCA GCGAACGCTC CTGCGGAAGA AGCGGTCAAT GAGCCGAAAT GGAACAATAT GCGACCTATG CAAGACGAGT GCGGTACAGC TCAGACTTCT TCAAGGAAAG ATTCCAATAC TGTAGTAGAC GCAACACTGA CATCGAAATC GGGGGATGTC ACTGTTGAAA AACCAACGTC ACAAAATTTA GTGTCGTCCA ACGCGACTGT GTCCGAATGT AACAAATCAT CTTCAAGCCT TACTCCCCAA CCCACAGGTC CCATAACGAT CTATCCGTCA CATCCTCCTT ACAATTGGGA GATGGTTAGA CGACGGATGG CAAATGGTCG ATACGTTCTG GATCGCGAAC GGAAAGAAGC AGAGGAACGT GTTGCTCTAC TTCGACCATA CTACAAAGCG ATGGGGAAGA AGACGCCCAA GAGGAAAAGG AGTGGAAACA ATACTCGTGT GGTACATCCT AAGGGGGTTA ACTGGGATCT TTTCAAAGAG GATGTGCTGA GTATGTGTAG GGCCGCAATA GAGAGGGAAG GCGAAGACGA CTCAGAAGCT CGAGGCTCAG TGATGGCGTC GGCGACTAAA GTAAAGGACG CCGTCTGCCA AGCTTTTGAG AGAACAGGCT CTCGGCAAAT GGGGGAAATG GAAATTGCCG ATCTCCGATA CAAGTTTTCC TTAGCTCTTG AGAAAAGCCT CAACGAGGAA GCGGCGATGC AGAGCTGGAG GAAATCACCA TATCCGGAAA GATGCTACGA TCGACTGTCG CATGACGTGG TATGTGCAGG CCTATCCGAG TTGGACGAGC ACATTGCTAC CTACGAGCTT CGGACGAATC TTCCAGACAG GTTTGTAGGA ATTTCGTATC GATACGATGA TACTGGACAG AGCGAAGCCT GGATGAAGTC AGTTGTCGAC GAAACAGAGT CGCTTGACCG AAAGAGGAAA AGTGCAAACG AGGAAAAACA AGCTGCCTTG GCTTTGGCTG GAGACGACGG AGTAACTAGA GCTCAGGTAT ACGCATCAGT GCAGTCGCTG CTTATCGGTG TGCAAGACAA AGTCATGACA GATTTAGGCG TTTTGAAGCA GCCTGAACTA CGGAGTGCCA ATTGGTTCTC ATCCGGGACT AGCAGTGGGC ATTGTCCGAC GTCAACACAA CCGACAAACG ATCAGCATCG TGCAAATGGC GTGTGTCTTG ATGAAAAAAA GTCTCCGGAA GTAGTCGAAC AACCTGTATG GGGAATGGAT TGCTACACGA GGCGAAACAT TGCATCGTGC CTTGAGACGG ATTTTGACCC AGCGACTGCG CTGCACTTCA TCGAAAAGTG GTTACTACCG GCGATTAATG CCTGTCCTAT TGATCTTGCG CACAAAATAT CAAATGCAGC TCGGATACTT GAAGGATTGC CGTTTGAATC GATGGAGGAT GGGGAATATG GAGAGAAAGA GAACATCAAT GATCGCAAAA CTCCGGAGAA ACTGTGGGCG TACTCTCCGC TTGGCAAGGC GCTGCGTGAA AAAATTAAAG TTGCTGCTCC AGTCTGGTTG ACCGCTGCAG CATATCTTTT GCGAAAAGCT TACACCGCTT TAGGTCCTGA CTTTTTCCGA GTACACCCAA AAGGCCATGG GTCAGTTTTA CTGAATTCCA AAGTTGGCGC AAACACGCTT GTAACGTTTT ATCGTGGAGA GGTATATCCA TCTTGGAGAT GGGGTGAAAA AATGGACGCA ATTGAGATCA CACAAAGTAG AAAAGCGCTA AAACCAGCTC TGCCTGATTT TTATAACATG GCCTTGGAGC GACCCCAGAT CGACCCTCGC GGCTACGGTC TCCTCTTTGT CGATGCTTCA AGGAAAGCAG GTCACGGATC TTCTTTGTCC CATAGCTGTG CGCCGACATG CGAGGTTCGA GTTACCGCTG TTAACGGCGA GCTCACCCTT GCGATGACAA CATTGAGAGA GCTAGAAATG GGGGAGGAGC TGACATTTGA TTACAATGCT GTCACCGAGT CTTTAAATGA GTATCGATCT GCGGTCTGTC TCTGCGGATA CGGGAAATGC AGAGGTTCTT TCTTACATTT TGCAACCGCA GATTGCTACC AATTAGTTTT GAATCGAAAT GCCCCTATTG CAACTCGTTT TGCAAACCTT GTAAAAGGAA GTATGAAGCA GGTCATGTCC GACGAGGACA CTCGGGTATT GCACAATCAT GGCTTTCTGA CTGCGGCCTT CGGCGCGATC AGTGTAAACC GCCGCAATTT ATTGGAAGGG GGGCAAAAGG GTGTTTTGGA TACTTTGGAT ATTGTCCCTG TTTGGTTACG AACTTTTGTT GCTGACACCT TGCGCTACAT TGAATACGAG CGCAGGGCCT TGCCAATCGC GTTGATTTGT GATCATGTTT CTTCGGCAAA GCGAAAATCT ACTTTGGAAA CGGCTTCCAG GAGTTCAGGG AAGGCGCCAA CAAAGCCTGA GCCCCCATTT TTTTACTTTT GTCGCTCCGA AGTCGATCTA CTGAAAGCTT TGCTGAAGAA AGATGGATTC CCAGACTCGG TGTCTGGGAT GCAGCTCAAT CATGCCATCA AGAAAGTTGG CTCAAACTAC TGGCAGGGCC TTACGGAAGA GAAGAAGGAG TATTGGAAGA AGCTTGCTGA AGCCGATTTC CAAAGGCGGA AAAAAGTATG GCATAAGTCT CAAACTGTAA GTATTTCCAA AGGTCTAAAA ACATCGGGGA AAACTGAGAA TACTGAATCA AAAACTTTAG ACATGAACGA CCTGCTGCAT GCTTCTGATG TCTCGTTCCC AGACGCGGAT TCAGAGGGTG TCTCTGCAAT GGAGCAGCGC ATTCAGCAAC TTACTCAGGC GCTTAGTCGA ATAGGGAGAG TTCTGGATCG ACACAGAGAG CTCGTTTTAG AAGAAGCTAG CAACGATCCT GTTGAACCAA GTGCCAAGTC TTTATTGGAT GTTGTCCATG CACCACTGAA AGTGTGCTCA GACTTGAAGG TCATCGGGTG GTTGTGGAAT GGTACGAATG GTGTCGTGCC GTCTTTATTT GAGTGTATTG AGAGCGCTCG ATACGCAAGA CCTGAGCTTC TGGAAAAGCT GATATTGGTT CGAACGAAAT ACGGCCGCCT CGACTCTTTC AGTGCCTACG AATTGGATGA TCTGTGTTTG AACAACACGC AAATCATAGA GGGGCGCCGA GAGCTGGCAG AAGCCTTGAT GGAATTCCGG AAAACCATAC TTGATGAATT GAGAATTATG GCCAAGGAAT TTAGAAATAA CAAGATTTGT GTGACTCCGG CCTCGAAGTT GGCACCAGAC CATGTTCTAG ATTACGCTAC ACAGGCAACG TCAGAGTTGC CAGAATTTGA TAAAATACTT TTCTCTGGAA GCAGAGAAAA CAATCAAGAA AGCACGGAGC AGAGCAGAGC TTTGGACGAA GAACCCATTG TTGAGGAGAA TGCAAGCTCC GCTATTGCTA GAACGATGGA GTATTTAGTC ACAGAAGTTG AACGAAGAGG TCAAAAAGCG GAGCGCCAAT CTAGCTTTGA AATCGACAGA AAGCCTCCAA AGATCGACTC ATGCAAGTCA AAGTCAATGA GTAAGACGCA ACAATTTGAG TCCAAAGGAC TTTTCGAAAA TAGTGGCTGG CTTGAGCATT ACAACGAGCG TTTTGTGTTG CATGCCTGTG CTGACTTGCT ACTGTTCTAC GCCCGTACAA AAACATTTTT TGAGATGAAT TCCTACAAAT TATTGGAGTC CACTCCGATT GAGGTCTACG CAAGAGAACT CGGCAATGCT GTGCCGCGAT CTGCAATCGA TGCTAATATT ATAGATCGAA GAGGTGTAGC CTTTAAAGAA ACCGCCTTCG TCAGCTCGGA CGATGGGAGA GTCGCTACGT CTATCCACAC TGTAAAAACC TTTGAGGATG CTACTTCCAG AACGCAGAAA ATTTCGAAGC TTTGCTCACC TGATGATATT GTTGCCGAAG TTACTGTAAC ATACAATGAA GACTACGTTC TGTCCCAGCT TCTGCAATGG TACAACGGTG GTATAGGCTT GAAGCCCGGG ATGCCTGACA TGATTGGCTG TGTTGTATTG CCATCGATTC CGGAATGTTT TTCCAGCGAA ATGCTTGAAA AGTCAAACGT GAAACCGGAT AAGAGGACTA CGTATGAAAC AAAAATTCGC CCCCGTCTGA TCGAGTGGCT ACAAGATCCC TACCAAAGGG GAAATCCATG GCCTGACAGC GTCAAAAAAG CATTCCTCCG ACCTGGATCT TCGTCCGCGC AACTTTTGCT TTTTGGATCC CCTGTCATTG ACTTTCTTGC CACAGGAGAC GAATCCAGTA TCCTAGACGT CTTAGGGGAT TTGAATTCTG ATAACAAGAT TGAGTCAAAG AGCGCTTCTG CCGTTATGTT GTCTTCTGTG GACAAAGGTC GGCCTGCTCA GGCCATTTCA ACCTGGGTAC AATGTGAAAA TCCTGTTTGT TTGAAATGGA GAAAAGTACC ATGGCATGTT GACGTAGACC TGTTACCGGA GAAGTTCTAT TGTAAGGACA ACGCCTGGAA CCCCAATGCT AACTCTTGTA CGAAAGCTGA AGACGATTGG GACAATGCGG ATGCCATGGT TGGTAGAGAC GGCAAGGTCG AAGGCTCACC GATACGTAAA AAGAAGCATT CAGAGCTTTC GCCTATTGAA GAGAGTAGTT TCACAGCTGG TGGTAGGTAT ATGAGAGCTT CATTACGCCA CTATGGGGCC TGCCAATGTG ACTAACTGTC TCTGTTTCTG CTATTTAAGC TCGTTTTGAT ATTTTGCGAA AGGAGAAGTA TGTAGTTGGT CGGGTTGTGA AAGTGGATTT CTCCGGCAAA GTAAAGCGTA TCTTGTTTCA TTTTCTGAAA AAGCATTCCA AGTATGATGA ATGGATCGAA TTTGGCTCGC CCCGTATTTC TTCTCTGCAC TCCAAGATTG CTCCACGTGC AGCAAAGGCC GCCGTCGATG TTTCCTACGT TTCTTCTTTG CGCGAAGTTG GCCAGGATGT GGCGAAGGGA AGAACACCCG CAAAACAAGC GAAAGGCAAG AAACTTAATG CAAAACCGAA AGTTCGTAGA CGAGATGTTC CACATAAGCG GCTAATCCCC TCACACGGGG ATCGTCCCTC GGAAATATGC CAGCAGATCG ATACAGGAAT GAGGTCTGCT AACGATATGA AGTCGCTGGT GGATAGAGTG CAGGAGAAAA TGGAATTGAC AGTGAATGAA AACTTGAAGC AATTGAAAAA AACGAAGAAA GCTGGAGGTA TAGCAGATGT GGAGGTTCAA CCCGAGAGTA TTTTAAGACC TGAAAAGAGA AAGAAGATAG ATTTAACACG CTCGGCATCA AGAACACAAT TCTCCGGTGG GGAGTTTCGG AGTTTGAAGA GTACCTTGGT GGAGAAAGTC AACGGACCTC TGCTGGCGGA TTCGTTGCCT TCTCAAACTC ACTTGACTGT GAGTAAACCG TCCCAAAACG GATCAGGTGG TAGTGATACT TCGGTGACCA AAGACCTGGA TTGGACTACA ACGACCCAGA ACTCTATATC AAACGGATTT CATCGACTTA CTCCCGGTAC TTCAGAGACT CCTGGGGCCA CCGACGGTAC CTGTAAATGT GATGACCAAC CCTTGACAAC AATCATCAGA CCACAGTCCC TTCCTTTTCA GGAAGATACA AGCAGAATGA GCACGGATGC TCAATGTAGC ACCGGTAGAG GAGTAATTGG AACGTAGCTG GCTCAGCCAG CACCTCAAGC TTGAATGCAT AATAGCGATC TCAGTCCTAG ATGAATAGGT GTACAGGCTT CGCTTCACCC TACAAAGGGA GATTTGCAGA AAATCTCACA GTCGTACCCC TGCGGAGTCA TCAGAAATTC GGAATTAGTC AGAACTGATA TTTTTAACGA TCTAAACCCA AATGTTTCGC TAAAATAGTA AAGATGTAAG GGG
|
Protein sequence | MQTLEDQKVM AEEALAGEAP VSSVGKDQEP LSISSPLPTV PEGALDDRPS ETPSALFASP GQKNFDPIIK KLRRGDLVRI SEAGIHSERR FRDRQGQLGR VSVCHDTVNE GTVDVEWILG GKSTKLPRNV LLYTTATQEG MTDESMVGPL GSRQRRKRKQ PDTIDPNPAL MQKVVKKAAP TKSKATKVLP TKAKKASSVK EKPAILPKTV ASKKKSKSAG SKRKTAPAPK KTVEKKLKQK TSATQTPQPT TNILELYEKH RREFERIVAR LEKVDQFGWF WDPAPAEYDE QYDTVPDPAN APAEEAVNEP KWNNMRPMQD ECGTAQTSSR KDSNTVVDAT LTSKSGDVTV EKPTSQNLVS SNATVSECNK SSSSLTPQPT GPITIYPSHP PYNWEMVRRR MANGRYVLDR ERKEAEERVA LLRPYYKAMG KKTPKRKRSG NNTRVVHPKG VNWDLFKEDV LSMCRAAIER EGEDDSEARG SVMASATKVK DAVCQAFERT GSRQMGEMEI ADLRYKFSLA LEKSLNEEAA MQSWRKSPYP ERCYDRLSHD VVCAGLSELD EHIATYELRT NLPDRFVGIS YRYDDTGQSE AWMKSVVDET ESLDRKRKSA NEEKQAALAL AGDDGVTRAQ VYASVQSLLI GVQDKVMTDL GVLKQPELRS ANWFSSGTSS GHCPTSTQPT NDQHRANGVC LDEKKSPEVV EQPVWGMDCY TRRNIASCLE TDFDPATALH FIEKWLLPAI NACPIDLAHK ISNAARILEG LPFESMEDGE YGEKENINDR KTPEKLWAYS PLGKALREKI KVAAPVWLTA AAYLLRKAYT ALGPDFFRVH PKGHGSVLLN SKVGANTLVT FYRGEVYPSW RWGEKMDAIE ITQSRKALKP ALPDFYNMAL ERPQIDPRGY GLLFVDASRK AGHGSSLSHS CAPTCEVRVT AVNGELTLAM TTLRELEMGE ELTFDYNAVT ESLNEYRSAV CLCGYGKCRG SFLHFATADC YQLVLNRNAP IATRFANLVK GSMKQVMSDE DTRVLHNHGF LTAAFGAISV NRRNLLEGGQ KGVLDTLDIV PVWLRTFVAD TLRYIEYERR ALPIALICDH VSSAKRKSTL ETASRSSGKA PTKPEPPFFY FCRSEVDLLK ALLKKDGFPD SVSGMQLNHA IKKVGSNYWQ GLTEEKKEYW KKLAEADFQR RKKVWHKSQT VSISKGLKTS GKTENTESKT LDMNDLLHAS DVSFPDADSE GVSAMEQRIQ QLTQALSRIG RVLDRHRELV LEEASNDPVE PSAKSLLDVV HAPLKVCSDL KVIGWLWNGT NGVVPSLFEC IESARYARPE LLEKLILVRT KYGRLDSFSA YELDDLCLNN TQIIEGRREL AEALMEFRKT ILDELRIMAK EFRNNKICVT PASKLAPDHV LDYATQATSE LPEFDKILFS GSRENNQEST EQSRALDEEP IVEENASSAI ARTMEYLVTE VERRGQKAER QSSFEIDRKP PKIDSCKSKS MSKTQQFESK GLFENSGWLE HYNERFVLHA CADLLLFYAR TKTFFEMNSY KLLESTPIEV YARELGNAVP RSAIDANIID RRGVAFKETA FVSSDDGRVA TSIHTVKTFE DATSRTQKIS KLCSPDDIVA EVTVTYNEDY VLSQLLQWYN GGIGLKPGMP DMIGCVVLPS IPECFSSEML EKSNVKPDKR TTYETKIRPR LIEWLQDPYQ RGNPWPDSVK KAFLRPGSSS AQLLLFGSPV IDFLATGDES SILDVLGDLN SDNKIESKSA SAVMLSSVDK GRPAQAISTW VQCENPVCLK WRKVPWHVDV DLLPEKFYCK DNAWNPNANS CTKAEDDWDN ADAMVGRDGK VEGSPIRKKK HSELSPIEES SFTAGARFDI LRKEKYVVGR VVKVDFSGKV KRILFHFLKK HSKYDEWIEF GSPRISSLHS KIAPRAAKAA VDVSYVSSLR EVGQDVAKGR TPAKQAKGKK LNAKPKVRRR DVPHKRLIPS HGDRPSEICQ QIDTGMRSAN DMKSLVDRVQ EKMELTVNEN LKQLKKTKKA GGIADVEVQP ESILRPEKRK KIDLTRSASR TQFSGGEFRS LKSTLVEKVN GPLLADSLPS QTHLTVSKPS QNGSGGSDTS VTKDLDWTTT TQNSISNGFH RLTPGTSETP GATDGTCKCD DQPLTTIIRP QSLPFQEDTS RMSTDAQCST GRGVIGT
|
| |