Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46566 |
Symbol | |
ID | 7201706 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 739193 |
End bp | 745963 |
Gene Length | 6771 bp |
Protein Length | 2173 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180888 |
Protein GI | 219120293 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTGCCGTG GTTGCTTCTC GGTTGCTGAA CTGACTGTGA AATCCAACGA GTTCCACCGA GTAGAATCGA TAGACCCTCA AGGCCTTGCA CTGCAGTGCC ATACACTGGT AAAAAAGGTG AGCGGAGTAT TCATTCGAAC GATTGGACAT GTCGTGGTTT AGTAGCCGTC GCGCCAAGCC AGCCAAAGAG AGCGCGTTCA ATACCGACGA CAAACCACAA AATGCTTTTG GAGGATTCGG CGCCGTGGCG CACTTTGACG ACGGTCCCTT GCGAGACCCT CGGATGGCCA ACGACTTTCT CGAAAGGGGA ACGAAAAACG AAAATATTCA TACGCATTCG GACACCAGTA GCGACGGTGA CGACGCTGGT TCCGAGGCAG AGTCGGGTTC TCTCAGCGAC GATTCCTTCG TAGGAGGTCC GCTCATGTAC GAGTCAACGT GTTCCGACGA TTCTATCACT GTAGACACTG CCGATCCAGA TTCTCAGGAT TGGAATCTAC GAAAAGCCAA TAACTTTCTG CAGGATTTTT ACGACAAGGA GGACCTGGAA CGGGTGTCGC AAGAACAGCA CCGTTTAAGG TCGACTGACG ACGAGAACGA TGCAGAATCG GCATCGTCTT CCGACCAGTC TTCGGAAGAA GGCGAGTCCG ATTCGGAAAC GCTAGAAAAA TCCCACTCTC CTCTGCATAA AACATGTAAA GATGCCGATA TTTGTCACGA AGACCAGGAT CTGGGTACGG CCAAAATAGA CACTGCTACT ACCTTATCAA GCCACGATGC AGAAAAATTT CCGATAGCCC CGGAAAGCTT TGACGAGGGA TATGGCGCGG AGCATGATGG CTATGATGTT GCTCCCAACG AGTCCCAGAT ACATGAATGT TCTGAAACAG AGACGAGTTC GCAGCATGTA GTTGTCCACG ATTTTTCTAC AGCAAGCGCC TTGAGACCGG CTGTAGCTGA AGCAAGGTCT TTCCGCAACC TTGACAACTC ACATAGCGAT GTCAATCTTG ACGCGACAGC TTCTTTCACA ACTAGCCTCC AGGAATCATC CACTGGTAGC GAAACTTTTG ATAGGATGGA TCCGAAAATT GATGCTTCCA AACTGGGCTG CGATAGCGCT AGCTTAGATC ACGAAGCAAC TTTTGGCTCT GGAAACGACC GCGATCCATC GCGGCACATG CGAGGAGACT TTCAGGGGGC ATCGGCTGTT GATAGCGCTT ATGGCAGCTT GAGCGACGGA TCTCACAATA GTAAATGCAA TCGAGGTAAT CATTCCGGGT CTGGCTTGGG CGAAATGTTT GGTGTTGGAA ATGCGCTCGC CGCGAACGGT GGCTTTGGAG TTTCTCGTTG CGATGAAGAC AAAGAAACTT CCTCCAATGA TTCAAGCTCT GATCTCTTGG TCTCTGGGAA TTCACCTGTC CACAATAAGC ATCGACATGA AGCTGACTCG AGTTTAGAAC TTGATGCACT AGACTGTGGC GGTAGTAGAG GCGATATTTA TCCAAATCAA AATGAACTTT TCTCTCTCGA AAGAAGGGAA GCCGAATCCA GAGAAGTAAG CGAGTCAACC ACTCCATTGC AAACTGAACA TTCAGAAAAG AAGGTGGTAT CCACCTTCCA CGAAGAATAT GTTGTCAACA TACACGACGT TCGTGTTGCG CTGGATTCCG AGAATGAATT ACCTTCGAAA GATCCACTAA ACGACCAGGA TAAGCCTGAT AAAGGCAACA CTGATTGCGT ACCTACTGCC ATCGTACAAG ATAGATTCTC GGACAAGCAT TCGTCTGCAG AAATTGAAAG TCTTCTGTGC TCTGGTGATG AAAGTACGGG TGATGGTTTG TTGGGAGAGA GATCGTCTTC CCATTCCTCG TCGGAACCTC GAAAGTCATG CGATCAATCA CTACAAGGAG AGAAAGAAAT CTTTATTGAC GCTAGTCGGG AGCACATCTG TTCTGATGGT AACTTTCAAC AAGATAGGCA AGCTTTGTAT ATCACGAATT GCGTTGAATG CTCTGAGGAT GGAGTTCCTT TCTCAGGCCG ATTCCAATCG TGTCTATCTC AATGGAAGAA GCAATCAAGT CCTCCTCAGT CTAATCGGTC AGGACGTGGT CTACGAGCAT TTGATGTACT TGAAGAGATT CCATTGAAAG ATGGTCGTAT TGAGTCGTTT TCCTTGAGAG GTTCCGATGG TCTTCATGAT TCCCCAAATA CTCGTTCAGG ATCGGCTGCA ACCAGTTGCT TCCATCCTTG TCATAGGCGA CAGGAATCAG GTGGTATGGA AAGCGAAGTT AAGCGGTCTT GCGTTTTCCA AGACCATTCA GACAATTGTC ATGATGCCTC CACTTCTCAT CTATCTCGAG ACGAAGACGA AAGTTCGCGA CCAGAGGAGG AAGGTTTCGT TCCAAGAGGA CTCGTGGATG ATGACGACGA CGAATGCAAA TCAGATGACT CAAATGTTGA CTACGTCAAT AGTCCAACGA CTGCTTTTGA GGATGATATT TACGCTAGCC GATCAGGTGA TTTTGAAAGG GAATCATTTA TGTCAGCTGC GCCTGCTGTG GAATCCTCTC GAAGGACAGA TCTTTCTCCA GCAATGGAGA AAAGGAGCAA CTCCTTAAAC GCAGTTGTCG CACCGGATAC TGCACGCCTC CCTGCATTGC AGACTAGATC AGAACAACGT AGTCAGCAAA AAGGCGCCTC CGCAGAAGGT ACAGTTTTGG ATCAAACTTT TGCCGTCAGG CTCCTGTGGG TGGCAAAAGA GAGGCTATCG ATCAATGTTG TGAACACGAA TTGACTTTCT GGCGACCAGA TCCCCCTGAA TCTGACCGTG ATACGAAATC TTCAAAATCT TGTGTTTCGT CGGGACTAGT GAGCACTTCG AATAGTTGTT TGTCAGAACG TTCGCCGGTC TTGGAGGCTG GCTCTGGCTG GAAGCATTCC AGCATCTCAG CTAGAGAAGT TATTTCTGAA AGGAACGAAA CGATTGCTTC GTCTCACCTT ACCAACCACC AAAAAACCGA CCTTTCCTTT AAAAGCTCGT TTGTTGCAGT TGAGGAAGCG AATTGTGAGG CGAGGTCAAC ACGAGAGTCG AAGGCTGAGC AGGATAAACG AAAGCAAAGC CCGGAAGACC ATCCTTCTGA AAATGAAAGC GAAGACGTAT GGTATAGCCA AAGTAAATCT CTGTCCGGCC AACAGAAAGG GTTTTTTGGC TTATTTGTTG AGAAGGATGA GAGCTCATCA CAGAGCGCGG CAGCCTGCAG CACAAAGAAC GAAAATCTTG AGTTAGCAAG AAAGCTACTT GCCAGTGGTT TCGACGACGA TGTCTCAGAC AGCAACAACA ACACAAACGA CTACTCCGAA ATAAATAACT TCCGGGGATT GGTGGGAAGT AAGAAATTAG GAGGTGACTT ACATGTTGAC GACGATCGAT CATCCTGCGA TCAATTTGAA CCACTTTTCC GGAGGAACGA ACACCCCTTG AAAGGAAGGT ACCCGGTGGA TACCAGGCTC GAAAGTAGTT CAAATTCTTC TTCCGGTTCA CAATTGAAAG TCGACCGATT GGAATCACTC AAAACACTGA CTCCAAGGAA TGAATCAAAT GGTAGAGAAC CGAAGAAACG TCTGACATTG GCAGACGAAT TAGGCATCGA AGGGTTTGAC GACAGGGAAA CCGCTGAGGT GACCGCACAA GACCTAACCA GCGACAATTT AGAAAGTAGC TGCGCATCTC CTCACGATGA ACACACAGCT GGAGAGACAA AGAGAGTTCT TTTTCGCGAT CAAGACGACC AAGAAGCGAA CTCTTTCGAC TCTACCTCCA CCTCCTCTGA AGATTCCGAT TCCGACTGCG GCGACGAAAA CGATGCTAAA ATACAGTACT CAGCTACCGT TGTAGTAAAA GAGGCACCAC TTCTACGACT TCAAAATTGC ACTTCGGTTA CACGTAGTCA TATCGCCGAA AAGAAGGTCA GCCAGAAACT CAATGGCAAA CGATGGGGCT GGTTTAAATT CGGAAAGAGC AAAGATAGCC AAACATCATC GCACGACTTC GTAGAAAATG AGCAATCGCA TGCAGATCTC GAAGCGGAGG CAACTGCCGA GTCCAAGAGC GACGAAAAGG ATGGGGCAAA ATTGAATGCA AGTTCACCTC CGTTAAAGAA AAATTCTGAA CATGGAACCA ACGACGACGA CAATAGCAAT CTAAGTCAAC CTTTGCACAC CAATGCACCA GAACACCCAT CACAAGCGAG TCAAAGCTCG GTAGCAACTG CAAGTACACG TGGAAGTGCT GATCAAAAAA GTGTAAAACC CGTCGCCGAC TTTTTCAAGC CTGACGATCC AACAACTACT TTCTTTCAAA GCCAAACAGA TTCATCCAAA TCATCGATGC TACACAAACC TACGTGGAAG GACGAAGACA GTGTAGTTTC AAAAATTTCT GCAGCAAAAT CCACGTCCAA TGCAACTAAT ACTAGCGAGG CGACACCGTC TTCTGACGAG CCTGTGATTG CAGATGTGGC TGTGATACAC AATCCCGAGG TAGATGCAAA AGTAATCGGA CAAAAGAGGA AGAAGGAAAG AAAATCGAAG AAGCAGAAGA AAACAGGCCT TTCGGCGCAT TTCGGAAATG CTCGTAAAGT CAATCGGCAC GGGGACGAAG TCAGCGTCGG AACTATGAAT TTCAAGACAA AAGCAATAGA GCATCAGCAT GTTGTATTGT CGCAGACCAA TTTCGACGAT TTTGTTGAGT CGGTATCCCA GCGGCGTCCA TGGGAGACGG CGCTGAAAAG GAATAGAAAG CAACCCGCAA AATCATTAGG AGTGAATTGC GTGATTCCCG AGGGTGATGA AGACGAGGAG GACTTGACAG ATGCAGAAAG TGTCACGGGT CTCCTTCGAA TGGAAAAAGA AATTCCCAGT CTCTTGGAAA CCGGTAGTAG CCAAAACGCA AAGCCTAGCG AAATAGTTTC TATGGTAGAG CTGAGGAAAA TATCATACGC GTCTGATGAC ATATCGGCTT CGAAAGGCAG ACTTCTGAGT TCCGACAATG AAAGTGACTA CGACGAAGAA ACAATGGATG TCTTAGCTGA TATGATGTCG CTGAGCGAGT TAGAAAGCAA GTTGTTGGAA TACGGAGACT GCAAAGAAAT TGACTTTGAA GACATGTGGG ACGAGGTTTC AGCCGATTTG TCAACCGCCG TGGAATTCGA AAGGAAGAAG CGTCAAAGAG ATCGCCGAAC CTTCAAAAAA GCCATGCACC GGCTCAAGAA GAATGAAGAA AAGCACGAAA AGAAGAACAA GCGACTCAGA CTCTTCGAAC AGAGCCTGAG TACGGAGGGG AATCCTAAAA ATGCGAAAGC ATCATTTTCA TCAGAGTTTC TAACTGCTCT CCAACAAGTG TTTGAAGATA GCATATCGTC CGAAGATGAA GGCCTGTCTT CTGGCAAAGG CAACTCCTAT ATTGAAGTGG AAGACGGGGC GTCTGCTCTC AAAAATGACA ACAACGTTTC CTCTGAAACG AAAAAATCGA TTTTGCAAAA GTCTCGGCCA GAGAAGAAAT CCAGGAGGGT AGACAATCTT TGCAACACCA TTTATACAGA GGATGAAGAA CGCAATCCCG GTACCGTTGG TGATACCATC TCATCCAGCG GTAGCAGTTT CCTAGAAGCC AGCAGAAAGT CTGGACTGTC TTCGACTGGT ACAAACTGCA CAAGCTTGAA AGCCTCCAGT CTAAAAAGCA AGACCACAAA ATCAAAAGCT TCTCGCACAC ATAAAATCAA TCCAGCCGAA ATTTTCCAAG TAGAGCTCAA GCGACAACAA GCTGCCAAAA TTTTGTCCAT ATCCAACTTG CGGCAGGAAA TGTATGACCG ACGTGGAGTT TCGCCAGATC TACTCAAACG GGAATATGAC CAACACCGGC GCCAGTGCCT GGCCAAACGA AATGAGTCCG ACAAGCAGCT GCAACGTAGC AGTATTGACA ATGCGCAAAA GATTAACCTA CAGGCTCGCA CAGTGGAATT TGGCAAACCA GAATCCGCTT CCGATGTTTT TGTAGACAAG ACCAGGTACC AAAATGAGGG TGAATTCATA GACGCGAGCC GTGGCCACGG TCATCCGCAA CCCCTGCGGT CTCGCTGGGA TTCGAGTGAA TCAGTCAGCG AACTTGATGA TTTGCGGACC GTTATGGAGT CGCCACCGAC CAGCCGCCCC AGTCGCGTCA TAGAGGGTGC ATCGGGATTG GCTAACGGAG CGCTCAACAC AAGCAATGAC ATCTTCGTAT TCGCATCGAA CGCAGCGCAC AAGTCGATGG GCGTGGCGTC GAATGTCACC CATGGAACCG TGGGTGCCGC ATCTAGTGCG TTGCGGGTAG TGTCGACGAG GGCCGAAAAC GCGGCAATTC ATTTGCCCAA TTTCGCAACG TTGTCGGAAG TTGGTGGCAT GCCCTTCCGT GCAAGTTTTG ACGACATGCC ATTGACAACG ATCGGGGAAG TTGAAGACTA CGAAGATGAA AATGAGCAAG GGTTACTGGG AGCTTCCAGC GACGACAAGT GGGACGATGC CACTTTGGGT TCTAAGTCTT ATGTATCGAG GTCGAGCAAA TTCTCTCTCG GCGCAAAGCT CCCCAAAGTG GATCTAAAGA TCCCGAAAGT CGGCCTCAGC GTTGGCGCCG GCCTCAAGAA GATGAAACAG ATATTGCCCA GGATGAGCAA AGATCGACAA GGTTCAAACG GAATGATGAT GGGAGACGAT GGCCACGGAC TTTTGGGTTA G
|
Protein sequence | MSWFSSRRAK PAKESAFNTD DKPQNAFGGF GAVAHFDDGP LRDPRMANDF LERGTKNENI HTHSDTSSDG DDAGSEAESG SLSDDSFVGG PLMYESTCSD DSITVDTADP DSQDWNLRKA NNFLQDFYDK EDLERVSQEQ HRLRSTDDEN DAESASSSDQ SSEEGESDSE TLEKSHSPLH KTCKDADICH EDQDLGTAKI DTATTLSSHD AEKFPIAPES FDEGYGAEHD GYDVAPNESQ IHECSETETS SQHVVVHDFS TASALRPAVA EARSFRNLDN SHSDVNLDAT ASFTTSLQES STGSETFDRM DPKIDASKLG CDSASLDHEA TFGSGNDRDP SRHMRGDFQG ASAVDSAYGS LSDGSHNSKC NRGNHSGSGL GEMFGVGNAL AANGGFGVSR CDEDKETSSN DSSSDLLVSG NSPVHNKHRH EADSSLELDA LDCGGSRGDI YPNQNELFSL ERREAESREV SESTTPLQTE HSEKKVVSTF HEEYVVNIHD VRVALDSENE LPSKDPLNDQ DKPDKGNTDC VPTAIVQDRF SDKHSSAEIE SLLCSGDEST GDGLLGERSS SHSSSEPRKS CDQSLQGEKE IFIDASREHI CSDGNFQQDR QALYITNCVE CSEDGVPFSG RFQSCLSQWK KQSSPPQSNR SGRGLRAFDV LEEIPLKDGR IESFSLRGSD GLHDSPNTRS GSAATSCFHP CHRRQESGGM ESEVKRSCVF QDHSDNCHDA STSHLSRDED ESSRPEEEGF VPRGLVDDDD DECKSDDSNV DYVNSPTTAF EDDIYASRSG DFERESFMSA APAVESSRRT DLSPAMEKRS NSLNAVVAPD TARLPALQTR SEQRSQQKGA SAEDPPESDR DTKSSKSCVS SGLVSTSNSC LSERSPVLEA GSGWKHSSIS AREVISERNE TIASSHLTNH QKTDLSFKSS FVAVEEANCE ARSTRESKAE QDKRKQSPED HPSENESEDV WYSQSKSLSG QQKGFFGLFV EKDESSSQSA AACSTKNENL ELARKLLASG FDDDVSDSNN NTNDYSEINN FRGLVGSKKL GGDLHVDDDR SSCDQFEPLF RRNEHPLKGR YPVDTRLESS SNSSSGSQLK VDRLESLKTL TPRNESNGRE PKKRLTLADE LGIEGFDDRE TAEVTAQDLT SDNLESSCAS PHDEHTAGET KRVLFRDQDD QEANSFDSTS TSSEDSDSDC GDENDAKIQY SATVVVKEAP LLRLQNCTSV TRSHIAEKKV SQKLNGKRWG WFKFGKSKDS QTSSHDFVEN EQSHADLEAE ATAESKSDEK DGAKLNASSP PLKKNSEHGT NDDDNSNLSQ PLHTNAPEHP SQASQSSVAT ASTRGSADQK SVKPVADFFK PDDPTTTFFQ SQTDSSKSSM LHKPTWKDED SVVSKISAAK STSNATNTSE ATPSSDEPVI ADVAVIHNPE VDAKVIGQKR KKERKSKKQK KTGLSAHFGN ARKVNRHGDE VSVGTMNFKT KAIEHQHVVL SQTNFDDFVE SVSQRRPWET ALKRNRKQPA KSLGVNCVIP EGDEDEEDLT DAESVTGLLR MEKEIPSLLE TGSSQNAKPS EIVSMVELRK ISYASDDISA SKGRLLSSDN ESDYDEETMD VLADMMSLSE LESKLLEYGD CKEIDFEDMW DEVSADLSTA VEFERKKRQR DRRTFKKAMH RLKKNEEKHE KKNKRLRLFE QSLSTEGNPK NAKASFSSEF LTALQQVFED SISSEDEGLS SGKGNSYIEV EDGASALKND NNVSSETKKS ILQKSRPEKK SRRVDNLCNT IYTEDEERNP GTVGDTISSS GSSFLEASRK SGLSSTGTNC TSLKASSLKS KTTKSKASRT HKINPAEIFQ VELKRQQAAK ILSISNLRQE MYDRRGVSPD LLKREYDQHR RQCLAKRNES DKQLQRSSID NAQKINLQAR TVEFGKPESA SDVFVDKTRY QNEGEFIDAS RGHGHPQPLR SRWDSSESVS ELDDLRTVME SPPTSRPSRV IEGASGLANG ALNTSNDIFV FASNAAHKSM GVASNVTHGT VGAASSALRV VSTRAENAAI HLPNFATLSE VGGMPFRASF DDMPLTTIGE VEDYEDENEQ GLLGASSDDK WDDATLGSKS YVSRSSKFSL GAKLPKVDLK IPKVGLSVGA GLKKMKQILP RMSKDRQGSN GMMMGDDGHG LLG
|
| |