Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21388 |
Symbol | |
ID | 7202042 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 576403 |
End bp | 583008 |
Gene Length | 6606 bp |
Protein Length | 2189 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181230 |
Protein GI | 219121764 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0338841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAAC GTGAGGAACT CCAAAAGCGC TACGCGTACC ATGCCATGTC CAACAAAGTA GAACAGGCCG ATCGGTCGTC TCGACGGGTG CGGGGCAGCG AAGGCACTGG TGAGGTCGAA ACCTTACGCG GACGGCGGGA TATTGGACGC ATGGGCGATC GGGTGGAGGA GGGACAACCG CCCCCGCCGC CGACGAAAAA AGCCAAACAC GTACCTTCGG GACCCCCGTC CAGAAGGGTC GCTCACGGAA ACGAGACTAT TCTCGATTTG GGCAATCTCA CCGGTTATCA GCCAACGACG GAACAAGCCA AGGCGGCGTA CGAAAGTATT TTGACCATCA TTGGCTCCAA GGCGTTGTTG GGGAATCAGG GACCGCAAGT GTTGCGGGAC GCTGCCGAAG AAGTCCTCGC GACACTCAAG GATCCCAATC TACGGGATCC GGAACGTCAC GAGACAATAT CGGTCCTGCT CACGGGTAAG AGTCCGCGGT TATCCGGGGG ACTTTCCACC GAACACTTCA CGGCCTTGCT GCAGTACGGT AAACAATTGG ATGACTACAA TAAGGACCAA CAGCTGAAGG ATGATGACAA GGTGGACGAC GAAATGGGGG TCGCTGTGGT CTTCGACGAA TCCGAAGACG AAGCCAACAA ACATGACGAC AGTGAGATCG ATCAGAATGT GGTCGTTGAC ACATCATCGT CCTCTTCTGA AGACGAGGAG GAGGGCCCGG AAGCGGAGGA GGATTCCGAG GCGTTCCGAG ACGTGGACGA GGAGATCATA GTCCAGGGCG CCGATGACGT AGGGGCCAAG AAGCAGAGTC GTGCAGGCGA CCGCATATTG TCGGTACATG AGATTGACGC CCACTTTTTG CAGCGTCATT TGGCCAAGCA CGTTGACGAT GCGGACGAAT CGGCCAAGCT GGCGGCACAG GTTATGGGAA TTATCGATTT TCGTACCAAC TCGGACATGC GGGAGTGCGA AAATCGACTG CTCGTACTCC TCGGCTTTGA TCTTTTTGAT ACCATCAAGC TCATTCTCCG TAACCGTGTA CGAGTTTGGG CATGCATTTC CATGAAACGC GCACAGTCGG ACGAGGAACG TAAAGCAATC GAAGAAGCAC TGGCGAACGA ACCGACCGGT GAGGGGAAGT GTGTCTGGGA AGAGCTGCAT TCCAAGGGAA GAGCCGAAAA CTGGACGCGC GATCGTATGA AGGGCCTGGC GGATGCATTC AAGAGTGAAG CTACAGGGGA TCTTACCAAA GCAATCGACT CGGTGGGAGT CAAGCAAGAA GACGACGAGA CTGCAATGCA GGTCGAAGTA AAAGAAGAAG CAAACGAGTT GGATCTTGAC ATTCTAGCAT TCCCAGAGGG AAGCCACACA ATGACTAATA AGAAGTGCGA CCTTCCCGAT ACATCCTGGC GGGCCATGAA GAAAGGTTAC GAAGAAGTCC ATGTTCCTGC GGTCCGGAGT GTCATTCCCA AGGACGAGCG ACTTATTCGT ATCGACGAGC TTCCGTCTTG GACGCATGCC GCTTTTAAAG GAATGGAAAA GCTGAACCGG GTACAGTCCA AGTTGTGTGA CGTAGCGTTG CGTTCCTCGG AAAATTTGCT ACTGTGCGCC CCAACTGGTG CTGGAAAGAC CAATGTGGCG TGTTTGACCA TGATGAACAT CCTGGGCCAG TATCGTCGAG ACCGTCAAGT CGATGACGAC CCCGACGCGA AGGACTCGTT TGACTTGAGC TCTTTCAAGA TTGTGTACGT CGCACCCATG AAAGCGTTGG TGCAAGAAGT TGTCAAGAAT TTTTCCGAAA GGCTTGAGGA TTACGGAGTA ACGGTCAGAG AGCTTTCGGG TGATAGCAGT TTGTCGAGAC AGCAAATTTC TGAAACGCAG CTTCTAGTGA CTACACCTGA GAAGTGGGAC GTTGTTACGC GCCAAGGAGA AGGAAGAGCT TTTACGCAAC TCGTCAAGCT TGTCATTGTG GATGAGATTC ATCTTCTGCA CGACGAACGT GGACCAGTCC TTGAAAGCAT TGTAGCGCGA ATTATTCGCC AAGTGGAAAC TACATCCGAA CCTGTACGCC TTGTAGGCCT TTCGGCAACA CTTCCGAACT ATACCGACGT CGCTACTTTT CTTCGCGTAG ACCACAATAA AGGTCTCTTC TTTTTCGACC ATTCGTATCG GCCCGTTCCT CTACAAATGC AGTACATTGG ATTGACGGAG CGGAATGCAT TCCGGCGATT TCAGTTGCAG AACGAAATCT GCTATGAGAA GGCTATTGAG CAGAGACGCA ATGGAAATCA GATGCTGATT TTTGTTCACA GTCGCGCCGA GACTGGAAAA ACTGCCAAAG CGCTGCGGGA TCTTGCTCTG GAACGGGATG AGTCGACTAA TTTTGTCCGG GAGAAGGGGG CAACACAGGA GATTCTTCGA GAGGAGTCTT CGGCGGTGAA GAACGCCGAT CTTAAAGATG TTTTACCTTA CGGTTTTGCG ATTCATCATG CTGGAATGGC TCGCGAAGAC CGAGAGCTTG TGGAAGATCT TTTCGCGGAC CGTCACATTG CCGTATTGGT TTGTACCGCG ACTCTGGCTT GGGGAGTCAA CTTACCTGCG CATGCTGTCA TTATTAAAGG TACACAGATT TATGATCCGT CCAAGGGCCG CTGGGCCGAA CTAAGCCCAC TCGATGTGTT ACAAATGCTT GGCCGGGCAG GACGCCCCCA GTACGATAGC GAGGGCGAAG GAATTATTCT TACGCAACAT AGCGAATTAC AGTACTACTT GTCGCTCACA AACTTGCAGC TTCCGGTCGA AAGTCAGCTC ATCAAGACTC TTCCAGACCA CTTAAATGCT GAGATTGTGT TAGGAACGAT TCAGACCATC TCCGAAGCTG TCGATTGGCT AGGTTACACG TTTCTTTTCG TCCGTATGCT GCAGAATCCA AATCTCTACG GAATATCAGA GACCTCGTTT CTAGATGACC GCACTCTGAA GAAGCGGCGG CTTGATCTTG CCCACTCTGC CGCGTCGATT CTTGAGAAAA GTCATCTCGT TCGCTACGAC AGGAAAAGCG GCGCGTTGCA AGCAACCCCC TTGGGCCGCA TATCAAGCCA GTTTTACATT TCACACTCGT CTATGGCTGT GTACAGTCGA CACATGAGAT CTAATATGTC TGATATTGAG CTTCTTCGTT TGTTCTCGCT TAGTGGGGAG TTTCATCACA TAACTGTCAG AGAGGAGGAG AAATTGGAGC TCACTAAACT TTCCGGGCGT GTCCCAATTC CAGTAAAAGA AAGTCCAAAC GAAGCTTCGG CCAAAGTTAA TATTCTTCTG CAAGCCTACA TATCGCGGCT TAGACTCGAC GGCTTTGCCC TGGTCGCTGA TATGGCCTTC ATTCAGCAAT CAGCTGCACG TATCATGCGT GCGTTATTTG AAATTGCCCT TCGGAGGAAT TGGTCCTCTC TCGCTAAGCT TTGCCTTGAC ATGTCGAACA TGGTGTCGTA TCGGATTTGG AGAAGTCAGT CTCCTTTGAG ACAATTTAAG AACGTCCCGG AAGTTGTCGC GAGGAAGCTT GAGCGTAAAA GTGATATAGA ATGGGCGAGA TACAACGATC TGACCTCAGC CGATCTCGGG GAGCTCGTCG GCGTTCCTAA AATGGGCCGT GTTTTGCACA AACTTGTTCA GCAGTTTCCT CGTCTCGAAT TGTCGGCTCA GATTCAGCCA CTAACGAGAT CTATGTTACG CATTGAGGTG ACCCTACTGC CATCGTTTAA TTTCGATGTG ACGATACACG GATATGTGCA GCTTTTTCAT GTTATAGTCG AAGACGTCAA CGGCGATACA ATTCTGCATC ATGAGTTATT TTCGCTGAAG AGCAGCAATG CAGATGAGGA GCACGTTCTA CTGTTCTCAG TCCCAGTGCT AGAGCCGCTT CCGCCAGCAT ATTTTATTCG TGTAATGTCG GATAGGTGGC TTCATTCCAC AGCTGTGCTT CCTGTTTCCT TCAGCAAGAT GATCCTACCT TCAAAGTTTT CTCCTCCAAC CGAATTGTTG GATTTACAGC CTCTTTTACC ATCAGCTCTT GGAGTTTCTG CTTTGTCGGA GATTTTCGCT TACAAAGAGT TCAACCCAAT TCAAACCCAA GTATTTCATG AACTGTTTAA AACCGACAAG AATTGTCTAG TTTGTGCACC CTCTGGTGCG GGAAAGTCGA CTTGTGCTGT GTTCGCTGTT TTGCGTATGC TGACAACTAA CGCTGACGGT GTGTGTGTGT ACATTGCCCC AACTGACGCG ATCGCAGACC GAACGTTCAC GGAATGGAGG CTTCTTTTTG GTCGCATCCT TCCAAGCAGT TCTATTGTCC GACTGAGTGG GGAAACAGGT CCAGATCTGA AGCTCCTCTC TCAAGGAAAG GTTGTTGTAT CGTCGGCAAA GCAGTGGGAT ATGGTCAGTC GCCGCTGGAA GCAACGAAAA GCAGTGCAGA ATGTTGCCCT AATGATCTTC GATGAGCTGC ATTTTCTTGG AGGTATCATT GGACCGACGC TGGAAGTTGT AATATCTCGA ACTCGTTATA TGATCGGGCA GAGCGAAGAT GGCAAGACTG TCGCAAACAT GCGAATTGTT GGTCTCAGTG CCTCGCTTGC AAACGCTCGA GATGTGGGGG AATGGATGGG CGTGTCTGGA AAGAGTTTGT TTAATTTTTC TTCAAAGGCG AGGCCTATGC CACTCGAAAT CTTTTTTCAG TCCTTTGAAC AAGCGAATTA CTCAGCGAGG CTTATGGCGA TGGCCAAGCC TGTTTTCAGT GCCGTCGAGA GGCATATCGG GGAAGGAACG GCAATTGTGT TTACCCCGAG CCGTAGGCAG GCTCAACTTA CTGCGATTGA TCTTATGACA TTTCGAGATG GACAAGGGCT TGGGTCTTAC GTTGGCAAGT CTGTAGATAC GCTTACTCTC GCAGAAATCG CTTCAACACT GCGAGAGCCT GCCTTGCAAC AGGTTGTGAC GAACGGGATT GGCTTCCTTC ACGCAGGAAT GATAGATAGC GATTGGAATA CAGTAGTGGA TCTCTACAAC TCTGGGGCAT TGCGGATTCT GGTTTGTCCT ACAGACGTAT GCTGGAAGAT TCGTTGTGTG GGGCGACTGG TCATTATTAT GGGCACGGAA GTTTACGACG GTAGGGAGGG TCGACACCTT GACTACCCGG TCATGGACAT TCTTCACATG ATTGGACGTC ATGATCCAAG ATCGAGTGGT AAATGCGTTC TTCTATGTCA CGCCCCCAAA AAAGACTACC TAAAGAAGCT GATATACGAC CCAGTGCCGA TCGAAAGCCA TCTTGATTCA TACCTGCATG ATCCACTGAA TGCTGAAGTT GTAACTAAGA CAGTGTCGTC GATGCAGGAT GCAATCGATT ACTTGACCTG GTCTTTTCTC TATCGCCGTC TTCCTCAGAA CCCGACATAT TATGGACTGC GGGGTACTTC AAATGTTTTC CTTAGCGAAT ACTTGAGTGA GATGATTGAA ACAGTCATTG GCGATTTGGA AGAGAGCAAA TGCTGTAGCA TGTCGGAAGA AGGTGATATA TCTCCGTTAA ACTTGGGAAT GATCGCAGCA TACTACTATG TTCAGTACCG AACAATCGAG CTCATTGCCT CGTCCGTAAC GGAGAAAACA AAAATTCGGG GTATCATGGA AATTTTGTCG GCGGCTTGGG AATTCTCAGA GTTTCCTATT CGTTTCGGCG AAGACAGGAC GTTGAAATCT CTCGCACGAA CCCTGCCGTA TACGCCTCCA GACGGTGCGA ATTACGATGC AAACACGAAA GCCTTGATTC TTTTGCAATG TCACTTTAGT CGTAAGGTTA TTGGTGCTGA CCTGCGATCC GATCAGAAGA GCATGCTAAA GGAGGCCGTC AATCTCGTAC AGTCCATAGT AGATGTAATC AGCAGCAACG GATGGCTAAA GCCTGCGCTC GCCGCCATGG AGTTAAGTCA GATGCTCGTG CAAGGACTTT GGAATAAAGA TCATGTCTTG AAGCAAGTGC CCCACTTCAC GGAAGAAATC ATAGGACGTT GCCGAAATCA CGACGAACCT GTTGAAACAG TGTTCGATAT TTTGACGATA GAAGACGACG TCCGTAATCA GCTTTTACAG CTCCCCGATG ACAAGATGGC TGACGTTGCG GTATTTTGCA ATACATACCC AAGTATCGAG GTCTCATTCA AAGTACATGA TGTCGAAGAC GTTGCCGCAG GCAATCCGGT ACAAATCGTG GTCGAATTAG AGCGCGAAGT TGACGAAGAT GACATGGACG AAGCCGAAAT GGAGGCGCTT GGAACAGTTG CGGCACCACT GTTTCCGATT GCAAAGAAGG AAGGATGGTG GGTTGTTGTT GGGGATACAT CCACCAACTC ATTGCTATCC CTGAAACGTG TCAACTTGCG ACACAAGCAA AAATTGTCTC TAGATTTTCT CGCCCCAGAT GAACCAGGCG ATTACGACTT GACCCTGTTT TGCATGAGCG ACAGTTACCT TGGATGTGAC CAAGAGTACA GGATTCCCTT GAGTGTTGCG GCTGCAGAGT CTGACGAAAG TGAGGAGGAC GAGTGA
|
Protein sequence | MAEREELQKR YAYHAMSNKV EQADRSSRRV RGSEGTGEVE TLRGRRDIGR MGDRVEEGQP PPPPTKKAKH VPSGPPSRRV AHGNETILDL GNLTGYQPTT EQAKAAYESI LTIIGSKALL GNQGPQVLRD AAEEVLATLK DPNLRDPERH ETISVLLTGK SPRLSGGLST EHFTALLQYG KQLDDYNKDQ QLKDDDKVDD EMGVAVVFDE SEDEANKHDD SEIDQNVVEG PEAEEDSEAF RDVDEEIIVQ GADDVGAKKQ SRAGDRILSV HEIDAHFLQR HLAKHVDDAD ESAKLAAQVM GIIDFRTNSD MRECENRLLV LLGFDLFDTI KLILRNRVRV WACISMKRAQ SDEERKAIEE ALANEPTGEG KCVWEELHSK GRAENWTRDR MKGLADAFKS EATGDLTKAI DSVGVKQEDD ETAMQVEVKE EANELDLDIL AFPEGSHTMT NKKCDLPDTS WRAMKKGYEE VHVPAVRSVI PKDERLIRID ELPSWTHAAF KGMEKLNRVQ SKLCDVALRS SENLLLCAPT GAGKTNVACL TMMNILGQYR RDRQVDDDPD AKDSFDLSSF KIVYVAPMKA LVQEVVKNFS ERLEDYGVTV RELSGDSSLS RQQISETQLL VTTPEKWDVV TRQGEGRAFT QLVKLVIVDE IHLLHDERGP VLESIVARII RQVETTSEPV RLVGLSATLP NYTDVATFLR VDHNKGLFFF DHSYRPVPLQ MQYIGLTERN AFRRFQLQNE ICYEKAIEQR RNGNQMLIFV HSRAETGKTA KALRDLALER DESTNFVREK GATQEILREE SSAVKNADLK DVLPYGFAIH HAGMAREDRE LVEDLFADRH IAVLVCTATL AWGVNLPAHA VIIKGTQIYD PSKGRWAELS PLDVLQMLGR AGRPQYDSEG EGIILTQHSE LQYYLSLTNL QLPVESQLIK TLPDHLNAEI VLGTIQTISE AVDWLGYTFL FVRMLQNPNL YGISETSFLD DRTLKKRRLD LAHSAASILE KSHLVRYDRK SGALQATPLG RISSQFYISH SSMAVYSRHM RSNMSDIELL RLFSLSGEFH HITVREEEKL ELTKLSGRVP IPVKESPNEA SAKVNILLQA YISRLRLDGF ALVADMAFIQ QSAARIMRAL FEIALRRNWS SLAKLCLDMS NMVSYRIWRS QSPLRQFKNV PEVVARKLER KSDIEWARYN DLTSADLGEL VGVPKMGRVL HKLVQQFPRL ELSAQIQPLT RSMLRIEVTL LPSFNFDVTI HGYVQLFHVI VEDVNGDTIL HHELFSLKSS NADEEHVLLF SVPVLEPLPP AYFIRVMSDR WLHSTAVLPV SFSKMILPSK FSPPTELLDL QPLLPSALGV SALSEIFAYK EFNPIQTQVF HELFKTDKNC LVCAPSGAGK STCAVFAVLR MLTTNADGVC VYIAPTDAIA DRTFTEWRLL FGRILPSSSI VRLSGETGPD LKLLSQGKVV VSSAKQWDMV SRRWKQRKAV QNVALMIFDE LHFLGGIIGP TLEVVISRTR YMIGQSEDGK TVANMRIVGL SASLANARDV GEWMGVSGKS LFNFSSKARP MPLEIFFQSF EQANYSARLM AMAKPVFSAV ERHIGEGTAI VFTPSRRQAQ LTAIDLMTFR DGQGLGSYVG KSVDTLTLAE IASTLREPAL QQVVTNGIGF LHAGMIDSDW NTVVDLYNSG ALRILVCPTD VCWKIRCVGR LVIIMGTEVY DGREGRHLDY PVMDILHMIG RHDPRSSGKC VLLCHAPKKD YLKKLIYDPV PIESHLDSYL HDPLNAEVVT KTVSSMQDAI DYLTWSFLYR RLPQNPTYYG LRGTSNVFLS EYLSEMIETV IGDLEESKCC SMSEEGDISP LNLGMIAAYY YVQYRTIELI ASSVTEKTKI RGIMEILSAA WEFSEFPIRF GEDRTLKSLA RTLPYTPPDG ANYDANTKAL ILLQCHFSRK VIGADLRSDQ KSMLKEAVNL VQSIVDVISS NGWLKPALAA MELSQMLVQG LWNKDHVLKQ VPHFTEEIIG RCRNHDEPVE TVFDILTIED DVRNQLLQLP DDKMADVAVF CNTYPSIEVS FKVHDVEDVA AGNPVQIVVE LEREVDEDDM DEAEMEALGT VAAPLFPIAK KEGWWVVVGD TSTNSLLSLK RVNLRHKQKL SLDFLAPDEP GDYDLTLFCM SDSYLGCDQE YRIPLSVAAA ESDESEEDE
|
| |