Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49808 |
Symbol | |
ID | 7198470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 387101 |
End bp | 392450 |
Gene Length | 5350 bp |
Protein Length | 1718 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184535 |
Protein GI | 219128680 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.365245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGGC AAAGGCAACG GGCGGGATCC GAAGACGGAG AAATCGATGA GGAAGAAGGA GAAATCACAG ATAATCCTCA GCCACCGGTG GCTTTATCGC TGTCTCCTTC GAAGTCTCGT CCCACCGTCA CCCACTCGCT CCCACACAGT TCCCAAGCAA CGGCCGCATT CCATGGCGAA TCAAACTTTC CTCCGCAGCC GCCTTTGCCG AACCGTCTGA GCAGTACTAG CGGGCCGAAC AGCATCCATC CCTACCCGGG CGCCAGTCAC AGTAGTAGTA ACAGCAACAA CAACATACCG GTCCCTCCCC CGCGTCGTGG AAGCTGGCGG GGCGGACGTG GCGGTACTAG TGTCGGTGGT GGAGCCTTTG AACGACGTCC CGGTCGAGGG TTGGGCCACC GCAATACTTC TTTCGGCAGC GGACCGCCAG CGTTCGACGC GCCCCCACCG GCACGCAGCC AAAGCTTCCA GTCATTTCAC CGCCACAGCA GCGGAAGCAT TCCAGGCCTT CCCGCCAGCA ATGTTCTCCC ACCGGTAGCG GCTACTGATC CGAGACGGGC GACGGATCCA CGGTTTCGGG GAGCACCCGG CGTTGCGAAC ACCCCGGTGT CCGCACCGCA TTTTACCGAA TCGCGGGCAA CCAGCGAGGG CCGGGGCTTT ACTACCTCGT CCAGTAGTGC TCCGGTATCG GTCAGTAGTA CTTTAGCGGA TTCAGTCAGT CTGGCAACAC CCTACAGCAG CTTGGCTGAA GGCAAGCCAC CCCACGTACT AGCTGCTGAA GACGCGAAGG TCGTTCGAGG ACTCCGTGCC AGTAGTGTCG CGAGAAACGA GTCCATCGGC AATGATGTTC CACGGGATAG CAGCGTCAGT GGGCCCTTCC CGCCACTAGG GGGTTCCACG GAAATTGCCT CTTTTCCTGG AGGTTTTCGC GATAGAGGTC CACCGCATCG TCGACATACT GGCGACTTTC GGGGGCACCC TGGCGGCCAC GGCTCGCTCA ATCGACGAGT TTCAAGCGAA TATGGTAGCG GCGACGGACC GAATAGCGAA GTCCTACCCT TTGGAAGGCC CAACGAAATT CATCAATCTG CTAGGCAGGG TTCTCAGCAT GCAGTCCCCC CATCCGGTGA GCCACTGCCG GTGTCGCGTG GTCAATCCAC AGGTAATCAG ACTCCTCCAC CTCCGTTTCA TCGAGGCCCA GCGCCGCCTC CTACTCAAGA GCAACAGCAA CCTCCATTCC ACCGTAGTCA TCCGTTACCG CAGGATCCAC CGCCGGGAGC TTTCCCTCGA GACGGGCCGC CTCAAGGAGA AGTGCCATCC TTTTATAGGG ATGAGCAATC AGTGTTTTCG CGAAATCAAC ACACCCATGG CGGAGATTTC CCACCGTTCA ACCAAAGAGC TTCGGATCAG CCTCCTTTTT CCTCGGGCCC GCCCAACGTA GAGCAACCGC TATTTCGAGG ACCAAGACAG GATTCCTATT ACGGACCCGC TTCACGCGAC GTAAATTCTG GAAGGTTCGG GTCGCCCTCC CAACGTGATC GCCCCATTGT CAATGCCCGA GGGGTTTCGG GAGGTCCTCC ACCTCCCCCA CCACCACTGG GATCGCAAGG GGCGCTCACA TCCACACAAC ATCCCTCTTT AGCTCCTGGT GTGGCCCCGC TTCATCGACG GAACGATCCA CGCCTTCATC GAGACCCCGA TGCGGAAGGA CGAGATTTCG CGGACGCGGC ATCCGCATCC GAGCCGCTTC GACCCAATTT TCCACCCGAG CGCACCGGAT TCCCGATGCA AAGTGAACGC GGTTTCCGAA AGCCGCCTTT TGGACAAATG TTTCCACCGG GAGGTGCTAC AGAAAATAGT ACCGGCTCGG ATTCGTTTGG TCGATCACGC GAACGGAATA CCGCGGCAGC GCGGTCGCCT CAAACGTCAC CACATACGCG TAAACCCGTT TTGAGCTACT TTCAGGAATC GCCGGCGAAG GAAATTCCGC GTCTGCCTGC CATAATTGAT GCCAAATCGG GCAGTCTGTC GAGTCGAATC AAATCTGTAG GTCAACATAC CGAAGCAAGG ACAGAAGAGC CAGAACCTCT TTTGACATCC GTGCTTGGTG AGGATTCAGT GGATAGGGCG GAAAAGGTTG TATTGCTTCT GACTGATCAA AGGGATAAAG CTAGCTTGGA AAGAGATGAT AAGGGATGTA GTGAGCTTCC GAAGAAGCAG ACAATCCTGA TTGCGCTGAA TCGTATGGAC ACCAAAATCA AGCTGCTTCA GAAATCTACC TTAGATAAAG AAGAAGAAGT TGAAGCACAT ATCGAAAAAG AAAAGGAAGA TCAAAAACGG GCTGCTAAAG AGGCGAAATC TGAAGCTGAA CGTTTGGAAA AGGAACACAG GCGACGCCGG GAAGAGGAAC AACAAGCCGA TGAAAAGGCC AAACAAGAGC AGATTGAAAG TATGATAGAA GAAGGGCAGG CTGGTTTCGA TGCAGATCTA ACAATATCTA CAGTGACGTT CGAGACCGAT CTCGAAGCAG CTCGTAAGGT AGAAGAAGCA AGGTTTGAGC TAGAATGTCA AGAACAGATA TCTGCGGCTA CGGAGCGATT CGACAATGAT GTGCAAACTA CACAGCAAGA GTTGGAGAAT TCTATACAAT CTATTTCGAA TACTCAAAAC CTAATTTCGG CACTCGAGGA GGAGTACAAG TGCAAGATGG AGGAAGGAGA TACAGCCGGT GAAGAGAAAA TGGATCAACC TGATCTAGTA AATACAGTTT TGGAAGAAAA TCGAAGGCGC GCTGCCGAGG CCCATGTGTC TCAATGGGCA GGTTTCCCTG TGGTGTCGGA TGATGATGAG TACGGTGTTT TAGAGAACGA AAAGGATCCT AAAGAAGGTA AACGTCATGT ACGGTGGGCA GAGATGGCGC AGAAAGTTAC GGGAGTCGGA GATGCACTCT ACAACGAACC TTCGGAAGCG CCGTATTTTG AGCAAAATGA GAGACTTCAT GCACTGATCG GCCCGCTGGT AACAGAGCAA ATACGCTACA GTCAACGGCA ATTCGACACC CACTGGAGAG AACTTGCCGA AGAATACGAA TACCGAAGAG TAGTTTACGA GGCTCAACAA CTCAAAGATG GCACGGCTCA GAGAAGGCGC ATCAAATCCA CAAGTGTGCC CCATAGGCTC GTTGGGAGCA AACCTAATGT CCCTATCCTC GAGTCCACAT CTGGCCACGG ACGCTCGTCG AACAACCCAT ATCGTCGGGC ACGTAGAGGC AACGAGGTGC GGACAGAATA CGAACAAGAA CAAATTATAG CAGAGCTGGC AGCCAAAGAA GCGCTGGAAA AGAGAATTGC AACTGGGGGG TCAGAGCTTC CGCGTCAGAT AGGTCAGATC GAAAGAAGCT GGACAGCCAC CTACATCCAA ACATTTTCGG CGCAAAGGGT TGACCTTGAG GAACAGGAGG CAGAGTTACG TATTACGGGT GTTTGGACGG ACATGGAAAA GTGCATTTTC TTAGACCGAT TTATGCAGCA TCCCAAGGAT TTCCGCAAGA TTGCTTCTTT TCTCCGAAAT AAGACGACAA CTGATTGTGT CGCCTTTTAT TACGATTCCA AGCAAACGCT GCCTTATAAG GGTGCGTTAA AGGAACACGT AATGCGGCGG AAGAGACGTG GCGGATATCC AATTTGGGAA GCAACTATTC AAGCCGCCCT CTCGGTAGGT GCAGTCGTTG AAGCAGGGGA TAGTGAAGAA AAGCCATTGA TCTTCACACT TCCGTTTGAT GATCACACTT TTTCTACTTT TGGCCTTCAT CCTTTGAAAC GCGAAGTTTT GGATTTAATG GAAATAAAAG AGCAGGCTCT CGCTGAATTT GACGCAGATG AGGATGCAGA CGACGTTTCT AGCAAATCAG GGCAACCCAA AAAACGTCCT CGCGATCGTC TTTTCCTGTT GGATCCGAGA CAAAGAAAAT TCCTGAAACC CTTGCCCCAG GAATCGGCTC ACGCTACCTG CCTTAAGGTG GACAGTGGAA AAGCAAGCAC AGCTGACGAT GATCACAACG ATTCCAAAGA GGGTACAGCA AAAGATGAGT CGGGGCGATT AACTCCTCTA AGAAAAGCAC CCCAAAAATG GACGGCGTCA GAGAAAAAGA TTTTTCACGA TACCTTGGAG AGTCATGGTA GGAATTGGAG CATGCTTTCC CAGGCTGTAG GGACGAAAAC GATTTCTCAG ATTAAGAATT ACTACTACGA CTACAAGAAG CAGAAAGATA AAAATCGGAC GACTGACAAA GACAAAAAGG TCGAAAGCAA AACTGAGAGG ACCGAATCTC ACGAAAACAG TCCTACACCG CCACATATTG CCGCGGATCA AAGACCCGGC GACCAGACTA GTAACGAGCC GATTTCGGAT CTACGCAAAA ATCAGCCGCC TCGCTATGAT CCCCAATTTG AAGCTAAACA TATCGAGCGA CAGATGTTCG AAGTGTTGCA ACAGCAAGGG CAGGGTCCAT ACCCCGAACA AGAAAGGCTT GTCGATCGAC GTCCTGTCGA ATCGTTGAGT GATCAAGAAT TATGGGCCCA ATTACACCGA CAGGGACTTT TGGGTCAACA GCGAGGGCAT TTATCGGACG AGGCGGCACG GCAACTTCTC CAGCATCACT CGCAGTCACA CCATCAGCAA GTCCTCTCAA ATTTGATGCC CTGGGCTTCG GGAGGGCAAC TTCCGCAGCC AGTCAAACGA GCGCAACCAA TCAATGTGCA AGAATGGGAG CAGCTGCAGG CAATTTTGCA GATCCAGCGT CAACAAGAAC AACATCGCCA TCAGCATCAA CCTCACGTAC CGCACAACCC GATGGCCAAC TTGGACCCTC AAATGCTTGC GTTGGCCCGT CTAGCGGGTT TGGATTCCAG CGCATTGGGT ATGAACCCGC AATTATCGCG ACTTGCGCAT CATCCTGCAG TTGGCTCAGC TGGAAGTCAT GATGACGCAC AAATGGCTTT AGCACAACGG CTTCTGAGCT ACAGTCAGAG CGCTGGGGGA GGGGGGAATA GTGCCCAGGG GGCGCTAGAT TTGTTGACAC AGGCCATGAG TCGTGGGGGT GCCGGACGCC ATCCGAATCC AGATCGGGGT TCAGATCGGG GTACAGATCG GTACTAGAAT GGATACCTGA TCGAGAAAAG TGGTTGGCGT TGGGTGTTGG CCGGGTACAC AAGGTTTTTT GTGCATTCAG AAAAGTTCAA CAGCTCAAGT CAATAGTTTT TGTGTTGAAC TGCTCCGCTC TCTGCTATCG AGCGCTTGAT CCGTTGGAAT AGCAAATATC TGCCTCTCTT GATTTTCTAT AGTTTACGGT
|
Protein sequence | MSWQRQRAGS EDGEIDEEEG EITDNPQPPV ALSLSPSKSR PTVTHSLPHS SQATAAFHGE SNFPPQPPLP NRLSSTSGPN SIHPYPGASH SSSNSNNNIP VPPPRRGSWR GGRGGTSVGG GAFERRPGRG LGHRNTSFGS GPPAFDAPPP ARSQSFQSFH RHSSGSIPGL PASNVLPPVA ATDPRRATDP RFRGAPGVAN TPVSAPHFTE SRATSEGRGF TTSSSSAPVS VSSTLADSVS LATPYSSLAE GKPPHVLAAE DAKVVRGLRA SSVARNESIG NDVPRDSSVS GPFPPLGGST EIASFPGGFR DRGPPHRRHT GDFRGHPGGH GSLNRRVSSE YGSGDGPNSE VLPFGRPNEI HQSARQGSQH AVPPSGEPLP VSRGQSTGNQ TPPPPFHRGP APPPTQEQQQ PPFHRSHPLP QDPPPGAFPR DGPPQGEVPS FYRDEQSVFS RNQHTHGGDF PPFNQRASDQ PPFSSGPPNV EQPLFRGPRQ DSYYGPASRD VNSGRFGSPS QRDRPIVNAR GVSGGPPPPP PPLGSQGALT STQHPSLAPG VAPLHRRNDP RLHRDPDAEG RDFADAASAS EPLRPNFPPE RTGFPMQSER GFRKPPFGQM FPPGGATENS TGSDSFGRSR ERNTAAARSP QTSPHTRKPV LSYFQESPAK EIPRLPAIID AKSGSLSSRI KSVGQHTEAR TEEPEPLLTS VLGEDSVDRA EKVVLLLTDQ RDKASLERDD KGCSELPKKQ TILIALNRMD TKIKLLQKST LDKEEEVEAH IEKEKEDQKR AAKEAKSEAE RLEKEHRRRR EEEQQADEKA KQEQIESMIE EGQAGFDADL TISTVTFETD LEAARKVEEA RFELECQEQI SAATERFDND VQTTQQELEN SIQSISNTQN LISALEEEYK CKMEEGDTAG EEKMDQPDLV NTVLEENRRR AAEAHVSQWA GFPVVSDDDE YGVLENEKDP KEGKRHVRWA EMAQKVTGVG DALYNEPSEA PYFEQNERLH ALIGPLVTEQ IRYSQRQFDT HWRELAEEYE YRRVVYEAQQ LKDGTAQRRR IKSTSVPHRL VGSKPNVPIL ESTSGHGRSS NNPYRRARRG NEVRTEYEQE QIIAELAAKE ALEKRIATGG SELPRQIGQI ERSWTATYIQ TFSAQRVDLE EQEAELRITG VWTDMEKCIF LDRFMQHPKD FRKIASFLRN KTTTDCVAFY YDSKQTLPYK GALKEHVMRR KRRGGYPIWE ATIQAALSVG AVVEAGDSEE KPLIFTLPFD DHTFSTFGLH PLKREVLDLM EIKEQALAEF DADEDADDVS SKSGQPKKRP RDRLFLLDPR QRKFLKPLPQ ESAHATCLKV DSGKASTADD DHNDSKEGTA KDESGRLTPL RKAPQKWTAS EKKIFHDTLE SHGRNWSMLS QAVGTKTISQ IKNYYYDYKK QKDKNRTTDK DKKVESKTER TESHENSPTP PHIAADQRPG DQTSNEPISD LRKNQPPRYD PQFEAKHIER QMFEVLQQQG QGPYPEQERL VDRRPVESLS DQELWAQLHR QGLLGQQRGH LSDEAARQLL QHHSQSHHQQ VLSNLMPWAS GGQLPQPVKR AQPINVQEWE QLQAILQIQR QQEQHRHQHQ PHVPHNPMAN LDPQMLALAR LAGLDSSALG MNPQLSRLAH HPAVGSAGSH DDAQMALAQR LLSYSQSAGG GGNSAQGALD LLTQAMSRGG AGRHPNPDRG SDRGTDRY
|
| |