Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_15555 |
Symbol | |
ID | 7195330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 605183 |
End bp | 610303 |
Gene Length | 5121 bp |
Protein Length | 1706 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183777 |
Protein GI | 219127092 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAACACAGG TCAACGGCCA ACCCGTATAC GGTGGAGCCA ACGACCCGCG TTTGGGCAAT CTACACGACA AGTCCGATCC GGGATACTTT GGACACTTGG ATTTGGCCAA ACCAGTGTAT CACCAGGGAT TTTTCAACAC CACGTTGCGG GCACTCCGGT GCGTCTGTTT TCACTGTTCC CGACTCCGCA TGCTCCCGGA CGAATTCAAG TTCCAAAAGG CCATACAGAT CAAATCGCGC AAACGCCGAC TCGAAGCTCT GCACGAATCA CTCCGCGGGA AGAAAAAATG CGATCACTGT CAAGGTGTAC AACCCAAATA CACCAAGGTG GATCTGCACG TCGAAGCGGA CTTTCCCGAA GACGGAATGC ACGGAAGTAC CGGGGGCGGA GGGGATTCCA AACAATTCTT GTCCGGGGAC ACCGTGGTCA AGATATTCAA GCAAATTCGG GAAGAAGATA TTGTGTTGTT GGGTTTGGAT GTCCAGCACG CGCGACCGGA TTGGTTGCTG GTGCAGGTAT TGCCCGTCCC GCCCCTACAC GTTCGACCCA GCGTTACTGT CGGGGGTGGT ACGCAATCGT CCGAAGACGA TTTGACGCAT CAACTCGTCA ACGTTATTAA ATCGAATCTC TCGTTGCAGC AGGCCGTCTC CAACGGTGAA CCCCAAATTG TGGTGGAACA GTTCGAACTG GCCCTGCAAC ACAACGTCGC CGCCTTTATG GACAATGAAC TACGAGGCAT GCCGCAAGTC ACTCAACGCA GTGGACGACC CCTTAAAACC ATTACGCAAC GTCTCAAAGG CAAGGAAGGG CGAATTCGGG GAAATCTCAT GGGAAAGCGG GTCGACTTTT CCGCGCGTAC CGTGATTACG GCCGATCCCA ATCTCGGTAT TCATCAAGTC GGTGTGCCCC GGAGTGTCGC CATGAACTTG ACTGTCCCGA TTCGCGTGAC GGCCTTCAAT CAAGCCGAAC TTAGCGCCCT CGTGGCCAAC GGCCCCACCA TGCATCCCGG GGCCAAGCAC ATTATCCGAT CGGACGGAAC GCGGATCGAT CTGCGTTACG TCAAAAACAA ATCGGAACTT CTCCTGGCCC ACGGCTGGAT TGTGGAACGG CATTTGCGTG ACGACGATAT CGTGTTGTTC AATCGGCAGC CCAGTCTACA CAAGATGAGT ATCATGGGAC ACAAGGCCAA GGTACTGGAT TGGAGTACTT TTCGATTGAA TTTGTCGTGT ACGAGTCCGT ACAATGCCGA TTTCGACGGC GACGAAATGA ACCTTCACGT GCCTCAGGGA TTGGCGGCTC GTGCTGAAGC GGAACTCATG ATGTTGAGCT CCCGGGTTAT TGTCTCGGGT CAATCGAATC GACCAGTCAT GAGTATTGTT CAGGACAGTT TGTTGGCGAC TCAAAAAATG ACGAAACGGT CGGTTTTCAT CGAAAAGGAT TTATGCTACA ATATGCTCAT GTGGGTGCCG CAGTGGAACG GGCAGATTCC CATTCCTGCC GTGATCAAAC CAAAGGAATT GTGGACCGGT AAGCAATTGC TCAGTACAAT CCTGCCCAAG GTGAATCTCA AGTCCAAGGC AAACAATGGC CCCGGAAAAG ATGCTCGTGG CAAGAACATG CCGAATACGT TCAACATGTA CGATCATTTG GTGACGATTC AGGATGGTGA ACTGTTGGAG GGTACAGTTG ATAAGAAGAC AATCGGCAGC TCCATGGGTG GCTTGATCCA CACGGCTTGG TTAGACGTTG GGTTTGAAGA AACGGCTCGT TTTATGAATC AAATTCAGCA GCTAGTCAAT CATTGGATTT TGCAGTACTC GTTTTCCATT GGAGCGATCG ATGCCGTCGC CGATGCAGAT ACTATGCGAC AGATTGAGTC GACCATTGAC AAGGCAAAGC GGCAGGTGCA AGATTTGGTT CGCCAAGGGC AATTGGGAGA ACTTGAGATT CAACCCGGTC GTACCATGAT CGAGTCGTTT GAACAGCTCG TCAACAAGGT GCTGAACACG GCTCGTGATC ACGCCGGAAA ATCTGCACAA TCTTCTTTGG ACGAAACAAA CTCGGTCAAG GCCATGGTGA CGGCTGGTTC CAAAGGTTCA TTTATTAATA TTTCGCAAAT TATTGCCTGC GTGGGGCAGC AGAACGTGGA AGGCAAACGC ATACCGTACG GTTTCAAGAA ACGAACCCTA CCGCACTTCT CCAAGGATGA TATCGGCTCC GAGTCCCGAG GCTTTGTCGA GAATTCGTAT TTGCGTGGTC TGTCTCCTCA GGAATTTTTC TTCCACGCGA TGGGTGGACG GGAAGGTTTG ATCGATACGG CTTGCAAGAC CGCCGAAACC GGATACATTC AACGTCGCCT GGTCAAGGCA ATGGAAACCG TCATGGCGCG TTATGATGGA ACTTTGCGAA CGAGCAGTGG ACAGATTGTT CAATTTTTGT ACGGAGAGGA TGGCATGGAC GCGGTCTGGA TTGAAAAGCA AAATTTTGAC TCTTTGACGC TGGCAAAGCC AGAGTTCAAC AAGCGTTTCT TATTCGACAC ATCCAGCCCA GAGTTCGGAC ACGATGAGCA AGGTATTCCG TTTCTGGAAC CAGACGTAAT TGAGGAGTGT CGTCGCGACC CTGATATCCA GGCTACTTTG GATCAAGAAA TTGAGATTCT TCGGGAAGAT CAAGCAATTC TCCGAATTGT CATGCGCAGC CGAGAAGCTG GGAGAGAGAG CGACGACAGC TCATACGCAC CAGGCAATGT GCGTCGTGTG ATTCACAACG CAATGCGTCA ATTTCGTATC GACAAGAGCA AGCCAACGGA CCTGCATCCC ACAGAAGTGA TACAGATTGT TAACAACCTG TTGGAGCGTC TGATTGTAGT GGTTGGGAAC GACCCGCTAA GTGTTGAAGC GCAGTCAAAC GCAACCACTC TTTACCGCAT TCTGATTCGA ACCATGCTTT CGAGCAAGCG TGTCTTAAAG GACTGGCGTT TGAGCAAGGC TGCTTTGAAC TGGGTGGTAG GCGAAATTGA AACTAGATTC AATATTGCTA TGGTTAATCC TGGTGAAATG GCTGGAGTAT TGGCCGCTCA GAGTATTGGT GAGCCTGCAA CCCAGATGAC GCTCAACACC TTCCATTATG CTGGTGTTTC CGCCAAGAAC GTGACGCTGG GTGTCCCTCG ACTGAAAGAA ATCATCAATG TTGCTAAAAC TCCAAAGACT CCTGGCCTAA CTATTTATCT TCAGGAAGAG GTCAGTGGTG ACGAAAAAGT TGCCGAGCAG GTCGTTGCTA TGCTGGAATT TACTGTTTTG GGCGACGTTG TAAAGAAGAC AGAAATTTAT TACGATCCTG ACGTGAAAAA TACGGTTGTC ACTAAGGATC GGGAGTTCGT CAAGGAATTC TATGACTTTA CGGATAAGAC AGACGATGAT TTGCGTCGCA TGAGTCCTTG GGTTCTTCGT GTTGAGCTTG ACAAACCGCT ACTTTATGTC AAGAAAATTA AGATGGAGGA AATCGCTAAA GAGATTGGGG AAGAATACGG TGCGGATCTG AACGTAGAAG TGACAGACGA CAACGCCGAC GAAATGGTCG TCCGGATTCG AATCGTGAAC GATACGCCGT TCAACTCAGG CCAAACAGAT GAAGGCGGAA ATTTGATGGA CGATCAACCG GAAGTTGGCC AAGAAGACGA TATTTTCTTG AAACGTCTAG AAAAAAGCAT GCTTTCGAGT CTGAAGCTTC GCGGGGTAGA CCATGTGAAG AAAGTGTTTA TGCGCGGTGG TGCGAAACGT ACAGTTTGGG ACGATGTAAA AGGTTTCGGC GTTAGAGATG AGTGGGTACT AGAAACGGAT GGGACAAACT TGATGGCAGT TCTTGGCGTG GACTACGTGG ATGGTACGAG ATCTGTCAGT AATGACATCG TCGAGGTGTT CGTAGCGCTT GGCATTGAAG GAGTCCGCGG AGCGTTGTTA AGTGAGCTTC GCAACGTCAT TAGTTTCGAC GGTTCTTACG TAAACTATCG CCATTTGGCT TGTCTGGTGG ATGTCATGAC AATGCAGGGG CACTTAATGG CTATTGATCG CCACGGCATC AATCGAGTCG ACACTGGTCC ATTGCTCCGA GCTTCATTCG AGGAAACGGT TGATATGCTC ATGGATGCAG CTGTGTACGC TGAGGAGGAG ATTCTTAAGG GCGTGACCGA AAACATCATG ATGGGTCAGC TTGCTCGAGT TGGCACCGGT GATGTAGACT TACTACTGGA TGAAGACAAA GTTGTTCGAG AAGCAGTTGA AGTTGTTGTG GACGAGTTTG CTGTCGACAA AGATCTCGGT ATGGCCGGAG TGGGGGGTGT AGGAGGAGCG ACCCCTTATG CCACCACTCC ATTTGCCGCT AGCCCAATGG TGGGGGATGG CGCAGCAGCG TCTCCTTTTG TGGATGGCGG AGCCGCTTTT TCTCCAGCAG TTGGTGCGGC AAGTTTCTCA CCGGCTTATT CTCCAGACAG CGGTAGTTAT GGTTCTGGAT TTGCGAGTGG AAGTTACGGA GCTGGCGACA GCCCAGCGTA CAGTCCGACG TCTCCGCAGT ATTCGCCGAC TTCTCCGGCG TACAGCCCCA CGTCTCCAGC ATATTCGCCC ACAAGCCCAG CATACAGCCC TACCAGTCCA GCGTACAGTC CAACGTCACC TGCATATTCG CCAACAAGTC CGGCCTACAG CCCAACTTCG CCTGCGTATT CACCAACGAG CCCCGCATAT TCGCCAACGT CCCCGGCTTA CAGCCCGACG TCGCCGGCAT ATAGTCCGAC GAGTCCCGCA TATTCGCCAA CGTCGCCTGC GTATTCTCCA ACGAGCCCAG CTTACAGCCC AACTTCACCG GCCTACAGCC CTACATCTCC AGCATACAGT CCGACTTCAC CGGCTTACTC ACCCACCTCG CCAGCTTACA GTCCTACGTC TCCGGCTTAT TCTCCGACGT CTCCCGCGTA CAGTCCTACA TCCCCCGCGT ACTCGCCGAC ATCTCCGGCC TATTCGCCAA CATCCCCCGC ATATTCGCCG ACCTCGCCAG CCTATTCACC GACCTCGCCG GCCTACTCAC CGTCGGGTGG CGATGATAAG AAAGACGAAA TGGAAGACTA A
|
Protein sequence | ETQVNGQPVY GGANDPRLGN LHDKSDPGYF GHLDLAKPVY HQGFFNTTLR ALRCVCFHCS RLRMLPDEFK FQKAIQIKSR KRRLEALHES LRGKKKCDHC QGVQPKYTKV DLHVEADFPE DGMHGSTGGG GDSKQFLSGD TVVKIFKQIR EEDIVLLGLD VQHARPDWLL VQVLPVPPLH VRPSVTVGGG TQSSEDDLTH QLVNVIKSNL SLQQAVSNGE PQIVVEQFEL ALQHNVAAFM DNELRGMPQV TQRSGRPLKT ITQRLKGKEG RIRGNLMGKR VDFSARTVIT ADPNLGIHQV GVPRSVAMNL TVPIRVTAFN QAELSALVAN GPTMHPGAKH IIRSDGTRID LRYVKNKSEL LLAHGWIVER HLRDDDIVLF NRQPSLHKMS IMGHKAKVLD WSTFRLNLSC TSPYNADFDG DEMNLHVPQG LAARAEAELM MLSSRVIVSG QSNRPVMSIV QDSLLATQKM TKRSVFIEKD LCYNMLMWVP QWNGQIPIPA VIKPKELWTG KQLLSTILPK VNLKSKANNG PGKDARGKNM PNTFNMYDHL VTIQDGELLE GTVDKKTIGS SMGGLIHTAW LDVGFEETAR FMNQIQQLVN HWILQYSFSI GAIDAVADAD TMRQIESTID KAKRQVQDLV RQGQLGELEI QPGRTMIESF EQLVNKVLNT ARDHAGKSAQ SSLDETNSVK AMVTAGSKGS FINISQIIAC VGQQNVEGKR IPYGFKKRTL PHFSKDDIGS ESRGFVENSY LRGLSPQEFF FHAMGGREGL IDTACKTAET GYIQRRLVKA METVMARYDG TLRTSSGQIV QFLYGEDGMD AVWIEKQNFD SLTLAKPEFN KRFLFDTSSP EFGHDEQGIP FLEPDVIEEC RRDPDIQATL DQEIEILRED QAILRIVMRS REAGRESDDS SYAPGNVRRV IHNAMRQFRI DKSKPTDLHP TEVIQIVNNL LERLIVVVGN DPLSVEAQSN ATTLYRILIR TMLSSKRVLK DWRLSKAALN WVVGEIETRF NIAMVNPGEM AGVLAAQSIG EPATQMTLNT FHYAGVSAKN VTLGVPRLKE IINVAKTPKT PGLTIYLQEE VSGDEKVAEQ VVAMLEFTVL GDVVKKTEIY YDPDVKNTVV TKDREFVKEF YDFTDKTDDD LRRMSPWVLR VELDKPLLYV KKIKMEEIAK EIGEEYGADL NVEVTDDNAD EMVVRIRIVN DTPFNSGQTD EGGNLMDDQP EVGQEDDIFL KRLEKSMLSS LKLRGVDHVK KVFMRGGAKR TVWDDVKGFG VRDEWVLETD GTNLMAVLGV DYVDGTRSVS NDIVEVFVAL GIEGVRGALL SELRNVISFD GSYVNYRHLA CLVDVMTMQG HLMAIDRHGI NRVDTGPLLR ASFEETVDML MDAAVYAEEE ILKGVTENIM MGQLARVGTG DVDLLLDEDK VVREAVEVVV DEFAVDKDLG MAGVGGVGGA TPYATTPFAA SPMVGDGAAA SPFVDGGAAF SPAVGAASFS PAYSPDSGSY GSGFASGSYG AGDSPAYSPT SPQYSPTSPA YSPTSPAYSP TSPAYSPTSP AYSPTSPAYS PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP TSPAYSPTSP AYSPTSPAYS PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP TSPAYSPTSP AYSPSGGDDK KDEMED
|
| |