Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35556 |
Symbol | |
ID | 7200910 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 215786 |
End bp | 221848 |
Gene Length | 6063 bp |
Protein Length | 1902 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179992 |
Protein GI | 219118439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCGC CGTCGGATGA GACAATAGCG GGACCGCCAT CGTCTCCGAC AACCGAAGAT GCCGGAACGC AGGCTCGGAA AGCCTCGGCC GCCCTGACGT CGAAAGTAAT GCTGGAAGAG TTAGAACGTC AGTACATGCA GATTAAAGGT GAACGCGATG ATTTGCAGCA AGAGTTAACC GCGGCGGAGG AAAAGTTCAA GACGCTCAAC GAGTCGTTGA CTTCCAACCA AGCACTCGAG GTGGAGAATG CCGCGCGGCA ACACGAATTG CGTGTGGCGC AAGAACGCGT AAAAGCATTG GAAGAAGAAC AAACAGCTAC GAAAGAGCGC ACCGATCGGG CTCAAGCCGA ATCGGATCGT CTCCGAGAAG AAATCGCGTA AGTACAACGA AAGCACTCTC GCGTTTGGAT AAAAATGGCA GGGATCGGAT ACGTACCGTA GGCCGCGAAA CGATCACTCA CGATCAACCC GAAAACTCTT TCTCGACAGT CGTTTGGCCG GATCGAACAG GGAACTTTCC GAGAATATTG CAACGTTTGA AGTCCAGAGC AAGCTCAGTG AGTCCCAGGC AATTCCCCTA TTTCACGAAA AGCAGCGTCT CCAGACGGAA CTTGACAGCT TGCAAGCCCA CGCCAATTGG CTGGAACAAG AGCTAACGGC CAAGTCACAG GATTACCAAA AGCTGCAGCG AGAGTCACGT GATCGTTCTA TTCAACTTCA GCTTCAACTG GACCAAACAA TTAACGAAAA AGAAGCTTTC GAAGCCCGTT TGGATGAATT GCACAAAATG GAGCGGCGTT TGCAGGACAA AGTCGAGGAG CTTTCCCACG ACTTGTTGAC GGGGAAGCAA GCCATGACGG ATCTACAGGA ATCTACAGAG ATCGAAATTC GAGAGGAACG GCGTCTTGTA CAGCTCCAAA AGACTCATTT GGATCGCTGG GAACATCGAT ACAATGACGT CGTCCGTGAA AACGAGAGCC TCAAAAAGGC TGCCACGGAA GCCATGGAGA CATCGCGTTC AGAACTTCTC CAGACCAGCG AAGCACTGGA AAACAAGTAC AAAGATTTGC TGAGAGAGCA AGCGGCCGAG TACGAAGAAA AGCTTAAGTC GAACCGTTTG GAAGCCGGTC CAGTTCGCCT GGCTCTGCCT GCACCCCCGG CTGCTGTGGC GACCGACTAC GAAGATGATG TTCCACTCAA TCTTACCGAC TTATACACAC GTTTAGAAGA GACCAAGGCT ACCCTGCGTC GCGAAACAGC TCGGGCCGAT CGTGCCGAGC TCTTGAATGA GCGCATTCAG AAGGACATTG CTGCTAAGGC ACCTTTGTTG AACCGGCAAC GGGAAGAATA CGACTTTGCC CTGGACCAAA TTCAGAATTA TCAGCGCCGC TTGGAACAAG CTTTAAATGA AGTTGATAAT GCACGCGAAG ATAGTAAAGA GACGCGCCGG GACGCAAATC GGCTACAGAA ACTGTTGTCG GAGAGGACTT CCGAGTCCAA GGAGTTGGCA AAACAAGTTC AGGCACTATT GGTAACTCGG GCAGGCGGGC AGGTGGGAGA AGAAATTCCG ACTTCGATTG TCGAGATTCA GAATCAAAAC CAGCGATTAC TCGCGGAGCA CCGTCGACTA ACTGAAACAG TCAGAGAGCT GGAAAGCAAG CTTGAGTCGG ATACTCTGAA GGCAAAACTG GATGCAGTGG AAGCGGAGCT AGCGGATTTG CGCGAAGACC GTCAACGTCA AGAGACTGCC GTGGAGCGTA TAGTTCAACA GAGAGATTTG TATCGAGCTA TTTTGAGCAA ACAGGATGCA AACGTGCTTG GCTCAGAGTC CGAGCAGCTT TCGGCGATGG AGATTGCCAA ACAGCAGTCG GAGCGATACA AGGCTTTGGA TCAGAAAAGC AAGACCTTGG CATCTGATCT GGCTGCTGCC AGGGGTGAAA TCGATCGCAT GGGCCGCGAG AGAGAATCTA TGGTTGAGCG GTTGGCCCGT TACGAAGCCC ATTCCGCCGA AATGAAAGCC GCAGTTGACA CATTGGAACG CGAGCTGTTG AGTGCGCGTG GTGACGCAGC ACGAAGCAAT TCCGAGTCAC TCTATCACAG AGAGAGAGCT GAGCGTGTGG AGGAGTCGCT GCAGCGTGCG CGCGATGAAA TCTCGATGAT TGGAAGTTCG AAAGCTGAAC TTCAACGCAT CAACACTGAT CTACAGCAGA GAGTTGATAT TGTTCGCTCT GAAAGTTCTC GAGTTGCCAC GGAAGTACGA CAGGCTGAGA TGAAGGCTCG GTTAGCGGAG ACACAAGTAG AGACGGCAAA AGCAAGTGAA TCGCGAATGG CGGAAGAGGT CAACCAGCTA CGTGGGGAAG TTTCAAGGCA AGGATCCATA ATAGAGTCAA TCAGAAGAAT CGAAGCTTCT CTTTCTGCAA AGAGTGGTAG CGAACGAGAG GTTCTCAAGT CTGAATTGGA GAAACTTTCT CAGGTCCACA AGTCGGAGCA AACAAGCTTT AATACGAAGA TCGAAAATCT CAACGCACGA ATTCAAGAGA TGGACTCGCG TGTTGTTGCA GCTAATGATT CAAAGGACAA GTTTCAATCA GAGTTATCTA GCGTAAAGGA TGAGCTGAAT GCTGTCACAG CGGAGAGGCA AGAGCTCAAT TTGAAGGCAC GCCGTCTGGA AGCTCAACTT CGTGCCGCAA AGAAGAAGCT TGGGGAAGGA GATGACCTAG ACGACGTTGA GGTAGCTCTA CAAGCCCGCA TTGAACATCT CACCAACCAA CTGGAGGAGA CAAAGAGTGA ACTCGCCAAT TCAAAGAAAC AGGCTGACAC ATACCAGCTA ATATCGAAGA ATGCGGAGTC GGCTCTCGCC GAATTGAGTC AAGCTACGGA GACTATGAAG GCTACGAACG AGTCAGAAGT GTTGGAACTC AACGGAAGAC TAGAAAAACT CCAAAAGGAA AACGCATCAA AACAAGAGAT TGTTCTCGAT TTGACCAAGG ATTTGTTGAG TCAGAGGGGG GAACAAGAGA AGGTCGAATC TATATTGAAA TCTGAAATTG AGAGTCTGAA AAGTGAGATG AAGACAAGGG AACAGGATTC TGAGTCTTCA GCAGCTGGGA CGGCCGCTTT GAAACTCGAC TTAGATGCTA TGCGCACCGA GGTTGCCACG GCCCAAGGCA ACTACGAGCG AGAGCTCCAA CTGCATTCGC AGGCACGCAC TGCTTTGCGT GAAGCACGAG AGCAGGCTCA GGAGGAGACA CGACTACGAC ATATTGCTGA GGAAAAAACG GACGCTTCTG CGAGGGAGTT CGATCAGCAG AAGAACGTTT GGGAACAGGA AAAGTTATCT GCGAATGAAA ATGCAAAGAT GATAGAAGAA AGTCTAAAGG AAGCCCGTGA GCAAAATAGA GTCCTTCACA TGCAACTAGA GAGTTTGGGT GCAATGGTAG AAGAGAGTCA AATGTCACGT GCTGTGGCTG CCAGCGAGAT CCCAGAGCCC GGAAACTCAT CGGAGCAAAT GAATTTGCAG AAAATGTTGT CAGAGCTGCG AGAAATTCTC AAGTTTGTAA GGTCAGAGAA CGAAATTCTT CAGACTCAAC TCGACACGGC AAAACGGGCT GCTGATCGGG AACGAACGAC TTTTCAGGTT GTCAAACGCA GCTTGGACGA GGCAAGAGCT GAACTTAAAT CGCTTCAAAA TCAAGACATC ATGGATAAAG ACCTCCCTGG CAATAATTCA GCCGAGCAGC TGAGGGATGC GGTGGAACAA CTTACTTTGT TAAGGGATAG CAATAAGCTT TTGCGAGATG ATGCAGATAA GCTGCAGTCT AATTTGACCG CAACACAGAA TGATCTCAAC GCGTTGAAGT CATCAAGGAA GCCTGCTGAG AAAGTTCAAC GCGAGCTCGA AGCCCGCATT GCGTCAGCTG AAGCCGAGAA AGAGAGTTTG AATCGCGACT TAGCTGCATG GAAGTCAAGA GTTGAAAGTC TCCTTTCCAA ATTCAATCAG ATTGATCCGG AAGAGTATGA GAAGGTTCTA AGACAAGTCG AAGAATTGAC GAAGGAGAAG GAGTCTCTGA CTGCATGGAA GAAGACAACA GAAGCGGAAA ACACACGAAT CAGAGAGATA TGCCGTAATC TCAAGAAGCA TATCAGCGAA TTGAAGAAAA CGATAGAGGA ACAGAAAAAA GATATAGATA AGCTCACGAC GGAAAAGGCT ACCCTCACAA CCAAGTCTAC AGAAGGAACT TCAGCTGCGA AGGAGCGAGA TGAGCTTAAG GGAAAACTGT CCCAGGCCGA GAAAGACACA GCTTCCACGA AGACGGAGTT GGATGGCGCA AACCATCAAA ATGAGATCTT GAGAGAGAGG ATGCGCCAAA TGGTAAAGAC GAGCAATGAG CTGCGGAAGA AAGAGAGGGA GCTGGTAGGA CAGCTCGCCG AAGCAAAATC TGCAACCCAA ACAGACTCCA TTCAAGACAA ATTAGGGAAA CCGTCTGAAA ATGCTTCGGA AAAACGTACG TTGGCGTCTT CGAAGATAGG CATTTCGACG CCTACCTTCT CTGCAACAAC ACCAGAGTCG ACTCAGAGGA CCGCAGGTTT GGTGGATGTA AGAGCACCTC AGATTCTTCC GACTATTCCG ACTAATGGAT TTAGGTTTGG ACCTAGTACC AAGCCGAAAC CGAAGTCTCC GATGGCTGTG TCCGAAGACG TACTCCAGGC AAAATCGGAG GAAAAGATCG AAACACGGGG AGCTTCCGAA CTAAAAGAGC CGACTTCCGT TTTTAGCGAG AAAGCCCCAG AGTTTGTGCC ACACTCGCAA ATGTCAACAA AAACGGAGAC AAATATTTCA TCTACGAGCG GAAGCTCACC TCTACAAAGC GGTGGTGACG TAGCCGACGC AAAACAACGA AGCTCAGGAG AGCTGGCTAC GACGACGCAT GCTACACCGA GTGAGAGTCA AGAATTATCG ATGAAAAAAG AAAAGCTTTT AGAGAAGAAA CGACTACTGG CAGACGCTAA GAAACGAAAA ATCCAGGCAG AAGCGGAGGC AAGAAAGACC GACACTGACG GACTGCAACA GGTAGCAAAA AAGCCAAAGA CCGAAGACAC TGAGAAAGAA TCGAGACTGA TTGGAGCAGA AGTGGGAGAC CGCACTCTGA CGGCTGGTAC AATGAGCGAG CAAGAGACCA GTAATTCAGC AGAAGTTTTG CTAGAAGCAG ACAAAACCGA GTCGGAACTT GTTGAAGTTG AGAATCTCGA CAACGAAGAA CCCTCGAAAA GCGAAATAGC TGAAAATGAG ACCGAACCAG CATTTAAGCC TTCATGTTTT GGAAGCGGAT CGACTGCTAC GACAAGCCCA TTTACGAAGA GCAATCCTGC CACGCCTTTC GGCCGATCGG ACGCTTTTGG TCAATCGGCA ACCGTCCTAC CCGGTGCCCC AACTACATTC GGGGTCACGT CATCCACCGT TCAGTCTTCA TCTGGTTTTG GGGAGGCATT TTTGAGCCAA ATGAAGCCTC CAGGAGGTTC TGCTACACCG CCCAGCTTTT CCTTTGGATC TTCCGAGATT CCCGGCCAGC AATTCCAGGC GTCAGCTTCT GTATTCGGTT CTTTCGGCGG CAAATCAACG CTTGGGAGTC TTGCCGGTAG CAAAGAACCG TCAATGGCAG CTTTTCCATT CGCTAGCCAA CCAGCTGTAC AAGAAAACAA GGAACAGAAC GATGACGCCA AGGATGCGAC CGAATTAGAG CAAACGGAAG AGGACGTTGT GAGGTAGCAC CTTATTTGCT GTTGCTTCTG CTTCGCCGGG GGCGTTCCTA AGGACGGTAG TACACCTTTT CTCCTTGTGG CAATCCTCCT TCGGAGTACT CACGCCATAG GAGATGTATA TCCTGTCTAG GACTAACAGT AATTTTACGA AGTCAGACCA AAGTACTATA ATGCACGAGT TTTATGTTCT CGTTATTTGG CTGCCTATAT ATAACATTTA CTTCCTTTCC ATTCTGGCAG CATTGGGCTC GCCTGCCTGG CTCTCGAAAG CCCCATTTCC AAAAGCCTCC AGACCTCGAA TAG
|
Protein sequence | MSSPSDETIA GPPSSPTTED AGTQARKASA ALTSKVMLEE LERQYMQIKG ERDDLQQELT AAEEKFKTLN ESLTSNQALE VENAARQHEL RVAQERVKAL EEEQTATKER TDRAQAESDR LREEIARLAG SNRELSENIA TFEVQSKLSE SQAIPLFHEK QRLQTELDSL QAHANWLEQE LTAKSQDYQK LQRESRDRSI QLQLQLDQTI NEKEAFEARL DELHKMERRL QDKVEELSHD LLTGKQAMTD LQESTEIEIR EERRLVQLQK THLDRWEHRY NDVVRENESL KKAATEAMET SRSELLQTSE ALENKYKDLL REQAAEYEEK LKSNRLEAGP VRLALPAPPA AVATDYEDDV PLNLTDLYTR LEETKATLRR ETARADRAEL LNERIQKDIA AKAPLLNRQR EEYDFALDQI QNYQRRLEQA LNEVDNARED SKETRRDANR LQKLLSERTS ESKELAKQVQ ALLVTRAGGQ VGEEIPTSIV EIQNQNQRLL AEHRRLTETV RELESKLESD TLKAKLDAVE AELADLREDR QRQETAVERI VQQRDLYRAI LSKQDANVLG SESEQLSAME IAKQQSERYK ALDQKSKTLA SDLAAARGEI DRMGRERESM VERLARYEAH SAEMKAAVDT LERELLSARG DAARSNSESL YHRERAERVE ESLQRARDEI SMIGSSKAEL QRINTDLQQR VDIVRSESSR VATEVRQAEM KARLAETQVE TAKASESRMA EEVNQLRGEV SRQGSIIESI RRIEASLSAK SGSEREVLKS ELEKLSQVHK SEQTSFNTKI ENLNARIQEM DSRVVAANDS KDKFQSELSS VKDELNAVTA ERQELNLKAR RLEAQLRAAK KKLGEGDDLD DVEVALQARI EHLTNQLEET KSELANSKKQ ADTYQLISKN AESALAELSQ ATETMKATNE SEVLELNGRL EKLQKENASK QEIVLDLTKD LLSQRGEQEK VESILKSEIE SLKSEMKTRE QDSESSAAGT AALKLDLDAM RTEVATAQGN YERELQLHSQ ARTALREARE QAQEETRLRH IAEEKTDASA REFDQQKNVW EQEKLSANEN AKMIEESLKE AREQNRVLHM QLESLGAMVE ESQMSRAVAA SEIPEPGNSS EQMNLQKMLS ELREILKFVR SENEILQTQL DTAKRAADRE RTTFQVVKRS LDEARAELKS LQNQDIMDKD LPGNNSAEQL RDAVEQLTLL RDSNKLLRDD ADKLQSNLTA TQNDLNALKS SRKPAEKVQR ELEARIASAE AEKESLNRDL AAWKSRVESL LSKFNQIDPE EYEKVLRQVE ELTKEKESLT AWKKTTEAEN TRIREICRNL KKHISELKKT IEEQKKDIDK LTTEKATLTT KSTEGTSAAK ERDELKGKLS QAEKDTASTK TELDGANHQN EILRERMRQM VKTSNELRKK ERELVGQLAE AKSATQTDSI QDKLGKPSEN ASEKRTLASS KIGISTPTFS ATTPESTQRT AGLVDVRAPQ ILPTIPTNGF RFGPSTKPKP KSPMAVSEDV LQAKSEEKIE TRGASELKEP TSVFSEKAPE FVPHSQMSTK TETNISSTSG SSPLQSGGDV ADAKQRSSGE LATTTHATPS ESQELSMKKE KLLEKKRLLA DAKKRKIQAE AEARKTDTDG LQQVAKKPKT EDTEKESRLI GAEVGDRTLT AGTMSEQETS NSAEVLLEAD KTESELVEVE NLDNEEPSKS EIAENETEPA FKPSCFGSGS TATTSPFTKS NPATPFGRSD AFGQSATVLP GAPTTFGVTS STVQSSSGFG EAFLSQMKPP GGSATPPSFS FGSSEIPGQQ FQASASVFGS FGGKSTLGSL AGSKEPSMAA FPFASQPAVQ ENKEQNDDAK DATELEQTEE DVHWARLPGS RKPHFQKPPD LE
|
| |