Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43680 |
Symbol | |
ID | 7197450 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1171917 |
End bp | 1177675 |
Gene Length | 5759 bp |
Protein Length | 1458 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178091 |
Protein GI | 219112679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.16765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGGGT TTGGCAGCAG TTTTGGCAGT CCGGCTCCTG GTGGTCCAGC CTCTGGACCT TCTTTATTTG GCGCGAGCAG TAGCACACCG AACAACAGCG CGACGGCGGG TCACACACAA TCCAACAACA CCGGATTTGG CGCCTCGACG TCACCGCCAG CTTTTGGCGC ACCCGTTTCC TTCGCTCCCA CTGTCGCATT TGCAGCTCCC GCTCCGGCTC TCGCCGCGCC TCTCTTTGGT AGCCCAACTT CTGTTTTCGG AAGTAGCAAT GGACCCGCCC AAGGGTTTGC GATGAGTACT GGCTTTGGAA GCTCGGCCTC ACCAGCACCT TTTGGCAGCT CACTCTCCGC ACTTCCGTTC GGATCAATAC AGCAACCTTC AGCGGCAGCC ACTGTGTTCG CGTCAGGCGG TTTTGGAACT GGTAATAGTT CAGCTGCATT TGGACCTATT TCCTCACCGG CACCATCTGC GTTTGGATCG AACCAAAATA CTGCGTCTAT GTACGGATCT GCAGGAAGTG CTAATTTATC GTTTGGGCCT ACTTCAATCA CGAACTTTGG ATCTGCTAGT AGCAATCCTT TCGGCGGTCC GCCACGGCAA GCCCCTTCTT CAGATGCCTA TAATTCTTTG TCGGCTACGC CTTTCGCGAC GTCATCAATG TCGACACCCT TTGGATCGAC GGCTCCGACC CAAAGTACAG GTACGCCCTT TGGAAGTAGC ACCAATGCTA TGGTGAATCC TTTTACTTCA ATCTCCACTA CTGCACCCAA CGGATCGACG GTTCCGACCC AAAGTACAGT TACGCCCTTT GGAAGTAGCA CCAATGCTAT GGTGAATCCT TTTACTTCAA TCTCCACTAC TGCACCCACC ATATCGATGT CTTTTGGTGC GTCCACCAAT CTTGCTTGCA ACAATGATCT AGAAATGGAA AGCAGTAGTC CGACGCCGCA CGGCATGTCA CCAATCATGG GATCTCCAAT TCATTCAGCA TCTGACAGCG AAGATATGGC CGACGCCGGA GGAGGCAATA ACGCGTCAGC CTTTTCCAGC AGCAACGCAC CCTTTGGAAA GCCCATATTT TCTTCTACAA CAAAAAGTGG ATGGCCCGCT CCGGCGCCAT TCGGCGGAGG AGCGCTTGGG ACTATTCCGG AATCTGCGTC CACGTTCCCG CCCTCAATGA TGACCAGCGC CGGGGAAGTT GCTCCGAGTT CGGTTTACAC GGAATCCGCA GATTCCTTAA CCAGCAATGC AGATGACAAT CGATTGGCGG AACTCAAAGC ACGCATTGAA GAAAAGAAGC GTATGCTGTT GGAAAAGAAG AAGCGGGAAG CCGAAACATC TTCTACAACG ACCGCTTTGA ATGCGGATGC ACCCTCATTT GTTCCGGAGT CGACAAGCAC GAACGTACAA AGCGCTCTAT CCAAAAAGGC AATCTTGTCT CTTAGCGAAG ATCGGTCGCC GGCACCTCGC AGTCGAAGTA CCAGCCCGAT ACCACCGATC AACATTTCGT CTCTCTCAGA TCGCAACGCA CTTCGGTTTA CTGGAAGTCA TAATGGGAGA TCTCACATGC CAGCTGATCT ACAATCCAAG TCAGAAGAAG CTCCAAACTA CGCGGCACTT CGTACGGATC CACAAGGAGG TCGTGCAGAT TTAGGATCGG CTGTTTCTAT GGTTGGCACG TGTCCGTACA TGTGTCCCGA CGAAGAGCTC CTTCGCCGAG AAAGAGAGGG GGACATTCAG CTTCTAGAGA CCCCACAGCC GGGAACGTTG CACCCAGAAT CATGGACGTA TCGTGACACC GTCGTTAAGC GCTTTCGAAG ATCAGCCGCC GATTACAAGC TGGATGTACC TGAATGGGTG CGTCCTCCTG ATGTCCTGGA GCGTGTTTGC TCGTATCTGG AAGAATGGAT TATGGTAAGA TAAAAAGTTG GCAGCGTAAG GCGACATAGG CTACCGTCCG ACACCTTGCT TTCAATGAAA TTTCAGGAAA AAGATCGACA AGGACCAGAC CAGCGCTTTC CTCAGGGAGG TGTGCCCCCG TCGCTCGATG TCTACCAATT TATCTGGGAC CGCACAAGAA TGATACGGAA AGATTTCATC TTGCAAAATT ATGTTGGAAC AGGAGGCGCT TGTGACGCCC GGGCTGTACG ATGCCACGAA CGGATAGCAA GATGGCACGC AATGTGTGAG CACCAATTGA GTCATATCTC AGAGTACGTG TCCCATCAAA GTCAGCAGAA TATTCAGGAG CTCGGTCAGA CGATGAAAAC TCTAAATCAA TACTACGATG ACTCTCTTAA GCGATCCCTC ATCGAAGTAC CTGATGCACA GGGCAACGAG ACAAGGCTTA ACTTACTTGG CCAAGCACAC GGATGTGAAT CCAATTCTGT GCAAGGTCCC AACCCTGTTG ACTATGATGG GACTCCTTTG TCGAATGACG AAGATACGAG TCGGGTATCC CTTCGGGTTA TTGGCGGTGA TGTATCGCGT AGCTCGATGC ATGGTACTGC CGAACCCGAA ATGCGTGGAT TGTATATTCT CCTTACAATC GACAACGACG GAGGAATGGA AGTCCTCAAG TATGCTGCGT GGCTTTTTCG AGAACGTCCT TATATCTATC AATCGCAGCC AGTCCAACTC GCTATGACCA TCTACAAGGT AAAACTTTTA AATACATTTA TTGGAATTTG ACCAATGTCT TACCTCACGA CTTGCGCTTT CTCGCCTCAG GCAAAGCGAG AGTTTAATTA CGCTCGTTTC TTTTCCATTT TGCGATCATC ATCGACTCCA TATCTATTTG CGTGCATCAT GTTTAAACAT GTTCAAGTGA TGCGCAAGAA TGCATTCCAA ATCATGTCAA AAACGTTCGG AGTCCGTAGT TCTGCAACTG AAGCTATCTA CGATGCCTAT CCATTAAGGG ACTTGATGAG ACTGTTGTGT TTTGAGGACA TAGAGGAAGC TAGAAATGCA TGCCAACACT ACAACATTAC CTGCAAAGAA ATGAAGATAA AAACGTCTGC TAGTGGTACT CGTATTGAGG ATATCGTGTT TTGGAGAGCA ACGAAATTTA GAGTACCCAC AGATGCCGAG AAGGGAACAA TTGTTCAGCT ACGACCCAAA AAGATGATCA AGACCATTGA AAGCAAGTTA AATGGAGCCA CCCGACTAGC CGTTTGCAGA GGAGAAGTGA GTGCTTTTGG GGCATCGCCT ACAGACAGAG AATTCCCTCT TGTTTCGGCT CAGAGTCAAA ATGTTGCTTT GAAGGAAGAC ACTGTAAAGA GAAATTCAGG AGAGGAGGTA GCTTCGTTAG CAGCAAGGAA GCAAGAAATT GAACGCCAGC AGCAAGAAGA ACTTGAAAAA CGACAACAAA TCGAAAGACG TAGACTTGAC GAAGAACGGA TACGCCATGA AGCCGAAAGG GAACAACGCA TTTCGTTAGA GCTTAAACGC CAGGACATGG AGTCGAAGCA ACGTGAGATA GAGCTTCGGC AGCAACAAGC GAAGGAGCGG ATAGAATTGG AGCGCCTGCA AAAAGAAGAG GTGGAGCAGC AAAGAAAGCT TGAAGAGGAA GCTGAGCGAA AAGCGCGAGA CGAAGAGGTC CGAAGGCAGC TCGAGTTGGA GAGACTTGAG AAAACTCGGC GCGAGGAGCT CAAGCGCCTC GACGAAGAAG CCCGGAAACG AGAGGAGGTT GAAAAACAGA GGTTACAGCG AGAGGCAGAA GAAGCAGAGG CTCGCAGGCA AGAGGAGGCG GCTCGACAGA AAAAGCTGGC GCTTGAAAAG GAAAAGGAAC GACAGCAGTG CGAGGAAAAA GCGCGTTTAA ATGATGTTCG ATGGCGAGAG CAAACTGAAG AGGCAAAAAA AATTCTTTTA TGGAAACGAT GGAAGCAAAA ACTTCCCAAG TATTTGGAAA ACGCGGAAGA GACCGCGCGA ATGTTGCAAC ATTTGAGCCC CACCTCGTCA GCCTCCCTGA AACTTGTGAA GCTTCTTCAC GGAGCCACTA CATTTCATTC GCATCAACCT GCTTCTCTGC TTCGCCACAA ATTTTCCTAT CCGCCTGATT TGCGAATGGT TCTTGAAAAT CTTCTCAACC AAGACACCAA ACAAATTAAT CTGTCGTCAG GCATTGCTGA ATCATTATCT CGGATATGGC GTTGTTCAAG GTTGGACAAT TCTTGTCAAA AATGTTTTTT ATTCAAGATC GGAGTCTTGT GTCCACAGTC AAAATCAATT CGCCTACAAA GTCTTTCAGA ATTACTGAAA GCTTGGCTCC AAACTGTTCT TAGCTTTCAC AAAATACACG AGACATCAAC AACTTTCGTA AACGTCAGGG TGGTTGTTAC TGACAGCAAC CATTCGAAAA TGGATGATTG CGATTGTGTC TTGGTGGTGG TTCCTGTTAT GTTTTCGATA GGCGAGGAGC TTTTGCGCCT CAAAGGTTTG TCTTCTACGA TTGGCCGAAC CGTGCCGCGA GTTGTCCTCG GTCTTACGGA CAATTTTGAT CGGGCCTGTG TGGACGACAT GAATCGTCGC TTGGATGATA CGTTTTCATC TTTATCCAAC AATATCACTT TAGTCAATAA TGCTGAACTA TCACTAGAAG ATGTACGCGG TGCTTTGCAT TGTGCCTGCG AACAAATTGC ACAACGAATT ACGGAAGAAG CTCCAGTGGT TATTCAAAGG ATGACTCCCG AGCAAGTTGC GTGCAAGTGC ATATCGGACG CTATCTCACA ATCTGGTCCT ACCGAGAAGC GCGATAAAAT CGTCGAACTA GCACAAGCAG CTCTCATCAT AATGATCGAA GAGATGGAAG ACGTGGCGGC AGAAAGAGAA AGGGCTCCTT CTTGGCCGGC ATCTGAGTTT GCGAACGACA GCGGCCGCGT TTTGCATTAC TTTGGCCGAG GCGAACACTT GCCGCTGCTT TGGTCTTCGA CCTTGTCCTT AGGAAACGTA GAATCCCGGT TGACTCCTAT GTCGACTGTT TTGAATGGTT CTTTGCCTGA TGCATTGGAG CAGTTGCTCC GCGGGGCACC GGAACAGATT CTTTACGAAT GCCACACTTT GCTGGACAAG AGGTTGTTTC GGCAATGCCT ACAGCAGTCT CTCTTTTGGT TCAGAGACGC TATACATCCC TGCACCAGCG AGTCTTTTCT GTATTTCGCC CCGGGCGACG TAGATTGTAT TGCTCAAGCA ACCGGTATTC GGCTCCGAAA CCTTGTGGAA CGAGAAAACT CGATTTGCTT CGAAGCGAAC GAAGGAAAAG ATGATGTACG GGAAAATGAC CTGTTTGCGG AAGTTATCGA CTCGCCGTTG GAGTTTGACG GTGAAGCACA ATTAGCGTTG GGGAGCGCCA CCGAGACACC GACTCCACCG GCATCTCCCG AAGTCACAAA GACGACGCCT AAGCGGGCGT TGAGTGATGC AAGCATACAG CCGATCGATT CCGTGGAAGC CGGCTCGTTT GCAAAGCAAA CACCGGGTGC GGTTCCGACT CCACAGTCCC TATCGTACGG CACGTCCCCG AAGCGTGCCC GTCAGACGAG TGTGGCGACG ACATCACTCG TCGCTGTCTC GAGCGATGTT ATGCACAGCC GTAAGTGGAC GGAACGATTG GAAGCGTTGG CGTCAGGTGA CGTGATGGCG GACATGATTG TGGGCCAGTA TATGTTGTCG GCGCTCGTAC AAGATGCTCC GCCTCTGACG AAAGAATAGT AGGACTAGTT GGAGAGTGTA ACGAGTTTAG CAATAAGCAA CGTTACTCT
|
Protein sequence | MSGFGSSFGS PAPGGPASGP SLFGASSSTP NNSATAGHTQ SNNTGFGAST SPPAFGAPVS FAPTVAFAAP APALAAPLFG SPTSVFGNRS PAPRSRSTSP IPPINISSLS DRNALRFTGS HNGRSHMPAD LQSKSEEAPN YAALRTDPQG GRADLGSAVS MVGTCPYMCP DEELLRRERE GDIQLLETPQ PGTLHPESWT YRDTVVKRFR RSAADYKLDV PEWVRPPDVL ERVCSYLEEW IMEKDRQGPD QRFPQGGVPP SLDVYQFIWD RTRMIRKDFI LQNYVGTGGA CDARAVRCHE RIARWHAMCE HQLSHISEYV SHQSQQNIQE LGQTMKTLNQ YYDDSLKRSL IEVPDAQGNE TRLNLLGQAH GCESNSVQGP NPVDYDGTPL SNDEDTSRVS LRVIGGDVSR SSMHGTAEPE MRGLYILLTI DNDGGMEVLK YAAWLFRERP YIYQSQPVQL AMTIYKAKRE FNYARFFSIL RSSSTPYLFA CIMFKHVQVM RKNAFQIMSK TFGVRSSATE AIYDAYPLRD LMRLLCFEDI EEARNACQHY NITCKEMKIK TSASGTRIED IVFWRATKFR VPTDAEKGTI VQLRPKKMIK TIESKLNGAT RLAVCRGEVS AFGASPTDRE FPLVSAQSQN VALKEDTVKR NSGEEVASLA ARKQEIERQQ QEELEKRQQI ERRRLDEERI RHEAEREQRI SLELKRQDME SKQREIELRQ QQAKERIELE RLQKEEVEQQ RKLEEEAERK ARDEEVRRQL ELERLEKTRR EELKRLDEEA RKREEVEKQR LQREAEEAEA RRQEEAARQK KLALEKEKER QQCEEKARLN DVRWREQTEE AKKILLWKRW KQKLPKYLEN AEETARMLQH LSPTSSASLK LVKLLHGATT FHSHQPASLL RHKFSYPPDL RMVLENLLNQ DTKQINLSSG IAESLSRIWR CSRLDNSCQK CFLFKIGVLC PQSKSIRLQS LSELLKAWLQ TVLSFHKIHE TSTTFVNVRV VVTDSNHSKM DDCDCVLVVV PVMFSIGEEL LRLKGLSSTI GRTVPRVVLG LTDNFDRACV DDMNRRLDDT FSSLSNNITL VNNAELSLED VRGALHCACE QIAQRITEEA PVVIQRMTPE QVACKCISDA ISQSGPTEKR DKIVELAQAA LIIMIEEMED VAAERERAPS WPASEFANDS GRVLHYFGRG EHLPLLWSST LSLGNVESRL TPMSTVLNGS LPDALEQLLR GAPEQILYEC HTLLDKRLFR QCLQQSLFWF RDAIHPCTSE SFLYFAPGDV DCIAQATGIR LRNLVERENS ICFEANEGKD DVRENDLFAE VIDSPLEFDG EAQLALGSAT ETPTPPASPE VTKTTPKRAL SDASIQPIDS VEAGSFAKQT PGAVPTPQSL SYGTSPKRAR QTSVATTSLV AVSSDVMHSR KWTERLEALA SGDVMADMIV GQYMLSALVQ DAPPLTKE
|
| |