Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47622 |
Symbol | |
ID | 7202825 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 310766 |
End bp | 315482 |
Gene Length | 4717 bp |
Protein Length | 765 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181882 |
Protein GI | 219123127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACGATCCAG ACGTCTCCTT TACATAACAT TTTGATCCGC GATAAGAAAG CATATATTCA CATTCCATGC GTACTGATGG AGGAGGCGTT GTCAGAAGAC GCGAGCGGGC GGAAGCCAAG CCATTGTTAC TTCCCGTTAG AGAGTGCATT TTACTCGTCG TCTTTTTGGC AATCCAAGCA TCATGTTTTA TGACACAAGC CCCAGCACGA AGGTCAGTAC ATCCTTCAAT AAACACTGGA AGCGTACACC TACGTTTTGC AAGGCGACGA CTCTTCCTAT CTCGGATGCC GATTCCACAG GTTTATCTAC CGACCTTATC TGCCTCTATT GCAAAATCAA CATCACCGGC TGATCGCTCC CCAGACTATG CCTCGAATCG ATCATTGTCA AAAAAACGAA AGCGAAACAC GAAACGCTAC GAAATTCGTC AATTGTTTCA AAAAGCCAAA GATCTCGAAC GCAAAGGCCA CTGGCGCAAG GCAGCCGACG TACTCAATGA AATTTTAGTT TGGGATCCAG CCGACGCTCA CAGCCACCTT GCTCTAGCGC GACTGGAAGG CCGCCGCTTT CCAGATACTA ACAAAGCATG CGAAGCCTTT GGCAAAGGAA CTGAAGCTTG CCCGAATTCG GTGCATCTTT GGCAGGCTTG GGCAGTTCAT GAGGATTCCA GTGGTCACGT TGATCGTGCC AGAGAACTTT TTGAGAAGGC TCTGGCAATA GATCCGCACA ACCCTTATGT GTGCCATGCG TTCGGACTGA TGGAGCGAAA GCTGGGCAAT GTGGAATCAG CGAAGAAATT ATGGGCTCTA GCGCTACAGA AAAAAAGTAC CGCTGCTTTG GTTTGTCAAA TGGGAGAGCT GTTCATTGCT GAGAATCAGC TGGACCAAAC AAGAGAATTG TATTCAAAGC ATCTGCTACG GCTCGAAACA GCCAAAGATC GAACCGAAGT CTACTTGGCG GCAGCCTGGT TAGAAGAACG CTACTTCACT AACTATATCA AAGCGGAAGA ACTCATAAAG AGTGCTCTAG CGCTGAATCC TAGTAGCAGC GTGGCCCACG TGGCATTGGC TCGACTCGAA GGCAGGAATC GGCAGCGCAT ACGAGGAGAA GGGTACGAGA GTGCCACCGC TAGGCGATTG GCTAGTGCCT GCATAAGCCT TGAGACTGAA GGAAACACTG TGCAGCCAGA CGATGGTCGA GTCTTTAATG CCTGGGCTCA TATAGAAATC AAAGGACGCC GATTTTCATC GGCGCGTAAC ATCCTGCGTC GGGGCTTGAG CAGGTACCCG GAAGACTATT CACTTTTGCA GGCGGCAGGA ATATTAGAAG AACGCGTGGG TAACTATACT GGAGCTCGGG CTATATATGG CAAAAGCCTG CGGATACAGC CTGCTGCTCC AACTTTAGTC GCCTACGCTC TTCTGGACTT GCGGCACCCT TCATCCGGAG AGGCCAACTT TACCAGAGTT AAAGCGCTCT TCGAAGAAGC CATACTGTTG GATCCTCGCC ATGGCCCAGC CTACAATTCT TACGGAAATC TTGAACTTCG ACAAGGAAAC ATCCGAACAG CACGTAACAT ATTCGAACGT GGTATCTTAG CCCACTGTTC GGATGTGGCT TCTGTCTATC ACGGCTACGC CCGTCTGGAA TTGTCCATTG GAAATGTGAA GAAAGCTCGA GAAATTTTAG TAGATGGCAT TCGTGAGGCA TGCCAGCAAG ACGCTGGCAT GGATTCGCCA CATCGTGAGC GAGCACTGTT CCTTTCACAC ACGTTAGGAA TGCTTGAGTT AAACAGCAAT CGTCCTATAG ACGCTCTGAG TATTTTCATC GACGGCGTCA ATCGGTACGG TAATTCTTCA CAGCTTCTTT TGGGCGCTGC CCTTTGTGAA GTAAAGCTCG GGAATGAAGT CAATGCCCGT ATGTTGTTTG AGCGATCACT ACTGGTCGAC GAGAAGCACC CTCAGGCGTG GCAGGCTTGG GGCGTCATGG AGCTTCGGGC GGGAAACACG CTTACAGCTC AAACTCTCTT TGAATGTGGC ATCAAAGCGG CACCGAAGCA TGGTGCTTTG TGGCTCGCTT ATGCAATTTC TGAAGGAAGA CTTGGAAATC CAGAAACGGC CCGGTCTCTC TTCGCTAACG GTATAAAGCA TTCGCCTAGA CACATACCGC TATATCAAGC TTGGGCATCC TTGGAACTTC GGGAAGCTAA CTACAATGCT GCGAAAGCCT TGATTAGCGA GGCCCTAACC CGAGACAAGC GAAACGGCTC AGGATGGCTG GTTGCTGCCG AGATAGAAAA GAGTCTTGGT AACGCAGGAC TTGTCAATCT TATACTAAGA AGAGGAATTG AGTGCGCACC GACAAATGCT GAGCTTTATA GAGCCCTTGG AGACTCATTG CTTCAGCGTG GAAATGTTCT GGAAGCTCGT GAGATTTTTG AGAAAGGAAT TGATGTCGAT CCTCTCCATG CCCCGCTATA TCACTCGTTG GCAGAGCTCG AAGCCAGAAT ATTCAACGTC GAAGGCCTAT CAAAATTAAA CGCAAGGGCC ACTAAAGTGT TTAAGACAAA TGCTCTGGAG CCTCCAAGCA CTGCGGGTGA GGAAGCATGG GGTACAAAAA TCCGAGCCGG TCGTCTAAGT AGTACGCCCA CAGGGGTGAA GGCTTTAGCA GAGCGAATTG TCGAGGATGA CATCTCGGAC CTTTTTTTGA ATAACACCAA TCTGGACCTA TTCATTGAGA GCATCAGTGA AAGCCTCATG GAGGATGGTC TGGTGGGTCA ACTTTTCAAT ATGGAAATGG ACAGAATAGA AATTGACTCA GATGTTGCTA AAACCTAAAT ACCATGCTAC GCTAACGAAA GTAAAAGGGA ATTAACAGAA AATGAAAACG AAGTTTCAGT CAACTAAAAG GAGGGTCGCG CTCTTTGCGT CCAAGTCTGG CTCGTTTCCT TGCTAACGTC CGTGTCATTT ACTCGATGCA AAGTTTCTTC GCCTCGCCGC CCATAACCAT TCTGACTTTT CGTCTGTGCA GATGAAGCGA TGGCCTCAGC GTTTCTGAGC TCTGCTTCAG ACAACTTTAG GTTCTCAGTG AGTACATCAA TTTTATCAAT CATATATCGC TCTTCTTCGT CTCTCTTTTG CAAAGCCTCA ACAAGCGAGT GATTCTCAAC AAAGGCGTTT TGCAGTCTGG TTTGCGTCAC CTCGTATCTA CTCTTCAACT CCACGATTTC TTCGCGCAAA CGCGATTCGT TCTTTTGTGA GAATCCCGTG TTTCCACTCA AGTGAGACAC ATTCAAGCTT TGAGTTTCTT CGTCGCGCCT CCCGTAGCCA TTCTGATTTT TCGTCTGTGC AGATGATGTG ATGGCCTTGG AGTTCCTGAG CTCTGCTTCA GATAATTTTA GTTTCTCAGT GAGTACATCA ATTTCGTCAA TCATACATCG CTCTTCTTCA TCTCTCTTTT GCAAAGCCTC GACAAGCGAG TGATTCTCAA TAAAGGCGTC TTGCAGTCTG GTTTGCGTCA CCTCGTATCT ACTCTTCAAC TCCACGATTT CTTCGCGCAA ACGCGATTCG TTCTTTTGTG AGAATCCCGT GTTTCCACTC AAATGAGACA CATTCAAGCT TTGCGACATA TCTTCCAGAG CATCGTGCAG CTGCGAATTT TCTTTTTCAA GCTGTTGTAC TGCAGATTGC AACTCATGAA GCTTCTTAAC ATTTACATCG TTTGGCGTGG ATCGCGATAG CGTGGACATT TCTTCATCCA TTTTTCTGCG GAGATCATCA ATTTCTTTCT CCAGTGCCCG CTTTTCGCTT TCGGCCTCTT CCAGATGTAG CTGAAGAGGT CGAAGTAAAT TTAAAGCTTC ATTGAGACTT TTGTTTAGGG CTCGTTCCGC CTGTCGTTTC GATTTGAGCT CCTCTTGCGA CTCCCCGAGC TTGTCGTTCA GCTCGTCAAC TCGTTTCTGA GCTTCATCCA GCCGATGCTT GTCTTCTGCA CTCAACGCCT GAGTTTGCGC TTTGGCAAGT TCCAAATCGC CTTGTATACC TCGCAATTCC ACTTCCTTTT TTGCTAATTC CCCTTTGAGT ACTATCATTT CACTTTCCAG AGCCCTTTGT GCATTCTTCA CCCCTTCACC AACATCCCTA GCCGTCGCGG ATGGACCAGG TGATAGATGA TTCTCTGCCG AGCGTAGCCG TCCCTGCAGT TCGGTATTTG TAGTTTCAGT CTCGTTATTT TGTTCGTGAA GAGCGCTGAA TTTCGTGCGA GCATCGCGTA ACAAATTTTT TAGCTTTTCT TCAGAGGCTC GTTTGGCCTG GAGATCCTGC CGGGCTTTGG AAAGCGATTC ACCACCAGCC GGAGTAGAGG GGGATCGATT TGCCGACACG AGAACCGTTG CGCCTGCAAG TTTCGACTTG GAAAGCTGAA TCTGCAGAGA TTGAATTTCT TGCTTCAGGT CTTCAATCTC GTCATCCTTT TCGTCCATTT GCCTGCGCAT TCTCTTCATT TCGTCTTTGT CGATGCCGTT TTTCTTGAGT TGATCGATTT GAAACAGTAA TTGCTGTTTT TCCCTGTCCG CTTTTGCAGC AGCATCTTGC AATGGCTTGA GCAAGCCTAC AGCTTCTTTA AGACTGGTCC GCAGCTCTAC GTTTCGTTCC TCCCGCGCTT GTAAATCGCC TTTGTACACC GACACATGCG ATTCAAGCTC CGACACTCGC TTACGCG
|
Protein sequence | MFYDTSPSTK VYLPTLSASI AKSTSPADRS PDYASNRSLS KKRKRNTKRY EIRQLFQKAK DLERKGHWRK AADVLNEILV WDPADAHSHL ALARLEGRRF PDTNKACEAF GKGTEACPNS VHLWQAWAVH EDSSGHVDRA RELFEKALAI DPHNPYVCHA FGLMERKLGN VESAKKLWAL ALQKKSTAAL VCQMGELFIA ENQLDQTREL YSKHLLRLET AKDRTEVYLA AAWLEERYFT NYIKAEELIK SALALNPSSS VAHVALARLE GRNRQRIRGE GYESATARRL ASACISLETE GNTVQPDDGR VFNAWAHIEI KGRRFSSARN ILRRGLSRYP EDYSLLQAAG ILEERVGNYT GARAIYGKSL RIQPAAPTLV AYALLDLRHP SSGEANFTRV KALFEEAILL DPRHGPAYNS YGNLELRQGN IRTARNIFER GILAHCSDVA SVYHGYARLE LSIGNVKKAR EILVDGIREA CQQDAGMDSP HRERALFLSH TLGMLELNSN RPIDALSIFI DGVNRYGNSS QLLLGAALCE VKLGNEVNAR MLFERSLLVD EKHPQAWQAW GVMELRAGNT LTAQTLFECG IKAAPKHGAL WLAYAISEGR LGNPETARSL FANGIKHSPR HIPLYQAWAS LELREANYNA AKALISEALT RDKRNGSGWL VAAEIEKSLG NAGLVNLILR RGIECAPTNA ELYRALGDSL LQRGNVLEAR EIFEKGIDVD PLHAPLYHSL AELEARIFNV EGLSKLNARA TKHCG
|
| |