Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46639 |
Symbol | |
ID | 7204479 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 51483 |
End bp | 56233 |
Gene Length | 4751 bp |
Protein Length | 1392 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185616 |
Protein GI | 219120769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCGCGT TTGTCTCTAT TTGTGTCAAA AGAGCTGACA ATATATTGAC TGCGACAGCA AGCTCCTGCT CGAGACACTT TGGCGTGCCA CAGGCTCTCA ATCGTTGGCG ACCTACACAA TCGACCGTTC AAGGTAACAA GGACGACGCT TTCCCTGCTC AATTTTCGGT GTGTCGCGCT CTTTCGAAGC TCTAAGAATT GCTTGCATCA GCTCAATTAA CACATCCCGC AAGCACTCAC TGTCAACGAT CTCCGTAACC GCTCCGCCGG TGGCAGCGAA TCACAAACCT CGTGGAGCTC CTTTTCATTG CCTCGCTTAT CACAGACACA ATCAATCCAC GTTGGTCGCG AAAGTGGATA GATACCTTTC CTCGTTTCAC TTGTTCATCA TGACAACATT GGAAACTATT GGCTTTACAG TTGCCGGTGG TGCCGTGATG CTTGTCACTT CAATGGGGCT GTTTGGATTG GCTGTACGTG TTCGGCATCC TTTTATTGCG CCGCGCGCCA CAGCAAGGCC CCGCGATGTC ATTCACAATC CCCGCCCCAA ACACAAGAGC CAAAACCGAG GGAACGCTTT CTTTGGATGG ATTGGCTGGA CACTCGGTTT GAGCTACGAT ACCATGCTAC GCGGAGTACC CGGGACTGGT ACAAGAGGAA ATGGCATGTC GGGGAGCATG TTACGTGTCA ATCTAGATGG TATTGTGCTA CTGCGCTTTC ACGCATTGTG TCTGCGACTC ACCGCGTTGG CAACCGTTCT ACTTGTCTTG TTTATACTTC CCTTGTACCT GTCGGCGCAA TGCTATCGAA TAGATCCGGA GGAAGCTTTG GGCCAAGCCG AATGTGCCAG TACGGTGTAC AATCTGACCA ATTATGAACA AACAACAATT GCAAACGTTC CTTCTCTTTC AGGATCTCGC TTTTGGTCAA CCATTGTTCA TCCGGATCAC AGCGGTGTCT TGTTACGACT CTACGCTGTC GTCATTGTCT TCCTCATTAT GACGGGATAT TTGCTACATC TCCTCTATCA GGAGTGGATT CACGTGTTGG CGTTGCGGCG TGTATACTAT CTTGAGTATG ACGTTTGGGG TGAACGCAGG GAAGAACTTA AGCAAACTCT CTTGTTTGAT GAGATTTCCA AGCAAAAACG GAAAAACATG AAGCGCTTTA CGTTTAGCGA GCCCGACGAG TTAACGGAAG AATCTCGCCG CAATCATTTT GAGACTGCTG AAAAGCACTT AACGGTCCGT GATCCGTGGA TTCCGCACCC CGAGCAGCGC GACACAGTTC CGAATGTGTC GCTATATTCC GTGCTAGTAG GTGGGCTCCC TTCGTTGCCT GATTACGCTG CTGATACCAT TGATACGGAA GCAACTATCA GCTTTTCGAG GCGGGAAAGT ATGGATTGGC AGCTTTCGTT GACAAGTACC TTCTTTGACC ATTGCGTCCC GAATCAACCG GGATTTAGTT CCTCTGTGGC GGCCGTCACA ATTATTCCAG GGGCGAAAGA CTTGTCTTTG GCTTGGAAGA AATGGTACCA AGCTGCGGGC AAATTGCGTC GATTGAAATT TATACGAAAT CAAATTGCCG AGCGTCGGCA TTACGACATT CAAGTCGGAG ATGAGGAAGA AGGGCTGGCC AAAAATGGAC GAATTTCATC AGTTCGATTC AACCTGAGCC CTGGCGGACA ATCGGCTGAT GAAGAAGGCG ACGAAACGCC GCGGGAGATA TATGCTGAAG AGGGCAAGAA TCACGATTAT TATCGCGAGG TGCTTGGATC ACTAGTGGAA CAAGAGGTGG AATCGCACAT TTTTGAAGCG ATGCACTTTG GACCGGAACA GACGGCTGTA TATAGTCGAG AGTTTGCTCA ATCGGCGGCT CCGTGTTGCC CAAATGGTTG CTTCGAAGGT CGAATCCGGA ATGCTAGCAT AGATGACTTA CTCGAAATGG AACGATACGC TGCATCTCAA CTGCACAAAG CCAATCTAGA GCTTCGCGAA GCTCGAAGCA GGGCTACTCG TGCCCACGCA GAATTCCCTA TAGAAGACGC TGCGGAACTT GAACCTGTTG GTTCAGTAAA TCCTATTCTT CCAGATAGTC TGCCGCCTTC GGAAAGGAGC AAACTGCGGC GCAATGTTAG TCTGGAGAAA GCTAAAATGC CGAGCGATTT ACATCTGGAA TCTGAGTTGT ATCAAAAGAC GGCGGTATCC TTTGGAAGCG CTGACGGAAA AACTCGACTA CGACCGAAAC CCATCGTGGC TAAGGCGCAA CCTTCAAGTG AACTTAAGGC GGATTATGAA TTGCGCACAA AACGAAAAAC TACGGAAAGG GAAGACGGGT CACCTCGCAA CTCAAAGCCA CCGCCGATGA AGTCGATACA TGAGGTTCCA AATGATTCTA ATGTACATCC TTGCAGAACC GACTGGAAAT CCCGGCCGCA TGATTTGGTG GATCAGATGA CTTTTGGACA CAACGAATTA AAGGGAAGAA CATCGGCCCA AGTGGCCCCA TGGGCAAAGG TTCAATCCAT TGTGTCAGAA ATGAGAAGTA GCTCTAGCGA AGGGAGCGGT CGTAATAGGC ACGTACGGGG CAACCGGATG CCTTCCGGAG TTTGGGAGTG GCCTACGTTT CTCAGCTTAT TTGGTCAGAC AAAGGAAAAG GCCAGTACAA TTACCACTTG GGCAAAGAAG CAGTCTGCCG CCGCCATCGA TGATATATCT CGTGAATCAA CGTATGCAGT AATCACGTTT ACGTCTCGTC AAGCTGCGGT TGCCGCGCGC CAATGTCTGG CCGATGGTCG TGCAGCTGAT CGATGGGTGA GCTTGGCAAG CATTCCAATT CCTCCCTTGG CAGATGCCGC GTCATGTGAT TTGTTGGTGT GCCGAAACTG TTGTCGTCCT GTCACACTAA GCATCAACGA CCGACAAAAG AATGTCCGAA ATTACGCGTA CGTTCTCCAC ATTGCTTTGC TGCCATACAT TTACAAGTAA CTCACATGTG GTTTTTGATT GTTAGAGCAC TGGCAATGTT GGCTGCAATT TATTCGCTCT ACACAATTCC TTTGACCCTT GCTTCGACTT TACTAGATCC AAGTAATCTC CAAGAAGTTT TTCCGATTCT TGCCGAGTGG TCTGACCGTG ACAAATTTGG AGTAACAAAG CTTATTTCCG GTTTGCTGTC TGCTTTGATA TGGACGACAT TTTTCGCCTT ATGTCCGATT ATGTTCAAAT CCATTGCCAA CTTTGGTTCA AAAGCTACAA GTGTAGCTCA GGCGGAATTC AAAGCCCTCC AGTATTTTTG GTGGTTTATG GTATTGACGG CGTTCACTGG GCAACTTTTG GCGCAGATGG TGCTTTACGG TTTCAACGAT GGCCTTCAAT TTGGTACTGA ATTTACTAGC ATTCTACGTG CCGTCGCGTT GTCCATTCCT TCAGGACTTT CAGCAAGTTG GCTGAACTGG ATCATCGTTC GTTGCATGAT CATCCTACCT CTGAATTACT TGCTTCAAGT CAACACTTTC CTTTTTCACT ACTTCGGTAT GCCATGTTGC GCTAGGGTTG TTCGAGGAGG AGGTCCCGGA GGGCCGACGC CCTACAGGAT ATATGTGGAC TCTGGAGTTG TATTGATGTG CACTCTTGCT CTAGCTCCAG CATCACCTCT TGTGGCGCCT GCATGCTTTT TCTATTTTCT ATTTTGTCAG CCGGTGCTTC GCCGCAATGC TCTCTTTATG TATCGTCCGA AATACGACGG CGGCGGATTA CGCTTTCCGT TCATATTCGA CATGGCGATA TCTGCGTTAG TAGTCGGCCA GGTACTTCTG TGTACAAGCA TGGCTCTCAA GCAGGCGCTT GGTCCAGCAA TTCTCGCTGC CGTCCCGATT GTTCCAACGA TTTTGTTTTC GCGTAATACG AAAAAGAAGT TTTTGAGGGC TTATCAGGAT GCTGCCCTGC TTCAAACTTC ATTACTGGAT GGATGGAACA CTGCAGAGGA AACATCAGAG AAGGAGCGAC AGGAATTTCG AAAGTTTCTA GTTGATGCAC ACAAGGCGGC CTATGTTCCA GTTTGTATAG CAGGCGCAGA GGATGAGGAC AGCTTCTTGA CCGCAGAACC GGCCGTAGTA GTACCATTAG AGTCAGAGCT TGCCGACGCT GGGGCTTTGG AATTTGTTGA GGAGGCTTTA AACGCCACAC CTTCACAATC GCACGAATCT GAAAACTCGC CTGCAGTTCA GGTGCAAGAG AATCGACCAA AATTGGATCG CCGCGCGTCG CAACATGGTG CAACTTTGCG CAGAGCTGTC CACACATTGT CTGCTTTGCG ACAACGACGA GATTCTTCCA CGGGCTCTGA GAACGTGAGC TCTTCTATAG CAGATGAAAG GTTGATGACA CATTCAGATG ATAGGATCCC CCCTTCGGAA GAGATGGAAG TGCCTTCATC CATTCGTCGA AACCGACAGC AGAAGTTTCA GGAACAGCAG AATAAAGGAG ATTAAGAACA GTTTTACATA GTGTAACAAA TCCTCTAGCG CTGTACATTA TTCAGAACCT TGCACGACTT CCGAAAAGCT CGATTGCCGG ATTTTGACGA GGTTAAGCTG GCCCCACTCA TCCACAAAAG TCTCGTCTAA GATATCACCC GGTTTAATTT TTTGGAAAAT ATCCATTCCT TCCGTAATGT AGCCAAATGG AGCGTATTCA CCGTCTAGAA GGCTGCGCTT GTCGTCAATC ACACTGTCGG CCTGCAATGC G
|
Protein sequence | MCAFVSICVK RADNILTATA SSCSRHFGVP QALNRWRPTQ STVQGNKDDA FPAQFSHSLS TISVTAPPVA ANHKPRGAPF HCLAYHRHNQ STLVAKVDRY LSSFHLFIMT TLETIGFTVA GGAVMLVTSM GLFGLAVRVR HPFIAPRATA RPRDVIHNPR PKHKSQNRGN AFFGWIGWTL GLSYDTMLRG VPGTGTRGNG MSGSMLRVNL DGIVLLRFHA LCLRLTALAT VLLVLFILPL YLSAQCYRID PEEALGQAEC ASTVYNLTNY EQTTIANVPS LSGSRFWSTI VHPDHSGVLL RLYAVVIVFL IMTGYLLHLL YQEWIHVLAL RRVYYLEYDV WGERREELKQ TLLFDEISKQ KRKNMKRFTF SEPDELTEES RRNHFETAEK HLTVRDPWIP HPEQRDTVPN VSLYSVLVGG LPSLPDYAAD TIDTEATISF SRRESMDWQL SLTSTFFDHC VPNQPGFSSS VAAVTIIPGA KDLSLAWKKW YQAAGKLRRL KFIRNQIAER RHYDIQVGDE EEGLAKNGRI SSVRFNLSPG GQSADEEGDE TPREIYAEEG KNHDYYREVL GSLVEQEVES HIFEAMHFGP EQTAVYSREF AQSAAPCCPN GCFEGRIRNA SIDDLLEMER YAASQLHKAN LELREARSRA TRAHAEFPIE DAAELEPVGS VNPILPDSLP PSERSKLRRN VSLEKAKMPS DLHLESELYQ KTAVSFGSAD GKTRLRPKPI VAKAQPSSEL KADYELRTKR KTTEREDGSP RNSKPPPMKS IHEVPNDSNV HPCRTDWKSR PHDLVDQMTF GHNELKGRTS AQVAPWAKVQ SIVSEMRSSS SEGSGRNRHV RGNRMPSGVW EWPTFLSLFG QTKEKASTIT TWAKKQSAAA IDDISRESTY AVITFTSRQA AVAARQCLAD GRAADRWVSL ASIPIPPLAD AASCDLLVCR NCCRPVTLSI NDRQKNVRNY AALAMLAAIY SLYTIPLTLA STLLDPSNLQ EVFPILAEWS DRDKFGVTKL ISGLLSALIW TTFFALCPIM FKSIANFGSK ATSVAQAEFK ALQYFWWFMV LTAFTGQLLA QMVLYGFNDG LQFGTEFTSI LRAVALSIPS GLSASWLNWI IVRCMIILPL NYLLQVNTFL FHYFGMPCCA RVVRGGGPGG PTPYRIYVDS GVVLMCTLAL APASPLVAPA CFFYFLFCQP VLRRNALFMY RPKYDGGGLR FPFIFDMAIS ALVVGQVLLC TSMALKQALG PAILAAVPIV PTILFSRNTK KKFLRAYQDA ALLQTSLLDG WNTAEETSEK ERQEFRKFLV DAHKAAYVPV CIAGAEDEDS FLTAEPAVVV PLESELADAG ALEFVEEALN ATPSQSHESE NSPAVQVQEN RPKLDRRASQ HGATLRRAQM KG
|
| |