Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43732 |
Symbol | |
ID | 7197258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1307565 |
End bp | 1312176 |
Gene Length | 4612 bp |
Protein Length | 1504 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177803 |
Protein GI | 219112103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.576008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTGTACAGT CTGACACGTA CAATTAACAC AAAAAGGTTG GATTGGCAGT TCGGCGCGTC CGATTGATCA TACCGCAGCG GTGATTTTCT TTTCATCATG GCGGAAGACT CGAGCGCCGA CGCCTTAAAT CCTGTCGGAG ATCTGCCGAC CGAGGAGCCA GAGTCGGAAG AAGCTACACC AGAAGCGACA CTCTTGAATA ATCTCATGAT ATTGGCACCT CGTCAACGAA ATCCGTCCGC TGGGACTGCA GCGGCAGGCC TGGCGCTTCC GCCATTGCGA GCCGAAGAAC CGGTGCAGAG CCTCCGCGGA GCACTTACCG AAGTTGTTGG CTACGCGCAT TTGACGAGCT TTCGTTTCGA ACTGGCGCCA GATGGTGCTC CACAATTCGA AGGCGTTCCC CAGCCCTCCC CGCCGAACAC TTTGGTTTCG CCCTTTACCG GACCAAATGC GGTTGTCTCC ATTCCTTCTC AATTGCAATC CTTGAATGAG GATCCTCAAA TCCAAGACCG TACCGAAGAG AAGCATGCCG ATGTGTTGGA CGAGTTTGGC GATCTGACAC CCCTCCTATC ACGGGGGCTT ACGGACGGAT CGGCCTTTGC TATGGTATTG GAACGATACG ATGCAGCAGG AATCAAGGAT CATGTACAGC GACTCAATTT CTTGATGGAT GGTAATGTTC CTTCGGTTGT GAGTTTGGAC AATGAAAAAG TTGCGCAAGT CGACGAGGAG GTGCTAGACG AGAAGTCCAC AGACGGGCCA AAAGTACCTC CACTTCCAGA ACTACCAACG TTTCCAAACG ATAAATCCAT CGTTATAGAT GGCACAAATC TACAAGACTT TTACTATATG GCGTGTGGGG AAGACTTGAA TCTTTACCAC GGGAAAGCGG AAAGCTCTGC CTTAAAGACA AAAAAGAAGA AGAAAAAGGC AGTGGTGAAC GGATCCAGGA AAACACAGAC TCCGGAACCA GTTGTGATCG TTTCTGAAGT GTCCCCTGAG GCACTCGTAA AGGAAAAAAT AACCCGTTTG AATGAATTGG AAGATATATG CCGGGTCCCT TGCACAATTC GATACAGTGG CTTTCATCCG CCACCTACCT CCCGTCGTTG GATTGGTGAT TTGGCTTACT TGGAAGTGAC TCCTCCGGAA GGCAAGACCC TGCATATCAC GGCCATACCG ATGGGGTTTT ATCTCAATCG CTCGAAATCT GAATCCGGTG GACGTGAGAC TTTTGATCCT TCTCCAGCCG TGAATTCGTG TTTTGCTCAC GCTCTATTGG ATTGTGTGTT GCAAGCATCT CCGTCGCTTT CGGAAGCTTG GAGACTTGCG CTGGAAGCAT CGGCAGAACG CTCAGAAATT TTGGCCGCAC TGAACAACAA GGGTACTTTT ACGTCACTGT TCCGTGTCGC TATTCGTGGA GATTTTCAAG GCTTTCAATC GGCGTCGACG GCACACATGG CAACACAGGC TCTGGATTCG ATGCTGCACA AACCGAGTTG GTTGATATCT CTGCCACGTG TGTTCCACAA CGAGACGTCC TGGAATCGAA ACCAATTGCA CTCCTATTCC CCTACGCGAT CCGACCATGA ACTGGCGAAT TCTTTTGGTG TCGATATCCG TAGCGGCACG ATTCGTGATT GGAACGAAGA ATTGCAGCTG GCCCGGGAGT TGCCGACTGA AGACGTAAAT GAACGCATTG AACGTGCAAG ACTCATTCAC AAGGTTATGA CTGAGTTTGG CGAGGCCTCG TTGCTTGGGG TAAAGGCGAT TGCTGACGGA CAGATTTCCC CAATGAATCC AAACGAGCCA ACTCGCTCGC AAGTATACTT GCACAACAAC ATCTTCTTTT CACGAGCTAT TGATGCTGGT CCAGAAACCT TCAAACTGGC CAAAGGCGAC CGTGCGGCCA AAAAGTCGGC GAATCGAGAC ATTCAGTGCG TAGGCACATT CCATCGGATG GAGCAAAATG ATTTATATAC GCTAGCCACG GTTTTAATCG ACTTCCTGGG AACGCGATAT GTCTGCCAAA GTATTCTTCC CGGTATATTG ATTGGCGAAA AGTCGCACAC TCTATTGTAC GGTGCGGTGG AATCGGGCGC ACCGCTCAAG TGGAATGAAG ATTTTCACAA ATTGATGGAA GACAAACTTT GCGATACCAT GATGATCGCC ACGCGTCCAA TCTTAAAAAA TCCCTTGACC GAGGAACGTC AAAACGAAAT TAAAAGCCAG AGAAAGCCCA ATTCATCTTT TGCCGATCCC GAAAACCAGG ACGACGCTGA TACAGACTTG AATGCTGTCA TGCAGTCCTG CATTCCAGTG GAGGCAAAGG GTATTTTGGG CAGCGATCAG CGCAAGTATG TGCTTGATTT TGGTCGTCTC ACACCTCGTG ACGCAAATTG GGTTCCGGAA AAGAAGGGCG GTACGGGAAA GTGGGAAGCT GCCAAGACAG AAAATGGCAA CAAGCGACAC AGTGCTATCC CACTAGACAT TAAAGACGAC GAATGGACGA TGTGTGTACT TCGTCCTGAG CTTGTGTCGC GATATACTCA AGTTAGTATG GGCAAATATC TCCAAGACAA GAAGAATACA GAGGCAGAGA GAAGCGCTGT TGATAATCCC GCTTCGGAAA ATAAGCCGAC GTCGAAGGAT ACGATGCCAA CAGAAGGCGA TGAAATCACT TCTAAGGAGG ACAACATCGA CGCTTCTGAA GGCAGTCAAG TAGACAAAGT AAATTTCCCA GCAGCGAAAG ATGAGGATCA ACTAAATGAG CAAAATTTAT CGGAAGAAGA TTTGAAATAT TTGAAGTCAT TGCGTTTAAA CGTCAACGTA TTTTTGGACG ATGTGAGATC GTTCGCGGAA AATGATAGTG AAGCAGCCGA ACTGATTCAG CAAGATGAAG GGAGGGCTCG TGAAGTTGCT GTTTTTCTTT GGGAGGTGAT TTTACCCAAA ATTACCCTTG CAATCAAAGA ATCGTCAGTT CACCAAGTAC CTCTTGACGG ATTCTCCCTC ACCGAGTTCT TGCATCGTCA TGGAGTCAAT TGTAGATACA TGGGACGACT CGCGGTTCTT GCAAAGAAAC AGGAGGAAAA GGATGAGCAA AGTGATATTG AGCTGAAGGA AGGCCGATTG TCCGTCATTG AAAGACGGAA GATGCCAAAG TGTTGGCTGG AATTGCTCGA GTGCGAAATG GTTGCTCGAG CAGCTAAGCA CGTTCTCGAT CGTTACCTGA CCGAAAATGG TGGAGTAGCT GCTGCTCAAC CTGCTCAGAC GATTGCCTCT TTTTTGTGTG CTCTCGTGTC CGAAAGCGAG GAGACTGCAG CTCAGACTGA GACACGTGCA ATGCAATGTC CGGCGGACCA GCCAGACGAA GACGACTACA CCGAAATGAC CATGAATGAC GTGGGCGGCA ATGGCAATGC TGTGCCATCC CCGGTTCGGG GAAGGCACGA GGTGTGGCAT GATATTGAAA TGGAAATCGG CCGCAGATTC CGTTACTCCC TTTCCCTTTT CAACACCGGA AAGCCGAATG GAAGAGCTTT GCATATACCT CTTTTGAGAC GAGTATGTCA GCGAACTGGT GTACGCCTCG TAGCGAAGAG GTACGATACT GGCGGAAAAT GCCTATGTAG TAGTGGCAAC GTTATAGGCG GACGACTGGC GGTTTCCTAC CCCATCTCTC CTCTGGACAT TGTCCATGTT TTACCCTTGA TGAAACATGC TGCTGCGTAC AACGAAGGCT TCGCCGCATG TTCGATCGCG CAAACCGTAA CAATTCCTGC GCTTCACATC TCCTTGCCCG ATGCGCGAAC CGCCCTTGAA CGTGCTCATG TTCAAATGGG TAGTCGAGCT CTGAGTCGGG CTCTGGAGCT CGCCCAGGAG GCCTCTAACC TATACCAGCG TGTGACAGAT AGTGCAGTTC ACCCGGGAGT CATAGAGAGC ATAGATTTGA TGTCTACAGT GTTTCTTGAG GCCGGTGACC CCCATCTTGC GGCTGAAAGC GCAGCTAAGG CGCTTGGGTT GGCAGTTCAC AACGGCGGGT TTGATACACC GAATGTTTTT AATGCGCACA TGTCCCTATT CCAAATGCTT TACGCCTCTC GGCAGATGGA TCTCGGATTG AAGCATTTAC GTGCTGCGAT TTACTTGTTG GAAGTGATGG CAGGACCAAA TCATATTGAA CTCTTTAGTG CTTATCACAA GCTTGGGACG GTTTACTCGC ACTCTGATTA CGATGGTGCG TACCTTGAGA CGGCCCTCGA GTGCTTTAAA GAAGTGATGG CACGCAATGG TTGTGATCGA TTGATGGATG GAATCACGGC GAAGAACCAT GCCAAGATTC TCGCAGGATT GGGCAACTAT AAGGATGCCA TCGTCGAGGA AAAGCGTGCC CATCGGACAT TGTTTACATT TCTTGGAAAG GATCACGCGT GGACGAAAGA TTGCGACAAG GAGCTTCAAA TGTACACAAA ACTTGCAGTT GAGCACGGAA ACAAAAAGGC TGAGACCGAC AAAAAGAACG AAGAAGCTGC CCGGGCGGAT GCTTTGGCAG CTGACCTCAT CGCGCAAGAA AACCGGAGCA AGAAGAATAA GAAGAAAAAG AGTAAAAAGT AA
|
Protein sequence | MAEDSSADAL NPVGDLPTEE PESEEATPEA TLLNNLMILA PRQRNPSAGT AAAGLALPPL RAEEPVQSLR GALTEVVGYA HLTSFRFELA PDGAPQFEGV PQPSPPNTLV SPFTGPNAVV SIPSQLQSLN EDPQIQDRTE EKHADVLDEF GDLTPLLSRG LTDGSAFAMV LERYDAAGIK DHVQRLNFLM DGNVPSVVSL DNEKVAQVDE EVLDEKSTDG PKVPPLPELP TFPNDKSIVI DGTNLQDFYY MACGEDLNLY HGKAESSALK TKKKKKKAVV NGSRKTQTPE PVVIVSEVSP EALVKEKITR LNELEDICRV PCTIRYSGFH PPPTSRRWIG DLAYLEVTPP EGKTLHITAI PMGFYLNRSK SESGGRETFD PSPAVNSCFA HALLDCVLQA SPSLSEAWRL ALEASAERSE ILAALNNKGT FTSLFRVAIR GDFQGFQSAS TAHMATQALD SMLHKPSWLI SLPRVFHNET SWNRNQLHSY SPTRSDHELA NSFGVDIRSG TIRDWNEELQ LARELPTEDV NERIERARLI HKVMTEFGEA SLLGVKAIAD GQISPMNPNE PTRSQVYLHN NIFFSRAIDA GPETFKLAKG DRAAKKSANR DIQCVGTFHR MEQNDLYTLA TVLIDFLGTR YVCQSILPGI LIGEKSHTLL YGAVESGAPL KWNEDFHKLM EDKLCDTMMI ATRPILKNPL TEERQNEIKS QRKPNSSFAD PENQDDADTD LNAVMQSCIP VEAKGILGSD QRKYVLDFGR LTPRDANWVP EKKGGTGKWE AAKTENGNKR HSAIPLDIKD DEWTMCVLRP ELVSRYTQVS MGKYLQDKKN TEAERSAVDN PASENKPTSK DTMPTEGDEI TSKEDNIDAS EGSQVDKVNF PAAKDEDQLN EQNLSEEDLK YLKSLRLNVN VFLDDVRSFA ENDSEAAELI QQDEGRAREV AVFLWEVILP KITLAIKESS VHQVPLDGFS LTEFLHRHGV NCRYMGRLAV LAKKQEEKDE QSDIELKEGR LSVIERRKMP KCWLELLECE MVARAAKHVL DRYLTENGGV AAAQPAQTIA SFLCALVSES EETAAQTETR AMQCPADQPD EDDYTEMTMN DVGGNGNAVP SPVRGRHEVW HDIEMEIGRR FRYSLSLFNT GKPNGRALHI PLLRRVCQRT GVRLVAKRYD TGGKCLCSSG NVIGGRLAVS YPISPLDIVH VLPLMKHAAA YNEGFAACSI AQTVTIPALH ISLPDARTAL ERAHVQMGSR ALSRALELAQ EASNLYQRVT DSAVHPGVIE SIDLMSTVFL EAGDPHLAAE SAAKALGLAV HNGGFDTPNV FNAHMSLFQM LYASRQMDLG LKHLRAAIYL LEVMAGPNHI ELFSAYHKLG TVYSHSDYDG AYLETALECF KEVMARNGCD RLMDGITAKN HAKILAGLGN YKDAIVEEKR AHRTLFTFLG KDHAWTKDCD KELQMYTKLA VEHGNKKAET DKKNEEAARA DALAADLIAQ ENRSKKNKKK KSKK
|
| |