Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54586 |
Symbol | |
ID | 7201565 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 437144 |
End bp | 441830 |
Gene Length | 4687 bp |
Protein Length | 1525 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | hypothetical protein |
Protein accession | XP_002181010 |
Protein GI | 219120548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTC GACCAACCTT GGGATCCTTT CGGAAACGAC TGTCCGGTCG ACGGACCAAG GACAAAGAGG ATGAGGGAGG CAGCGTCCCA CGACGTCCGA GCACCCCGCC GAGATCCCCG AATTCAAGGC GGTCCTTCAG TCAAGATCGT CCCAAAAGTC GCAGCCCGGT GAACAGTCCC AATCGACCTC ACAACTCGCG TAGGTCACAC AGTCTGAACC CTTCCGTTGA AGATGCAGCG ACCAACGGTG GTAGCGAGGA ATCGGTAGTC TCGGGGAGTC CTTCGTCCAC TTCGTCATCC CGCGGGGGAC GTAGTCGGGA CTCTCGACCG AACATACTGC AACGCATGCG CGAACGTTCC CGGTCTCGGT CTCGTTCCCG CAAACGTGAC GTTAACACTA GTAGTCCGAA AGAAATGCTC GTGGCCGTCA CGTCCTGTCG CTCGGACGGC TACTACAATC AAAAGGCTCC GGGCTCGACG TCGAAACTAC CGCGAAAGGC ACCCACCAAT CTCAAACTCT TCCACGAACT GGCGGTCGGC GTCAAGGATG CCTACGCGGC CGTCGGAGCG ACCCCGAGAA AACTAACCGA GGAAGAAGAA GCAAGATTCA AAGATGAAGA AAAGATTGGA CGAACGGTTT TGTGGGATTT CGTTGGCAAT CTCGATTTTG TAAGTGCGAA CCAAAAATGC CCACCGATAC CCGCTCTTGC ACAGGCTAAA ACGGCGTGCT CACGCAATTT GTTTTTTTAT TGTATAGCTG TTGGCTCTCG TGGACGAAGT TGCTGTGGAT ACCATCACTC GTGGAGCACT TAAAGATGAT TCTACTTTTA AAGGACTCCG CGACGTTATC AAAAAGGGCA ACCGTGTATT GGAAGAGATG CTAGTCCGCC GAGAACGCAA ATACACTCTC TTTTTCCGGC TTGTACAACC CCATGATCGG GACCAAATTG ACCGTATGCG CTCTTGGAAC CAGAAAGTTG AAAAGGCCGT TGGTGCCGTC ACGGGCGGAC GCGAATCCTT GGGTGAATCA GAGAACGACA GCGATACGAA CAGTATTTCG TCCAATTCGT CGGCATCCGT GAGTAGCCGC GCCGGGGTCT TCTCACGCGG ACGACAGCTG TTGCCAACTG CTGGACGTGT GCGATCACGA CGAGCGACAC CCACCCCACG GCTACGTAAA CACCGAAGTC AGGGGGACAT CACGGAAGAT GATGCCGGAG CGGCCGAAGA CGGGTTTTCC ACCGTCAGTA TGGCACCCGA CTCGGCACAG TCGCCGAAAA GCTCCTCATC GGGAGGCGAT GGTGGTGGAG GTATGCTGCC ACCAGCAATG CGTTCCGGAG GGCAAGGAAA CTCTCACCAT CCAATGGCTT CTGTCGAACA AGTGCGACCA AAAGATGAAC TGGTGGACGT TATTCGCGGT TTGCGAGTTG ACAAGATTCG TAATCAAGAA GGATCTTCCG ACAAGGATCT AGCGGAACTC AAGCCGAACT GGCGACCCAA AGCGGAGATT CCTCCATCCG TACCCAAGCT TCCAACCGAG TACGTTCACC GACACCGATT GATGAAGCAG GTTGTTAGTT GTCTATTGGA CGAGAACGAG CCCCCCAAAA ACGAAGACGG CTCTCTCCAG AATACGATAA TTACGTGCGT TACATCTCGG CACGGCGACA AAGCCGGGAA TGGAAAATCA ACTTTAGCAA TTGCTGTCAT TCAGACTGTT GAAGTCCGCG AACGGTTTCC GAATGGTATT GCTTGGCTCA AGCTGGGGCG AGGCCCTCTA AGCGAGCGAG ACATTCGGCG CTTATACGAA GATTTATATC GACAGCTTGT TGTAAAACAA GCAGATATTG AGGGTTCCGT CTCGGAAGCC GATACTGTAA ACTTCCACGA GTCATTTGCT TCCACTGGGT CTGCTAGAAC AGAACCGTCA GAAACAAGGA TTGATCGCAC AGTAGACCGG GCCGACAGCG TTCGTCGCTT TGAAGGAGGG GATTTAGAAG GAATCAAGGA AGATCTAGGA CGAATGCTTG TGCGCCAGAA AGTTTTAGTC TGCCTTGACG ATGTATGGAG AGTGGAGGAC GCCAAGTGGT TTATTTTCGA GACACCGTCG TATTCTGCAC CGCCACGAAA CAGCCCATAC AAGATCTTGA TAACGACTCG AACACCATCT TTGCTTGGGG CTGGAGCTGT TCAAGAAGTT TTCGTTCGAA TTTTGTCTGA ACATGAGGCA GTCAAGCTGC TCCTATCGAC TGCAGGGCGG CGTCCATATG GTGGGCGAAA TTCTACAGTA TTCAATCAAT CAAAACTTAT TGTCAAAGGA TGCGGCAACT CACCACTAGG GATTATTCTT GTGGGTAGCA TGCTGCGCGA ATACAACCGA AATTGGAATT TGACCTCACC TGTATGGACG GGCATCTTTA ACCAGTGCAG TTTGAATCTT GAAGAAGCAG CCCAGCTCCG AAGTTTCAGG AACGCATTTA ACCGAGTTGT CGATACTGCC CTTTTTACAA TCGAAGACTC ATTCTTCCGA ATCGCGTTGC GACGATGTTT TGTATACTTC GCCATGGGTT TCCGGGCCAA TGACTGGATG TTGTCAGGTA TTGGGATCCC GCAATCCATT ATCCTTAAGT TTTTTAGGGT TGTTATTGCA GCTGGTCCAC GAAACAATGA CTCTGTTGAG CCAGAAGTTG TTCTGGCAAA GCTGGAGAAT TTAAAATTGA TTCAGCGCGC GAGGCATTCC GTCACGCTGC AAATCACTAA ACCTGGTACA GATAGTCCCT CGGTGTCGTC ACACGGAAAA GAAAACGGTG AGGCATCAGA GAAGTCAGAA TCTGACTGGG ATGATTTCGA TGAGATCGTG AAGCCGCCTG TCCAGCACTG TTATACCATG CACGATTCGC TAAAAGCTGT CGCAGAACTA ATGGCACAGC GCGCATTGCC GTCGTTTACC CCTGCAGAAG ATCAGTTCAC TTACTTTTCC GATCTCATTC GGCACGAGAC TGATATCGGA TCCGGAGAAA ATCGGGCTTG GTCGGCGCCA TTGCGCTTTC TATTTCAACA CCTGTCACCA CAAACTTCAA CCTCGAAGAG TAGTTCTCTC ACAAATGAAC ATGTCCACGA GCTCGTAGTC ATTGCGCTTC TTGGTGGGGG CGATGGGAAA GCGCTATCAA AGTCATCTGT TTCAAGTGTC ATGAAAGCGA ACCAAATTTC TATGCAGTTG ATGGAAGGGG GTACCAAATT CGAAGAGTAT ACGATGTCGT TTATCATTGA GCATCTCATG TTGTCAAGGT CGCTTTCCAG CACATCCGAG CTCATAACCG ATTCAGAGTT CGTTCGGCGT CGTGTATTTG CCCTTGGCAT TATGGAAGCG ACAGGACGTC AAGTGGCCGA CATTTTGGAT CTCCGTGGGT TTGCGGGCAA AGGAGGAGCG AAAAGAACAG CTAATACGCT GCCGTCACCT ACAAAAGATG TGCAGTCTCC AAACATTGAT CGAACCGTGT CCGAAGACAA AAGTGAATCA AACTTGAACT TCGACGTCGA AGAGGTGCTG TGCGCTGCCT CGCGCATCAT CATCGACGAA GTCTACAAAG CAACGAATAC AATGGGTTCG TCAGACTCCC TGGGCATGGC GACGTGTTTA GCAACAGTCG GAGAAACGCT CCTGAAATGC CGCCAGCCGC GTGATGCAAT GCTGCGGTTG GAAGAAGCTG TCAGTATATA TCGTGGGCTT CTCGGGCCGT ACCATGTTTA CACTGCACAT GCACTGCACT CTACTGCAAA GGCTTTGGGC AAACTAGGAG AAACCCGTGT TGCACTGCTC AAATTTGCCG AGGCCGCCCG CATCTACGAA GCCTGCAACG CCACCCTTCA CTATGATTCA ATCACGAACG CCCAGTCCTT AGCCGCACTG CTTGTAGACA TCGGTGACAT ACGGAAAGCC GAATCAATGT TTGAAGAAGT TATTTCAATG AAGCGAGCTG TTTACGGCGA ATTTTCGGTG CCTGTTGCAA AGACCATTAA CAGCTATGCA ATTCTATTGG CCAAGCATGC GCGTATGAAA GATGCTCTTC GCAATTACGA GCTCGCCAAA TCCACTTATC AAAAAGCTCC TCCTGCTCTC ATTGTAGATC CTGAATTCGA CGTCAAATGC AAATACGATG TGACCCTCAT CAATCTCAAT ATTGCATCAA TCTACTCGAA GAAAGGCGAC CTTCAGCGAG CATTGACCTG CTACGAAGAC GGTGTTACTG GTCTAGAGCA ATACGAAGCT TCAATGGACG AGATCAGAGA AAGCGACTGT GTGATTGACG TTACTGCGAA GCATTCTTCA CACAAACATC TAGTCGCAGC TCTGGGTCGC ATTGGCTCGC TTAAGCTAAA ACTTGGTGAC AATGAAGGTG CGCTGGAAGC CTACCTATCT CTAATCGAAC AAGTCGACGA AGACAGTCCT ATTGCTTCGT ACACGGAACA AGCGAAAGCG CACATTAAAT GCGCGACTAT CTTTCGACAG CGCAACGGTG AAAAGAATCG GGATGAGTCG ATATCGCATT TAAAGGAGGC CCTTCGAATG TACAAAGCTC TTTACGGAAA CGAGCACAAA GATACAACTG CAATTTCCAC ATCATTGAAG CAATGGTTAG CAGAGGATAA GCAACCTCCC AATTAATTTC TTAGGCTTGA TTGTGCC
|
Protein sequence | MKGRPTLGSF RKRLSGRRTK DKEDEGGSVP RRPSTPPRSP NSRRSFSQDR PKSRSPVNSP NRPHNSRRSH SLNPSVEDAA TNGGSEESVV SGSPSSTSSS RGGRSRDSRP NILQRMRERS RSRSRSRKRD VNTSSPKEML VAVTSCRSDG YYNQKAPGST SKLPRKAPTN LKLFHELAVG VKDAYAAVGA TPRKLTEEEE ARFKDEEKIG RTVLWDFVGN LDFLLALVDE VAVDTITRGA LKDDSTFKGL RDVIKKGNRV LEEMLVRRER KYTLFFRLVQ PHDRDQIDRM RSWNQKVEKA VGAVTGGRES LGESENDSDT NSISSNSSAS VSSRAGVFSR GRQLLPTAGR VRSRRATPTP RLRKHRSQGD ITEDDAGAAE DGFSTVSMAP DSAQSPKSSS SGGDGGGGML PPAMRSGGQG NSHHPMASVE QVRPKDELVD VIRGLRVDKI RNQEGSSDKD LAELKPNWRP KAEIPPSVPK LPTEYVHRHR LMKQVVSCLL DENEPPKNED GSLQNTIITC VTSRHGDKAG NGKSTLAIAV IQTVEVRERF PNGIAWLKLG RGPLSERDIR RLYEDLYRQL VVKQADIEGS VSEADTVNFH ESFASTGSAR TEPSETRIDR TVDRADSVRR FEGGDLEGIK EDLGRMLVRQ KVLVCLDDVW RVEDAKWFIF ETPSYSAPPR NSPYKILITT RTPSLLGAGA VQEVFVRILS EHEAVKLLLS TAGRRPYGGR NSTVFNQSKL IVKGCGNSPL GIILVGSMLR EYNRNWNLTS PVWTGIFNQC SLNLEEAAQL RSFRNAFNRV VDTALFTIED SFFRIALRRC FVYFAMGFRA NDWMLSGIGI PQSIILKFFR VVIAAGPRNN DSVEPEVVLA KLENLKLIQR ARHSVTLQIT KPGTDSPSVS SHGKENGEAS EKSESDWDDF DEIVKPPVQH CYTMHDSLKA VAELMAQRAL PSFTPAEDQF TYFSDLIRHE TDIGSGENRA WSAPLRFLFQ HLSPQTSTSK SSSLTNEHVH ELVVIALLGG GDGKALSKSS VSSVMKANQI SMQLMEGGTK FEEYTMSFII EHLMLSRSLS STSELITDSE FVRRRVFALG IMEATGRQVA DILDLRGFAG KGGAKRTANT LPSPTKDVQS PNIDRTVSED KSESNLNFDV EEVLCAASRI IIDEVYKATN TMGSSDSLGM ATCLATVGET LLKCRQPRDA MLRLEEAVSI YRGLLGPYHV YTAHALHSTA KALGKLGETR VALLKFAEAA RIYEACNATL HYDSITNAQS LAALLVDIGD IRKAESMFEE VISMKRAVYG EFSVPVAKTI NSYAILLAKH ARMKDALRNY ELAKSTYQKA PPALIVDPEF DVKCKYDVTL INLNIASIYS KKGDLQRALT CYEDGVTGLE QYEASMDEIR ESDCVIDVTA KHSSHKHLVA ALGRIGSLKL KLGDNEGALE AYLSLIEQVD EDSPIASYTE QAKAHIKCAT IFRQRNGEKN RDESISHLKE ALRMYKALYG NEHKDTTAIS TSLKQWLAED KQPPN
|
| |