Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50562 |
Symbol | |
ID | 7199390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 132365 |
End bp | 135484 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185489 |
Protein GI | 219130684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAGA AACGGAAAGC TTTCTCGCCT CTGCATGCGA ATACGCTGGG ATCCTCCCCT TCGCAACGGA GCTCCTCGAC ATCCTCATCG TTGCCGCGTT TGGACAAGGA CGACGACTTT CTGTCGCGTC CACCGAAGAA GCCACAGCCT GTGCGGAGTA GTGATACCAA GACGCCGCGC ATCAGCAATT GTACGAACCC CCAGGGATTG GTACGTACCA CAACCGCGGT CCCGGCTCCG TCCACGGCAT CGTTGGGAGC CACATCCCGT ACCAGCCGTC CCTCCAAGGC GCGTTCCAAA GTACCCCGCC AGACAAAGAG AACCTTGAAG AAACCTCCAC CACCAAGGAT TCAGCAGTCA AACCAGCCTG CGGACCAAAG TCGAGCAGAT GTGCCCAAAC GTGAAGCACA GGGACCGGGA GTGATCATTC CTTCGAATCC GCAAACGTCC AAAGATGACA CACAACGACA ATCGCGCCGG TCCACCTCCA CCCTGGAAAC AATATTGGAA GAAGACTCCT TGTCTTCCAA CGGATCTTAC GGTCCTCATG GCATATCCAC TATTAAAGCA GCTATCCAAG CGTGTTTACC GAATCCCTCT TTACCAACAG CAAATCCAAG TCGCCCAATA CCCATCGCAT CTTCGCACCA CGTCATGGAT CCACAGCATT GCCAAACAAA ATCGCCTCAC TTTCAAAAAC TGAACGCATC ATCGTCGGCT CGCAACTGCT CGGATGACGA CATGCAGCTT TCCTCGGACG ATGAGACTCT GTCACTGTCG TCTCCCGAAA CAAGCATACC AAATGTTCCC GACAGGGCTA TCCCCTACCA AAAGGGGACT TCCCACGCTC TTCATTCTTT TGCTCGGGGC TCAATAGATC ACACAACGAG CCGTCTGAGA GACGGTAGAG GTCTTCCAAA TGCCACCTTC CAGTCCAATG TCGGCAAGTC GGTGACCAAT GGTTCCTCAA ACTTTGATGT TGCCACTAAA AATGGGGAGC AACACATTAC GAAAGCATTG AAGGCTCAGG TGACGGACAA GGCAATGCAG CGCTTTTTGG ACACCCCGGT CGACAATCCG GACGTTGTCA AGGCCATGCT GGCTAATAGC GACCCATCGC AAGGGCTCAT GGGACTCTCG CTACAAGTTC AAGCTGGAGA TGATAAATCT TTGGCTCAAA GCGAATTGAC CGATCAGACT GACTTGGACT CTTTCACGAA CACTCGGCAC GACATTCATG AAGGCTTCGA ATCAGCTGAC GGCTGGATGG AGCCGCGGAC TTGGACTCGT TTGGCTGCCC GAAACGCGTT ACGAGATCAA ACTGGGAAGC TATGCGCCGT GCCCGGACGA GCCTTGTTTT CTCCATCTTC CGAGCCGATT AATGCTGCAT GCAACTCGAC GCCGCACGAG TCTTCGGTAC AATCGGCAAA GCTTTGGACA GCCAAGGAGG CGAACGATTT TAATCCAGCC CCAATATTTG TTTTACGTGA TGGAAAGCGG TACCGCCACC CACCCTTGCC ACCGGGATGG ATGATTGGAG TCTCACAGAC CAAGAACCGG CCCTACTATT ACCACACGGA TTTTGGAACA TCCTTCTCTT GTCCTGTCCA CCTACCTGAT GACGATGGAC AGGTTTATGG CGATACTCCT TCACCCGTTC CGTCTGCATG TGCAAAGAGT TGCGCAAGCC TGTATAGTTC TACAGTAGGT CATGACGAAG CTTCAAGAGG AAGGGCGTCG ACCATATCAT CAAACACTGT CTCTTTGCAA GCATCCAGAA AAAGCAAATT CGCATTAAGT CCTGTATCAA AAAGGGATGG GCTTACTGTA GTCAGCTGTA GATTGCAAAG TTCCCCTTCC TTCGAAACAC CGAAAAAGAC AAACGACGCA GTGGAAACCA TTGAATTGGG AGTGTGCGTG CATAACATCA CAAAAGAGCA TTCCCATATG ACCCCAGCAG TCCATGAAAG TGAATACGTG TCCGGAAATC CGCTCACTTC TTCTCTTCGA AACAGCAAGG GGCACGAGGC CCATCTAGAA GAGCATACGG AACAATTGAA ACTAGCCCAG AATCTTATCA GGACGTCCAC AAATAAAATG TTTAGTCCAC TGTCATTGGC AATGAGGAAG AAGTCCTGGC CTCGGGGTCA GGGATGTGTG GCCAATGCTC GTATGCCGGA CAACGAATTT GACGCTTCCA GCAGCGAGGA TTCTCGCGTG CGTTCAGAAA AAGTTTTGTG TTCGACAACG CACAACCCGA CTTCCCATCT TTACTCTCTA TCATCCCCTC ACGGCACTCA CACCATTGAT CGACGCGACT CCTTGAGGAC AACCGAAAAT CTGTCGAATT ATGCCACCTC TGGAGCAAAG TTTCCCGAAA CCTACTTGCC TAAAGAAGGC GTTGACAGAT TCTTCGTCAA CCGGATCTCT GAGGCCCCTG ACAAGACGGT AGACATGAGC AGTTCTGTTC CCTCAAAATC TTCGTTTACC CTACATTCAT CTCGTTTAAG CAGCCCGAGT AACGTTGAAG GTGACTCAAG GCAAGGATTG GCTTATGAGA GGGGGAAGGA TAGTGACGAG ATTCCAGGTG TGGCCAACGA TGACTTCCCA GATGTCGAGT ACGACGAAAG TCCGATCGGA CTACCCACCA TCGGTGACAG AGATATGACA ATCGGCCTCC GACAGAAGCT AGACTTCACG TCCACCGACC AAGAATTGGA GGAGATCTTA CCTATTGAAA TGTCTGCAAG CAGGCGATCG TCAAGACTCC AACCTTTTGT TCTGACATCC ACCGGATCCA AAGACGACGA AATATCCGCG TTGGGAGTAG ACGATCTCTC CCAACAAAGC AGAGTAGAAG ATTTTATTCA AGAGAGGGGT GAGATACGGA GCCATAGGTC TTTAGGAGGA TCTGCGTCGA CGTTTGGTAC AAATTTTAGC CACCGGGTGC GACATCCGCC AATGCCACTG TGCAGTTTAC AGAACGTTGG ACAGCTCGAG CGATATGCCT CACCGAGTCA CAAACTTTCA AGGAAATCAA GAGACAAACG GGGACGTCGT AAGTCGAAAG GCGACCTGAG AGTGATCTCG CCGCGAATTT CGGTGATGGT GTCGAACTGA
|
Protein sequence | MTKKRKAFSP LHANTLGSSP SQRSSSTSSS LPRLDKDDDF LSRPPKKPQP VRSSDTKTPR ISNCTNPQGL VRTTTAVPAP STASLGATSR TSRPSKARSK VPRQTKRTLK KPPPPRIQQS NQPADQSRAD VPKREAQGPG VIIPSNPQTS KDDTQRQSRR STSTLETILE EDSLSSNGSY GPHGISTIKA AIQACLPNPS LPTANPSRPI PIASSHHVMD PQHCQTKSPH FQKLNASSSA RNCSDDDMQL SSDDETLSLS SPETSIPNVP DRAIPYQKGT SHALHSFARG SIDHTTSRLR DGRGLPNATF QSNVGKSVTN GSSNFDVATK NGEQHITKAL KAQVTDKAMQ RFLDTPVDNP DVVKAMLANS DPSQGLMGLS LQVQAGDDKS LAQSELTDQT DLDSFTNTRH DIHEGFESAD GWMEPRTWTR LAARNALRDQ TGKLCAVPGR ALFSPSSEPI NAACNSTPHE SSVQSAKLWT AKEANDFNPA PIFVLRDGKR YRHPPLPPGW MIGVSQTKNR PYYYHTDFGT SFSCPVHLPD DDGQVYGDTP SPVPSACAKS CASLYSSTVG HDEASRGRAS TISSNTVSLQ ASRKSKFALS PVSKRDGLTV VSCRLQSSPS FETPKKTNDA VETIELGVCV HNITKEHSHM TPAVHESEYV SGNPLTSSLR NSKGHEAHLE EHTEQLKLAQ NLIRTSTNKM FSPLSLAMRK KSWPRGQGCV ANARMPDNEF DASSSEDSRV RSEKVLCSTT HNPTSHLYSL SSPHGTHTID RRDSLRTTEN LSNYATSGAK FPETYLPKEG VDRFFVNRIS EAPDKTVDMS SSVPSKSSFT LHSSRLSSPS NVEGDSRQGL AYERGKDSDE IPGVANDDFP DVEYDESPIG LPTIGDRDMT IGLRQKLDFT STDQELEEIL PIEMSASRRS SRLQPFVLTS TGSKDDEISA LGVDDLSQQS RVEDFIQERG EIRSHRSLGG SASTFGTNFS HRVRHPPMPL CSLQNVGQLE RYASPSHKLS RKSRDKRGRR KSKGDLRVIS PRISVMVSN
|
| |