Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54511 |
Symbol | COPbeta |
ID | 7201531 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 84499 |
End bp | 88041 |
Gene Length | 3543 bp |
Protein Length | 978 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180373 |
Protein GI | 219119217 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGCTCATCG TTGCTCGGTC GTTGGCAAAT GACCATGACG CCCGCCAACT CGAACGAAAG CTATTGTACC TTTACACTCG CCTTGGATGT GACCGCAGGA GGACTTCCGT CAGAGGCTGA GATAGCCAAA GATTTGGAGT CGAATGACGC CAATGTACGT TGACTAGCGA TCGAGAACGA CTGAATCCAA TGTAAACAAC TCTACGAGGT AGAACTATAG CGGCAATCGA TGTTACTGTT CGTCCCTTCG AGTGTCACGG TTTTGCTGGC GGATGTCTAT AGACACAGGT CTTTGCATTT ATTTAAAGGA TCATGGAACT TACCCTTTGC CCCTTTTTTT CGTGTAACGC GTAGGTCAAG AAGCTCGCTT TGAAAGCTGC CATTATGGCA ATGCTCGGTG GAGAAGCCAT GCCGCGCATT CTCATGCAAG TCATTCGGTT CTGCATTAAT TCCAACGACA AGCAGCTCAA GAAGCTCTGT ATGCTTTACT GGGAAGTCGT TCCCAAGTAT CAGGAGCCTA CGTCGGAAGA GCTTTTGGCG GCAGCCTCCG GAGGTCCTTC CGTACAGCGA AAGATGCTTC CGGAAATGCT CCTCGTCTGT AACGCTCTCA TGAACGATCT CAATCATCCC AACGAATACG TCCGCGGATC CATGTTGCGA TTCTTGTGCA AAATCAACGA CGCCGAAATC CTCGGTCCGC TCATTCCCTC CGTCAAGTCT TGTCTCGAAC ACCGACATCC CTACGTGCGC AAAAATGCCG CGCTCGCCGT ATTCCACGCC CACAAGTTAC ACGGTGAAAC CTTATTGCCC GATGGACCCG AACTCGTCGC CGCATTTCTG GAACAAGAAA CCGATGTTGC CGCTCGTCGC AACGCATTCT TAATGCTCTT CAACGAAAAC GAAGATCTCG CCATTGACTT TTTGGCTCGC AATATGGATG ATGTGGGCAA ATACGGGGAT GGCTTTGCCT TGCTCGTCCT CGAACTCACG CGTCGAGTGT GTCGCAGAGA CCCCTCGCAA AAATCTCGAT TTGTCCGTGT CCTCTTCCAG ATGCTCTCCA GTACCTCCCC CGCAGTTTCC TACGAAGCCG CCTGGACCTT GGTCACGCTC AGCTCAGCGC CGACGGCGGT TCGTGCCGCT ACCTTGACCT ACATCAACTT GCTGAACGGA CAAAACGATA ATAACGTCAA GTTGATCGTT CTGGAGCGAT TGGAAGGGCT CAAAGACAAG CACTCCAAGA TTTTACAAGA ACTGCTTATG GATGTCCTGC GGGCCCTGGC GAGTCCCAAT CCGGATATTT GTCAAAAGGT CTTGGCCGTG GCCATGGACG TGGTCACGAG TCGCTCGGTC CAGGAAGTCG TCAACGTCCT CAAACGCGAA GTCCAAAAAA CCGTGTCGGA AGAAGCATCC ATCGAAGGAA AGGGTGCGAG TTATCGCAAT ATGCTCATCA AGGCGATTCA CGGTTGTGCC GTCCGCTTCC CACAAGTCGC CGAGTCGGTC GTCCACACAC TCACCGACTT TTTGAGCACG GACAGTGGTA TGCAGGTTAT TATCTTTGTA CGCGCTATTG TGGAACAGTA CCCGGAGCTT CGGGCGCCGC TTCTAGCCAA GCTCGTCTCC ACCCTGGAAG ACGTCACCTC GAATCAAGTG TTCTGCGTGT GCCTCTGGAT TTTGGGTGAG TATTCCGAGA CAGCGGACAG CATCACCGAC GCTTTCAATA CAATTACGGA ACAGGCAGGA GAACCTCCCT TTATTCTCAA GAACGCAGAG AAGGAAGAAG CAGAAAAGGC TGCGGCCGAA GCCGAAAAGG CCGCTCCCAA AATTGTTTCC AAAAACGTTG TTTTGGCTGA CGGCACTTAC GCGACGCAAA CGATCTACAG TGAAGCGAAA ACTCCCGTTC ACGACTCGGT CAATTCCCTG CGTCGCATGT TGATTGGAGG GGATGTTTTT CTGGGAGAGA CACTGGCTTC GTCGCTGACC AAGCTTTGTT TGCGCGGTGG CCTTTGTCCC GAAATGGACC CACTTGCTTT GAAAGCTATG GTGGCCAAGT CCGTTCTCGT CATGTGCGGC GTTGTCAAAA TGGCCGAAGT TACGATCGCT GCCCAGCGCA CATCCCTATC GGACTGTCAA GAACGTATCA CTCTTTGCTG CCGTGCCCTG ATTGATCCCA AAGCGCAAGA ACTTCTCAAA CCAACTCTTC TCGAAGGTGG CCGTGCCCGT TTCGGAGAAT TTCTCAAAAT TCTAAAAGAT AAGGAAGCCA AGGAGAATAA AAAGAAAGAG TCAGACAAGG AGATCACAAC GCAAGCCGAC GACTTGATTC ACTTTCGCCA ACTCAAATCG ATGGCGGTCC AGGCCGGTGA TCTAGATTTG GACGACGGTT CCGATTTGGC TCGGGCTACT GGATACGACA ATGCCGGCTC GTTGTTGAGC TCGGAGTTGA GCCACGTGTA CCAGCTCAGT GGCTTTGCCG ATCCAGTCTA CGCTGAAGCT TTAGTGACGG TCCACGATTA CGATATCGTA CTGGAGATCC TCGTTATCAA CCGAACCCCC AATACTTTGG CTAATTTGAC GGTAGAACTT GCGACGATGG GTGACATGAA GATCGTGGAA CGGCCACAGG CTCACACGAT TGGCCCGCTT GATCAAGTGA CAATTCGAGC GTCCATCAAG GTTAGCTCTA CGGAAACCGG ACATATCTTT GGCACTATTG TGTACGAAGA CGCGGCGACA CGGGAGAAAG CTTACGTCAA CCTGAACGAT ATTCACATGG ATATTATGGA TTACATTCGC CCAGCTTCGT GTACCGACGA GGTTTTTCGA TCTATGTGGG CGGAATTCGA ATGGGAAAAC AAAGTCGCCA TTTCGACCTC CATTGCGTCC TTGTTGGGTT TTCTGAGCCA CATTGTCAAG TCCACAAATA TGACGTGTCT CACGCCGCAC GACAAGACCG AAAAGGGTTC CTTTTTGGCA GCCAATTTAT ACGCTCGAAG GTAAGCTACG CATATGGTAC TTGGTTGTAG GTCGTCTGTC CGAGGTCTTT TTGTTGTTGC TTTGCCCCGA TTTCGTGTTA GATGCCGGAA GCCGTCTTCA TCGGTCGGGA TCTGTTAATA TCGGGGGAAA TTGGTGGAAG GCTGAGAACT TAGCGCCGAC AAGGCGTTTC TCGGTCTGGT CGGCATTATT GCAAACACTG TCTTCACTTT TTTGGAGTCA GGGTATCATA TTGTCGATAC AACTGTATAC TGGTGATCCG TGCGTTGCTC ACATATTCGT CGTTGCTTTT GCACTGACAG TGTTTTCGGA GAAGACGCAT TGGTGAACGT GTCGGTCGAA AAGAAGGACG ACAATGACGG CAAACTGGCT GGCTACATTC GTATTCGCAG CAAGACGCAA GGCATAGCTT TGAGTTTGGG AGATCGCATT ACTTCGGTCC AACGTGGCTT GCCGGAAACG GTCAAGGCAC AGTAAGCATA GAAACCGGCG GTGTAGTACG TACAGTCACG GGCATGGATA CATTTTATTT TTTAAAACGT TCGTTGCGGT TAC
|
Protein sequence | MTMTPANSNE SYCTFTLALD VTAGGLPSEA EIAKDLESND ANVKKLALKA AIMAMLGGEA MPRILMQVIR FCINSNDKQL KKLCMLYWEV VPKYQEPTSE ELLAAASGGP SVQRKMLPEM LLVCNALMND LNHPNEYVRG SMLRFLCKIN DAEILGPLIP SVKSCLEHRH PYVRKNAALA VFHAHKLHGE TLLPDGPELV AAFLEQETDV AARRNAFLML FNENEDLAID FLARNMDDVG KYGDGFALLV LELTRRVCRR DPSQKSRFVR VLFQMLSSTS PAVSYEAAWT LVTLSSAPTA VRAATLTYIN LLNGQNDNNV KLIVLERLEG LKDKHSKILQ ELLMDVLRAL ASPNPDICQK VLAVAMDVVT SRSVQEVVNV LKREVQKTVS EEASIEGKGA SYRNMLIKAI HGCAVRFPQV AESVVHTLTD FLSTDSGMQV IIFVRAIVEQ YPELRAPLLA KLVSTLEDVT SNQVFCVCLW ILGEYSETAD SITDAFNTIT EQAGEPPFIL KNAEKEEAEK AAAEAEKAAP KIVSKNVVLA DGTYATQTIY SEAKTPVHDS VNSLRRMLIG GDVFLGETLA SSLTKLCLRG GLCPEMDPLA LKAMVAKSVL VMCGVVKMAE VTIAAQRTSL SDCQERITLC CRALIDPKAQ ELLKPTLLEG GRARFGEFLK ILKDKEAKEN KKKESDKEIT TQADDLIHFR QLKSMAVQAG DLDLDDGSDL ARATGYDNAG SLLSSELSHV YQLSGFADPV YAEALVTVHD YDIVLEILVI NRTPNTLANL TVELATMGDM KIVERPQAHT IGPLDQVTIR ASIKVSSTET GHIFGTIVYE DAATREKAYV NLNDIHMDIM DYIRPASCTD EVFRSMWAEF EWENKVAIST SIASLLGFLS HIVKSTNMTC LTPHDKTEKG SFLAANLYAR SVFGEDALVN VSVEKKDDND GKLAGYIRIR SKTQGIALSL GDRITSVQRG LPETVKAQ
|
| |