Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34592 |
Symbol | CPF2 |
ID | 7199524 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 983416 |
End bp | 985541 |
Gene Length | 2126 bp |
Protein Length | 610 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | cry-dash from the cryptochrome/photolyase family |
Protein accession | XP_002178889 |
Protein GI | 219116188 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTA GCAGGTCCAA GCAGTCTTTT TGGTTCCTTA TCTTTGTTCA CACCGTCCTT ATCTTGACCT TTGCTGCCGG TGCCGCCACT GCCATGAGAG GAGCATCCGT CAGTACCGTG ACGAAAGCGA CCTCACGCGC CTCGTCGCAA GTTGTCTTGC ACTGGTTCCG ACACGGAGAT TTACGCTTGC TGGACAATCC GGCCTTGATC CATTCCAGGT AAGAGTTACA CGTTATTTTG TCTTTGCGAC AGCTCGAGTA GGAATGGAAC GCACAAAAAC TCGAGTCCGT CCCCACGAAC TTCCAACACA AACCTTTCCT CACTCATCTT TACGTGTGAC CCTTCTTGTT GGCTTGTTCG TTAGCAAAAC TGCCGAATCT TGTGTTCCCG TCTTTTGTTT CGACGACAGT GTCTACGGCA ACGACAATCG GACTCCGGAC ACTCGCGCGC CGCATTCGAA CGATCGCGGT CAACTCAAGT GCGGACCCCG TCGTGCGCAG TTCGTACTGG ATTCGGTCCA GGATCTACGA CGCAGTCTAC AATCCCGAGG TAGCGCCTTG TACGTGGCGC ACGGGAAACC CGCACAAGTC TTCCAAAGAT TGGTGGATGC CTGGCCGGCG GTTCCAGCCG CCGACACTGC CGCTCCGAAC GGCAGTCTAC TCACCATTGT TTGTCAGCGG GAAGTGGTGC GGGAGGAAAA CGACGCGGTC CGAGCGGTAC AATCCGTTCT ACGCCGACGC TTTCCGCAAG CCAAAGTTCA ACAAATTTGG GGCTCCACCA TGTACGAACT GGACGATTTG CCCTTTGCCA CGGACCTCGC CAACATGCCC GATACCTTTA CGCCCTTTCG GAACAAGGTG GAAAAGAATT GTCAAATTGG CACACCCCTG CCGGTACCCA AGCAACTCTC TCTGCCGGAG AATTTTCCTT CCGCCTTGAA ACAAGGCCTG GAGTACCTAC CCACCTTAAA AGAGCTGGGG TACACGGATG CGCAAATTCA GCAAGTCGAA ACCCATGACG AGCGCGGAGT CCTGCACTTT ATGGTGGCGA AACAGCCGGA CTCGCACGTG TTGAGGACTA CATTTGGACA CAGGATTGTT TGAAGGATTA CTTCGAAACG CGCAACGGGA TGCTGGGGCC GAACTATTCG ACCAAATTCA GTCCCTGGTT GGCCCACGGC AACGTTTCAC CCCGGTACGT GGCCGCACAG TGCCGTAAGT ACGAAGAAGA ACGCGTGGAG AACAAATCTA CCTACTGGGT AGTCTTTGAA CTGCTGTGGC GCGATTTTTG CAAATTTTTT GCCACGAAAC ATGGTGACGC TATTTTTTAT CCGTACGGGA CGACCGAACG CACGGATCGG CACAAGCCTT GGTCGACCTT TGGTCGTAAT TTACAGGCGT GGCAGGAGGG TCGCACGGGC TACCCTCTCG TAGACGCCAA CATGCGAGAA TTGGTCGCCA CCGGCTTCAT GTCAAATCGT GGCCGTCAGA ATGTGGCATC CTTTCTCGCC ATTAACTTGA ATCACGATTG GCGATGCGGC GGGGATTTCT TCGAAAGCCA TCTGGTACGT TAAGGAGAAC ATTGTGTGGA CGTTTCCACG TTTTGGAATT GTTACTGACA CTTAATATTC GCGCATTTCT GAAATTCTAT TTTTGTCTTC GTTGCGGTGT TGGTGGTTGC TGCAGTTGGA TTACGACGTG TATAGCAATT GGGTGAATTG GTGTGCGGCC GCGGGTATGA CGGGTGGTCG TCTCAACCGA TTCAACATTT CTAAGCAAAG CAAGGACTAC GACCAACACG GCGACTACGT GCGGCATTGG TTACCGGAAC TGGCCAAGGT GCCCAACGAA TTCGTCCATG AGCCTTGGAA AATGACGTCG TTCCAGCAGA TGGAGTACGA GTGCAAACTC GGCGTGGATT ATCCCAACCC GATTGTTCCG CCGTCCCGGC CCAACCCCCA CACGGACCGG AACAATCGCG GCCGTGGTGG GCACCAGTCA AAAGGCAACA ATCGCCACGG TCCACCCGAC AAATCCCGCA AAGCCAATGC GAGTGGAAGC AACCGACACC AAAAATACGA AATGAAGAGT CTCCAACCCG GCAGTTTTCG AGTCAAGGAA TCGTAA
|
Protein sequence | MSSSRSKQSF WFLIFVHTVL ILTFAAGAAT AMRGASVSTV TKATSRASSQ VVLHWFRHGD LRLLDNPALI HSSKTAESCV PVFCFDDSVY GNDNRTPDTR APHSNDRGQL KCGPRRAQFV LDSVQDLRRS LQSRGSALYV AHGKPAQVFQ RLVDAWPAVP AADTAAPNGS LLTIVCQREV VREENDAVRA VQSVLRRRFP QAKVQQIWGS TMYELDDLPF ATDLANMPDT FTPFRNKVEK NCQIGTPLPV PKQLSLPENF PSALKQGLEY LPTLKELGYT DAQIQQVETH DERGVLHFMV AKQPDSHDCL KDYFETRNGM LGPNYSTKFS PWLAHGNVSP RYVAAQCRKY EEERVENKST YWVVFELLWR DFCKFFATKH GDAIFYPYGT TERTDRHKPW STFGRNLQAW QEGRTGYPLV DANMRELVAT GFMSNRGRQN VASFLAINLN HDWRCGGDFF ESHLLDYDVY SNWVNWCAAA GMTGGRLNRF NISKQSKDYD QHGDYVRHWL PELAKVPNEF VHEPWKMTSF QQMEYECKLG VDYPNPIVPP SRPNPHTDRN NRGRGGHQSK GNNRHGPPDK SRKANASGSN RHQKYEMKSL QPGSFRVKES
|
| |