Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51952 |
Symbol | CPD3 |
ID | 7201113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 553266 |
End bp | 554799 |
Gene Length | 1534 bp |
Protein Length | 511 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | class II CPD photolyase |
Protein accession | XP_002180071 |
Protein GI | 219118604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCAA ACCGTACACG AGTCCTTACT TCGGAAGGAA CCGAGCCAAA AGAGGGACAA TCGGTAGTGT ACTGGATGCA ACGTGACGTG CGATCGGTCG ATAATTGGGC TCTCTTATGG GCTCGAGATC TAGCGATGCA GCACGATGTT CCCCTACACG TCGTGTACGC GTTGCCACCA CCGGCTTCGT CGGACGGATC AGACAACGAT AGAGATTTGC CTCCAGCTCT CATTCAATTA CCCATGACGA AGCGACATGG CGCCTTTTTG CTGGGTGGGC TAGAATGCGT GTACAAGGAG TTGAAAGAGA TGAAGATTCC GTTGTACGTC TGCCTTCCCG ACTCTCACGA GAAGGTTGGC GAGACTGTCT GCGAAGCAAT CCTGCATAAA TACAAGGCAA AAATTGTTGT CTCTGATTTT TCTCCGATAC GCGAATACCG TCAATGGATG GAACTCCAGG CCGTACCTAT CTTGGAGGAA GCGAAGGTCC CGTTTTATCA GGTTGATGCC CACAACATTG TGCCGGTGTG GACGGCAACC GACAAACGAC AAGTTGGAGC TCGAACCCTA CGGCCGCGAA TTCATAAAGT GTATAATGAC TACCTACAAG ACTATCCGGA TCTCAAAGGC AACAGCCATT CGGTTGACCA ACCCAAGTTT GACCGGGTCG AATACGAATC GTTTTTACAG ATGGACGAAT CGGTGGAATC CGTCGACTGG GCACAGCCTG GAACAGAAGC AGGTATGAAA CAGTTTGAAT TTTTTAGCAA GAATGGCCTA AAGATTTTTC ATGAGCAACG TAATGATCCC GTGCAAAAGC ACGTCTGCTC TGACATGTCC CCCTGGATCA ATCACGGCCA TATTTCGTTT CAACGGTTGG CCCTAAATGT CAAAGCATTG AACAAACACG CCAACGGAGC TGCAGCCTTT ATTGAAGAAG GCGTTATTCG TCGCGAGCTA TCAGACAACA TGTTGTACTA TTCTCCGAAC GACTACGACT CGCTCGAAAC GGCAGCTGGC TGGGCACGGG AGAGCCTGCA ACTGCACGCG TCCGACGAAC GTGAATTCGT GTACTCACTC TCCGAGCTGG AGGAAGGACG CACTCACGAT GATTTGTGGA ATGCAGCTCA GTTGCAGATG GTACGAGATG GAAAAATGCA CGGCTTTATG CGCATGTACT GGGCCAAAAA AATCCTCGAG TGGTCCGAAT CTCCGGTTGG GGCGTTGCGG ACGGCGCAAT ACCTGAATGA CAAGTACGAA TTGGACGGCC GCGATCCAAA CGGCTTTGTC GGGGTCGGCT GGTCCATAAT GGGAATCCAT GATCAAGGAT GGAAGGAACG AGAAGTGTTT GGTAAAATTC GGTACATGAA CTACAACGGG TGTAAGCGTA AATTCAAAGT GGAGGAGTAT GTTGCTCAGT ACAAAGGTGC CGCCGAGAAT GCTGCCAATG CAGTGGAGGA AACAAATGGG TCGTCCAACA AGCGCAAATC GTTACCAAGC TCTTCAAATT CGAAACAAAA GACTGCCAGA AAGT
|
Protein sequence | MLANRTRVLT SEGTEPKEGQ SVVYWMQRDV RSVDNWALLW ARDLAMQHDV PLHVVYALPP PASSDGSDND RDLPPALIQL PMTKRHGAFL LGGLECVYKE LKEMKIPLYV CLPDSHEKVG ETVCEAILHK YKAKIVVSDF SPIREYRQWM ELQAVPILEE AKVPFYQVDA HNIVPVWTAT DKRQVGARTL RPRIHKVYND YLQDYPDLKG NSHSVDQPKF DRVEYESFLQ MDESVESVDW AQPGTEAGMK QFEFFSKNGL KIFHEQRNDP VQKHVCSDMS PWINHGHISF QRLALNVKAL NKHANGAAAF IEEGVIRREL SDNMLYYSPN DYDSLETAAG WARESLQLHA SDEREFVYSL SELEEGRTHD DLWNAAQLQM VRDGKMHGFM RMYWAKKILE WSESPVGALR TAQYLNDKYE LDGRDPNGFV GVGWSIMGIH DQGWKEREVF GKIRYMNYNG CKRKFKVEEY VAQYKGAAEN AANAVEETNG SSNKRKSLPS SSNSKQKTAR K
|
| |