Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_1947 |
Symbol | CPD1 |
ID | 7201298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 693015 |
End bp | 694559 |
Gene Length | 1545 bp |
Protein Length | 515 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | cyclobutane pyrimidine dimer 1 |
Protein accession | XP_002180488 |
Protein GI | 219119456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000588882 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTGA TCTGGCTTAA GCGGGACTTG CGCCTTACAG ATCACGGCCC ACTGGCGGCT GTGGCCCAAC GCAAAGACCG AGATGTCTGT ATACTCTACT TGTACGAACC CGATCAGCTG GCAGAACCAA GCGTTCACGG ATCCCACGTA CTATTTGCTA ATGAAGGGCT AGTAGACTTA GATACGAAGT TGTCGAATCT TCGAAGGTTG TCGTCTGGCC AAAACGCTGC GGCCAGCAAA AGCTTTGGTT CGTTGACAGT CTGCCACTGC GAGGTAATCC AGGCTCTACA AGCTATTCAT GCGCAAAAGA AAATAGCGAG GCTACTGGCA CATATGGAGA CCGGACACAT GCGGTCCTAT GCTCGCGATA AACGAGTTCG AAAATGGTGT AGGGATAGAA AAATACCTTT CGTGGAGTTG CCTCAAACGG GTGTGTCACG ATGCCTTACG AATCGCGACG ACTTTCATCG CAATCTGCAA ATGTTTTTGA AGAAGAAACA GTATCGCACA CCAACAGCCC TCGAATGCAA CATAGTCATC GATTTAGAAT TACCAGGAAG AAGCATGGAG CCCTTGTTCG CCGAGTTGAT CGAGATTCCT TTAGAGCAGA GAGTCGACCG TACAGAACGC CAGCAAGGAG GGGAAACAAC AGCATTGGAA ATCCTCCGGT CTTTTCTTTA CCATCGAGGA GTAGGATTTT CAAAAGGAAT TTCGTCACCC AATTCTTCGT GGACGTCATG CAGTCGGCTT TCCCCGTACC TTACATGGGG CCAAATTTCC TTAAGACACG TAGTACAAGC ACTCCAGGAA CGTCAAGCTC AGCTGAAGAC GCAGAAATGT CGATCTGATG ATCGCTGGTT GCGCTCCTTT ACTGCGTTCT CATCTCGCGT GCACTGGCGA TCGCACTTTA TTCAAAAGCT CGAGTCGGAA CCGGAAATGG AACAACGCGA CGTAAATGCA GCCTTTCAAC CACTCCGTCG ACAACCCGGC GATTGGAATG AATGCTACTA TCAGGCTTGG TCAACTGGAA ACACAGGCTA TCCAATGATG GACGCATGTA TGCGCTGTTT GCACCGACAT GGTTGGGTCA ACTTTCGAAT GCGGGCCATG CTGGTTTCAT TCGCAAGCTA CAATTTGTGG CTGGATTGGC ATCGGTTCGC TCCCCACTTG GCTCGCGTTT TTCTAGACTA TGAACCGGGA ATTCATTATC CGCAAATTCA AATGCAGTCG GGTACAACAG GTATTAACGC CTTGCGCTGT TATTCTGTAA CAAAACAAGG AAAAGAGCAC GATCCTCGAG GAATTTTCGT TCGCAAGTAC ATTCCTGAAC TCCAGTCCGT ACCAAATGAC TACATTCACG AGCCTTGGAA GATGTCTAAA TCTATGCAGG CCAAGTGCGG CGTTCACATT GGCGAACACT ATCCCGCACC TATTGTGAAT GAACAGGAAA CAGCGAAAAG CGCCAAAGAA CGCATCGCTG CCGTCCGTCG AAGAAACGAA ACTCAGGAGG CCTCACGAAA GGTTTACGAA AAGCATGGGA GCCGT
|
Protein sequence | MDVIWLKRDL RLTDHGPLAA VAQRKDRDVC ILYLYEPDQL AEPSVHGSHV LFANEGLVDL DTKLSNLRRL SSGQNAAASK SFGSLTVCHC EVIQALQAIH AQKKIARLLA HMETGHMRSY ARDKRVRKWC RDRKIPFVEL PQTGVSRCLT NRDDFHRNLQ MFLKKKQYRT PTALECNIVI DLELPGRSME PLFAELIEIP LEQRVDRTER QQGGETTALE ILRSFLYHRG VGFSKGISSP NSSWTSCSRL SPYLTWGQIS LRHVVQALQE RQAQLKTQKC RSDDRWLRSF TAFSSRVHWR SHFIQKLESE PEMEQRDVNA AFQPLRRQPG DWNECYYQAW STGNTGYPMM DACMRCLHRH GWVNFRMRAM LVSFASYNLW LDWHRFAPHL ARVFLDYEPG IHYPQIQMQS GTTGINALRC YSVTKQGKEH DPRGIFVRKY IPELQSVPND YIHEPWKMSK SMQAKCGVHI GEHYPAPIVN EQETAKSAKE RIAAVRRRNE TQEASRKVYE KHGSR
|
| |