Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54072 |
Symbol | ERCC2 |
ID | 7197063 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 24858 |
End bp | 28911 |
Gene Length | 4054 bp |
Protein Length | 782 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | xeroderma pigmentosum group D complementing protein |
Protein accession | XP_002177848 |
Protein GI | 219112193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGACTTCATT ACCGTAAACA GAAACAAGTC TTGTCAGTCA TCAATGAGGA ATTGACCGTT CTCTTCATAG GCAATTCCTG AAAGCTTTGC ATAAGCGGAA GGTGTGAAAA AGGCGTCAAA ATGAGATTCG ATTTGGACGG CCTGGATGTG TTCTTTCCGT ACGATCGCAT CTACCTTGAG CAGCATCAGT ACATGCGAGC ACTGAAACAG TCGCTGGACG CAGGTGGGCA CTGTCTCCTG GAGATGCCTA CGGGAACGGG AAAAACCGTA TGTTTGCTAT CGCTGATTAC TTCCTACCAG TTTGCGAATC CGTCGGCGGG TAAGCTCGTG TATTGCACTC GAACCGTTCC GGAAATGAAT CACGTTATGG AAGAACTCGC GACAGTTTTA GCGTACCGGT CGCAAGAACT ACAACGGCAC CAGGAAGAAA ACATCTTGCC AATGGACACT GACGGGGATA TTGAAAACGT TCCGGTTGCG GGCAGTAGCG GGGATGTAAA CGTCACTGCC ATCACCCACA TTACCAGCAA TGGCTCGCCC AATCGCAAGA AGCCTCGAAA GGTCTATACC GCCAATTTAC AAAAACCTCC CATGGGACCG ATCTCGGACG GTCGCGGAGC TGGTGGAAGC GGGGCTCTAG CATTGTGTTT AAGCAGTCGA CGGAATATGT GTGTTCACGA GCGAGTCATG GCCGAAAGCG ATCGGGAAGC CGTCGATGCG GCTTGCCGAA GTCTAACGGC GAGCTGGGTT TTGGAAGAAG CACAGAAGCG TCCGGGCTCG ATCGAAACCT GCAGCTATTA TGATAATTTT CATGCTGCAG GCGAGTCCAC TTCAATGCCC AGTGGCGTGT ACGATTTGGA AGAGCTGCAG AAGTGGGGCA AGCGGCGTGG ATGGTGTCCA TATTATCTGA CTCGGCAAGC CATTAATCAC GCCAATATAC TCGTATTCAA CTATCAGTAC ATGCTTGATC CAAAAGTTGG TACGTCGCTT GAGGCTGCGT TTTGTACAAT CCTTGAGGTT CACGCGCTTA CACTCTGCTG TTCCTTTTCT CGTTATGTAG CCAAAATGGT TTCGAAAGAA CTCGAAGCCG AAAGTATCGT GGTGTTTGAC GAAGCCCACA ACATTGATAG TGTATGCATT GAAGCGTTAT CAGTGACCAT CAACGACCGA GGATTGGAAC AAGCTACACG TTCTCTTGGA CGCTTGTCAT CAGAAGTGTC TCGCCTGAAA GCATCTGACA ATCAAAGGTT GCAGACCGAG TACCAAAATC TCGTCAACGG CTTGATTGAC CAAGGATTAC TGGATGCGCC GGCGAATGAT GCTGGACTCA CAAGCGTACG TTTTTTTCTT GGTCCGACAC TGGTTATCCT TCCCATAGTA TAAATATTTT TGTCTCACAT TCACCTGTGC TCTGGAATAC AGAATGTGCT CAATCCTGAT GTTTTGAATG AAGCCGTACC CGGCAACATT CGTCGAGCCG AACACTTTAT AGGATTTATG AAAAAAGTCG TGGAGCATTT GAAAGCTAGG CTGCGTTCCG TTGCAGGGCC GAATGGAGGT GTGCGGAGTG AAACGCCGCT AGCATTCTTA CATCGCATGA CAAATGGTAC CAGTCTGGAA GCCAAGCCTT TGCGGTTTGC CTATTCCCGG CTATCGTCCT TGCTACGCAC CCTGCAAGTT TCGAATCTGG ACGATTTTAA TTCACTGACA GACGTTGCGG ACTTTTCAAG TCTACTTGCT GCTTACAGTG AAGGAGTCGC CAAGTTTGCT ATTATCATGG AGCCGAACGG GTCCTCCATT CCAGGGGCTA CGGACCCTGT AATTCAACTT GTTTGTTTGG ATTCGTCACT GGCTATTGCA CCTCTTTTCA AGCGATTCGG GAGCGTGATC ATCACATCGG GAACACTGTC ACCCATTGAT CTCTATCCCA AACTGTTGCA GTTTGAACCT TGTGTCTCCG AGTCGCTGTC CATGTCTACG TTCCGTCCAT GTATTCGCCC ACTGGTGATC ACGAAAGGAT CGGATCAGCT GGCGGTATCT ACAAAGTACG AAGACCGAGG TGACATGGGT GTCATTCGAA ATTATGGAAC CATGTTGGTC GAACTGTGCA GTACCATCCC TGATGGCGTG GTTGCTTTCT TCACATCGTA CAGTTATATG GAATCCCTGA TCAGTGAGTG GGATGCGATC GGAATTCTAC GAGAGTTGAC CAAATCGAAG CTTGTTTTTA TAGAAACCAA AGATGTGGTC GAAACCACCT TAGCTTTGGA CAATTACCGC AGAGCCTGTG ATTGTGGGCG GGGCGCCGTT TTTCTATCAG TAGCACGGGG AAAAGTATCG GAAGGTATCA ACTTTGATCG ACATTACGGA CGAGCTGTGA TTATGTTTGG TGTTCCCTTT CAGTACACTC TTTCGCATAC CCTCAGAGCT AGACTGGAGT ATCTTCAGAC ACACTACCAG ATTCGTGAAC AAGACTTTCT AAATTTTGAC GCCATTCGCC AAGCTTCACA GTGTGTTGGT CGTGTCATTC GAAGTAAGAC CGACTATGGG CTGATGATCT TTGCAGACAG TCGATATAAT CGCCATGACA AACGATCGAA ATTGCCTAAA TGGATCCAAC AATTCATGTC CGATCATTTT TTGAATTTGA GCACAGACAT GGCGCTTTCG CATGTCAAGC AATTTCTGCG CCTCATGGGA CAACCTATCG ACCAGGAGGC GCTGCAAAGC GTTCTACTTA CGCTAGACGA AGTCAATCGA ATGAATCCGC CCACAATTCG GACAAACATA GCTGCAGGGG AAGGCGGTGC AATGATTACC GAGGTATCGT CCGCTCAGCC AATTCCAGCA TTTTCGGTAA TGTAGATTTG GAAGGTGTGT CTTGTTGTTC TCTATCTATC AAGATGGAAA TTTAATACAG AGCAATAGGT GCAGTTTTGG CATAGGCTGT TTACCATTCG CTAGTTCATC TTTCGCCCCC ACTCTCGGAG AGCAGTCGAA CAACCCATAA TGTTCTGAGC TATCGCTCCT TCCAATTCGC GATACACGTC AACCGTTAGC TGCAAAGGAA CAGCTTGCAA AAGAGACGGC TCCAAATCCA AGACGGAACG ATCATGTTGG TACTGCAGGG GTGCCCCACC GCTATGATCT GAAGCGGGAC GTAGCAGGCC AACACTAAGC AAGTCAACAA AAGCCTTGGT CAGAACTTGT TTGGAATACC GATTTGAACT ACCTTTGTAC GAGCCAAGGT ACTCTTCCAA CATGCGTTCA AGCGTCAAAA CGGCGTTGCT TTGCTCCCGG CGATTGTCCC GGGATAGAAT TCGCCTCGTA GCCAATACCA GCGCTGCTTG CGGGCCCGAC AAATCTCGCA AGGCTTGCAT ACGGGGATCG ACAGCCACAT CCTCTACTAT CACCAGGTCG CGTTCTCCCG ATTCAGATGC CAAGGATCCT CCCATATCGG CCAATGCTTC CAAGAGGTAT TCGGCATCAA ATGTTGGAAT CGGAGATGCC GATGAGCTAG CATCACCTTC CATGGCGGCC AAGCAATCCA TCCGATAGTT TGTAATCGCG AATGTCAAGA CGCGGCTGAA CCAGCGAACA TCTTGACCCA ACATGCAATC CCGCACGAAA GCATTATAAA CTCGTCTCCT TGTGGGATGA TTGTCGTCTT CTGCAGGTTT GGTCAGGAGT GCGCGAATTT CGTGGTAAAG CGATGGCTGA TAAGCCGCAT CATAGTGGTC TTTTCGTGTC GATTCCAACT TGCACGACAC AATATCGCGA AGTTTGCCGA TGGACGTGCA CGGTCCAAAT TGAAGAAATT GAGATGTACC CTCGGCTCGA CTACGAATTC GTTTTTCCAG TAAACCGATG AGGCCTAGAT GGCAGGTCAT ACCGACAAAG CACACAGATG AACCGTCGGA AGCTACGCGA TCGAGAAGGT GATAAAGTAA CAGTTGTCGA TCTTTATGCT CCGTCAAACC ACTATCCAAA GAATTTGCCG TCGCTTTCTG TTGTCCCAGA AAAAGATCGA GCTCATCCAG AACAATTACG ATAG
|
Protein sequence | MRFDLDGLDV FFPYDRIYLE QHQYMRALKQ SLDAGGHCLL EMPTGTGKTV CLLSLITSYQ FANPSAGKLV YCTRTVPEMN HVMEELATVL AYRSQELQRH QEENILPMDT DGDIENVPVA GTLCLSSRRN MCVHERVMAE SDREAVDAAC RSLTASWVLE EAQKRPGSIE TCSYYDNFHA AGESTSMPSG VYDLEELQKW GKRRGWCPYY LTRQAINHAN ILVFNYQYML DPKVAKMVSK ELEAESIVVF DEAHNIDSVC IEALSVTIND RGLEQATRSL GRLSSEVSRL KASDNQRLQT EYQNLVNGLI DQGLLDAPAN DAGLTSNVLN PDVLNEAVPG NIRRAEHFIG FMKKVVEHLK ARLRSVAGPN GGVRSETPLA FLHRMTNGTS LEAKPLRFAY SRLSSLLRTL QVSNLDDFNS LTDVADFSSL LAAYSEGVAK FAIIMEPNGS SIPGATDPVI QLVCLDSSLA IAPLFKRFGS VIITSGTLSP IDLYPKLLQF EPCVSESLSM STFRPCIRPL VITKGSDQLA VSTKYEDRGD MGVIRNYGTM LVELCSTIPD GVVAFFTSYS YMESLISEWD AIGILRELTK SKLVFIETKD VVETTLALDN YRRACDCGRG AVFLSVARGK VSEGINFDRH YGRAVIMFGV PFQYTLSHTL RARLEYLQTH YQIREQDFLN FDAIRQASQC VGRVIRSKTD YGLMIFADSR YNRHDKRSKL PKWIQQFMSD HFLNLSTDMA LSHVKQFLRL MGQPIDQEAL QSNLPSLSVV PEKDRAHPEQ LR
|
| |