Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29502 |
Symbol | DDB1 |
ID | 7203571 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 229215 |
End bp | 233824 |
Gene Length | 4610 bp |
Protein Length | 1284 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | damage-specific DNA binding protein 1 |
Protein accession | XP_002182922 |
Protein GI | 219125301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCATCTTTGG CGATTGTCTC TCTTCGACTG TCTGTCCATG AGTGCTTCTA CCAAGAAGGA AGCGGCGCAC TACGTGGTGA CCGCACATCC GCCGGGAGGG GTTTTGTTGA CGGCGAAATG CAACTTTACG TCACCCTTTT CGCTAGTGAG TATGATGCTG CTCGTGCTCG TGTTACTTTC CAGCAATCGT GCGCTATTGT TTCCTGGTCC AGTAACAATC AAAACCTACG CCATGTATTG TATGCTCTGT ACCTAATCGG AAACAATTTC TTTTTTTTCT CGTTCGGCAG GACGTATTGG TCGCCAAATC GCGCCGTCTC GAAGTCCGTC AGCTGCGCAC GACGACGGAA GGGCTTTCAC CCTTTCCTAT CCTCGCCAGT GTCCCCATTA ACGGTCGCAT TGTGGGTCTC GTCCCCTTCA AGGTTCACGG TAGTGACACA TCCTACGTAT TCGTGCTGAC GGCGCGGCAG CAATACGCGG TCCTCGCCTA CGATCGAACG AACAGCGGGT CTGCGGCCTA TCCCCTCGTG ACCCTGGCTT CGGGAACCTT GCAGAGTCAG GAACACGCAG TTTTGGGACA AGAAGCGGAA TCGGGACCAA TCGTAGCGAT TGATCATTTT CACCGTTGTA TTGCGCTACA CGTCTATGAT GGTCTCCTCA CGATCATTCC CGTGAATTTG GAATACATGC GTCCGAAACC TTCCAAGCAA GACGAGACGT CGACGTCGAC CCAGAGTACC CGTACACTCG CGTCGCAACA GCTTTTGGGG ACGCCATTTC ATTGCCGTAT CGAAGAACGT ACCATTCTGC ACCTGGCCTT TTTGCAAATA CCCTTCGAAG CCTTGCCCCA ACTAGCAGTC TTGCACCAGG ATGCTCGCGG GGCACAGCAC ATCACGTCGC ACGTCATCAA TTGGAAACGC AAAAATATAT TTCTGTACGG ATCCTCCTCG GCACCGGCTG CTACCGAATG GTTACAAAAA TCCAACGTCG ATGGCGGATC GTCGTTGATT ATCCCCGTCC CGTTGCCGCC CAAGACGGTG GCGGCCGCGG CCGCTCCCGC AGCGAGCGCG GAAGCCCCAC CGGATTTCGC TCCCGCCAAA CATCGTTCTG GCGGAGTGCT GGTGGTAGGT CAACGCCAAC TTACCTTTAT CAATAACAAT GTAACCAAGG TCGTGCCTAT TCCACAGGCA CTCCATCTGT GTGTGGAAGA ATTGCCCGCC GATCCCAACG GTGGCTTGCC CCGATACTTA CTGGCCGATG AGTTTGGCAA TCTGCACATG GTCACCATTG TGCTCGTGGT CGATAAGGTC GTGGCACTCC AAATAGACAC GTTGGGTTCC TGTACATTGG CAACGTCTAT CGCCTATTTA CGGGAAGGTC TCGTCTTTGT TGGGTCCACC TTGGGCGATC CGCAGCTTAT TCAAATACAC GACGAGCCCA TTGTCGACGT CGAAGACGAA GAAGACATGG TTGGAGCCGA ATCATCCTAC CTCAGTGTAG TGGAAGAGTA TACGCATCTC GGACCCATTT TAGATTTTGA TCTCGTTCCG ACTGCTCCGG GCGGTGGTGG GCTTGGCCAG ACTGAGGGAA TTCACGGTCC GTCTCAATCC CAAGTCGTGA CAGCGTCGGG TTCATCCAAA TCGGGTTCAC TCCGTTTAAT TCGTAACGGA ATTGGCATGA ATGAATCAGC GGCGGTCGAA ATTCCCGGCA TACAAAACGT ATGGAGCTTG CGACGATCGT TTGCCGATGT TGACGATACG TATCTGGTAC AATCGTTCGT GCACGAAACA CGCGTACTAG GAGTGACGAC AATGGATGAC ATGTCTCAAG ACGAGAAAGA AGGCGATGTT GCTCCGGGAG GCACTCTGGA AGAAGTTTTT CTTATCGGCC TGAAGTCATC CTGTGCAACG CTATACGTCG GTAACGTCCA AGCGCACCAG AATGGTCTGC TTCAAATCAC AGAAGGGGAA GTGCGATTCG CCACAATGGA GGCCGTGTTG GACACGTGGC TCGTCCCATC CGGGGCGGCC ATTACTGTGG GCACAGCCAA CGAAGCTGGA CAAATAGCAG TTGCCCTCAA TGGCGGAAAG GTGCTTTATT TGAAGATAGA AGAAGGGAAG ATCCGGGAAT GTTCGGGGCA GCAAATGGAA CGCGAAGTCA GTTGTTTGAA CTTGAATCCC TTTGCTGCGT CGGACGCCAT GGATGTGGAC AATGACGTAA GAAAAAGCAC ATCACACACG AGCAGTTTTC TAGCCGTCGG CTTGTGGGAC GATTTCACGG TCCGTCTCTT GTCGCTAGAC GATGGATTGG AAGAGCTCCT CAAGATACAT CTAAGTACGG ACGAAGACGA GGATTTGGTC AGCGACACCA CAAGCTCCTT GGCGAGTCCG TCTCATAGAA ATCGGAACAA CATGATGGCA CGTAGCCTCT GTTTGATAAC GTTGGACTTT TCCAGCGGCA CATCCGGCAA TACCACATCA ACATCAACTT CACTTTCGTC CACTGGTTCT GGTGTGAATA TGTTGTTTGT TGGTCTCGGA GACGGTACAC TGATTTCGTT TGCGGTTGTC GAACGGGGTG CATCAATTTT CGTGCAGTCA AAGAAAGAAG TCTGTTTAGG AACGCAGCGA ATCGATCTCG TTCCATTATG TACTGAGCAG GGCGGAACGT GCGTCCTAGC GACTGGAGAT CGTCCTACAG TTATCTACTT GGCAGGTGTC GGTGGAATTT CTGCAAACCA GTTCAATCCA AAGCTATGCT ATTCCAACGT CAACCTCTCA GCCGGTGACG ACGAGGAGGA AGACGATGTC AGCCGACCTC CTTCGCAGCA AAGTATTGTG GTCAATGTTG CGACACCATT CTCGTCATCA CTATTGTTCG ATTCGGCAAC TGGCGGAAGT CAACGCTATT CGTTGTGCGT GGCGGACGAT TCGTTTCTGC GTATGGGGAT CATTGACGAC ATTCAAAAGC TTCACGTCAC AACTTGTCGA CTAGGAATGG CTCCCTGTCG AATCGTTCAT TGTGCCGACG GTCGATTGTT CGCCGTAGGC TGTATCGAAA GCGGTATCAA GCAATTCAGT TTGGGTGGGG ACGAGGCGAA CATGGGCAAC TGCATTCGTT TTATGGATGA CGCCAATTTC GATGATATCC ATCGAGTGGA TCTCGAACCA TTTGAAATGA TATTGTCAAT GGTGTACGCC ACGCTACGGA TTCCGTCTGA CGGAGATCAA TCGGACCAAC CCGTACATAG GCCGTTTTTG CTGGTTGGTA CCGCATACGC AATGCCAGAC GAAGATGAGC CAAGTCGTGG TCGCATTCTT GTCTATTCTT GCCAAGCGGA CGAGGCTTCC GGGACGCCAA CCAGCACACG TGCAGTGCGA CAGATTACGG AAATGTCGAC GCAAGGCGGT GTCTACAGTA TTTGCCAGTT CTACGACGGC AATTTTTTGT GTACTGTCAA TTCCAAAACA CATGTTGTGC AGATTGTTGC GGATTGTGGT GTCTTGCGGC TAGAGTACGT GGGAATCGGG CACCATGGAC ACATAGTGAG CTTGTTTGTG AAAAGCCGAG CGAAACCTGT GACGGACAAT AGTCCATCTT TGGCAAACCC AATGACTATG GGTGAGGGCA CCAAAGACAA TATTCCTTCT GGAGCAAACA TCAAACAAGA TCCAGAAGAA AAGCTGGCAA TTGTGGGAGA CTTGATGAGG TCAGTTAGCT TAATGCAGTA CTACCCTCAA CATGAAACTC TTGAGGAAGT TGCGCGAGAC TTCAACCCAA ACTGGACGAC CGCGGTGGAA ATGCTTACGG ACGACGTGTA CATTGGTGCC GAAAACTGGA ACAATCTTTT CTGCCTTCGA CGCAACAAGG CTGCTACCAG TGAAGAGATT CGCTGTCGAT TGGATAACAT TGGAGAGTTT CACCTAGGAG AAATGTGCAA CAAATTTATG AGTGGCAGTC TTGTCATGCC GGTCTCTTCC AACTCCACCA CATCAAGCCG GAGGGCCGTG CGAAGGACAA CCACTCCCCA GAAGAAGAAA GTCTCTGACT CTTCTGCTAA AGCTAGCTCT CCGGCTAGAG TTTGTCGCCC TGTTGTGATA ACTGGAAGTC AGACTTTGTT CGGAACAGTA GAAGGCTCTC TAGGGGTTAT CCTGGGCTTG GACGGGAGAA CTGCTGCTTT CTTTATCACG CTGGAAAGAG CAATTGCCAA AACGATTCAG CCTGTGGGCG GTTTCTCTCA TCAGCTATAC AGATCTTGCC AAGCTGAGCT CCGTGTTCAT CCCGCGCATG GTTTTGTTGA TGGCGATCTA GTTGAGACAT TTTTGGATTT GGATCGAAGG ACGATGGAGG CAGTTGTGGC TGAAATGAAT CGGGACGGCG GGTGGGAAGT GGATGATTTC GCGAACTCAA GGTCTGACGA GAACAATGAT AGTTCCAAGG ACACCGACAG GATCAACCTC GAAGAGCGAT CCGAGCTGTC TATTGACGAC GTTTTAGCGA TGGTAGAGGA GATGACAATG CTTCATTGAG TACTACAACA AGTTATGCGA TCTATACCTA CCTATACATT ATAGCGTGTT GTGCGGCCCT TGAGATGCAA AAGAAGGGTC
|
Protein sequence | MSASTKKEAA HYVVTAHPPG GVLLTAKCNF TSPFSLDVLV AKSRRLEVRQ LRTTTEGLSP FPILASVPIN GRIVGLVPFK VHGSDTSYVF VLTARQQYAV LAYDRTNSGS AAYPLVTLAS GTLQSQEHAV LGQEAESGPI VAIDHFHRCI ALHVYDGLLT IIPSTRTLAS QQLLGTPFHC RIEERTILHL AFLQIPFEAL PQLAVLHQDA RGAQHITSHV INWKRKNIFL YGSSSAPAAT EWLQKSNVDG GSSLIIPVPA EAPPDFAPAK HRSGGVLVVG QRQLTFINNN VTKVVPIPQA LHLCVEELPA DPNGGLPRYL LADEFGNLHM VTIVLVVDKV VALQIDTLGS CTLATSIAYL REGLVFVGST LGDPQLIQIH DEPIVDVEDE EDMVGAESSY LSVVEEYTHL GPILDFDLVP TAPGGGGLGQ TEGIHGPSQS QVVTASGSSK SGSLRLIRNG IGMNESAAVE IPGIQNVWSL RRSFADVDDT YLVQSFVHET RVLGVTTMDD MSQDEKEGDV APGGTLEEVF LIGLKSSCAT LYVGNVQAHQ NGLLQITEGE VRFATMEAVL DTWLVPSGAA ITVGTANEAG QIAVALNGGK VLYLKIEEGK IRECSGQQME REVSCLNLNP FAATSHTSSF LAVGLWDDFT VRLFLCLITL DFSSGTSGNT TSTSTSLSST GSGVNMLFVG LGDGTLISFA VVERGASIFV QSKKEVCLGT QRIDLVPLCT EQGGTCVLAT GDRPTVIYLA GVGGISANQF NPKLCYSNVN LSAGDDEEED DVSRPPSQQS IVVNVATPFS SSLLFDSATG GSQRYSLCVA DDSFLRMGII DDIQKLHVTT CRLGMAPCRI VHCADGRLFA VGCIESGIKQ FSLGGDEANM GNCIRFMDDA NFDDIHRVDL EPFEMILSMV YATLRIPSDG DQSDQPVHRP FLLVGTAYAM PDEDEPSRGR ILVYSCQADE ASGTPTSTRA VRQITEMSTQ GGVYSICQFY DGNFLCTVNS KTHVVQIVAD CGVLRLEYVG IGHHGHIVSL FVKSRAKPLA IVGDLMRSVS LMQYYPQHET LEEVARDFNP NWTTAVEMLT DDVYIGAENW NNLFCLRRNK AATSEEIRCR LDNIGEFHLG EMCNKFMSGS LVMPVSSNST TSSRRATLFG TVEGSLGVIL GLDGRTAAFF ITLERAIAKT IQPVGGFSHQ LYRSCQAELR VHPAHGFVDG DLVETFLDLD RRTMEAVVAE MNRDGGWEVD DFANSRSDEN NDSSKDTDRI NLEERSELSI DDVLAMVEEM TMLH
|
| |