Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48136 |
Symbol | CAF1-subunit-A |
ID | 7203288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 299631 |
End bp | 303974 |
Gene Length | 4344 bp |
Protein Length | 1435 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | chromatin assembly factor subunit |
Protein accession | XP_002182657 |
Protein GI | 219124745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.846168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTTTCG GAACACGTTT ATGGAGCATG ACTCAAAATA CTCCGTATGT CGTTGTGTTC GGAAATGTGC CACTGGGAGA CGGGGACGTC TTCCGGAACT TATGGCCAGT CCGAAGAACT AGTCCCTCCA CGACTAGGCG GAAAACCCGG AAGTCGAAAT CAAGAACGGG AGGAGGTCGC GGCGCGGACG GTTCGAATGC GCGCCGACCC ATCCCTGTTC CGTACGCTCG CAATGCGGAC GGACGGATCC GATGCGTTCC CTCTCCTGCA ACACTCTCGT ACACGGCACC CACGGTGTGG GTCGCCTATG CGGATCCGAC TCTTCGCGTC TTTGGTGTGT ACCTATTCCT TCCTCGGACA GGTTCGTTAC CTACCCACTG CTACGACGAC TGCACCCGGT GGTTTCGACA ACCACTATTG CACTCTTCTA CCAAGTATCC CCACACGCAT CGGTTCCGAA ACGCTGTGTA GACAAGCACT CACGAAAAAA CAACGAGACA GTGCGACGCC GTATCGGCTC CAGTTTCGTT CACCCTTCCC CAGAGTCGCG CGCGATCGTT GGTGAACAAC ATGCCTTCCC TTGCCTCCAT CGAAGTTCCT TTGGACATCG TCGAGGATGA CAATGACCAA ACACACCAGG ACGACACCAC CGGACTAGGC AAACGTCTAC AAAAGTCAGT TGCAACATCG AAGGCTTCCG TGGTCACCGT CACTTCCGCT GCGAATGCGT CGCCCTCGTC GAACGCTGCG GCGGACGACG CGACGCCGTG CGGGAGCGTT CCGGAAACCC GCAAAATAAC TCCAACAGAT ACCAAGCCAT CCCGTAGTGC GTTGCCTCGA CAACAGCAAA AGACACTCGC CGGATTCTTT GCGGTTGGGA AACGATCACC ATCGTCCGTC CGCCCGACGA ACCCGCGGAA GGCGTGTGGG AGCCGTACGA CGCTCCGCAC GGATTCCACG ACTCCGACGT CCCTGCCCCG CACCGCCGTC CCCGTGACGA CGACGACGAC GACAAACAAG GCCAATACCC GCAGTGCGCG GTCCCAGTCC CAGGCTACCA CTCCCGCCGC TCGTCCCGTC GTGCCATCCT CCGTCGGGAC CCGTGCCGGA CTCGGACCCC AGCAGGCGGC CCTCATGGAC ATTTGCCTCG GCCGTGTGAA TCTCCTCCCT TGCTTCGATC ACAACACACC GAACGATCGG CGGGATCCGA CGACGCCACC ACCACCAACG GTCGCTTCCG CAACAAAAGA CGCGCCGACG ATTCCACCCG AGGAGGCCTC CACAAAATCA ACGACGGATC GGGGCTCCAG GGGGAAGACG CTGACGACCG ACACGCACGA ACCAGCCGTC GTGGATCTTA CAATTGATAC CGTGGGAACA AACAAGAGTT GTACCAAGAC GATCGTGCCC ACGGGTACCA CCCAGAGCTT ACGGTCAGTA CACGATGCCG CGATGGTGCA ACCAGGCCAC GGTTCCGCCC GTCTCGATAC CTCGAGTCCT CCTCGTGCCA AAGTAGACAC AGAGGCCTCT TCCGACCACC GTACGGTCCA TTCCACCAAC GAAACACAGA CGCGTCCGTC GGCGCAATAC GACTCACTCC GCCTTCAGGC TCAGGCCCGC GCCCAATCTG TGTTGCAACG CTGCCGAACT ATTGCGGAAG AAGACTTTAC CGTGGCCTTG CCCAAAATAT CCCCGTTGTC GAAAGATGAG ATTTTAGCCA CGGAATCGAG TGACTTTCCG GAACCAGCCG TAGAATGCCT AGCGGCACTC GTGGAAGGCA GTGCTTTGCC GCTGGCAGCG CTGGCGGCAT ACGTTGCGAA CGAGCTCAAT GGAATCTACA ACACCAAGGT ATTTACCCAC ACTCTCGTCA CGGCCAAGAT CCCGCTCGTG GCCAACCGTA AACAATACGT AAAACACCCG TCCGCGGCCT CGGTCACCGG CGACGGGGAT TCGGTAGCGG CACCGTCGCC GCCACCCGTC CGTGCACTGG AAGACGATCG GCCGGATCAC GTATGGCGAT GGGAATTGAC CGTTCCGGAG TTGCTGGAAC CCGTCTCGCG CAAGCTCGTC CTCAAAGCGC GTTCCGCGCG TCGCAAGCTC GCGGCCGAGT TTCAAAGCTG TGCCAAAGTC CTCCAAGTCT TGACCGAAAT GGACGCCTGG TGGTTGCGGG AGCCAGCGCC ACCCCAACCA GCCTCGACCA AAAAATTTGA AAGGCTCACC ACTCGTCTCG TGCTCGAGCA AACACGCTTG CTCAAGTACG CCCGGGACGA AGAAGCGGCC AAACTCGCGG AACAGGCCCA GCGGAAAAAG CTACGGGAGG CGTCCGTCGC CAAGGCTACG CAACAAGCCG AAGCGGCCGC CGCTAAACAG CGAATCAAAG AACAAGCTGC CGCGGAAAAA CAACGGAAAA AAGACGAAGC CGAGGCGGAA AAGCAGCGTA AAAAGGACGA AGCCGATCGC AAGCTTCAGG AAAAGGAAGA TGCCGCACGT GAGGCTACGG AAGCGAAACA AGCCAAATTG CGCAAGCAAA AATCCTGCTT GATGAGTTTC TTATCTGCAA CGAAGAAAGC TTCGGAAGAG GCAACGCATG AGCATTTGAC ATCCTTTGTC GAAGCGATGG AAGCGGTAGA CTGCGAGATG GAGCCGACAC TGGGCTCTGC TCCCACTTCT CCTCTCAAGC CAACGAAAAG TCATTTCGAT GTCGTCGCCT TTCGTGCTGC TTTGGAACGA GGGGTCGTTC CTTCCAAGTC TCAAGCGTGT AGCAGACATG GACGCTATTG GAAAGCAAGT CGACACCGCC GGACAAAAAT GGTAAATATG GAGGTTTTTG TGACTGTAGT GCCGGAAAAT GGAGCCTTCG GGGCCCAGCC CTTTGCAGAG CAGCAAACGA TAACCGTTCC AAACAAATAT AAATTCCTCC GGTTTCACGA AGACGTTCGG CCACCCTATT TCGGCACGTG GAGTAAAAGA GGTTCCATTG TGACGGGAAA GACACCGTTT CGGAAAGAGA CCACTCTATT GGAATACGAC TACGATAGTG AGGCGGAATG GGAAGAAGGT GACGACGAAA TCGGAGAAGA TTTGGAGAAC GGCGAGGGAG ACGATGATGA AGAAGACAAA GAGGAAGAAG AAGCCGCAGG AGACGACGAA GACGGCTGGT TGGCTGCCGA CGATGAAATC GATGACGAAT TAGACGATGA AACGAGAAGG CTTCGTATAA AAGCCTTGGC TGCTGCCGAT AGTCCGAAAC AGAAGGAACA AATCGTACAT GTCATCGCGC CGAGAGATGG CAAACCTATC GTTGACGCCC ATGTCTCCTG CGCTGCTAAA TGCGTTCAGG GCTTGGATGT TCGGCAGGCC TCTCGTATTT TAGCATCTCA TAAGGCATTG GTTCTATATG ATTGCGATTT ATTTTTGGAC GCGTTTCCCC CCGAGTTGAT CGACGAAAGC TTCTCCGATG CTTCTCCTGC CGAAGCTAGT AACAAGGCAC CGGGAAGTCA GGAAATGTCC GAGGACGATT TCAAGACCGT TGCCAAGTTT GTCCACAATT GCACCCTTGC ATCGAAAGAC AAGGTCGTCG ACGAGCTTCG AAAGGCGCAT GAAAGCGTCA CTAGCAGCCG TGCTCATGCA TTGCGAGTGC TCGAGTCCAT GGCCGACAAA AAGAAGCACC CTGTAAAGGG CATCTATTGG GAGGTAAAAG GTGAAGTACT CGATAAGCTT GGACTAGAGG ATCTCAAGTC TGTTGAAAAT GATTCACAAG ATGTACTACG CACTATTGCG AAATTTGTAC ACAATAGTAC ACTCAATTCA AAAGAACGTG TAGTCGACGA GCTCTTGATA GCTCACGAGT CGATTGCATC GAGCCGCGCG GAAGCCATGC GCATTCTCGA GTCTGTTGCG GAGAAACGGA AACATCCTGT CAGTGGGGCT TACTGGCAAG TGAAGGAGCC GGCCAAGTCG GAGTTGGGCT TGGCAGACTT GTCGTCGAAC CCGCCAATCC TGCCTGGTAC AGAAGCACTA GCGACTGCGT TAACTATGCC CACTGAAAAA GAGAGGAAGG CTTTACAGGA CACCCCATCG ATCAAGGCGA CATCAACGTC AGGTAAGAAG CGAAAGACTG GGGTACAATT AACTGCCGTA TCTTCTACAA AGAAAGCTAC TTTGTCGGTA GCCGTTGACA ACGAGCCAAG AGCCGTGAAG CCTCGCGAAC CTTTGTTCGA AAAGGTCTCG AAAAGCCCAT CTAAAAAGCG AAAAGAACCG CCTGCTTCAT CCAAGATTTT GACATCGTTC TTATCAAAAA AGCCAGCTGT CGAGTCGACG TCTGCTACGC AGCCAATCGG CTAG
|
Protein sequence | MCFGTRLWSM TQNTPYVVVF GNVPLGDGDV FRNLWPVRRT SPSTTRRKTR KSKSRTGGGR GADGSNARRP IPVPYARNAD GRIRCVPSPA TLSYTAPTVW VAYADPTLRV FGVYLFLPRT GSLPTHCYDD CTRWFRQPLL HSSTKYPHTH RFRNACDAVS APVSFTLPQS RARSLVNNMP SLASIEVPLD IVEDDNDQTH QDDTTGLGKR LQKSVATSKA SVVTVTSAAN ASPSSNAAAD DATPCGSVPE TRKITPTDTK PSRSALPRQQ QKTLAGFFAV GKRSPSSVRP TNPRKACGSR TTLRTDSTTP TSLPRTAVPV TTTTTTNKAN TRSARSQSQA TTPAARPVVP SSVGTRAGLG PQQAALMDIC LGRVNLLPCF DHNTPNDRRD PTTPPPPTVA SATKDAPTIP PEEASTKSTT DRGSRGKTLT TDTHEPAVVD LTIDTVGTNK SCTKTIVPTG TTQSLRSVHD AAMVQPGHGS ARLDTSSPPR AKVDTEASSD HRTVHSTNET QTRPSAQYDS LRLQAQARAQ SVLQRCRTIA EEDFTVALPK ISPLSKDEIL ATESSDFPEP AVECLAALVE GSALPLAALA AYVANELNGI YNTKVFTHTL VTAKIPLVAN RKQYVKHPSA ASVTGDGDSV AAPSPPPVRA LEDDRPDHVW RWELTVPELL EPVSRKLVLK ARSARRKLAA EFQSCAKVLQ VLTEMDAWWL REPAPPQPAS TKKFERLTTR LVLEQTRLLK YARDEEAAKL AEQAQRKKLR EASVAKATQQ AEAAAAKQRI KEQAAAEKQR KKDEAEAEKQ RKKDEADRKL QEKEDAAREA TEAKQAKLRK QKSCLMSFLS ATKKASEEAT HEHLTSFVEA MEAVDCEMEP TLGSAPTSPL KPTKSHFDVV AFRAALERGV VPSKSQACSR HGRYWKASRH RRTKMVNMEV FVTVVPENGA FGAQPFAEQQ TITVPNKYKF LRFHEDVRPP YFGTWSKRGS IVTGKTPFRK ETTLLEYDYD SEAEWEEGDD EIGEDLENGE GDDDEEDKEE EEAAGDDEDG WLAADDEIDD ELDDETRRLR IKALAAADSP KQKEQIVHVI APRDGKPIVD AHVSCAAKCV QGLDVRQASR ILASHKALVL YDCDLFLDAF PPELIDESFS DASPAEASNK APGSQEMSED DFKTVAKFVH NCTLASKDKV VDELRKAHES VTSSRAHALR VLESMADKKK HPVKGIYWEV KGEVLDKLGL EDLKSVENDS QDVLRTIAKF VHNSTLNSKE RVVDELLIAH ESIASSRAEA MRILESVAEK RKHPVSGAYW QVKEPAKSEL GLADLSSNPP ILPGTEALAT ALTMPTEKER KALQDTPSIK ATSTSGKKRK TGVQLTAVSS TKKATLSVAV DNEPRAVKPR EPLFEKVSKS PSKKRKEPPA SSKILTSFLS KKPAVESTSA TQPIG
|
| |