Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_21083 |
Symbol | hCdc48 |
ID | 7204652 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 415174 |
End bp | 418152 |
Gene Length | 2979 bp |
Protein Length | 806 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185883 |
Protein GI | 219121314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAGGGATC AATCCCATCT CACTCGTTGG CTTGTCACAC GCACACACAA CAAAGGCATA CACACGAAGG CAGATGGCGT AAGTGAATCG GAGCAATGGA GAGAGAGGAG TTGTAGCGAC GATGCTGCTG GGGATCGTTT GTCTGTGAAT CAGGAAATTC CGAGATTTGT TTCTTGGTGC CTGCCGTGTT CTTTGCATCC CTTACACTAG GTAGACACCG GCAATGAGTT CCGCAAATAC CGACTTGGTA GATGACTAGT CACTGTTTGA TAGTGGGTAT GATGTAATCG AAGTTGTCGA TCCGTCCGTA CATGTTTCTT ATACTTTCCA AGCCGGGTCA CATGAATCCC TCATTCTGTC GCTTTCCTTT GCCATTTCAA TTGCAGCAAG GACGAAGAAA TGGCGGACGC CATACTCAGC TCGGGAAGCA AGAAGCGCAG TCCCAACCGC CTCATAGTCG ATGACGCCAC CAACGACGAT AATTCGGTCA TCTCCCTCTC TCCCGCAAAG ATGGAACAGC TGGAGCTATT CCGTGGAGAC ACTGTCCTCA TCAAGGGAAA GAAAGGTCGA GATACGGTCT GCATCGTGCT TGCCGACGAA ACCTGCGACG ACACAAACGT GCGCATGAAT AAGGTGGTGC GTAAGAATCT ACGCGTGCGC CTCGCGGATG TTGTTACCGT CACGAGCTGT GGTGACGTGC CCTACGGCAA GCGCATCCAC ATTCTGCCCC TGGACGACAC AATCGAAGGC GTTTCGGGAA ACCTGTTCGA TGTCTATCTC AAGCCCTACT TTTTGGAAGC CTACCGTCCC GTCAAAAAGG GGGATCTCTT CCTGGTTCGC TCCGCCATGC ACCCGGTAGA ATTCAAGGTT GTCGAAACGG ACCCGGCACC CTATTGTATT GTCGCACCCG ACACCGTCAT CCATTGTGAG GGTGACCCGG TCAAACGTGA AGACGAAGAA AAGATGGATG ACGTGGGTTA TGACGATGTG GGTGGTTGCC GCAAGCAAAT GGCGCAGATT CGGGAAATGA TCGAGTTGCC CTTGCGTCAT CCGACTCTCT TCAAGACACT GGGTGTGAAG CCACCTCGCG GTGTCTTGCT GTACGGTCCT CCCGGCTCCG GAAAGACTCT CATTGCTCGG GCTGTTGCCA ACGAAACCGG AGCTTTTTTC TTTTTGATCA ACGGGCCCGA AATCATGTCC AAGATGGCTG GTGAATCCGA ATCGAACTTG CGCAAGGCTT TTGAGGAAGC AGAAAAGAAT GCTCCTGCCA TTATCTTTAT CGACGAGATT GACTCCATTG CGCCCAAGCG TGAAAAAACC AATGGCGAAG TCGAGCGTCG TATCGTCAGT CAAATGCTGA CGCTCATGGA CGGCCTCAAA CAGCGCGCCA GTGTTGTTGT CATTGGGGCA ACCAACCGCC CCAACGCCAT TGACCCGGCC TTGCGCCGTT TCGGGCGTTT CGATCGCGAA ATTGATATCG GCGTGCCGGA TGAGAATGGT CGTCTGGAAG TCTTCCGCAT TCATACGCGA AACATGAAAT TGGACGAAGA TGTGGAACCG GAGGCGATTG CGCGGGAAAC GCACGGCTTT GTTGGGGCCG ATATCGCCGC ACTCTGTACC GAAGCTGCCA TGCAGTGCAT TCGTGAAAAG ATGGATTTGA TCGATATCGA AGATGAACAG ATTGATGCGG AAATATTGGA CAGTATGGCC GTCAGTCAGG ATCATTTTCG ACATGCATTG GCGCAGTCGA ATCCGTCTAG TTTGCGTGAG ACGGTGGTCG AAGTCCCTAA CATTTCTTGG GAGGATATTG GTGGTCTCGA GCAAGTCAAG GTATGTTTAC CGGATCAAAC GGAGGAAACT ACTGCCTTTT GTGTGAGTTT CATCGTTTTC TAACTCCGCT TCTCACTTTT GTCACGACAG CGCGATCTCA AGGAACTTGT TCAGTACCCT GTCGAGCATC CCGAAAAGTT CGAAAAATTT GGAATGTCAC CTAGTAAAGG TGTTCTCTTT TATGGTCCTC CTGGTTGTGG TAAAACTTTA ATGGCCAAAG CTGTCGCCAA CGAGTGTCAG GCCAATTTCA TTTCCATCAA GGGACCTGAG CTGCTTACCA TGTGGTTTGG AGAAAGTGAA GCAAACGTTC GCGATGTGTT TGAGAAGGCC CGTCAAGCCG CTCCATGCGT GCTCTTCTTC GACGAACTCG ACTCTATTGC CCAGCAGCGT GGAGGGAGTC AAGGAGACGG TGGTGGTGCC GCCGATCGCG TCATGAACCA GCTTTTGACC GAGATGGACG GTGTTGGTTC GAAGAAGAAC GTGTTCATCA TTGGAGCGAC TAATCGTCCC GATATCATCG ATACGGCTTT GATGCGTCCC GGACGTTTGG ACCAGCTTAT TTATATTCCG ATGCCCGACT TTGAGTCGCG CTTGTCGATT CTTCGCGCGA CGCTTCGCAA GAGTCCAGTA TCGAAGGATG TTGACCTGAA CTACCTTGCC TCGCAAACCG ATAAGTTCAC CGGTGCCGAT CTTACGGAGA TTTGTCAAAG TGCGTGTAAA ATTGCCATTC GAGAAGAGAT CGAGCGGGAC ATTGAACGTC AGCGCATGAA GCAAGAAGCC GGCGAGGACA TGGACGACGA AGATGACGAG GTTGAAGATC TCATGCCGGA GATATTGCCA AAGCACTTTG AAGTCTCCGT TCGCAATGCG CGTCGATCTG TCTCGGACCG CGACCTGGCC CAGTACGCTT CCTTTGCGCA GACCTTGCAA CAATCACGGG CAGCCGTTTC GGGATCGACC GGTGGCAGTC TCGCAACTTT TGCTTTTCCG GACGCTAACG CGGCTGTTGG CGTTGGAGCG GCGGCGGAAG ACGACGATGA TGAGGAAGAC CTCTATAGTT AGATGAGACA GGTCAAGCCC GGAACGGCGA AGACAGGCAC GTCCATAATA ATCTAACCTT AAAAGTTGAA AAAGAGATGT ATGTATGCA
|
Protein sequence | MAKDEEMADA ILSSGSKKRS PNRLIVDDAT NDDNSVISLS PAKMEQLELF RGDTVLIKGK KGRDTVCIVL ADETCDDTNV RMNKVVRKNL RVRLADVVTV TSCGDVPYGK RIHILPLDDT IEGVSGNLFD VYLKPYFLEA YRPVKKGDLF LVRSAMHPVE FKVVETDPAP YCIVAPDTVI HCEGDPVKRE DEEKMDDVGY DDVGGCRKQM AQIREMIELP LRHPTLFKTL GVKPPRGVLL YGPPGSGKTL IARAVANETG AFFFLINGPE IMSKMAGESE SNLRKAFEEA EKNAPAIIFI DEIDSIAPKR EKTNGEVERR IVSQMLTLMD GLKQRASVVV IGATNRPNAI DPALRRFGRF DREIDIGVPD ENGRLEVFRI HTRNMKLDED VEPEAIARET HGFVGADIAA LCTEAAMQCI REKMDLIDIE DEQIDAEILD SMAVSQDHFR HALAQSNPSS LRETVVEVPN ISWEDIGGLE QVKRDLKELV QYPVEHPEKF EKFGMSPSKG VLFYGPPGCG KTLMAKAVAN ECQANFISIK GPELLTMWFG ESEANVRDVF EKARQAAPCV LFFDELDSIA QQRGGSQGDG GGAADRVMNQ LLTEMDGVGS KKNVFIIGAT NRPDIIDTAL MRPGRLDQLI YIPMPDFESR LSILRATLRK SPVSKDVDLN YLASQTDKFT GADLTEICQS ACKIAIREEI ERDIERQRMK QEAGEDMDDE DDEVEDLMPE ILPKHFEVSV RNARRSVSDR DLAQYASFAQ TLQQSRAAVS GSTGGSLATF AFPDANAAVG VGAAAEDDDD EEDLYS
|
| |