Gene PHATR_21083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_21083 
SymbolhCdc48 
ID7204652 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp415174 
End bp418152 
Gene Length2979 bp 
Protein Length806 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185883 
Protein GI219121314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCAGGGATC AATCCCATCT CACTCGTTGG CTTGTCACAC GCACACACAA CAAAGGCATA 
CACACGAAGG CAGATGGCGT AAGTGAATCG GAGCAATGGA GAGAGAGGAG TTGTAGCGAC
GATGCTGCTG GGGATCGTTT GTCTGTGAAT CAGGAAATTC CGAGATTTGT TTCTTGGTGC
CTGCCGTGTT CTTTGCATCC CTTACACTAG GTAGACACCG GCAATGAGTT CCGCAAATAC
CGACTTGGTA GATGACTAGT CACTGTTTGA TAGTGGGTAT GATGTAATCG AAGTTGTCGA
TCCGTCCGTA CATGTTTCTT ATACTTTCCA AGCCGGGTCA CATGAATCCC TCATTCTGTC
GCTTTCCTTT GCCATTTCAA TTGCAGCAAG GACGAAGAAA TGGCGGACGC CATACTCAGC
TCGGGAAGCA AGAAGCGCAG TCCCAACCGC CTCATAGTCG ATGACGCCAC CAACGACGAT
AATTCGGTCA TCTCCCTCTC TCCCGCAAAG ATGGAACAGC TGGAGCTATT CCGTGGAGAC
ACTGTCCTCA TCAAGGGAAA GAAAGGTCGA GATACGGTCT GCATCGTGCT TGCCGACGAA
ACCTGCGACG ACACAAACGT GCGCATGAAT AAGGTGGTGC GTAAGAATCT ACGCGTGCGC
CTCGCGGATG TTGTTACCGT CACGAGCTGT GGTGACGTGC CCTACGGCAA GCGCATCCAC
ATTCTGCCCC TGGACGACAC AATCGAAGGC GTTTCGGGAA ACCTGTTCGA TGTCTATCTC
AAGCCCTACT TTTTGGAAGC CTACCGTCCC GTCAAAAAGG GGGATCTCTT CCTGGTTCGC
TCCGCCATGC ACCCGGTAGA ATTCAAGGTT GTCGAAACGG ACCCGGCACC CTATTGTATT
GTCGCACCCG ACACCGTCAT CCATTGTGAG GGTGACCCGG TCAAACGTGA AGACGAAGAA
AAGATGGATG ACGTGGGTTA TGACGATGTG GGTGGTTGCC GCAAGCAAAT GGCGCAGATT
CGGGAAATGA TCGAGTTGCC CTTGCGTCAT CCGACTCTCT TCAAGACACT GGGTGTGAAG
CCACCTCGCG GTGTCTTGCT GTACGGTCCT CCCGGCTCCG GAAAGACTCT CATTGCTCGG
GCTGTTGCCA ACGAAACCGG AGCTTTTTTC TTTTTGATCA ACGGGCCCGA AATCATGTCC
AAGATGGCTG GTGAATCCGA ATCGAACTTG CGCAAGGCTT TTGAGGAAGC AGAAAAGAAT
GCTCCTGCCA TTATCTTTAT CGACGAGATT GACTCCATTG CGCCCAAGCG TGAAAAAACC
AATGGCGAAG TCGAGCGTCG TATCGTCAGT CAAATGCTGA CGCTCATGGA CGGCCTCAAA
CAGCGCGCCA GTGTTGTTGT CATTGGGGCA ACCAACCGCC CCAACGCCAT TGACCCGGCC
TTGCGCCGTT TCGGGCGTTT CGATCGCGAA ATTGATATCG GCGTGCCGGA TGAGAATGGT
CGTCTGGAAG TCTTCCGCAT TCATACGCGA AACATGAAAT TGGACGAAGA TGTGGAACCG
GAGGCGATTG CGCGGGAAAC GCACGGCTTT GTTGGGGCCG ATATCGCCGC ACTCTGTACC
GAAGCTGCCA TGCAGTGCAT TCGTGAAAAG ATGGATTTGA TCGATATCGA AGATGAACAG
ATTGATGCGG AAATATTGGA CAGTATGGCC GTCAGTCAGG ATCATTTTCG ACATGCATTG
GCGCAGTCGA ATCCGTCTAG TTTGCGTGAG ACGGTGGTCG AAGTCCCTAA CATTTCTTGG
GAGGATATTG GTGGTCTCGA GCAAGTCAAG GTATGTTTAC CGGATCAAAC GGAGGAAACT
ACTGCCTTTT GTGTGAGTTT CATCGTTTTC TAACTCCGCT TCTCACTTTT GTCACGACAG
CGCGATCTCA AGGAACTTGT TCAGTACCCT GTCGAGCATC CCGAAAAGTT CGAAAAATTT
GGAATGTCAC CTAGTAAAGG TGTTCTCTTT TATGGTCCTC CTGGTTGTGG TAAAACTTTA
ATGGCCAAAG CTGTCGCCAA CGAGTGTCAG GCCAATTTCA TTTCCATCAA GGGACCTGAG
CTGCTTACCA TGTGGTTTGG AGAAAGTGAA GCAAACGTTC GCGATGTGTT TGAGAAGGCC
CGTCAAGCCG CTCCATGCGT GCTCTTCTTC GACGAACTCG ACTCTATTGC CCAGCAGCGT
GGAGGGAGTC AAGGAGACGG TGGTGGTGCC GCCGATCGCG TCATGAACCA GCTTTTGACC
GAGATGGACG GTGTTGGTTC GAAGAAGAAC GTGTTCATCA TTGGAGCGAC TAATCGTCCC
GATATCATCG ATACGGCTTT GATGCGTCCC GGACGTTTGG ACCAGCTTAT TTATATTCCG
ATGCCCGACT TTGAGTCGCG CTTGTCGATT CTTCGCGCGA CGCTTCGCAA GAGTCCAGTA
TCGAAGGATG TTGACCTGAA CTACCTTGCC TCGCAAACCG ATAAGTTCAC CGGTGCCGAT
CTTACGGAGA TTTGTCAAAG TGCGTGTAAA ATTGCCATTC GAGAAGAGAT CGAGCGGGAC
ATTGAACGTC AGCGCATGAA GCAAGAAGCC GGCGAGGACA TGGACGACGA AGATGACGAG
GTTGAAGATC TCATGCCGGA GATATTGCCA AAGCACTTTG AAGTCTCCGT TCGCAATGCG
CGTCGATCTG TCTCGGACCG CGACCTGGCC CAGTACGCTT CCTTTGCGCA GACCTTGCAA
CAATCACGGG CAGCCGTTTC GGGATCGACC GGTGGCAGTC TCGCAACTTT TGCTTTTCCG
GACGCTAACG CGGCTGTTGG CGTTGGAGCG GCGGCGGAAG ACGACGATGA TGAGGAAGAC
CTCTATAGTT AGATGAGACA GGTCAAGCCC GGAACGGCGA AGACAGGCAC GTCCATAATA
ATCTAACCTT AAAAGTTGAA AAAGAGATGT ATGTATGCA
 
Protein sequence
MAKDEEMADA ILSSGSKKRS PNRLIVDDAT NDDNSVISLS PAKMEQLELF RGDTVLIKGK 
KGRDTVCIVL ADETCDDTNV RMNKVVRKNL RVRLADVVTV TSCGDVPYGK RIHILPLDDT
IEGVSGNLFD VYLKPYFLEA YRPVKKGDLF LVRSAMHPVE FKVVETDPAP YCIVAPDTVI
HCEGDPVKRE DEEKMDDVGY DDVGGCRKQM AQIREMIELP LRHPTLFKTL GVKPPRGVLL
YGPPGSGKTL IARAVANETG AFFFLINGPE IMSKMAGESE SNLRKAFEEA EKNAPAIIFI
DEIDSIAPKR EKTNGEVERR IVSQMLTLMD GLKQRASVVV IGATNRPNAI DPALRRFGRF
DREIDIGVPD ENGRLEVFRI HTRNMKLDED VEPEAIARET HGFVGADIAA LCTEAAMQCI
REKMDLIDIE DEQIDAEILD SMAVSQDHFR HALAQSNPSS LRETVVEVPN ISWEDIGGLE
QVKRDLKELV QYPVEHPEKF EKFGMSPSKG VLFYGPPGCG KTLMAKAVAN ECQANFISIK
GPELLTMWFG ESEANVRDVF EKARQAAPCV LFFDELDSIA QQRGGSQGDG GGAADRVMNQ
LLTEMDGVGS KKNVFIIGAT NRPDIIDTAL MRPGRLDQLI YIPMPDFESR LSILRATLRK
SPVSKDVDLN YLASQTDKFT GADLTEICQS ACKIAIREEI ERDIERQRMK QEAGEDMDDE
DDEVEDLMPE ILPKHFEVSV RNARRSVSDR DLAQYASFAQ TLQQSRAAVS GSTGGSLATF
AFPDANAAVG VGAAAEDDDD EEDLYS