Gene PHATRDRAFT_50403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50403 
SymbolDHK1 
ID7199212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp259978 
End bp262736 
Gene Length2759 bp 
Protein Length883 aa 
Translation table 
GC content47% 
IMG OID 
Productdiatom histidine kinase 1 
Protein accessionXP_002185350 
Protein GI219130391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGGCGCTA CTGTGATAGA AAGAATTGCG CGCTCGTGAG TACATTCCCA TCGTTGCGTT 
TTTCGTCGTT TCGCAGAAAT CGATGATGGC TGCCTCGACG GCATCCCAAC AAATTAAATT
CGAAAGCAGT GCCGCCAGAG ATGCCTTTTG CTGTTCTCGG AATGCTGCTA TTTCTCAGAG
AACTCAGCCT CTCGGCTGCT ACGCACAAAT GGATCGAATA CAGCTTACCG CACCTTCTAC
AGACTTGTTG AGCGACGACG ACGAAGAAAC CCTTAAAATT TGGATATGGA AGCATCACGG
CGAAAAAATA CGCAAAGCCT TAAAAGTTGG ATACGGAGGG CACGAAGACC TAGCTGGTTC
CTGTTGTCAC CTTGTTCACG AAGAATCTCC GCATTCTACA CCCATACAAC TAACTGAAGA
AAAATCGGAT GAAAGTATAT ATTCTTCACG GAGATTGAAG CTCCTCGATC GCACAGCCAA
GGTACAACGA CTATACTTAC AGAAAGAACC CCCTGGAGTT GTCTTTGGAA GTTTGCTCGA
AGGTCTTTTG GAACTCACCG AAAGCGCTTA TGGATTTATT GGCGAGGTCA AGCATGATGC
CCAGAAGGGT GTTTATCTGG AAACCCACGC CGTTCTAGAA AATTCGCTCG GCTCGCACAA
TTCTGCTTTC GAAGCGAACC AAGAGGGTAT GCAGGTTTTT AATATGGAAA CCCTGCTGGA
AAAAGTCGTA ACATCCCAGC AACAGCTTAT ATCCAATAAT TATCAGGAGC AAGGGCGCAA
CGACCCAACG GGTCCCCCAC CCATCGAAAC ATTTTTGGGT ATTCCATTTT TTGAAAACAA
TGGAAAGCTG ATCGGTCTCG TGGGAATCGC CAACAAACCG AATGGATACG TACAGGAAGA
TGCTGATTTT CTAGAGCCCT TCATGGTGAC CTGCAGCAAT CTGTTACAAG CCTTTCAACA
GGTACAGGAA AACGAGTCAC TCATCAATAC CCTGGAACAA AAGGTCCGGG ACCGCACCCG
GGAATTGCAA GTCTCTAATG AACGTTTGAA ACAGGCCAAT CGGCAAGTGA TGCAAACTTC
GGCACAGCAG CTTCAACATT TTGCCTGTAT GAGTCATGAA ATTCGCACTC CGCTTAATTG
TATTGTCGGT CTTTCCAGCT TGTTGCAAGA ATCCAAATTG AGTCCGATGC AGGAAGACTC
CATGCGTATG ATTGTCATGA GTGGTGATTT ACTCTTAACA GTGGTCAATG ATGTATTGGA
TTATTCCAGA TTGGAATCGG GCAATGTTGA TATCGAAATC CAGCGGAGCA GTCTGCAAGA
AACATTGATC TCTATGGTGC ATTCAATCGA GATGAAGGCT CAATCCAAAC GTATTTTGGT
TAAAACATAC TACGATCCCG CTGTTCCAGA ATACGTTCAC ACAGATAGTC GACGATTGCA
ACAGATTCTT TACAATTTGC TGGGCAACGC CATTAAATTT AGTCGAGATG ACAGTATCGT
GGAACTCCGC GTGTCGCTTG CTGAGAAAGC AGCAACGAAC TCTCTATTTG AGGGCATCGA
CATTCAAAGA GAATGCACGG ACAGATCATG GTCGCCGCTA GTTTTTCCAG AAGGAAATTC
GTCTGATACT GTGGAACCTC CAAATTGCGT GTTACGATTT ATCATCAAAG ATTATGGTCA
AGGGATCCGA CATACCGATT TTTCACGGAT CTTTCAACCG TTTCTGCAAG CCAGTTCCGA
AACAGAACGC GTTTATGGGG GCACTGGTCT CGGGCTCGCC ATTACGGCCA AACTTGTGGC
AGGTCTAGGC GGGCATGTCT TTGTTGACAG CGAAGTGGGG CGTTGGTCTA CGTTTACTGT
CGATCTTCCC TTCCATCAAG AGCCGGCCCC GATTGCTTCT ATTACTTCTC ACTTACAGAA
CGCCACGATC TTGTTTGTGT GTAACGACGC TGGGACTCTT GCTCAGATTT CGCCAATTTT
TCAGCGTTAC AGTGTGACTT TCCATCAATT CGACGATATG GAAGAGATGG ATGGTAGTAT
TACGACCCAG GGTTTTTTAA AACGAGGCCG TCACTACTTT TGTTTGGTAC ATGAAGATTT
GTACGATTCA GAGGCTTTCG ATTTACTGTC CAATTTGGCT ACTTCTGTAC TATTGACGTT
CGGTCCCAAA TTTTGCATTC CAGAAACACA GGACCATTAC CGTTCCTTGG TCCAAATCTT
ACCTTCGGTC TTGATGGAAT CCATTGCCGC ATTTGTTCAC CGTACACGTA ATCGTCCGGA
AGGTCCAGTG AAGACTGCAT CCTTCAGACG AGCTAATCGC ATACCATACG CGGCGTTTCG
GGTACTCGTT GCCGAAGACA ATATAATCAA TCAAAAGGTT TTGCTCCGCA TCCTGGATCG
ACTCGGTATG AAAGACGTGG TGATGGTGGA CAACGGAAAG AAGGCCGTCG ATCGTGAAGC
CGAGGAGCCT TTTGATGTCG TACTCATGGA TATGCAAATG CCAGTTATGA ACGGTGTGGA
GGCCTGTAAA CGCATTGTGG GTCGGCATGC TACCAGGCAT CCCCAAGCAT TGGTTATCTT
TGTGACGGCA AACGTGTCTC ATGAGTTTGA GGCAGAATGC CAAAAGGCGG GAGCTGTTGG
GTTTATGCCG AAACCTTTCA ATATTGGGGA GATTGAAAAG ACCTTCCAAA AGGTCCACGC
TATCATTGGA GCGCGAGAAA GCTTGGCATT GTGAATGGAT GCGACTTTCA GTGTAAATT
 
Protein sequence
MMAASTASQQ IKFESSAARD AFCCSRNAAI SQRTQPLGCY AQMDRIQLTA PSTDLLSDDD 
EETLKIWIWK HHGEKIRKAL KVGYGGHEDL AGSCCHLVHE ESPHSTPIQL TEEKSDESIY
SSRRLKLLDR TAKVQRLYLQ KEPPGVVFGS LLEGLLELTE SAYGFIGEVK HDAQKGVYLE
THAVLENSLG SHNSAFEANQ EGMQVFNMET LLEKVVTSQQ QLISNNYQEQ GRNDPTGPPP
IETFLGIPFF ENNGKLIGLV GIANKPNGYV QEDADFLEPF MVTCSNLLQA FQQVQENESL
INTLEQKVRD RTRELQVSNE RLKQANRQVM QTSAQQLQHF ACMSHEIRTP LNCIVGLSSL
LQESKLSPMQ EDSMRMIVMS GDLLLTVVND VLDYSRLESG NVDIEIQRSS LQETLISMVH
SIEMKAQSKR ILVKTYYDPA VPEYVHTDSR RLQQILYNLL GNAIKFSRDD SIVELRVSLA
EKAATNSLFE GIDIQRECTD RSWSPLVFPE GNSSDTVEPP NCVLRFIIKD YGQGIRHTDF
SRIFQPFLQA SSETERVYGG TGLGLAITAK LVAGLGGHVF VDSEVGRWST FTVDLPFHQE
PAPIASITSH LQNATILFVC NDAGTLAQIS PIFQRYSVTF HQFDDMEEMD GSITTQGFLK
RGRHYFCLVH EDLYDSEAFD LLSNLATSVL LTFGPKFCIP ETQDHYRSLV QILPSVLMES
IAAFVHRTRN RPEGPVKTAS FRRANRIPYA AFRVLVAEDN IINQKVLLRI LDRLGMKDVV
MVDNGKKAVD REAEEPFDVV LMDMQMPVMN GVEACKRIVG RHATRHPQAL VIFVTANVSH
EFEAECQKAG AVGFMPKPFN IGEIEKTFQK VHAIIGARES LAL