Gene PHATR_37014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_37014 
SymbolDHK2 
ID7204469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp885982 
End bp888357 
Gene Length2376 bp 
Protein Length791 aa 
Translation table 
GC content48% 
IMG OID 
Productdiatom histidine kinase 2 
Protein accessionXP_002185972 
Protein GI219121499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.612589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTACG GAGATCTAAT CAACGGTATT TGCTACACAG CTTTTCTTCC CGTGTACTTT 
TACCTTTGGC TCCAAGAGAA GGACTTGGGT CGACGCAGCG CCGCTTCAAG TGTTGTGGCT
GCGTTCGCGT GGACAGCTGC CTTTTCTTGG TTGTCCTATT TTCTCAAAAA GCCCAATCTT
TCAGTAATGG CGTTAATGGG AATGTCCTCT TCCATTTATT ACATGAACAT CCACCAATGG
CCACACGCGG ACGAGCTTGT GGTGCTTCCA CAATTTCTTT GGTCGTTGCT CATTTTGAGT
GCAAACCGGC CTTTAACGTT TCCGTTTCAG TTCGCGTTGG CAATGGTAGC CTTGCTAACA
GCCGTTTATG TTGCTGCGAA GGAATCCTCA CAGGCAGTAC TGGATACCTT GCCTGTGCTA
GGAGGTGCCA CATCTATGCA CATATGGTTT CACTACTACG CCACATACCT ACGCAAAAAG
TCTCACGGTT TTGTCAGTGC ACATGTAGCC TCGCTTGTCT TGGCCGGTTC CTTTGCATAT
CACGCTTCCA GCGAAATTGT CACCATGTAT CGTTCTCCCG GATACGGCTC TCAAGGGGGA
TATAGCATCT TACGGGCAGG ATTTTTTGCT TTGGTTGGTT TGGCAGCTGC TGGTTCCTTC
CAACTCGAAA TGGACTCGAA AGAAGCTTTA GAACGGCTTG TCGATGAACG CGCAAGAGAA
CTTACGAAAC AAGCAGCTCA CTTGCGAGTA CTTGAGCACG CTTTGCAAGC CTCTGAGACA
GCCATAGCTA TTGTTGATGT ATCGCAAAAG GTTTTGTGGT CCAACAGCTC GCTTCAAGAG
CTGGTGGCTG TCTCCCCAGT CAAGCTACAA GACTCAAATC TGTTCAAGGC ATTGCAATAT
CCAGATGACA AAGTCGTGCA AAGCTTTCCA CCCGTCCATG CTATTACTGA AGAAATTGTT
CTTCGCAAAA GACACATGTC AGTCGAAATG ACGCCGTTTC CGGCGGATGC GAAGAAAAAC
AACGAAAACC GGTTCCTCGT CGCTTTAAAG GATATGACAT CCCAAAGAGC TCGGGAACGC
GCCGAGAAAG CGGCGGAAAG GGAGGCGCTG ATAGCGCAAA CCATGAATGA AAGTATGCAG
AATCTTAGTC ACGAGCTCCG CACGCCACTA CAGGGTATTA TGGGTATGGC GAGTCTCGTT
CTGGATGAAC CAGGCTTACC TCCGGATGTG ACGGAAAGTA TGTCGATGGT CATAGCGTCA
GCTCGTCTTT TATTGACTCT CATAAACAAC ATGTTGGATG TGCGTAAATG TGATGCTTCT
ATGCTAGACG AGTTCCAGTT GACACCGTTT CGATTGGTAT CATCGCTAGA AGACGCCATC
ACCTTCTGCA AGCCTTTTGC GATCACATCA GAAGTCAAGC TTGACCTGGA AATCAACGAC
GACGAAATGG AAGTGGAGTC AAATGATTTG CGCTTTCCGC AAATCATGAT CAATTTGCTG
TCAAACGCAA TCAAACATAG CTGTAGCGGC GACCAAGTAA TCATTCGAGC CAGCTCTATG
GAATTGTCTG AGGCTGAGCA ACTGGTGGAT CGAGCCTTGG TTATAGCCCC CGGAGAACCT
AGCTCTAAAG CTTTTCGCAA AGAGGACGGC CCCGTTGCCG TAGTAACTGT TACCGACCAA
GGACCAGGTA TCCCGTTCGA CCAACGGATG CGTGTCTTTG GACGCTTTTC ACAGCTGACT
GAGAATGTTC AAAATGCTAT TATCGGTAGC AAGGTGGGGC AGCCCTCAGG AACTGGCTTG
GGATTGAATT TGTGTATGAA ATTCGTGCAT CGAATGAACG GCAGGATTTG GGTCACCAAC
AATCCTGAAA AAGGGTGCTC CTTCTCATTC TACGTTCATA GAGTCTTACA ATACGATCGG
ATGCGTGAGG TATCCTTACC TCGAACCTGG AGTCAGAGCA TACACCAAGT CTTAAGACCA
GTCCCGAGAA TTCCTCTCAC CAAAGAATTT CGGATTGTTT TGGTGGACGA TACACTTATA
AACCTCAAGG TGCTTTCAAG AATGCTCTCC AGGCTAGGAG TTCAAAAAAT GGCCACCGCA
AATAATGGGA AGGAGGCCTT GGATCTCCTG ACGACAGAGG ATGACTTCAA TCTTGTCTTG
ACCGACATAC AAATGCCAGA AATGACTGGG ATAGAGTTGA GCCTTGCAAT ACGTAAGCTA
CCCTTGAAGA GACAGCCGTT GATTGTTGGC TTAACCGCTG ACACGAGCGA TGCAGGAGAT
GCGCGCTGTA CTCAAAGTGG TATGGCGACT ATTTTAAGGA AGCCTATCAC AACAAATCAG
CTCCACCACT TCTTACAAGA GGTAGAAGTC GAGTAA
 
Protein sequence
MYYGDLINGI CYTAFLPVYF YLWLQEKDLG RRSAASSVVA AFAWTAAFSW LSYFLKKPNL 
SVMALMGMSS SIYYMNIHQW PHADELVVLP QFLWSLLILS ANRPLTFPFQ FALAMVALLT
AVYVAAKESS QAVLDTLPVL GGATSMHIWF HYYATYLRKK SHGFVSAHVA SLVLAGSFAY
HASSEIVTMY RSPGYGSQGG YSILRAGFFA LVGLAAAGSF QLEMDSKEAL ERLVDERARE
LTKQAAHLRV LEHALQASET AIAIVDVSQK VLWSNSSLQE LVAVSPVKLQ DSNLFKALQY
PDDKVVQSFP PVHAITEEIV LRKRHMSVEM TPFPADAKKN NENRFLVALK DMTSQRARER
AEKAAEREAL IAQTMNESMQ NLSHELRTPL QGIMGMASLV LDEPGLPPDV TESMSMVIAS
ARLLLTLINN MLDVRKCDAS MLDEFQLTPF RLVSSLEDAI TFCKPFAITS EVKLDLEIND
DEMEVESNDL RFPQIMINLL SNAIKHSCSG DQVIIRASSM ELSEAEQLVD RALVIAPGEP
SSKAFRKEDG PVAVVTVTDQ GPGIPFDQRM RVFGRFSQLT ENVQNAIIGS KVGQPSGTGL
GLNLCMKFVH RMNGRIWVTN NPEKGCSFSF YVHRVLQYDR MREVSLPRTW SQSIHQVLRP
VPRIPLTKEF RIVLVDDTLI NLKVLSRMLS RLGVQKMATA NNGKEALDLL TTEDDFNLVL
TDIQMPEMTG IELSLAIRKL PLKRQPLIVG LTADTSDAGD ARCTQSGMAT ILRKPITTNQ
LHHFLQEVEV E