Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_37014 |
Symbol | DHK2 |
ID | 7204469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 885982 |
End bp | 888357 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | diatom histidine kinase 2 |
Protein accession | XP_002185972 |
Protein GI | 219121499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.612589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTACG GAGATCTAAT CAACGGTATT TGCTACACAG CTTTTCTTCC CGTGTACTTT TACCTTTGGC TCCAAGAGAA GGACTTGGGT CGACGCAGCG CCGCTTCAAG TGTTGTGGCT GCGTTCGCGT GGACAGCTGC CTTTTCTTGG TTGTCCTATT TTCTCAAAAA GCCCAATCTT TCAGTAATGG CGTTAATGGG AATGTCCTCT TCCATTTATT ACATGAACAT CCACCAATGG CCACACGCGG ACGAGCTTGT GGTGCTTCCA CAATTTCTTT GGTCGTTGCT CATTTTGAGT GCAAACCGGC CTTTAACGTT TCCGTTTCAG TTCGCGTTGG CAATGGTAGC CTTGCTAACA GCCGTTTATG TTGCTGCGAA GGAATCCTCA CAGGCAGTAC TGGATACCTT GCCTGTGCTA GGAGGTGCCA CATCTATGCA CATATGGTTT CACTACTACG CCACATACCT ACGCAAAAAG TCTCACGGTT TTGTCAGTGC ACATGTAGCC TCGCTTGTCT TGGCCGGTTC CTTTGCATAT CACGCTTCCA GCGAAATTGT CACCATGTAT CGTTCTCCCG GATACGGCTC TCAAGGGGGA TATAGCATCT TACGGGCAGG ATTTTTTGCT TTGGTTGGTT TGGCAGCTGC TGGTTCCTTC CAACTCGAAA TGGACTCGAA AGAAGCTTTA GAACGGCTTG TCGATGAACG CGCAAGAGAA CTTACGAAAC AAGCAGCTCA CTTGCGAGTA CTTGAGCACG CTTTGCAAGC CTCTGAGACA GCCATAGCTA TTGTTGATGT ATCGCAAAAG GTTTTGTGGT CCAACAGCTC GCTTCAAGAG CTGGTGGCTG TCTCCCCAGT CAAGCTACAA GACTCAAATC TGTTCAAGGC ATTGCAATAT CCAGATGACA AAGTCGTGCA AAGCTTTCCA CCCGTCCATG CTATTACTGA AGAAATTGTT CTTCGCAAAA GACACATGTC AGTCGAAATG ACGCCGTTTC CGGCGGATGC GAAGAAAAAC AACGAAAACC GGTTCCTCGT CGCTTTAAAG GATATGACAT CCCAAAGAGC TCGGGAACGC GCCGAGAAAG CGGCGGAAAG GGAGGCGCTG ATAGCGCAAA CCATGAATGA AAGTATGCAG AATCTTAGTC ACGAGCTCCG CACGCCACTA CAGGGTATTA TGGGTATGGC GAGTCTCGTT CTGGATGAAC CAGGCTTACC TCCGGATGTG ACGGAAAGTA TGTCGATGGT CATAGCGTCA GCTCGTCTTT TATTGACTCT CATAAACAAC ATGTTGGATG TGCGTAAATG TGATGCTTCT ATGCTAGACG AGTTCCAGTT GACACCGTTT CGATTGGTAT CATCGCTAGA AGACGCCATC ACCTTCTGCA AGCCTTTTGC GATCACATCA GAAGTCAAGC TTGACCTGGA AATCAACGAC GACGAAATGG AAGTGGAGTC AAATGATTTG CGCTTTCCGC AAATCATGAT CAATTTGCTG TCAAACGCAA TCAAACATAG CTGTAGCGGC GACCAAGTAA TCATTCGAGC CAGCTCTATG GAATTGTCTG AGGCTGAGCA ACTGGTGGAT CGAGCCTTGG TTATAGCCCC CGGAGAACCT AGCTCTAAAG CTTTTCGCAA AGAGGACGGC CCCGTTGCCG TAGTAACTGT TACCGACCAA GGACCAGGTA TCCCGTTCGA CCAACGGATG CGTGTCTTTG GACGCTTTTC ACAGCTGACT GAGAATGTTC AAAATGCTAT TATCGGTAGC AAGGTGGGGC AGCCCTCAGG AACTGGCTTG GGATTGAATT TGTGTATGAA ATTCGTGCAT CGAATGAACG GCAGGATTTG GGTCACCAAC AATCCTGAAA AAGGGTGCTC CTTCTCATTC TACGTTCATA GAGTCTTACA ATACGATCGG ATGCGTGAGG TATCCTTACC TCGAACCTGG AGTCAGAGCA TACACCAAGT CTTAAGACCA GTCCCGAGAA TTCCTCTCAC CAAAGAATTT CGGATTGTTT TGGTGGACGA TACACTTATA AACCTCAAGG TGCTTTCAAG AATGCTCTCC AGGCTAGGAG TTCAAAAAAT GGCCACCGCA AATAATGGGA AGGAGGCCTT GGATCTCCTG ACGACAGAGG ATGACTTCAA TCTTGTCTTG ACCGACATAC AAATGCCAGA AATGACTGGG ATAGAGTTGA GCCTTGCAAT ACGTAAGCTA CCCTTGAAGA GACAGCCGTT GATTGTTGGC TTAACCGCTG ACACGAGCGA TGCAGGAGAT GCGCGCTGTA CTCAAAGTGG TATGGCGACT ATTTTAAGGA AGCCTATCAC AACAAATCAG CTCCACCACT TCTTACAAGA GGTAGAAGTC GAGTAA
|
Protein sequence | MYYGDLINGI CYTAFLPVYF YLWLQEKDLG RRSAASSVVA AFAWTAAFSW LSYFLKKPNL SVMALMGMSS SIYYMNIHQW PHADELVVLP QFLWSLLILS ANRPLTFPFQ FALAMVALLT AVYVAAKESS QAVLDTLPVL GGATSMHIWF HYYATYLRKK SHGFVSAHVA SLVLAGSFAY HASSEIVTMY RSPGYGSQGG YSILRAGFFA LVGLAAAGSF QLEMDSKEAL ERLVDERARE LTKQAAHLRV LEHALQASET AIAIVDVSQK VLWSNSSLQE LVAVSPVKLQ DSNLFKALQY PDDKVVQSFP PVHAITEEIV LRKRHMSVEM TPFPADAKKN NENRFLVALK DMTSQRARER AEKAAEREAL IAQTMNESMQ NLSHELRTPL QGIMGMASLV LDEPGLPPDV TESMSMVIAS ARLLLTLINN MLDVRKCDAS MLDEFQLTPF RLVSSLEDAI TFCKPFAITS EVKLDLEIND DEMEVESNDL RFPQIMINLL SNAIKHSCSG DQVIIRASSM ELSEAEQLVD RALVIAPGEP SSKAFRKEDG PVAVVTVTDQ GPGIPFDQRM RVFGRFSQLT ENVQNAIIGS KVGQPSGTGL GLNLCMKFVH RMNGRIWVTN NPEKGCSFSF YVHRVLQYDR MREVSLPRTW SQSIHQVLRP VPRIPLTKEF RIVLVDDTLI NLKVLSRMLS RLGVQKMATA NNGKEALDLL TTEDDFNLVL TDIQMPEMTG IELSLAIRKL PLKRQPLIVG LTADTSDAGD ARCTQSGMAT ILRKPITTNQ LHHFLQEVEV E
|
| |