Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44514 |
Symbol | LHK |
ID | 7197731 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 780709 |
End bp | 783698 |
Gene Length | 2990 bp |
Protein Length | 850 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | light histidine kinase |
Protein accession | XP_002178303 |
Protein GI | 219115015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATGAACGA GTAACATTGC AATTTTATCT AGCAAATGTG TCGCTAGGAA GAGCGGCAGA GCATCAAGAC TTCTTCAAGA GGAACAACAG TTTACAGCAT CAACAGCAAA TTTGCCAGAT TTGACAGGTC CTCCGCAGCA ATCGCGAGCA TTGACTTCAC ATCTCAATTC AGTTTACCTT GCTGCCTGTT AACGTCGCTT AGCAGTGGAC ATGTCTCATT CGCAGACAAC GCCGGCGTCA AATAAGGGTG TGTCGGTGAT GGAGTTTCGG ACAGAGGAAG CTCTGTTAAA GCATTTGATT GACGACATCG AGAGCATACC GAGCGATCCA TCTTTGGAGC AGCTTTGCTC GAAAGCTAAA ACATCTATCA GTAAGACTCC TTTTTGGATC TTGTCTGCTC CTGACGAGGA CCATACAAAG TCGAAATCCG TTGAGGACGA ATTCCGCCGA CTTCAGACGT TGAAAAGTTA CAATGTTTTG GATGAAATGT TTGACGAAGA CTTGGATAGG CTAACAGCAA TGGCGAGCCG TATGTTTGAC GTCCCTATTG CTCTTGTAAG CCTGGTTGAC CTAGGAAGAA TCTGGTGCAT TTCAAATCGC GGCCTCGGAG AAGTACGCCA GATACCCAGG AAAGATTCAT TCTGCGCGCA TGCGGTGCTT AGCAACTCCA AGTCCTTTAT TGTTCCTGAT GCTAGCAAAG ACAGCCGCTT TTCGACGAAC CCGCTGGTTA CTGAAGGCGG AATTCGTTTC TACGGCGGAG CGCCGTTGGA GACACCAGAA AATTATAGGA TAGGTAACTT TTGTTTGCTG GGTACCAAGC CTCGCCCCGA AGGCCTTTCC GAAGCTGAGT TAATGACGTT GGTAGACTTT GCTGCCATGG CAGTCAAGGT ACTTGTGGAT CGTCGGTATC GGAGGGATAC GGAAATCAAA CAGGATTTGA TTGTAGCACA TACGGCACAC GATCTCATCA CTCCGCTTAC AGGACTTCAA ATGTCCTTGA ATCTTCTGAA CGAAGATGTA TCCTTGACCA GTAAAATGGA CGACGAACAA CAAGAACTCC TTTCAACTGC TATTGTCTGC ACTGATATTA TGAGCCGGTT CTGCCACGTT ACCTTGGCTG ATCTAAAAAA AGTGGACCCA CCCGCCGTAA CAATGACAAG CGAATGCTCA CTATCTCATT CCAATTGTCA TGTCAAAGAC CTCTTCCGCT GCCTGCATCA GGTAATCAAT ATCATGGCTA GTGTGCTAGC GATCCGACAT ATTTTAGCTA ACTGTTTTGA CCTTCTTTTT CTACTCAGAC CATTGCTCCT CTTTCGAAGA AAGTTCCGAT CGTCTTCTCC ATGGAAACAT CCTTAATGAA TCTCGTGGCG TGTGATGGAC TAGCTCTTTT CCGTTTTGCT TTAAGTCTTC TCTCTGCAGC ATGTGACCGA ACTAAAAGAG GAAGTATATC CTTGCGCCTT TTTACTCTCA AAGAAGAAAG CACCATAGCT ATTGAGTGTG AGGACACTGC TACCTCTTTG CTCGCCAAAG ACGTTAATTT GTTGTTCGAC AACAGTCCTG ATGTGTGTGG ACTCACAGAT CCTGCACTCG GTGGGATACA GTTTGCGATT TCACTTGCGC ACTCAATCGG TAGTTCCTGC GGGGTACGTG GTCGCTCGCT TGTCGACGAT CAACAGGATA GCGACTCCGT TGATGGATCA GTGTTTTGGT TCCATCTTCC GGCATCGTTT GATGGTGACT ATTGGAATGA AAATAGTGAG ATTACTGGGG AAGTAGCGTC GAGGTCGGCT TGCCATGAGG TCGCTACTGG AACACCTAAT TCGGACTGCT TCTTTCAAAG CGACTTGACC CAAACAATGA ACTTGCTAGA GCCAAAGGAA AATACTTTAC CTTTGACGGG TACAGAGCTC CCATGTTCGC CCAAAGAAGA CTCTTTGGAT TTTTTTTGGG ACTGAAGCTG CTTCGAAGGG CTCTAGTTAT CGATGATTCC TTGGTTGTAA GAAAAACTTT AGTGCGAGGT TTGTCGAGTC TAGGATACGA AGTTGAAACA GCAAGCGACG GGATGGAGGG GCTCAATGCA TTGAAAAAAC GTATTTACTC GGTTTGTTTA TGTGACTTTT TGATGCCTAT CATGGATGGA TTGGATTGTG TTCAGCAGTA TCGAGTGTGG GAAGCAGAAT ATCGTTTTAG CCATCGACAA ATTATTGTTG GTATTTCAGC ACACGCGACC AAAAAAGACT CAAACATCGC AAGACAGATT GGCATGGATG ATTACATGGC AAAGCCCATC ACCATCAAAA TGCTCAAAGA TCTGGACGAG AGTGATCTCG TTCGGCATGC TACGACGCAA AATCAAAACC TTACGAGTCT TAAAGAAGAT ACAGCCGGGA TCGAATTGCC TAGAAAACGA GCCCTGTCAG TTGGATCAAA GTACGTGCCG GTTCCTCGTC TAATGAAATC CAAGTTTACA CTTCCGAAAG TTAAACGAAA TGAGCTAATT GACCACCCTC GTTGCTTAAT AGCAAAACTT GGCTCCATCG ATGGTTCATC AGACCTCACA CAACTGTTTA CAAGGAGTGG TTGGCAGTGT GAAGTTGCGA GTAATGAGGC TGAGCTGTCG TTGATGCTCC GCAATCGAAA CTGGGATGCT GTACTTATCG ACGACTGCAT GTTTGAACGC AATCAGACTT GCATCTTGGA ATTTCGTGTT TGGGAAACAA AGAACCGTAT CAATCGGCAA AGAAATCTAT TTCTTCATTT TGCAATAGCC CAACAGACGA CAAAGCTTGC CATGGAAACT ATGTTCTTGC TAGCGCCTGA AGGATTCGAC GGAGCTATCC AGATACCGCT TCTGTGGGAC GACTTTGTGT CAATTTTTAA ATCAATGAAC TCAGATACCA GAACTATGTC CATAATAACA CGATAGAAAA GATTTGTTTT GCCAATAATG GTAACTGTAA GTGTTAATAA AGAACGTCTT CAACTGGTTC
|
Protein sequence | MSHSQTTPAS NKGVSVMEFR TEEALLKHLI DDIESIPSDP SLEQLCSKAK TSISKTPFWI LSAPDEDHTK SKSVEDEFRR LQTLKSYNVL DEMFDEDLDR LTAMASRMFD VPIALVSLVD LGRIWCISNR GLGEVRQIPR KDSFCAHAVL SNSKSFIVPD ASKDSRFSTN PLVTEGGIRF YGGAPLETPE NYRIGNFCLL GTKPRPEGLS EAELMTLVDF AAMAVKVLVD RRYRRDTEIK QDLIVAHTAH DLITPLTGLQ MSLNLLNEDV SLTSKMDDEQ QELLSTAIVC TDIMSRFCHV TLADLKKVDP PAVTMTSECS LSHSNCHVKD LFRCLHQTIA PLSKKVPIVF SMETSLMNLV ACDGLALFRF ALSLLSAACD RTKRGSISLR LFTLKEESTI AIECEDTATS LLAKDVNLLF DNSPDVCGLT DPALGGIQFA ISLAHSIGSS CGVRGRSLVD DQQDSDSVDG SVFWFHLPAS FDGDYWNENS EITGEVASRA KGKYFTFDGY RAPMFAQRRL FGFFLGLKLL RRALVIDDSL VVRKTLVRGL SSLGYEVETA SDGMEGLNAL KKRIYSVCLC DFLMPIMDGL DCVQQYRVWE AEYRFSHRQI IVGISAHATK KDSNIARQIG MDDYMAKPIT IKMLKDLDES DLVRHATTQN QNLTSLKEDT AGIELPRKRA LSVGSKYVPV PRLMKSKFTL PKVKRNELID HPRCLIAKLG SIDGSSDLTQ LFTRSGWQCE VASNEAELSL MLRNRNWDAV LIDDCMFERN QTCILEFRVW ETKNRINRQR NLFLHFAIAQ QTTKLAMETM FLLAPEGFDG AIQIPLLWDD FVSIFKSMNS DTRTMSIITR
|
| |