Gene PHATRDRAFT_44514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44514 
SymbolLHK 
ID7197731 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp780709 
End bp783698 
Gene Length2990 bp 
Protein Length850 aa 
Translation table 
GC content45% 
IMG OID 
Productlight histidine kinase 
Protein accessionXP_002178303 
Protein GI219115015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTATGAACGA GTAACATTGC AATTTTATCT AGCAAATGTG TCGCTAGGAA GAGCGGCAGA 
GCATCAAGAC TTCTTCAAGA GGAACAACAG TTTACAGCAT CAACAGCAAA TTTGCCAGAT
TTGACAGGTC CTCCGCAGCA ATCGCGAGCA TTGACTTCAC ATCTCAATTC AGTTTACCTT
GCTGCCTGTT AACGTCGCTT AGCAGTGGAC ATGTCTCATT CGCAGACAAC GCCGGCGTCA
AATAAGGGTG TGTCGGTGAT GGAGTTTCGG ACAGAGGAAG CTCTGTTAAA GCATTTGATT
GACGACATCG AGAGCATACC GAGCGATCCA TCTTTGGAGC AGCTTTGCTC GAAAGCTAAA
ACATCTATCA GTAAGACTCC TTTTTGGATC TTGTCTGCTC CTGACGAGGA CCATACAAAG
TCGAAATCCG TTGAGGACGA ATTCCGCCGA CTTCAGACGT TGAAAAGTTA CAATGTTTTG
GATGAAATGT TTGACGAAGA CTTGGATAGG CTAACAGCAA TGGCGAGCCG TATGTTTGAC
GTCCCTATTG CTCTTGTAAG CCTGGTTGAC CTAGGAAGAA TCTGGTGCAT TTCAAATCGC
GGCCTCGGAG AAGTACGCCA GATACCCAGG AAAGATTCAT TCTGCGCGCA TGCGGTGCTT
AGCAACTCCA AGTCCTTTAT TGTTCCTGAT GCTAGCAAAG ACAGCCGCTT TTCGACGAAC
CCGCTGGTTA CTGAAGGCGG AATTCGTTTC TACGGCGGAG CGCCGTTGGA GACACCAGAA
AATTATAGGA TAGGTAACTT TTGTTTGCTG GGTACCAAGC CTCGCCCCGA AGGCCTTTCC
GAAGCTGAGT TAATGACGTT GGTAGACTTT GCTGCCATGG CAGTCAAGGT ACTTGTGGAT
CGTCGGTATC GGAGGGATAC GGAAATCAAA CAGGATTTGA TTGTAGCACA TACGGCACAC
GATCTCATCA CTCCGCTTAC AGGACTTCAA ATGTCCTTGA ATCTTCTGAA CGAAGATGTA
TCCTTGACCA GTAAAATGGA CGACGAACAA CAAGAACTCC TTTCAACTGC TATTGTCTGC
ACTGATATTA TGAGCCGGTT CTGCCACGTT ACCTTGGCTG ATCTAAAAAA AGTGGACCCA
CCCGCCGTAA CAATGACAAG CGAATGCTCA CTATCTCATT CCAATTGTCA TGTCAAAGAC
CTCTTCCGCT GCCTGCATCA GGTAATCAAT ATCATGGCTA GTGTGCTAGC GATCCGACAT
ATTTTAGCTA ACTGTTTTGA CCTTCTTTTT CTACTCAGAC CATTGCTCCT CTTTCGAAGA
AAGTTCCGAT CGTCTTCTCC ATGGAAACAT CCTTAATGAA TCTCGTGGCG TGTGATGGAC
TAGCTCTTTT CCGTTTTGCT TTAAGTCTTC TCTCTGCAGC ATGTGACCGA ACTAAAAGAG
GAAGTATATC CTTGCGCCTT TTTACTCTCA AAGAAGAAAG CACCATAGCT ATTGAGTGTG
AGGACACTGC TACCTCTTTG CTCGCCAAAG ACGTTAATTT GTTGTTCGAC AACAGTCCTG
ATGTGTGTGG ACTCACAGAT CCTGCACTCG GTGGGATACA GTTTGCGATT TCACTTGCGC
ACTCAATCGG TAGTTCCTGC GGGGTACGTG GTCGCTCGCT TGTCGACGAT CAACAGGATA
GCGACTCCGT TGATGGATCA GTGTTTTGGT TCCATCTTCC GGCATCGTTT GATGGTGACT
ATTGGAATGA AAATAGTGAG ATTACTGGGG AAGTAGCGTC GAGGTCGGCT TGCCATGAGG
TCGCTACTGG AACACCTAAT TCGGACTGCT TCTTTCAAAG CGACTTGACC CAAACAATGA
ACTTGCTAGA GCCAAAGGAA AATACTTTAC CTTTGACGGG TACAGAGCTC CCATGTTCGC
CCAAAGAAGA CTCTTTGGAT TTTTTTTGGG ACTGAAGCTG CTTCGAAGGG CTCTAGTTAT
CGATGATTCC TTGGTTGTAA GAAAAACTTT AGTGCGAGGT TTGTCGAGTC TAGGATACGA
AGTTGAAACA GCAAGCGACG GGATGGAGGG GCTCAATGCA TTGAAAAAAC GTATTTACTC
GGTTTGTTTA TGTGACTTTT TGATGCCTAT CATGGATGGA TTGGATTGTG TTCAGCAGTA
TCGAGTGTGG GAAGCAGAAT ATCGTTTTAG CCATCGACAA ATTATTGTTG GTATTTCAGC
ACACGCGACC AAAAAAGACT CAAACATCGC AAGACAGATT GGCATGGATG ATTACATGGC
AAAGCCCATC ACCATCAAAA TGCTCAAAGA TCTGGACGAG AGTGATCTCG TTCGGCATGC
TACGACGCAA AATCAAAACC TTACGAGTCT TAAAGAAGAT ACAGCCGGGA TCGAATTGCC
TAGAAAACGA GCCCTGTCAG TTGGATCAAA GTACGTGCCG GTTCCTCGTC TAATGAAATC
CAAGTTTACA CTTCCGAAAG TTAAACGAAA TGAGCTAATT GACCACCCTC GTTGCTTAAT
AGCAAAACTT GGCTCCATCG ATGGTTCATC AGACCTCACA CAACTGTTTA CAAGGAGTGG
TTGGCAGTGT GAAGTTGCGA GTAATGAGGC TGAGCTGTCG TTGATGCTCC GCAATCGAAA
CTGGGATGCT GTACTTATCG ACGACTGCAT GTTTGAACGC AATCAGACTT GCATCTTGGA
ATTTCGTGTT TGGGAAACAA AGAACCGTAT CAATCGGCAA AGAAATCTAT TTCTTCATTT
TGCAATAGCC CAACAGACGA CAAAGCTTGC CATGGAAACT ATGTTCTTGC TAGCGCCTGA
AGGATTCGAC GGAGCTATCC AGATACCGCT TCTGTGGGAC GACTTTGTGT CAATTTTTAA
ATCAATGAAC TCAGATACCA GAACTATGTC CATAATAACA CGATAGAAAA GATTTGTTTT
GCCAATAATG GTAACTGTAA GTGTTAATAA AGAACGTCTT CAACTGGTTC
 
Protein sequence
MSHSQTTPAS NKGVSVMEFR TEEALLKHLI DDIESIPSDP SLEQLCSKAK TSISKTPFWI 
LSAPDEDHTK SKSVEDEFRR LQTLKSYNVL DEMFDEDLDR LTAMASRMFD VPIALVSLVD
LGRIWCISNR GLGEVRQIPR KDSFCAHAVL SNSKSFIVPD ASKDSRFSTN PLVTEGGIRF
YGGAPLETPE NYRIGNFCLL GTKPRPEGLS EAELMTLVDF AAMAVKVLVD RRYRRDTEIK
QDLIVAHTAH DLITPLTGLQ MSLNLLNEDV SLTSKMDDEQ QELLSTAIVC TDIMSRFCHV
TLADLKKVDP PAVTMTSECS LSHSNCHVKD LFRCLHQTIA PLSKKVPIVF SMETSLMNLV
ACDGLALFRF ALSLLSAACD RTKRGSISLR LFTLKEESTI AIECEDTATS LLAKDVNLLF
DNSPDVCGLT DPALGGIQFA ISLAHSIGSS CGVRGRSLVD DQQDSDSVDG SVFWFHLPAS
FDGDYWNENS EITGEVASRA KGKYFTFDGY RAPMFAQRRL FGFFLGLKLL RRALVIDDSL
VVRKTLVRGL SSLGYEVETA SDGMEGLNAL KKRIYSVCLC DFLMPIMDGL DCVQQYRVWE
AEYRFSHRQI IVGISAHATK KDSNIARQIG MDDYMAKPIT IKMLKDLDES DLVRHATTQN
QNLTSLKEDT AGIELPRKRA LSVGSKYVPV PRLMKSKFTL PKVKRNELID HPRCLIAKLG
SIDGSSDLTQ LFTRSGWQCE VASNEAELSL MLRNRNWDAV LIDDCMFERN QTCILEFRVW
ETKNRINRQR NLFLHFAIAQ QTTKLAMETM FLLAPEGFDG AIQIPLLWDD FVSIFKSMNS
DTRTMSIITR