Gene Hhal_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1964 
Symbol 
ID4710347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2160677 
End bp2162104 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content71% 
IMG OID639856437 
Producthistidine kinase 
Protein accessionYP_001003530 
Protein GI121998743 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGCTG GGAGAGATAC GGACCCGACC GGCCCACGCC GGCAGCGGGG GGCAGGAGCT 
GTCGAGCCCC TCACCGTACG GGAGCCCCGT GGGCACGGCC CCCGGGACGG TGGTGGCAGT
GCCGGCCGTT GGCACCCGGT GCGCCGCCTC CGGGCCCCCC CCACCCCCAC GCCCTACCCC
GACACCCCGA ATGTGCCTGC GTCATCGGCC AGCGAACAGG AGGCCCTGCA CCTGCTCAGC
GCCCTCGGCG TGGTGGCGGC CCAGGCGCGG GATCTCGATG AACTGCTTCA GGGCAGCCTG
GAACGGCTGA TCGAACAGAC CGGTGCCACG GCGGCTGCCG TACGACTCTT CGACGAACAG
GGCAGCCTGC GCCTGGTCGA GGCAAGCGGG CTCAACGCCG GCTTTATCGA TGCCGAGCGC
CGCCAGCCGG CGGCGGGCTG CTCCTGCGCC ATCGCCGGGG AACGCGGCAC CGTGCAGTTC
CGGGGCGACC TGCGCCAGTG CATCCGCCGC AGCGGCTGCA ACCCGCTGCC CAACCGACCC
CAGCTGGCCA TGCTGGCGGT GCCGATCCTC GATCCCGCGG GCGAGAGGGT GGGCATCTAC
AACATCTATC TGGAACCGAG CGAGGCCCAG CGCTGGATGC ATCCGCCACG CATGCTGGAG
TGGATCGGGC ACCAGCTGGG CGCGGCGATC GCCCGGGTCC GCGACGAGTA CCGCAGCCAC
CAGGGGGCCC TGCAGGAGGA GCGCAACCTC CTCGCCCACG AGCTGCACGA CACGGTTGCC
CAGGAGGTGG CCACGCTGCG CCTGCGGGTC CGCCAACTGG AGGAGCGGGC ACGCGGCGAT
GCCGACACCG CGGCCCTGCT CGCCCCGCTG GAGGATCTGC GCACCCGACT CGACCACACC
AACGACCAGG TCCGCACGGT GATGCAGCAA TTCCGTACCC AGGCCCTGGG TACACCGCTG
GAGACGGCGC TGAGCCGCCT GGCCAACCGC TTCCGGCGCG ACAGCGGCAT CGAGGTCCGC
CTGATCCATC GTTGGCCGGA GCTCGCCCTG GGCGAGCGCG AGCAGCTGCA CATCCACCGC
ATCGTCGAGG AGGCCCTGTC CAACGCCTGG CACCACGGCG GCGCCCGCAA CGTGCGCCTG
CAGCTCGAGA CCCCCGGCGG GGATCTCTGC CTGCTGATCG AGGACGACGG ATGCGGCTTC
GTGGTGGACG ATGTACCGGA CTCCGACCCG GCCGAGAGCC GGGGACACGG CCTGCGGGGG
ATGCGCGAGC GCGCCCGCCA CCTGGGCGCC ATCCTCACCG TGGAGAGCGA TCCCGGCCAG
GGCACCACCA TCCACTTACG CTTGCCCCAG CCGCAGCGCC TGACCTGGAC CACCAAGAGC
CTTCACCAGG CGGGGTTCAA CGACCATGCG TATCCTGCTT GTCGATGA
 
Protein sequence
MYAGRDTDPT GPRRQRGAGA VEPLTVREPR GHGPRDGGGS AGRWHPVRRL RAPPTPTPYP 
DTPNVPASSA SEQEALHLLS ALGVVAAQAR DLDELLQGSL ERLIEQTGAT AAAVRLFDEQ
GSLRLVEASG LNAGFIDAER RQPAAGCSCA IAGERGTVQF RGDLRQCIRR SGCNPLPNRP
QLAMLAVPIL DPAGERVGIY NIYLEPSEAQ RWMHPPRMLE WIGHQLGAAI ARVRDEYRSH
QGALQEERNL LAHELHDTVA QEVATLRLRV RQLEERARGD ADTAALLAPL EDLRTRLDHT
NDQVRTVMQQ FRTQALGTPL ETALSRLANR FRRDSGIEVR LIHRWPELAL GEREQLHIHR
IVEEALSNAW HHGGARNVRL QLETPGGDLC LLIEDDGCGF VVDDVPDSDP AESRGHGLRG
MRERARHLGA ILTVESDPGQ GTTIHLRLPQ PQRLTWTTKS LHQAGFNDHA YPACR