Gene Hhal_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0097 
Symbol 
ID4711382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp112981 
End bp114321 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID639854555 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001001694 
Protein GI121996907 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR02966] phosphate regulon sensor kinase PhoR 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCA ATCCCTGGCC AGGGTATCTG CTGACGCTGC TCGGTGTGGC CACGCTCTGG 
GCGGGCGTGG GGCTGGCCAC CGGGTGGTGG GTGGAGTCCC TGGCGGCCCT GCTGGCGATG
GGCTACCTGT GGAACGAGTA CCAGCTTTGG CGGCTCGAGC GGTGGCTGCG CGTCGGCCGC
AAGCTCGACC CGCCGATCTC CCGCGGGGTC TGGGGGGAGG TGTTCCACGC CCTGCATCGC
CGACAGCGCC GACAGCGTGA GCGCCGGCAC CGACTGCGCC GGGTGATCCG GGAGTATCGT
GACTCTGCCA AGGCGATGCC GGATGCCACG CTGGTGCTCT ATGCCGACGA TCGGGTCCTC
TGGTGGAACA ATGCCGCGCG CCACCTGTTG GGGCTGACCT GGCCGGGGGA TGAGGGGCAG
CGGATCAGCA ATCTGCTCCG GCAGCCGGAG TTCAAGGAGT TCCTGGATAC CGCCACCGAG
CACACAGCCC CGCTGACCAT GCCGTCGCCG GTCAGTCCGC GCAGGACCCT GGAGTTGCGG
CTGGTCCCCT ACGGGCGCAA TCAGCGGTTG CTCATTGCCC GGGACATCAG CCGCACACAG
CGGCTGGAGA CGATCCGGCG GGATTTCGTC GCCAATGTCT CCCACGAACT GAAGACCCCG
CTGACCGTGA TCTACGGCGT TGCCGAGGAG ATGGGCGAGG AGCTAGCCCC GGAGCAGCCC
GCCTGGGCCG GGTCGATCCG CTTGCTCCAG GATCAGTCAG CGCGGATGCA GCGCCTGGTG
CAGGATCTGC TCACCCTGTC CCGCCTGGAG ACCGGATCGC TGGCGGTGGA AGAGGATCCG
GTGGATATGA AGGTGCTGCT CGAGGAGGTC TGCGCCGAGG CGCAGACTCT ATCCGGGGAG
CACCAACACC GGATCGAACT CGACGCCGAG CCGGGTGTGT TCGTCCGGGG CTCCTACGGG
GAGTTGCGCA GCGCGGTGTC GAATCTGGTC TCCAACGCCG TGCGCTACAC GCCGGCCGGC
GGCACCATCC GCGTGCGGTG GCGCGCCGAT GCCCACTCGG CCCGTTGCGG GGTGGCGGAC
AGCGGGATCG GCATCCCCCG CGAGCACATC CCGCGGATCA CCGAGCGCTT CTACCGCGTC
GACAAGGGCC GGTCCAACGC CACCGGCGGC ACCGGTCTGG GGCTGGCCAT CGTCAAGCAC
GTGATGCACC GCCACGGTGG CTGGCTGAAC GTCGACAGCG AGCCCGATCA GGGATCGACC
TTCACGCTGG TCTTCCCGGC GCGCATCCTG GAACACCGGG GCGGTTCATC CAATCGTCAC
CTCGCCTTCA CCGGCCGGTA A
 
Protein sequence
MTRNPWPGYL LTLLGVATLW AGVGLATGWW VESLAALLAM GYLWNEYQLW RLERWLRVGR 
KLDPPISRGV WGEVFHALHR RQRRQRERRH RLRRVIREYR DSAKAMPDAT LVLYADDRVL
WWNNAARHLL GLTWPGDEGQ RISNLLRQPE FKEFLDTATE HTAPLTMPSP VSPRRTLELR
LVPYGRNQRL LIARDISRTQ RLETIRRDFV ANVSHELKTP LTVIYGVAEE MGEELAPEQP
AWAGSIRLLQ DQSARMQRLV QDLLTLSRLE TGSLAVEEDP VDMKVLLEEV CAEAQTLSGE
HQHRIELDAE PGVFVRGSYG ELRSAVSNLV SNAVRYTPAG GTIRVRWRAD AHSARCGVAD
SGIGIPREHI PRITERFYRV DKGRSNATGG TGLGLAIVKH VMHRHGGWLN VDSEPDQGST
FTLVFPARIL EHRGGSSNRH LAFTGR