Gene Hhal_1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1216 
Symbol 
ID4710406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1320166 
End bp1321239 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content68% 
IMG OID639855689 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001002793 
Protein GI121998006 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGATC ACGAGCGCGA CCCATCGCAG ATCCTCGAGG AGCTGACGAC CGCCGTCCTG 
CTGGTGGATG ACCGGGTGCG TATCCTCCAC GTCAACCACG CCGCCGAGAC GCTGTTCCGG
GTCAGTAGCC GACAGGTGGT GGGCCAGACC CTGGGGACGG CACTGCGCGG GGCGGAACTG
CTCGAAGAGC TCATCCGCCA GACCCAGCGT ACCGGCGGGG CCTACACCCA GCGCGAACGT
CGCCTTCCGG TCCGCGGCGA TCGCCCGGTG ACCGTCGACT GCACCATCAC CCCGGTCTCC
GCCAAACGGG TGCTGATCGA GATCGCCGAG GTGGACCGTC ACGCCCGTAT TACCCGCGAA
CAGCACCTGC TGTCACAGAA TCGTGCCGTA CAGGAGCTGA TCCGCGGACT GGCCCACGAG
ATCAAGAACC CGCTCGGCGG CCTGCGCGGC GCGGCGCAGT TGCTGGAGGC CGAGCTCCCG
GAGCGCGACC AGCGCGAGTA CACCCAGGTC ATTATCCGCG AGGCGGACCG CCTGCAGCAG
CTGGTGGACG CCCTGCTCGG TCCCAATGCG CCGGCCCGTG AGGAGCCGGT CAACATCCAT
GAGGTCCTCG AGCGCGTCCG TTCGCTGGTC ATCGCCGAGG ATGCCGAAGG CCGCGCAGAG
GCGCCAGCGG TGGCGCTCCA GCGCGACTAC GATCCGAGCA TCCCGCCGGT CACCGCCGAG
CACAATCACC TGGTCCAGGC GGTGCTCAAC CTGGTGCGCA ACGCCCGTCA GGCCACCGGG
CCAGGGGGGA CCATCACCCT GCGTACGCGC ACCCAGCGTC AGTTCACCAT CGCCGACAAG
CCCCACCGCC TGGTGGCCCG CATCGACATC ATCGACGACG GGCCGGGGAT CCCCCTCGAT
CAGCAGGAGC AGATCTTCTA TCCGATGGTC ACCTCGCGCC CCGAGGGTAC CGGTTTGGGA
CTGCCCATTG CGCAGAGCCT GGTCAGCCGC CTCGGTGGCC TGATTGAGTG CGTCAGCGAA
CCGGGGCGCA CGGTGTTCAC CATCTGGTTA CCCATGGAGA CGGAAAATGA CTGA
 
Protein sequence
MEDHERDPSQ ILEELTTAVL LVDDRVRILH VNHAAETLFR VSSRQVVGQT LGTALRGAEL 
LEELIRQTQR TGGAYTQRER RLPVRGDRPV TVDCTITPVS AKRVLIEIAE VDRHARITRE
QHLLSQNRAV QELIRGLAHE IKNPLGGLRG AAQLLEAELP ERDQREYTQV IIREADRLQQ
LVDALLGPNA PAREEPVNIH EVLERVRSLV IAEDAEGRAE APAVALQRDY DPSIPPVTAE
HNHLVQAVLN LVRNARQATG PGGTITLRTR TQRQFTIADK PHRLVARIDI IDDGPGIPLD
QQEQIFYPMV TSRPEGTGLG LPIAQSLVSR LGGLIECVSE PGRTVFTIWL PMETEND