Gene Haur_1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1161 
Symbol 
ID5733054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1333384 
End bp1334664 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content49% 
IMG OID641278301 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001543937 
Protein GI159897690 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAAAA CGATCATTAA TCGAGCCAAA AATCATGCCG ATCCTGCTAT TGTAGTTCAT 
TATCAACACC TGCTGGAGCA CAACAAACGC CGCTTGATCG AGTGGATTAT GCTGTTAGCT
GGTGGTTTGG CTTTGCCATT TACGCTTGTA TTAATCGTAG CGGTGGCCAA TCATCAGCAG
CCAAGTAGTG TTTTGGTTTT ACACCTTACC CGTAGCCTGC TCAATCCTTT ACTAGTTTGG
TGGTTGCTAC AGCGTAAACA AATTAATTGG GCTTGGCATT CTACGATGGT GTTTGCAATG
GCACATAATA CGGTTTTAGC CTATGTGATG CACTTGCCAA ATGTAATTAT TGTAGAGCTG
TTTGCGTTGG CCGGTTTTGC GGTGGTGATG CCATTTTGGC AGGTGTTGGC GTACATTGGC
GGGCTGATTG GGCTGAATTA TTGCTTTGCA GGGCAATTTA TTGTGCTCAA TGAATGGGCT
TTGGTGATGA TTGTCGTGCT GAGTATTGTG TTGATGTGCA GTACGATTGG CTTTGTTTCG
CGCCAAACCT TGTGGCATGC CAGCCAACAA CATAGCCAAA CCGCTGAGTT GGTGCAACAA
CAGAGCAGCA TGCAACAGCA ACTTCACGAT TTACAAACCC ATGTGCAACA ACTGAGTTTG
CTGAAACACG ATTTGCGCCA GCCCTTGAAA AGCGTTCAAG GCTTGTTGCA AGGCTTGGCT
TTTGAACAAC CAAGCACGCA TAGCACGATT CAGCCAGCGC TAGCCGCAAC CCAACGAGTC
GAACGTCAAC TTAATAATTT GCTCGATCAA GCCCGTCAGC AGCTTGGTCG CCAACGGGCA
AGCCTCGAAA TCATCGATGT ACAGCACTGT TTGGGGCAAT TGCAGCCAGC AATCAGCGGT
TTGGCGGCCT ACTACAGCGA GCCAATTGTC AAAGTGCAAC TTGAAATTGA CGCGGGCAGC
GTGATTGTGG CCGATCGTGA GCAATTTGAG CGAGCCTTAT TCAACTTGCT CGATAATAGT
TTGAGCCGTT GCCATCACGA GGTACGTATC AGTAGCTATC GATCGGAGCA AAACGTGATA
ATCGAAGTTC GTGATGATGG GGCGGGCATG CATCAAGCCT TACGCACGGC GCTGAATCAG
GCTGATTTTA GTGCGATTAA GCAAGGCTTG GGTTTGAAGC AAGTGCAACA GATGCTTACG
CAGGCCCAAG CGTGGCTGCA TGTGCCCGAC GTTGCGATAG GCTGCACGCT TCAACTGCAT
TTTCCACAGG CTACCCAATG A
 
Protein sequence
MLKTIINRAK NHADPAIVVH YQHLLEHNKR RLIEWIMLLA GGLALPFTLV LIVAVANHQQ 
PSSVLVLHLT RSLLNPLLVW WLLQRKQINW AWHSTMVFAM AHNTVLAYVM HLPNVIIVEL
FALAGFAVVM PFWQVLAYIG GLIGLNYCFA GQFIVLNEWA LVMIVVLSIV LMCSTIGFVS
RQTLWHASQQ HSQTAELVQQ QSSMQQQLHD LQTHVQQLSL LKHDLRQPLK SVQGLLQGLA
FEQPSTHSTI QPALAATQRV ERQLNNLLDQ ARQQLGRQRA SLEIIDVQHC LGQLQPAISG
LAAYYSEPIV KVQLEIDAGS VIVADREQFE RALFNLLDNS LSRCHHEVRI SSYRSEQNVI
IEVRDDGAGM HQALRTALNQ ADFSAIKQGL GLKQVQQMLT QAQAWLHVPD VAIGCTLQLH
FPQATQ