Gene Haur_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0852 
Symbol 
ID5732753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp962161 
End bp965328 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content51% 
IMG OID641277984 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_001543628 
Protein GI159897381 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGATG GTGATCGCGC GGTGTCAACC ATTTGGCATG AGCAACTTGA GTTGTTAACA 
AAAACCGCTC AAGAACATTT TTGCCTACAC AATGCCGAAT TGTTGATTCA ACGCATTTGC
GCACTGATCA GCCACTATTT GCAAACGCCG CGGGTCACCA TCGCCCTCAT TCAACACGAT
AGTTTTGTGG CAGTTGCCAG CGCCCTTGGC CCTTTAGAAG ACCCAAGCTA TCAGCAAGCA
CCACGTTTGG TCTACAGCGC TGGAGCGGCT TGGTCGGCGG TTTGGCAACA GCAACGCAGC
ATAATCACTC CTGCGCCAAC CCCTGATGGA ATGTGCCAAC TCAGCGTGCC AATTCTGCGC
AATGCTCAAT TACTGGGATT AATCGATATT CAAAGCCCAC AGCCTGAGCA TAAACTAAGC
CTTTTACAGC CAATTATCGA GATGATCGCC CAGCAATTGG CGCTAGCCTT GGGCACACTT
CCTAGCCATC GTCAACATTA TCGCGAAGAT CAATGCACCC AAATCATCGA AAAAATCAAT
CTGCATATTT TGAAACATCT TGATGTATTA GCTGAGCCAC AACAGATTAT TGAATTAATT
TGGCGTTTTT TGCCGTTGCA AGGCGCTGCA TTGTATTTAT ACGATGAACA AGGTTTGGGC
TTATTAACCC ATATCAATGC AGCCCATGTG CCAACTCAGC GTGGAATTCA GCCGCTTAGC
CCCAGCTATT ACGAGCAAAT CGGCCAGCAA TCAAGCCATA ACTCGGCCAA AATGTTGCTA
GCAGGGCAGC ATGGCATCGA TATTCATGTA CCGGGCAATC AAGCCAATCT TGGTTTGTTA
CACTTAATTC CTGCCACACC GTTTGAAGCT AATGATTTGC TGGCCTTGCA AGAGTTTGGC
GCACGCTTAG GCTACATTTT AGAGCATAAT CGGCTGTTTC GCTCGATGTA TGTGGCCAAT
GAACGCAATG TGCTGTTTGC CCGCATCGTC AGCCATATTC GCCAAACCAT GAATTTGCGC
GAGGTCTGGC AGGATGTATT AGCAACCCTC GGTTTGGGTT TACGGGCCGA TTTCTGCACC
GTGGCTTTGT ATGAGCAAGA GCAACGCTTG CGCTTCCATG GAACCTACAC TACCTTGCTG
CTACCCGATC ATCTGACGGC TGCCGAGGCT TTGTTGCATA GCGAAATGCT GCAAGCTTTT
CAGCAAAAGC GCTCGCTGAC GATCGACGAA TATCACGATA GCAATGACCC TGAATTGCGC
GAACGGCTGC TGGCCTTGAA TATTCATGCC TTGGCTTGGG TTCCCTTGGT CGCCAACGGC
CAATGGCTGG GCTTTGTTTG CGTCTACAAA GTGCAGCGCC CCTTCCTCTG GACGCTTGAC
GATTTGCGCT TAATCAACGA TGTAGCCGAT CAATTGGCCT TAGCTTTGCG CCAAATGCAG
CTCTACGAGG CCGAACGCCA ACGCCGCCGC GAATTAGAAG CCTTACAAGA AATTATTCGA
GCGATCAGCG GCGAACTCAA TTTACATGCT TTATGTGGCA ATGTGGTCGA AAAAGTAATC
GATGTGTTTA AAGTTGCCGC CGCTGCGGTG CTGATGTGGC ACAACGATGG CAGCGCCATG
CAAGTCGTGG CCCACGCAGG TTTTTCCGAG CGCTATCTCG ATTCGCTGGA GATGAACACC
GATACAGTGA ATTATTGGAT TTCGCGCTTC AAGCCACCAA CACCCTTGTA TATTAGCGAT
ATTCGGCGTG TTTCGCTGAT TGGCGGCGAT CCAGCTAGCG CCGAGGGTTT GTCATCACTC
TTTGCTCAAC CACTGATGGT TGATGGGCGG TTTAGTGGTT GGTTGCAAAT GTACAGCCGT
GGCAATGTGC GGGTTTGGAA TCCTGAGGAA AGTCATCTAG CGGCCTCGAT TGCCCAACAA
ATTTCTCAAG CAATTCATAC AGCCCGGCGT TATGAGCAAG AGCATTTGCT ACGCACCGAT
GCCGAGCAAT CGTATTACCA GTTGCGCAGC GTGCTTGACG AGCTAGAAAA CACTCGCGAA
ACCTTGATCA ATTCGGAAAA GCTGCGGGCT TTGGGTCAAC TTGCCAGTGG CGTTGCCCAC
GATTTCAACA ATCTCTTAGC TTCAATTTTA GGTAATGCCC AATTTTTGCT GATTCACGAA
GATGATGCCG ATCGTCGCGA TGCCCTGCAA GTAATCGAAC GTGCCGCCAA AGATGGGGCG
GTCACGGTGC GCCGGATTCA AGAGTTTGCG CGGGCCTCCG AAACAATCTA CGACGATATT
GTTGATTTGC GTGATGTGGT TAATATTGCG CTCGAATTTA CCCGCCCTTC GTGGCGTGAT
AAAACCCAGC AACGTGGGAT TAAGCTCGAT ATTAGCACCC ATTTGGCTGC CGCCCATGTT
CAGGGTAGCG CCGCCGAATT ACGCGAAGTC TGCGTTAATT TAGTGGTCAA TGCAATTGAT
GTCTTGCCCC AAGGCGGCAC AATTACGATC AGCACTGGCA CAACTGGCGA GTGGTCGTAC
TTTACGATTG CTGATAACGG CCCAGGCATC GCCCCCGAAG ATCGCACCCG TATTTTCGAG
CCATTTTTCT CGACCAAGCC AATTGGCGAG GGCACAGGCA TGGGCTTGGC AGTGGCATTA
AGCATTGTGC AACGCCATCG TGGCAAATTA TTGACTGAAG ATGTGTATCC GCATGGCGCT
CGCTTCGTGG TGTTACTGCC AATTCATCAT GCACCCCAAC CAAAACCCCG CCCCGTTGCG
ATTGTGCCCA AAACTGCGGC CCAGCGTATT TTGGTGGTTG ATGATGAGCC AGCAGTACGC
AATATCGTGG CCAAAGTATT GCGCCACGAT CAGCATGAAG TGACCTTGGC TGGCTCTGGT
GAAGAAGCTT TGCGCTGGAT CGACGAACAA GCCTTTGATC TAATTATTTC CGATTTGGGA
ATGCCTGGGA TGAACGGCTG GGATGTGCTT GAACAAGCGC GACAACGCCG CCCAAATATC
ATCGCGATTT TAATTACTGG CTGGGGCTAC CAACATGATG CTGATTACGC TGCGGCGCGA
GGTGTTGATA GCGTGCTCGG CAAGCCCTTC GAGATGCAAA CCCTACGCAG CACCGTCGCC
GATTTGATTC AAGCTCGCAA CACACAAGGC CCAAACCGTG TATCATAG
 
Protein sequence
MYDGDRAVST IWHEQLELLT KTAQEHFCLH NAELLIQRIC ALISHYLQTP RVTIALIQHD 
SFVAVASALG PLEDPSYQQA PRLVYSAGAA WSAVWQQQRS IITPAPTPDG MCQLSVPILR
NAQLLGLIDI QSPQPEHKLS LLQPIIEMIA QQLALALGTL PSHRQHYRED QCTQIIEKIN
LHILKHLDVL AEPQQIIELI WRFLPLQGAA LYLYDEQGLG LLTHINAAHV PTQRGIQPLS
PSYYEQIGQQ SSHNSAKMLL AGQHGIDIHV PGNQANLGLL HLIPATPFEA NDLLALQEFG
ARLGYILEHN RLFRSMYVAN ERNVLFARIV SHIRQTMNLR EVWQDVLATL GLGLRADFCT
VALYEQEQRL RFHGTYTTLL LPDHLTAAEA LLHSEMLQAF QQKRSLTIDE YHDSNDPELR
ERLLALNIHA LAWVPLVANG QWLGFVCVYK VQRPFLWTLD DLRLINDVAD QLALALRQMQ
LYEAERQRRR ELEALQEIIR AISGELNLHA LCGNVVEKVI DVFKVAAAAV LMWHNDGSAM
QVVAHAGFSE RYLDSLEMNT DTVNYWISRF KPPTPLYISD IRRVSLIGGD PASAEGLSSL
FAQPLMVDGR FSGWLQMYSR GNVRVWNPEE SHLAASIAQQ ISQAIHTARR YEQEHLLRTD
AEQSYYQLRS VLDELENTRE TLINSEKLRA LGQLASGVAH DFNNLLASIL GNAQFLLIHE
DDADRRDALQ VIERAAKDGA VTVRRIQEFA RASETIYDDI VDLRDVVNIA LEFTRPSWRD
KTQQRGIKLD ISTHLAAAHV QGSAAELREV CVNLVVNAID VLPQGGTITI STGTTGEWSY
FTIADNGPGI APEDRTRIFE PFFSTKPIGE GTGMGLAVAL SIVQRHRGKL LTEDVYPHGA
RFVVLLPIHH APQPKPRPVA IVPKTAAQRI LVVDDEPAVR NIVAKVLRHD QHEVTLAGSG
EEALRWIDEQ AFDLIISDLG MPGMNGWDVL EQARQRRPNI IAILITGWGY QHDADYAAAR
GVDSVLGKPF EMQTLRSTVA DLIQARNTQG PNRVS