Gene Haur_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3446 
Symbol 
ID5735307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4333613 
End bp4335988 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content51% 
IMG OID641280593 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001546210 
Protein GI159899963 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.285515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCG GGTCGGTACG CCGTACAATG ATCACAATTG CAGCGATCAT TTTGAGCTTA 
ATGACCGGAT TAAATATCAT CGGGGTTGTT GAGCGTAGCA TGCTCAATAA TGATATTGAA
TATGTGATCG ATTTAGAAGA TGCCGAACGT TTAGCTCGCG AAATTAGCCT CTACACTCAA
TATCAAGCCC ATGCGCTCGA TGCCTATGCC TTAGGTGAAA TCGAGGAGCG TGAACACTAT
ACCCGTTATC GCCAAGCCTT CGATGACAAA CGCCTAGAGC TTGAACAATT CTTCAAGAAC
ATGCAGCCGA ACCCAGAAAC CAAAACAGCC TTTGAGAATG TACAAAAACT TAGCGCCGAC
TATGAAGATG CCGGAGTTGC CTACCTCGCC CAAATCGACC TACGCATCCA AGAATCAGCG
CCAACCCGCT CAACCGCAGA ACTGGCTGCT TGGAAAATCC TCGACGAACA GGCCGATCAA
CTCGACGAGG CCACCCAAGT GCTCTCTGAT ATCATTGATG ATCAATCTGA AGCACTTGAA
GCAACAATTA CCAAGCAAAA TGGCCGCATG ATTGTGGCAC TGACTGGGCG CAGTTTGGCG
ATACTAGTAT TGCTAAGTCT GTTCGTCTAC TATCTGCTGG GGAGGGTCGG CAATCAATTC
AAGTTGGTAC GCGACGGAGC GCAACGCTTT GCCGATGGCG ATTTTACCAC TGATATTCCA
ATTCGCCGCT ATGATGAAGT AGGCCGCCTC GCTGCGATGT TCAATACCAT GGCCCAAACG
ATTCGTGGCC AAATCGAGCG ACTTGAGCAA GCCAAAGATC ATGCCCAACG CTTGCAATTT
GTGGCCGAAG AAGCCAATCG CGCCAAAAGT AATTTCTTGG CCAATATGAG CCACGAATTG
CGCACCCCAC TCAATGCGAT CATCGGCTAT AGCGAAATTC TCCAAGAAGA ATGTGAAGAC
CTCGGCCAAA CCGCAATGAT CGAAGATCTC GATCGGATTC GCCTCTCAGG GCGGCATCTG
CTGACCTTGA TCAACGATAT TTTGGATTTG GCCAAGATTG AATCGGGCAA GGTTGAAATT
TTGCCTGAGG AAATTTCGCT GCCCCAACTG CTGCACGATG TGCGCTCAAC CGTCGATCCG
ATGATCATCA AAAATGAAAA TCGCTTGGTG ATCGAATCAG CAGCTGGCTT GCTGACGATG
ATTAGCGACG AGACCCGTTT ACGCCAGATT TTGGTCAATT TGCTGAGCAA CGCAGCGAAA
TTCACCGAAC ATGGCCGCAT TACCTTGCGC GTCCAACCAA GCGAAGAAGA GGGCTGGATT
GATTTCAGCG TGCATGATAA TGGCATTGGC ATGAGCAACG CCCAATTATC ACGCTTGTTT
CAGCCATTTA CCCAAGCCGA TGCCTCGACC ACCCGCAAAT ATGGCGGCAC TGGGCTGGGT
TTGGCCTTAA GTCGGCGCTT GGCTCAATTG CTCGGCGGCG ATATTCGGGT ACAAAGTGAA
TTAGGCGTTG GCTCAACCTT CAGCGTGCAC CTACCGCAAT CAGTCATCGA TATGGCTCCA
GTTTCGTTAC TTGATGAAGC GCCAGTTATT ATCAGCGATG CCAACAACAA CCAACCCAAA
GTGCTGATTA TCGACGATGA TCGCAATGTC CATCATCTGC TTTCGCGCAC GCTCAAGCGC
GAGGGCTGGA GCGTCCTTAG CGCATTTGAT GGCGAAAATG GCTTAGCAAT GGTGCGCAAT
CATCATCCAA CGGCAATTTT GCTCGATGTG TTGTTGCCAG GCCATGTCAA TGGTTGGGAG
ATTTTGGCCG AAATCAAGGC CGACCCCAAA ATTGCCACAA TTCCGGTAAT TATGCATACG
ATTGTGGCCG AGCCAAACCA AGGGGTTTCG TTTGGGGTGT ACGATTATTT GATTAAGCCC
GTTGATCGCG GCCAATTGCT GCGCACGCTA CGGAGTTGTA TCGACCCGCA AAATGCCAAA
ACCCAATTGG TTTTGGTGGT CGATGACGAT CATGATAGTC GGGCGATGCT GCGACGCATG
CTCGAAGGCG CTGGCTGGAA AGTCTATGAG GCTGCCAACG GGCGCGAAGC CTTGGGCGCA
TTGCATAGCC GCCCATTTGG CGCGATGATT CTCGATCTAA TGATGCCCGA AATGGATGGC
TTCGAAACGA TCGCTGCCTT ACAAGAGCTT GAGCAATTCC GCGATTTGCC GATTATTGTG
GTTTCGGCCA AAGAACTCAC TGAAGAAGAG CGGCAACAGC TTGAAGAGAC CGTTGAACGG
GTGGTCAGTA AGGGGAATGT GCGGCGTGAA GAGATTTTGG CGTTGGTGCG CGAGCAAGTT
CGGCGGCGCG TTGAGCAACC GCCTACAACC ACGTAA
 
Protein sequence
MKRGSVRRTM ITIAAIILSL MTGLNIIGVV ERSMLNNDIE YVIDLEDAER LAREISLYTQ 
YQAHALDAYA LGEIEEREHY TRYRQAFDDK RLELEQFFKN MQPNPETKTA FENVQKLSAD
YEDAGVAYLA QIDLRIQESA PTRSTAELAA WKILDEQADQ LDEATQVLSD IIDDQSEALE
ATITKQNGRM IVALTGRSLA ILVLLSLFVY YLLGRVGNQF KLVRDGAQRF ADGDFTTDIP
IRRYDEVGRL AAMFNTMAQT IRGQIERLEQ AKDHAQRLQF VAEEANRAKS NFLANMSHEL
RTPLNAIIGY SEILQEECED LGQTAMIEDL DRIRLSGRHL LTLINDILDL AKIESGKVEI
LPEEISLPQL LHDVRSTVDP MIIKNENRLV IESAAGLLTM ISDETRLRQI LVNLLSNAAK
FTEHGRITLR VQPSEEEGWI DFSVHDNGIG MSNAQLSRLF QPFTQADAST TRKYGGTGLG
LALSRRLAQL LGGDIRVQSE LGVGSTFSVH LPQSVIDMAP VSLLDEAPVI ISDANNNQPK
VLIIDDDRNV HHLLSRTLKR EGWSVLSAFD GENGLAMVRN HHPTAILLDV LLPGHVNGWE
ILAEIKADPK IATIPVIMHT IVAEPNQGVS FGVYDYLIKP VDRGQLLRTL RSCIDPQNAK
TQLVLVVDDD HDSRAMLRRM LEGAGWKVYE AANGREALGA LHSRPFGAMI LDLMMPEMDG
FETIAALQEL EQFRDLPIIV VSAKELTEEE RQQLEETVER VVSKGNVRRE EILALVREQV
RRRVEQPPTT T