Gene Haur_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2137 
Symbol 
ID5734039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2687754 
End bp2690585 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content51% 
IMG OID641279278 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001544905 
Protein GI159898658 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAATT CGCTACGACC CGCCGAGCCA CGCATCGATC AAGCTACGAT AACCAAGCTG 
CTGCATCAGT TTGATTGGAG CACAACGCCG CTTGGGGTGA TCGATACTTG GCCAACTGCC
ATGCGCAGTA TCGTTGATAT GATGTTGGCG GTTCCGGTTG CCATGACAAC CATGTGGGGT
AAGCAGGGCA TCATGATCTA TAACGCTGGC TATGCGGTCA TTGCTGGCTC ACGTCACCCC
GAAGTATTTG GGCTGCCGGT AACCGCCGCT TGGCCTGAAG CCGCCGATTT CAATCAATCG
ATCCTCGATC AGGTCTTTGC GGGCAAAAGC CTTACTTACC AAAAACGCAA TTTTATGTTG
CACCGTAATG GCCGTAACGA ACAAGTCTGG TTTGATCTTA GTTATAGCCC GATTTTTGAC
CAAAACGGCG AAATTATTGG GGTGATGGCG CTGGTGGTCG AAATAACTGC CCAAGTTTTA
GCTGAACAAC AACGCCATCG CGCCGAAGAA CGCTTTCAAT TAGCCTTGGA TGCTGGCTTG
CTGATCGGCA CATGGGATTG GGATATCGTA GCCGATCGCT GTATCGTCGA TCCACGTTTT
GCCGAATATT TTGCGCTTGA CCCGACCCTT GCCAGCCAAG GTGTGAGCGT CGAAACCATG
CTTGAGGCGA TTCATCCCGA TGATCAAGCA ACCATTGCCC AATTAATTCA ATCGGCAATC
CATAGCGGCC AATCATATCG CGCTGAATAT CGCGTGCTCC ACCGCGACGG CGATTATCGC
TGGGTTGAGG CCAATGGCTT TTGCATTTTT GAGCACAATC AGCCGCAACG CTTTCCTGGC
GTATTAATCG ATATTACCGA GCGCAAGCGC CGCGAAGATG CCTTGGAGCA TAGCGAGGCT
CGCTTACGCG CAATTTTCGA TACCTTGCCC GTCGGCTTGG TTTTTGCCGA AACCCCAACT
GGACGGATCA CCGATGGCAA TGCCCATGTC GAGCAGATTT TGCGCCATCC GGTACTGCCA
TCGCCTGCGA CCGAGGCCTA TGGCGAGTGG ATCGCTTACG ACGAACACAA TCAACTTGTA
CCAATTGAAG AATATCCATT GGCGATTGCC GTCAAAACTG GCCAAGTTTC CGAGCGCGAT
TTCCACTACC AGCGGGGCGA TGGCACACGG GCTTGGATCA AGGTTATCGG CGGCCCAGTC
CGCGATCTTG ATAACACGAT CACTGGCGGT TTGATCACAA TTATCGATAT TGACCGTGAA
AAACGCACCG AAACGCAACT TCAAGCGCTC AACAACGATT TAGAAGGCTT GGTTGCCCAA
CGCACCCGTG AGCGCGACCG AATTTGGCTG GTCAGTCAAG ATCTGTTAGG CATTGCCGAT
CCAGCGGGCA ATTGGCTCGC GATCAATCCT GCTTGGCAAC GCACGCTCGG CTGGGATGAT
AGCGATATTC TTGGGCGCAC CAGCGAATGG CTTGAACACC CCGACGACCA TGAACGCACG
CGCCAAGAGG TTGCCAAGCT AGCCAATGGC ATTCCCACCA GCTATTTTGA AAATCGTTTT
CGCAGTCGCG ATGGCGAGTA TCATTGGCTT TCGTGGACGG CAGTGCTGGA TAATGGCTAT
CTCTATTGCG TTGCCCGTGA TATTAGCAAC GAAAAACAGC GCCAAGCTGA AATCGAACGC
ATGCAAACCC AATTGCGGCA ATCACAAAAA ATGGAGGCGA TTGGACAACT CACAGGCGGC
ATCGCCCACG ATTTCAATAA TATTTTGACC AGCATTTTGG GTGGGCTCGA TTTGCTGCAA
CGGCGGATCA ACGTAGGCCG TTTCGATACA ATTGAGCGCT ACATCAACAG TGCAATTAAA
TCGGCCAAAC GTGCTGCCGC ATTAACTCAA CGTTTATTGG CATTTTCGCG CCAGCAAGCG
CTGGATGTGC AACCAATCAA TATCAATAGC CTGATCCAAT CGCTCGACGA TTTACTCCAA
CGCAGCCTTG GCGAGCAAAT TCAGGTGGCA ACCAACCTCA GCGATGACGT TTGGCGGGTG
CGCACCGATG CCAATCAGCT TGAAAACGCC TTGCTCAACT TGGCGATCAA CGCCCGTGAT
GCCATGCCCT ACGGCGGAAT TCTGACGATT AGCACGAGCA ACATCGACGC GCAACAAGCA
AATCAACTGC AACTTGATCC GCATGAATAT GTATTAATCG AAGTCATCGA TACCGGCACT
GGCATGAGTA GTGAGGTCAT CGAACGCGCT TTCGATCCAT TTTTTACCAC CAAACCTTTG
GGGCAAGGCA CAGGTTTAGG TTTATCGATG ATTTATGGCT TTATCAAACA AATTGGCGGG
CATATTCAAA TTGAGAGCCA ACTCGAACAA GGCACCAGCG TCAAGCTCTA TTTGCCGCGT
GACCAAAGTG GGCTTGAGCA TGGCTTTGCC GCCGAATCGG CCCAAGCTCG CAGCATCGAG
GGAGCCACAA TTTTGGTGGT TGAAGATGAT GAAGCGGTAC GCATGGTGTT AATCGATGTA
CTCGACGAGT TAGGTTACCA TACGCTTGAG GCTGAGGATG CCAGCAGTGC CTTGACCTTC
TTCGAGCAAC CAACAACAAT TGATTTGGTG ATTAGCGATA TTGGATTGCC TAAAACCGAT
GGCTATGATC TAGCCCTCCA GATCCGCCAG CGCTACCCAA CCTTGCCAAT TTTGCTGGTC
AGTGGCTACA CCGATCGGGC GGCAGTGCGC AGCGGCGAGC TTGAACCACA GATGGAATTG
CTCAGCAAAC CATTTGAAAT TACCGACCTG GCCAATAAAA TTCACGATTT GCTACAACAA
GGCCACACGT AA
 
Protein sequence
MPNSLRPAEP RIDQATITKL LHQFDWSTTP LGVIDTWPTA MRSIVDMMLA VPVAMTTMWG 
KQGIMIYNAG YAVIAGSRHP EVFGLPVTAA WPEAADFNQS ILDQVFAGKS LTYQKRNFML
HRNGRNEQVW FDLSYSPIFD QNGEIIGVMA LVVEITAQVL AEQQRHRAEE RFQLALDAGL
LIGTWDWDIV ADRCIVDPRF AEYFALDPTL ASQGVSVETM LEAIHPDDQA TIAQLIQSAI
HSGQSYRAEY RVLHRDGDYR WVEANGFCIF EHNQPQRFPG VLIDITERKR REDALEHSEA
RLRAIFDTLP VGLVFAETPT GRITDGNAHV EQILRHPVLP SPATEAYGEW IAYDEHNQLV
PIEEYPLAIA VKTGQVSERD FHYQRGDGTR AWIKVIGGPV RDLDNTITGG LITIIDIDRE
KRTETQLQAL NNDLEGLVAQ RTRERDRIWL VSQDLLGIAD PAGNWLAINP AWQRTLGWDD
SDILGRTSEW LEHPDDHERT RQEVAKLANG IPTSYFENRF RSRDGEYHWL SWTAVLDNGY
LYCVARDISN EKQRQAEIER MQTQLRQSQK MEAIGQLTGG IAHDFNNILT SILGGLDLLQ
RRINVGRFDT IERYINSAIK SAKRAAALTQ RLLAFSRQQA LDVQPININS LIQSLDDLLQ
RSLGEQIQVA TNLSDDVWRV RTDANQLENA LLNLAINARD AMPYGGILTI STSNIDAQQA
NQLQLDPHEY VLIEVIDTGT GMSSEVIERA FDPFFTTKPL GQGTGLGLSM IYGFIKQIGG
HIQIESQLEQ GTSVKLYLPR DQSGLEHGFA AESAQARSIE GATILVVEDD EAVRMVLIDV
LDELGYHTLE AEDASSALTF FEQPTTIDLV ISDIGLPKTD GYDLALQIRQ RYPTLPILLV
SGYTDRAAVR SGELEPQMEL LSKPFEITDL ANKIHDLLQQ GHT