Gene Haur_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0444 
Symbol 
ID5732343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp520012 
End bp522153 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content44% 
IMG OID641277570 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001543223 
Protein GI159896976 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00394135 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTT TTGCTTGGTT GCGGCGCAAA GAATTACGCC CATTAATCGC TTTATTGAGT 
GTGGCTTTAG TTTCGATTGC CCTGTTGCAA TGGAATTATT ACCAAATGCG CTCGATGATT
CAATCGCAAA TCACGGCGTT TAATGGGATG ACTCAAGCCC GTTTGAATAT GAAACGTGGC
TATTTGATTG CTGAGCAGGT ATTAAATGGC AACCAAACCC TGCTGCCCCA AGATGCAATT
GCCTATCTTG ATCAGGCGAT TTTGGCGGTT GATGATTGGC AGCAAGGTAA AAATCCCGTT
GTAGTCTTTG ATCAATTGTT GCCGCCAGAT CCACCAACGC AGCAAGCGCT CACTGACTAT
CGCCAGCATT TGGTCAGCTT TCGAGCCTTA TTATTGCAAC ATCTCAATCG CCCACAAGAG
CAAGCCCAAA CCACGATTGA TCGCCGAATT GCCTTCTCAG AGTTGGAATC TGAAGCTGAG
CAGTTGGAAA ATCGTTTATA CCTTGATTTG GAATTATCAA TTAAACGGCA GGCCGAGCGC
TATTCGTTGA TGTTGGTTGG CTGGCTGATT TTCTTGGTGA TTATTGCGGG AGTTGTCTAT
CGTTTGTTGG GTTTGCAGCG CTTGGCGGTG GCTCAATTGG CCGCCAGCGA AAAACAATTT
CGAGCGCAAT ATCGTGGCAT TCCAATTCCA ACCTATACAT GGCGTTGTAT TGACGACGAT
TTTATTCTCG AAGGTTTTAA CGATGCAGCG ATGATTATTA CCCAAGGAGC AATTACCAAA
TTTGTCAATC AATCAGCTAG AATTGTCTAT CGCAATGAGC CAGAGATTTT ACACCATTTT
GAGCGTTGTC AACGTGAAAA AATAACACTT CATCAAGAAA TTCGCTATGA GTTTAAATTA
ATCAATCAAG TTAAAGATTT GTATGTGAGC TATGTGTTTA TCGAACCAGA TTTTATTATG
GTGCATACCG AAGATCGAAC TGATCGCAAT CAGTTGCAAA TGCAATTAAT TCGTGCCCAG
AAAATGGAAA GTATTGGGCG CTTGGCTGGT GGGGTAGCTC ACGATTTCAA TAATTTGTTA
ACTGCAATTA GCGGTTATAC AACCTTGGCG ATTGATAGTG CTCAGCAAGG CTTTCCGGTG
GTCGAAGAAT TAACTGAAAT TCAACATGCC ACTGATCGAG CAGCAATGCT TACCAGTCAA
TTATTAACCT TTGCACGTAA GCAGCAAATC CAACCAAAAA TTGTCAATAT CAACGATTTA
ATTATGTCGA TGGAGAAATT GATTAGACGA GTTTTACCAG AATCAATTCA ATGGCGAACT
GTACTTAATG TTAATATTGA TTTAGTTTTG GCTGATATCG GCCAGATTGA GCAAGTATTA
TTGAATTTAG TAGTCAATGC GCGTGATGCT ATGCCTCAGG GTGGAAATTT GCTGGTTGAA
ACAAATAATG TTGCAGTTGA TGCAAATTAT GCACGAACTC ATATTGATGT ACCAATTGGC
AATTATGTTT TATTCGCGGT GAGCGATACC GGCATCGGCA TGACCCAAGA AGTCCAGAGT
CGGGCATTTG AGCCATTTTT TACCACTAAA AGCGATCACG AAGGTACTGG TTTGGGTTTA
GCCACCTGTT ATGGCATTAT CAAGCAACAT GGCGGCCATA TTGGTTTGTA TAGCGAAGTT
AATCATGGCA CGACGATTAA AATTTATTTG CCACGTTCAG AGCAAGCTCA ACTCGATGAT
GCTCCAAGCC TCATCCAAAA CTTACCTCGT GGCACGGAAA CGATTTTGGT GAGCGAGGAT
GAGCCGCAGG TACGGGCGGT TTTGGTGCGG ATGCTCCAAG GCTTGGGCTA TAGCGTGCTC
GAAGCACTGC ATGGCTCCGA GGCCTTGGCC TTGCTCCAAG CCCAGGCGCT TGGCACAATT
CAGCTCTTAA TTACTGATAT GGTTATGCCA CAAATGGGTG GTTTTGAGCT GAGCCAACGG
GTTGCTGAGC TTGATCCTCA GCTCAAAATT TTGTTTATTT CGGGCTACTC CGAGCATAGT
TTGGCGCAAC ATCAGCAATT AGCCCAACAG CCTTTGTTGC TGAGCAAGCC ATTTTCACTG
GCTACTTTGG CGCAAACAGT CCGCAAGGTG CTGGATGATT AG
 
Protein sequence
MKRFAWLRRK ELRPLIALLS VALVSIALLQ WNYYQMRSMI QSQITAFNGM TQARLNMKRG 
YLIAEQVLNG NQTLLPQDAI AYLDQAILAV DDWQQGKNPV VVFDQLLPPD PPTQQALTDY
RQHLVSFRAL LLQHLNRPQE QAQTTIDRRI AFSELESEAE QLENRLYLDL ELSIKRQAER
YSLMLVGWLI FLVIIAGVVY RLLGLQRLAV AQLAASEKQF RAQYRGIPIP TYTWRCIDDD
FILEGFNDAA MIITQGAITK FVNQSARIVY RNEPEILHHF ERCQREKITL HQEIRYEFKL
INQVKDLYVS YVFIEPDFIM VHTEDRTDRN QLQMQLIRAQ KMESIGRLAG GVAHDFNNLL
TAISGYTTLA IDSAQQGFPV VEELTEIQHA TDRAAMLTSQ LLTFARKQQI QPKIVNINDL
IMSMEKLIRR VLPESIQWRT VLNVNIDLVL ADIGQIEQVL LNLVVNARDA MPQGGNLLVE
TNNVAVDANY ARTHIDVPIG NYVLFAVSDT GIGMTQEVQS RAFEPFFTTK SDHEGTGLGL
ATCYGIIKQH GGHIGLYSEV NHGTTIKIYL PRSEQAQLDD APSLIQNLPR GTETILVSED
EPQVRAVLVR MLQGLGYSVL EALHGSEALA LLQAQALGTI QLLITDMVMP QMGGFELSQR
VAELDPQLKI LFISGYSEHS LAQHQQLAQQ PLLLSKPFSL ATLAQTVRKV LDD