Gene Haur_4259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4259 
Symbol 
ID5736113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5433048 
End bp5435462 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content51% 
IMG OID641281414 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001547019 
Protein GI159900772 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAA TCCTCATCAT CGACGATAGC CGTCAGATAG CCGATTTTTT GGCTGAGACG 
GTGCTGCCAT ACTATGGCTA TCGTTCGCGC GTCGTACCCA CAGGCTGGGA AGGTCTGGCT
TTTTTGAAAC AACGCCAGCC CGACCTAATT TTGCTTGATC TCCAACTCCC CGACACCTCG
GGGCTTGATG TGCTTCGCCA AATGGGCGAA CTTGGCTACG ATATTCCGGT GATTTTGATG
ACCGCCCATG GTACCGAGCA AACCGCTGTC GAGGCCTTTC GACTTGGTGC CAAAAATTAT
TTGATCAAGC CTTTTGATGC TTCGGAAGCG GGCGCAGCAA TTGAACGTGC CCTGCGCGAA
CGCCGTTTGC AGCGCGAAAA AGAGGTGCTT ACCCGCACCT TGCAGCAACG CTTGCAGGAG
TTGACGATTC TCTCCAGCGT TGGTAAATCG GTGACCTCGC TGCTTGATCT CGAAGAATTG
CTCGAACGGA TCGTCGATGC TGGGGTGTAT CTGACCCACG CCGAAGAAGG CTATTTGCTG
TTGCGCGATG GTGATGAATT ATATTTGCGG GCGGCTAAAA ATCTCGGCGA GGAGCGTGTG
CAGCGTTTTC GGGTCAAGAT GGATGACAAT GTGGCAGGCC AAGTTATTCG CACCGTCAAG
CCAATTCGGC TTGATCGCTC GCAGCACCAA GATTTAAAAC TGCGCACTGG GCTTTTGGTG
CAGGCAATGT TGCAAGTTCC CTTGATTGTT GGTCGCCATG CGATTGGTGT ATTGGCGGTT
GATAATCGGG TGCAGCAACG TACCTTCAGC GAAAACGACC AATATTTGCT CTCAGCGATT
GCCGATTATG CAGCGATTGC GATTGAAAAT TCGCGCTTGT TTAAATCGAC CCGCGATTCT
GAGCAACGCT ACCGCGAATT ATTCGATAAC GCCAACGATA TGATCTTCAC GCTTGATCCA
CAGCTACGAA TTGCCTCGAT CAATCGTCAA GGGGTCAAAT TGTTGGGCAT GACCGTGGCT
GAAATGATGC AGCGCACCTT GATTTCGCTG TGTGTGCCCG ACGATCAAGC AGCAATCGAA
CATCAATTGC AACGCCAACT AGCCAAAGCT GCTGGTGATG GCGGTGCGTT TCCGTTGACT
TTGCGTCGTG CCAATGGCGA AGATTTGCAT ATTGAAGTCA GCGCCCAATT GATGCAGCGT
GGCGATCAGG TAATTGGTCT GCACTGTATT GCGCGTGATG TGACCGACCG CCGCCGCTTG
GAATTGCAAC TATTGCAGGC CGAAAAACTC TCAGCGATTG GCCAATTGGT GGCTGGGGTT
GCTCACGAGT TGAACAATCC GATGACCAGC ATCAAAGGCT TTGCTGAGCT ATTGCTGCGC
CGCAAAGATT TAGATGATGA TGCCCGTACC GATCTGAATT ACATTAATAA CCAAGCTGAA
CGAGCAGCGC GAATTGTGAC CAACCTGCTG ACGTTTGCCC GTGAGCATCA GCCGCAGCGG
GTTTTAGTCG ATGTCAATAA AGTTATCGAC GATACGCTGA GCTTGCATAG CTATCACTTG
CGCGTCGATA ACATCAAAGT TCAACGCCAA TTCGAGCCAG AGCTGCCAAC CACCGTCGCT
GATCCGTATC AATTACAACA GGTTTTTCTG AACTTGATTG GCAATGCCCA TCAAGCTATG
GCCGAAAAAG GCGGCGGCGG TTTCTTGACC GTCAAAACCG AGCGGGTTGA TGATGAAATT
CGGATTAACA TCGGCGATAC TGGCCCCGGT ATTCCCCAAC ACCTCGTCGG GCGGATCTTC
GATCCATTTT TTACCACCAA GCCTGTTGGC AAGGGTACAG GGCTAGGTTT ATCGATCTGC
TACAACATTC TGCATGATCA TGGTGGCAAT ATTTGGGTTG ATAGCGTGGC CAACGAAGGC
ACAACCTTCC ATTTGGCCTT GCCGGTGGTT CAGGGCGATA ATCCTGAGTT GATCAACGAC
GATGATCGTG AAACCACGAT TAAGCCCGAC CAAGCCTATA AAATTTTGGT GGTTGACGAT
GAAGAAGGCG TGGCCCAAGT GATTCAGCGC TTGGTGCGCG ATTTGGGTCA TCAGCCAGTG
GTGGTGGCTA GTGGCGAGGC CGCCTTGCAA GCCGTTGATC AAGCGCCCTT CGATTTGATT
CTGAGCGATG TTAAAATGCC AGGCATGAAC GGCTTCCAGC TGTATCGAGC TTTGCAACAA
AAAGCCCCTG AATTGGCCAA ACATTTTATC TTTATCACTG GCGATACGAT GAGTCCAGCC
ACGATCACCG CGATGCGCCA AATTGGCACA CCAATGATTG CCAAGCCCTT CTCAGCCAAA
AAACTCGAAC GTACAATCAA CGAGTTTATG GCTCGCGAAG AAGCATTGCA ACGCGAGGCC
GAAATTCAGC AATAG
 
Protein sequence
MEEILIIDDS RQIADFLAET VLPYYGYRSR VVPTGWEGLA FLKQRQPDLI LLDLQLPDTS 
GLDVLRQMGE LGYDIPVILM TAHGTEQTAV EAFRLGAKNY LIKPFDASEA GAAIERALRE
RRLQREKEVL TRTLQQRLQE LTILSSVGKS VTSLLDLEEL LERIVDAGVY LTHAEEGYLL
LRDGDELYLR AAKNLGEERV QRFRVKMDDN VAGQVIRTVK PIRLDRSQHQ DLKLRTGLLV
QAMLQVPLIV GRHAIGVLAV DNRVQQRTFS ENDQYLLSAI ADYAAIAIEN SRLFKSTRDS
EQRYRELFDN ANDMIFTLDP QLRIASINRQ GVKLLGMTVA EMMQRTLISL CVPDDQAAIE
HQLQRQLAKA AGDGGAFPLT LRRANGEDLH IEVSAQLMQR GDQVIGLHCI ARDVTDRRRL
ELQLLQAEKL SAIGQLVAGV AHELNNPMTS IKGFAELLLR RKDLDDDART DLNYINNQAE
RAARIVTNLL TFAREHQPQR VLVDVNKVID DTLSLHSYHL RVDNIKVQRQ FEPELPTTVA
DPYQLQQVFL NLIGNAHQAM AEKGGGGFLT VKTERVDDEI RINIGDTGPG IPQHLVGRIF
DPFFTTKPVG KGTGLGLSIC YNILHDHGGN IWVDSVANEG TTFHLALPVV QGDNPELIND
DDRETTIKPD QAYKILVVDD EEGVAQVIQR LVRDLGHQPV VVASGEAALQ AVDQAPFDLI
LSDVKMPGMN GFQLYRALQQ KAPELAKHFI FITGDTMSPA TITAMRQIGT PMIAKPFSAK
KLERTINEFM AREEALQREA EIQQ