Gene Haur_4708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4708 
Symbol 
ID5736944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6014131 
End bp6015702 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content54% 
IMG OID641281872 
Producthistidine kinase 
Protein accessionYP_001547467 
Protein GI159901220 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG1129] ABC-type sugar transport system, ATPase component
[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTACCT GCCGCCATAT TTCCAAGCAA TTTGGTACGC TGCCCGTGAT CGATGAGGTC 
AGCTTTGATC TTGCGCCTGG CGAGGTGGTG GGCTTGACTG GCCAAAGTGG TGCTGGTAAA
TCGGTATTGG TGCGTTTATT GGCGGGCTTG GAAAAACCCG ATACTGGGGT GATTTCGACC
CGTGGCCAAT TAATTAACTC GACCCAAAGT GCCTTTCGGG CTGGGTTGGC GGTCATCCAT
CAGCAGCCAG TGCTAGTTGA GCATTTGGAT GTGGCGAGTG CGATTTTTTT GGGTCACGAG
GTTGGGCGCG GTTGGCTGGG TTGGTTATCG TTGCCCAATC AGCGCCATCA CGACACCATG
GCGCGGCAAA TTCTAGCCCA ACTTGGTTTA GAGTTGCCAT CGTTGCGCAC CTTAGTTCGT
AATTTATCGA GCGAACAACG CCAGATGTTG GCGATTGCGC AGGTGCTGAT CCGCAAGCCG
CAAGTTGTGA TTATCGATGA GCCAACGCCG CTACTGCGCT ACGAATATCA ACAAACGCTG
CTCGAATTGA TTCGCGAATG GCAAGCGCAA GGTGTGGCGG TGCTGTTTAG CAGCCAAAAT
CTTGATCATC TATTTGCAGT GAGCAATCGG ATTTTGGTGT TGCGGCGCGG GCGCTTTGTG
TTCGAGGCGG CCACCGAAAA AACCTCGCGT GAGGAAGTGG TGCGGGCACA AATTGGAGCG
CGAGATCAGC AACATCTCAC ACCAATTATT TGGGCCTTGG AAAATTACCA TCGTGCCAGC
CAACAAGCCG AGGCCTTGCG CCAAAGCCAA TCCAGCCTAG AGCACGATTT AGCTAGCCAG
AATCAGCTTA ATCGCCAATT GATTGGCCAA CTCGATCAAC AGGTCAGTAA TCTTGATCGC
GCCAATGCGG CCTTGCAAGA AGCCCAACGT CGTTTGCTTT CCGAGCGCGA AGGTGAGCGC
AAAGTGCTTG CCCGCGAGCT GCACGACCAA GTTATTCAAG ATTTGGTGAG CCTCAATTAT
GATATTGATA ATTTGCGCAG CCAAATTGAC GACCCTGAGC AAGCCAGCCT TGGGCTTGAC
GATTTGCGCG ATAACATTCG CCAATTGGTG AGCACAGTAC GGGCAATTTG TGGTAACTTG
CGCCCGCCAA CCATCGATAG CCTTGGGGTC AATGCGGCAA TTCAATCGTT TGTGCGCGAT
TGGAGCAGCC GCAGCGGCAT TGAAGTGCAG CTTGATCTCG ACGACGATTT AGAGCGCTTG
CCCGAAATGC TCGAAATTTC GGCCTTTCGC ATGATTCAAG AGGGCTTGAG CAATGTGCGT
AAACATGCCC AAGCCACCAA AGTTGGCATT AGCCTGCGTA CCACCGCCCG CCGCACTCTG
CTCCTGACGA TTGCCGACAA CGGGCGCGGC TTGCAAGCTG AGATTAATTT GGCGGCGCTG
GCGAATGCAG GCCACTATGG GTTGCTGGGC ATGAGCGAAC GGGTGGCGCT GGTTGGCGGG
CGTTTCCGCG TGCACAATCG GGCTGGCGGC GGTCTCATCC TCGAAATCGA AATTCCCTAC
CAACCACTTT AA
 
Protein sequence
MFTCRHISKQ FGTLPVIDEV SFDLAPGEVV GLTGQSGAGK SVLVRLLAGL EKPDTGVIST 
RGQLINSTQS AFRAGLAVIH QQPVLVEHLD VASAIFLGHE VGRGWLGWLS LPNQRHHDTM
ARQILAQLGL ELPSLRTLVR NLSSEQRQML AIAQVLIRKP QVVIIDEPTP LLRYEYQQTL
LELIREWQAQ GVAVLFSSQN LDHLFAVSNR ILVLRRGRFV FEAATEKTSR EEVVRAQIGA
RDQQHLTPII WALENYHRAS QQAEALRQSQ SSLEHDLASQ NQLNRQLIGQ LDQQVSNLDR
ANAALQEAQR RLLSEREGER KVLARELHDQ VIQDLVSLNY DIDNLRSQID DPEQASLGLD
DLRDNIRQLV STVRAICGNL RPPTIDSLGV NAAIQSFVRD WSSRSGIEVQ LDLDDDLERL
PEMLEISAFR MIQEGLSNVR KHAQATKVGI SLRTTARRTL LLTIADNGRG LQAEINLAAL
ANAGHYGLLG MSERVALVGG RFRVHNRAGG GLILEIEIPY QPL