Gene Haur_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4061 
Symbol 
ID5735919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5186106 
End bp5188763 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content54% 
IMG OID641281212 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001546821 
Protein GI159900574 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCGCA ACTCGTCTTT TCGCATTTGG CTAACTCGTA TTCGTCCTCT AGCTTGGGCC 
TTAGCTTGCT TTGCGCTCGT GCTCTTCGGC TACACCTTTA CCACCACGAT TCCGACGGAT
CGCCTGACGG TGATTGCGAG TATTTTGGGG GTAGCGGCGC TAGCCTTGCC ATGGTTGTTT
AATCCTGAGG GCGATTGGCG CGAGTTGCGC ACGCTGGGAA TTTTGGCCTT GCCCTTGAGC
GTGGCGATTC TCAGCGAGGG CTATAATACG GCGCTCTGGT CGGTGCTCTT GATTCCAGCG
ATTGCGATTC CGCAAATGTT GCCGCCGCGT TGGGCTTTTA GCGCAATTGC CTTGATGGTG
ATTGCTTGGG CGGTCAGCAA TGTGTTAGTT CCAGCTGTCG CGGTCGAAAC TGTGGCGATT
GAAGTTGGCT TGCGAAGTTT GGGCTTTGTG CTGGTGGCGA TTGCGGTGTG GTTGGCGGCT
CGACCACGTT TTGATTATCC GGCCATGCTG CCCGAAGCGC CAATTCGTCG TGCTACACGC
GCTGCTGAGC GCTTGCGTGG CTCGCTTAGC CCCGAAGAAA CTCTCGAAGA ATTAGCCAGC
GCCGCCAAAG CGTGCGGCCC GTTTATTTTC GCCAGCGCCT CAACCGTCGA TTGGCGGGCA
CGAGTGCTGC GTATGGCTGT GGCGATTGGG GGTAGTGGCC GCACCCTCGG CGCAACCGAA
ATGCTCTCAA TTCCATGGGA TGAAATTACG GTGTTGCTGC GCGATGATCG GCGCATTGGC
GATAATGCCT ATCTTGCCGA TTCGCTGCCC TTCCGCGATA TTGCTGGCGA ACACTATATG
CTGGTGCCAG TCCGCACTGC GACTGGCGAG ATTTGTGGTT TGCTGACGGT TGGCGATGAT
GACCCCAAAG CCCGCAAGCG CCTGACTGAA ACTGCGCCAT TGCTTGAATT ATTGGCTTCG
CAAGCCGCCG CTGTGCTCGA AAACGCGGCG CTCCAAAACA CGCTTGCCCA ACGAATCGAA
GCCACAACCG CCGAAATGGG CCGCACCGCC GAAGATGCAA TGCGGGCACG CACTCGGGCC
GAAAGCATGT ATCAGATTGT GCGGGCACTC AGCGGCACGC TCGAACCGCA GCCATTGCTT
GATCAAGCCC TGTTGTTGAT TGCCCAAGCT ACCCAAGCCG AGCGCGGCGG GATTATGTTG
ATCGATCATA AAAATGGGCG TTTGGCTTTC AGCACCAACC TTGATCGTAA TATCACCCGT
ACCGAGGCGA TTTCCTTGGA GCGTGGCCAA GGCTTGGCAG GCTGGGTTGT TGAGCATCGT
GCACCCGTGA TTATTCCCAA TACTGCCGAA GATAGCCGTT GGATGGTGCG CACCGATTAC
GACAAAAAAG GTCGTTCAGC GCTGGCCGTG CCGATGGAGC AAGATGGGCG AGTCGCTGGG
GTGATTGTGC TGATCAACAG CCGCATCAAT CACTTTACCC AAGAGCATAT TCAATTTGTG
CAGGTCATTG GCGATCAAGT GATGACAATG CTCAGCAATG TGCAGCTGTA TCGTGCCACG
ACCGAGCAAG CTCGCCGTTT GAGCCAAGCT CTTGAACAAC GTGAAGAAGA AGTTAGCCGT
AGTTTGGCAA TTGTACGTTC GATTGGCGAT GGTGTAGTGG TTGGCGATCG GGTTGGTCGG
ATTCGCTTGA TTAATCCGGC TGCCGAGCAA TTGCTGAATA TCGAAGCTGC TGAATGGTTG
GGCAAGCCCT TGATGAGCCT GCCTGGTGCG CCCGAGAGTG AGCCACGCCT GACCGAAAAG
CAAACCTACC AGCAATTTGA GCTAAGCGGG CGCATGATTC GCGCTTCGAG CACGCCAGTC
TTTACTTCGC AAAGCGAATG GCTGGGCAGT GTGGTGGTCT ATCACGATAT TACAGCGTCA
GAATTGGCTG ATCGCATGAA AACTGAGTTT GTGGCGACGG CCTCGCACGA ATTGCGTACC
CCATTGACCT CAATTAGCGG CTACATTGAT TTGCTGATGT TAAACACGCT TGGTCCCTTG
ACCGAGCAAC AACGCCAATT TTTGAGCGTG GTCAAGAACA ACATCGAACG TTTGAATGCG
ATTCTCAACG ATTTGCTCGA TGTTTCACGC ATCGAATCGG GCAAAGTGCG GTTGCAACGC
AAGCCAATTA ACCTTGATGA ACTGATTCAA TCAACAGTGA TGTCAATTCA TCAACAATGG
AGTGGCAAGC AAATTTCCTT GGCGCTCGAT GTGCCCGATG ATTTGCCGCC AATGATTGCC
GACCCCGAAC GCATGCGCCA GATCGTCACC AATTTGATCT CAAATGCCTA CAAATATACC
CGCGACGGCG GCAGAATTGA TGTTGTAGTC AGCAATGGCG GCGATTCGGT GACCTTAGCG
GTCAAAGATA GCGGCGTGGG CATCGCTGCT GATGATCAAA AGCATATTTT TACGCGCTTC
TTCCGCTCGG AAAACCCGCT CAAGGAGCAG GCTGGTGGCA CGGGCTTGGG CTTGAACATC
ACCAAATCGC TGGTTGAGCT GCACGGTGGC AAAATCTGGT TTGATAGCGA AGAAGGTCGC
GGCACAACCT TTAATGTCCA ACTGCCGGTC GGCGGCGATT CCGACTGGAC TCCCGCTTCA
TGGCTTGAAG GAGTGTAA
 
Protein sequence
MGRNSSFRIW LTRIRPLAWA LACFALVLFG YTFTTTIPTD RLTVIASILG VAALALPWLF 
NPEGDWRELR TLGILALPLS VAILSEGYNT ALWSVLLIPA IAIPQMLPPR WAFSAIALMV
IAWAVSNVLV PAVAVETVAI EVGLRSLGFV LVAIAVWLAA RPRFDYPAML PEAPIRRATR
AAERLRGSLS PEETLEELAS AAKACGPFIF ASASTVDWRA RVLRMAVAIG GSGRTLGATE
MLSIPWDEIT VLLRDDRRIG DNAYLADSLP FRDIAGEHYM LVPVRTATGE ICGLLTVGDD
DPKARKRLTE TAPLLELLAS QAAAVLENAA LQNTLAQRIE ATTAEMGRTA EDAMRARTRA
ESMYQIVRAL SGTLEPQPLL DQALLLIAQA TQAERGGIML IDHKNGRLAF STNLDRNITR
TEAISLERGQ GLAGWVVEHR APVIIPNTAE DSRWMVRTDY DKKGRSALAV PMEQDGRVAG
VIVLINSRIN HFTQEHIQFV QVIGDQVMTM LSNVQLYRAT TEQARRLSQA LEQREEEVSR
SLAIVRSIGD GVVVGDRVGR IRLINPAAEQ LLNIEAAEWL GKPLMSLPGA PESEPRLTEK
QTYQQFELSG RMIRASSTPV FTSQSEWLGS VVVYHDITAS ELADRMKTEF VATASHELRT
PLTSISGYID LLMLNTLGPL TEQQRQFLSV VKNNIERLNA ILNDLLDVSR IESGKVRLQR
KPINLDELIQ STVMSIHQQW SGKQISLALD VPDDLPPMIA DPERMRQIVT NLISNAYKYT
RDGGRIDVVV SNGGDSVTLA VKDSGVGIAA DDQKHIFTRF FRSENPLKEQ AGGTGLGLNI
TKSLVELHGG KIWFDSEEGR GTTFNVQLPV GGDSDWTPAS WLEGV