Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4061 |
Symbol | |
ID | 5735919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5186106 |
End bp | 5188763 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281212 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001546821 |
Protein GI | 159900574 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.193403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCGCA ACTCGTCTTT TCGCATTTGG CTAACTCGTA TTCGTCCTCT AGCTTGGGCC TTAGCTTGCT TTGCGCTCGT GCTCTTCGGC TACACCTTTA CCACCACGAT TCCGACGGAT CGCCTGACGG TGATTGCGAG TATTTTGGGG GTAGCGGCGC TAGCCTTGCC ATGGTTGTTT AATCCTGAGG GCGATTGGCG CGAGTTGCGC ACGCTGGGAA TTTTGGCCTT GCCCTTGAGC GTGGCGATTC TCAGCGAGGG CTATAATACG GCGCTCTGGT CGGTGCTCTT GATTCCAGCG ATTGCGATTC CGCAAATGTT GCCGCCGCGT TGGGCTTTTA GCGCAATTGC CTTGATGGTG ATTGCTTGGG CGGTCAGCAA TGTGTTAGTT CCAGCTGTCG CGGTCGAAAC TGTGGCGATT GAAGTTGGCT TGCGAAGTTT GGGCTTTGTG CTGGTGGCGA TTGCGGTGTG GTTGGCGGCT CGACCACGTT TTGATTATCC GGCCATGCTG CCCGAAGCGC CAATTCGTCG TGCTACACGC GCTGCTGAGC GCTTGCGTGG CTCGCTTAGC CCCGAAGAAA CTCTCGAAGA ATTAGCCAGC GCCGCCAAAG CGTGCGGCCC GTTTATTTTC GCCAGCGCCT CAACCGTCGA TTGGCGGGCA CGAGTGCTGC GTATGGCTGT GGCGATTGGG GGTAGTGGCC GCACCCTCGG CGCAACCGAA ATGCTCTCAA TTCCATGGGA TGAAATTACG GTGTTGCTGC GCGATGATCG GCGCATTGGC GATAATGCCT ATCTTGCCGA TTCGCTGCCC TTCCGCGATA TTGCTGGCGA ACACTATATG CTGGTGCCAG TCCGCACTGC GACTGGCGAG ATTTGTGGTT TGCTGACGGT TGGCGATGAT GACCCCAAAG CCCGCAAGCG CCTGACTGAA ACTGCGCCAT TGCTTGAATT ATTGGCTTCG CAAGCCGCCG CTGTGCTCGA AAACGCGGCG CTCCAAAACA CGCTTGCCCA ACGAATCGAA GCCACAACCG CCGAAATGGG CCGCACCGCC GAAGATGCAA TGCGGGCACG CACTCGGGCC GAAAGCATGT ATCAGATTGT GCGGGCACTC AGCGGCACGC TCGAACCGCA GCCATTGCTT GATCAAGCCC TGTTGTTGAT TGCCCAAGCT ACCCAAGCCG AGCGCGGCGG GATTATGTTG ATCGATCATA AAAATGGGCG TTTGGCTTTC AGCACCAACC TTGATCGTAA TATCACCCGT ACCGAGGCGA TTTCCTTGGA GCGTGGCCAA GGCTTGGCAG GCTGGGTTGT TGAGCATCGT GCACCCGTGA TTATTCCCAA TACTGCCGAA GATAGCCGTT GGATGGTGCG CACCGATTAC GACAAAAAAG GTCGTTCAGC GCTGGCCGTG CCGATGGAGC AAGATGGGCG AGTCGCTGGG GTGATTGTGC TGATCAACAG CCGCATCAAT CACTTTACCC AAGAGCATAT TCAATTTGTG CAGGTCATTG GCGATCAAGT GATGACAATG CTCAGCAATG TGCAGCTGTA TCGTGCCACG ACCGAGCAAG CTCGCCGTTT GAGCCAAGCT CTTGAACAAC GTGAAGAAGA AGTTAGCCGT AGTTTGGCAA TTGTACGTTC GATTGGCGAT GGTGTAGTGG TTGGCGATCG GGTTGGTCGG ATTCGCTTGA TTAATCCGGC TGCCGAGCAA TTGCTGAATA TCGAAGCTGC TGAATGGTTG GGCAAGCCCT TGATGAGCCT GCCTGGTGCG CCCGAGAGTG AGCCACGCCT GACCGAAAAG CAAACCTACC AGCAATTTGA GCTAAGCGGG CGCATGATTC GCGCTTCGAG CACGCCAGTC TTTACTTCGC AAAGCGAATG GCTGGGCAGT GTGGTGGTCT ATCACGATAT TACAGCGTCA GAATTGGCTG ATCGCATGAA AACTGAGTTT GTGGCGACGG CCTCGCACGA ATTGCGTACC CCATTGACCT CAATTAGCGG CTACATTGAT TTGCTGATGT TAAACACGCT TGGTCCCTTG ACCGAGCAAC AACGCCAATT TTTGAGCGTG GTCAAGAACA ACATCGAACG TTTGAATGCG ATTCTCAACG ATTTGCTCGA TGTTTCACGC ATCGAATCGG GCAAAGTGCG GTTGCAACGC AAGCCAATTA ACCTTGATGA ACTGATTCAA TCAACAGTGA TGTCAATTCA TCAACAATGG AGTGGCAAGC AAATTTCCTT GGCGCTCGAT GTGCCCGATG ATTTGCCGCC AATGATTGCC GACCCCGAAC GCATGCGCCA GATCGTCACC AATTTGATCT CAAATGCCTA CAAATATACC CGCGACGGCG GCAGAATTGA TGTTGTAGTC AGCAATGGCG GCGATTCGGT GACCTTAGCG GTCAAAGATA GCGGCGTGGG CATCGCTGCT GATGATCAAA AGCATATTTT TACGCGCTTC TTCCGCTCGG AAAACCCGCT CAAGGAGCAG GCTGGTGGCA CGGGCTTGGG CTTGAACATC ACCAAATCGC TGGTTGAGCT GCACGGTGGC AAAATCTGGT TTGATAGCGA AGAAGGTCGC GGCACAACCT TTAATGTCCA ACTGCCGGTC GGCGGCGATT CCGACTGGAC TCCCGCTTCA TGGCTTGAAG GAGTGTAA
|
Protein sequence | MGRNSSFRIW LTRIRPLAWA LACFALVLFG YTFTTTIPTD RLTVIASILG VAALALPWLF NPEGDWRELR TLGILALPLS VAILSEGYNT ALWSVLLIPA IAIPQMLPPR WAFSAIALMV IAWAVSNVLV PAVAVETVAI EVGLRSLGFV LVAIAVWLAA RPRFDYPAML PEAPIRRATR AAERLRGSLS PEETLEELAS AAKACGPFIF ASASTVDWRA RVLRMAVAIG GSGRTLGATE MLSIPWDEIT VLLRDDRRIG DNAYLADSLP FRDIAGEHYM LVPVRTATGE ICGLLTVGDD DPKARKRLTE TAPLLELLAS QAAAVLENAA LQNTLAQRIE ATTAEMGRTA EDAMRARTRA ESMYQIVRAL SGTLEPQPLL DQALLLIAQA TQAERGGIML IDHKNGRLAF STNLDRNITR TEAISLERGQ GLAGWVVEHR APVIIPNTAE DSRWMVRTDY DKKGRSALAV PMEQDGRVAG VIVLINSRIN HFTQEHIQFV QVIGDQVMTM LSNVQLYRAT TEQARRLSQA LEQREEEVSR SLAIVRSIGD GVVVGDRVGR IRLINPAAEQ LLNIEAAEWL GKPLMSLPGA PESEPRLTEK QTYQQFELSG RMIRASSTPV FTSQSEWLGS VVVYHDITAS ELADRMKTEF VATASHELRT PLTSISGYID LLMLNTLGPL TEQQRQFLSV VKNNIERLNA ILNDLLDVSR IESGKVRLQR KPINLDELIQ STVMSIHQQW SGKQISLALD VPDDLPPMIA DPERMRQIVT NLISNAYKYT RDGGRIDVVV SNGGDSVTLA VKDSGVGIAA DDQKHIFTRF FRSENPLKEQ AGGTGLGLNI TKSLVELHGG KIWFDSEEGR GTTFNVQLPV GGDSDWTPAS WLEGV
|
| |