Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3403 |
Symbol | arcB |
ID | 5593772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3401616 |
End bp | 3403952 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640922524 |
Product | aerobic respiration control sensor protein ArcB |
Protein accession | YP_001460012 |
Protein GI | 157162694 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAA TTCGTCTGCT GGCGCAGTAT TATGTTGACC TGATGATGAA GTTAGGTCTG GTGCGCTTCT CAATGTTGCT GGCGCTGGCC CTCGTCGTTC TTGCCATTGT GGTACAAATG GCGGTAACCA TGGTGCTGCA TGGTCAGGTC GAAAGCATTG ATGTTATTCG TTCTATCTTC TTTGGTTTGC TGATTACGCC GTGGGCGGTC TACTTTCTAT CGGTGGTCGT CGAGCAACTG GAGGAGTCAC GACAACGTCT GTCACGGCTG GTGCAAAAAC TGGAGGAGAT GCGCGAGCGC GATTTGAGCC TCAACGTTCA GTTAAAAGAT AATATTGCCC AGCTAAATCA GGAAATTGCC GTTCGTGAAA AAGCGGAAGC AGAACTGCAG GAAACCTTCG GCCAACTGAA AATTGAAATC AAAGAGCGCG AAGAGACACA AATTCAGCTC GAGCAGCAAT CCTCATTCTT ACGTTCCTTC CTTGATGCTT CACCCGACCT GGTTTTTTAT CGTAACGAAG ATAAAGAGTT TTCCGGCTGT AACCGCGCGA TGGAGCTGCT GACCGGAAAA AGCGAAAAAC AACTGGTTCA CCTGAAACCT GCTGATGTTT ACTCACCGGA AGCCGCCGCA AAAGTCATTG AAACCGATGA AAAAGTGTTC CGTCATAATG TGTCACTGAC CTATGAACAG TGGCTGGATT ACCCGGACGG GCGCAAAGCC TGCTTTGAAA TCCGTAAAGT GCCGTACTAC GACCGCGTGG GTAAACGTCA CGGTTTGATG GGCTTTGGTC GCGACATTAC CGAGCGTAAG CGGTATCAGG ATGCGCTTGA ACGGGCCAGC CGCGACAAAA CGACGTTTAT CTCCACCATC AGTCACGAAT TGCGTACACC GCTGAACGGT ATCGTCGGTC TGAGCCGCAT TCTGCTGGAT ACCGAACTCA CCGCCGAGCA GGAAAAATAT CTCAAGACCA TCCATGTTTC GGCCGTCACG CTGGGGAATA TCTTTAACGA TATTATCGAC ATGGATAAGA TGGAACGGCG CAAGGTCCAG CTTGATAATC AACCGGTTGA TTTCACCAGC TTCCTTGCCG ATCTGGAAAA TCTCTCCGCA TTGCAGGCGC AACAAAAAGG ATTGCGCTTT AACCTGGAGC CGACGCTGCC ATTACCGCAT CAGGTCATTA CCGACGGGAC GCGTTTACGG CAGATCCTGT GGAACCTCAT CAGTAACGCC GTCAAATTCA CCCAGCAAGG CCAGGTTACC GTGCGCGTGC GCTACGATGA AGGCGATATG CTGCATTTTG AAGTGGAAGA CTCTGGTATC GGCATTCCGC AGGATGAGCT GGATAAAATT TTCGCCATGT ATTACCAGGT GAAAGACAGT CATGGCGGTA AACCTGCCAC CGGCACCGGT ATTGGTCTGG CCGTTTCTCG TCGTCTGGCG AAAAATATGG GCGGCGATAT TAAGGTTACC AGCGAACAGG GCAAAGGTTC AACCTTTACG TTGACGATCC ACGCACCGTC GGTAGCAGAA GAGGTCGATG ATGCGTTTGA TGAAGACGAT ATGCCTTTAC CGGCGCTGAA TGTGCTGCTG GTGGAAGACA TTGAACTGAA CGTGATTGTT GCGCGTTCTG TGCTGGAAAA ATTAGGTAAC AGCGTTGATG TCGCCATGAC CGGCAAGGCG GCGCTGGAGA TGTTTAAACC GGGCGAATAC GACCTGGTGT TGCTGGATAT TCAGTTGCCA GATATGACCG GGCTGGATAT CTCTCGTGAA CTGACGAAAC GTTATCCGCG CGAGGATTTA CCGCCGCTGG TGGCCTTAAC CGCTAACGTG CTGAAAGACA AACAAGAGTA CCTCAATGCT GGAATGGATG ATGTGCTGAG TAAGCCGCTT TCTGTTCCGG CGCTAACCGC GATGATCAAG AAATTCTGGG ATACCCAGGA TGATGAGGAG AGTACGGTGA CGACAGAAGA GAACAGTAAA TCAGAAGCAT TGCTCGATAT TCCCATGCTG GAACAGTATC TCGAACTTGT AGGACCGAAG CTGATCACCG ACGGGTTAGC GGTGTTTGAG AAGATGATGC CGGGCTATGT CAGCGTGCTG GAGTCGAATC TGACGGCGCA GGATAAAAAA GGCATTGTTG AGGAAGGACA TAAAATTAAA GGTGCGGCGG GGTCAGTGGG GTTACGCCAT CTGCAACAGC TGGGTCAGCA AATTCAGTCT CCTGACCTTC CGGCCTGGGA AGATAACGTC GGTGAATGGA TTGAAGAGAT GAAAGAAGAG TGGCGTCACG ACGTAGAAGT GCTGAAAGCG TGGGTGGCAA AAGCCACTAA AAAATGA
|
Protein sequence | MKQIRLLAQY YVDLMMKLGL VRFSMLLALA LVVLAIVVQM AVTMVLHGQV ESIDVIRSIF FGLLITPWAV YFLSVVVEQL EESRQRLSRL VQKLEEMRER DLSLNVQLKD NIAQLNQEIA VREKAEAELQ ETFGQLKIEI KEREETQIQL EQQSSFLRSF LDASPDLVFY RNEDKEFSGC NRAMELLTGK SEKQLVHLKP ADVYSPEAAA KVIETDEKVF RHNVSLTYEQ WLDYPDGRKA CFEIRKVPYY DRVGKRHGLM GFGRDITERK RYQDALERAS RDKTTFISTI SHELRTPLNG IVGLSRILLD TELTAEQEKY LKTIHVSAVT LGNIFNDIID MDKMERRKVQ LDNQPVDFTS FLADLENLSA LQAQQKGLRF NLEPTLPLPH QVITDGTRLR QILWNLISNA VKFTQQGQVT VRVRYDEGDM LHFEVEDSGI GIPQDELDKI FAMYYQVKDS HGGKPATGTG IGLAVSRRLA KNMGGDIKVT SEQGKGSTFT LTIHAPSVAE EVDDAFDEDD MPLPALNVLL VEDIELNVIV ARSVLEKLGN SVDVAMTGKA ALEMFKPGEY DLVLLDIQLP DMTGLDISRE LTKRYPREDL PPLVALTANV LKDKQEYLNA GMDDVLSKPL SVPALTAMIK KFWDTQDDEE STVTTEENSK SEALLDIPML EQYLELVGPK LITDGLAVFE KMMPGYVSVL ESNLTAQDKK GIVEEGHKIK GAAGSVGLRH LQQLGQQIQS PDLPAWEDNV GEWIEEMKEE WRHDVEVLKA WVAKATKK
|
| |