Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3506 |
Symbol | arcB |
ID | 6143115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3577302 |
End bp | 3579638 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618335 |
Product | aerobic respiration control sensor protein ArcB |
Protein accession | YP_001745482 |
Protein GI | 170683438 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.648174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAA TTCGTCTGCT GGCGCAGTAT TATGTTGACC TGATGATGAA GTTAGGTCTG GTGCGCTTCT CAATGTTGCT GGCGCTGGCC CTCGTCGTTC TTGCCATTGT GGTACAAATG GCGGTAACCA TGGTGCTGCA TGGTCAGGTC GAAAGCATTG ATGTTATTCG TTCTATCTTC TTTGGTTTGC TGATTACGCC GTGGGCGGTC TACTTTCTAT CGGTGGTCGT CGAGCAACTG GAGGAGTCAC GGCAACGTCT GTCACGACTG GTGCAAAAAC TGGAGGAGAT GCGCGAGCGC GATTTGAGCC TCAACGTTCA GTTAAAAGAT AATATTGCCC AGCTAAATCA GGAAATCGCC GTTCGTGAAA AAGCGGAAGC AGAACTGCAG GAAACCTTCG GCCAACTGAA AATTGAAATC AAAGAGCGCG AAGAGACACA AATTCAGCTC GAGCAGCAAT CCTCATTCTT ACGTTCCTTC CTTGATGCTT CACCCGACCT GGTTTTTTAT CGTAACGAAG ATAAAGAGTT TTCCGGCTGT AACCGCGCGA TGGAGCTGCT GACCGGAAAA AGCGAAAAAC AACTGGTTCA CCTGAAACCT GCTGATGTTT ACTCACCGGA AGCCGCCGCA AAAGTCATTG AAACCGATGA AAAAGTGTTC CGTCATAATG TGTCACTGAC CTATGAACAG TGGCTGGATT ACCCGGACGG GCGCAAAGCC TGCTTTGAAA TCCGTAAAGT GCCGTACTAC GACCGCGTGG GTAAACGTCA CGGTTTGATG GGCTTTGGTC GCGACATTAC CGAGCGTAAG CGGTATCAGG ATGCGCTTGA ACGGGCCAGC CGCGACAAAA CGACGTTTAT CTCCACCATC AGTCACGAAT TGCGTACGCC GCTGAATGGT ATCGTCGGCC TGAGCCGCAT TCTGCTGGAT ACCGAACTCA CCGCCGAGCA GGAAAAATAT CTCAAAACCA TCCATGTTTC GGCCGTCACG CTGGGGAATA TCTTCAACGA TATTATCGAC ATGGATAAGA TGGAACGGCG CAAGGTCCAG CTTGATAATC AGCCGGTTGA TTTCACCAGC TTCCTTGCCG ATCTGGAAAA TCTCTCCGCC TTGCAGGCGC AACAAAAAGG ATTGCGCTTT AACCTGGAGC CTACGCTGCC ATTACCGCAT CAGGTCATTA CCGACGGGAC GCGTTTACGG CAGATCCTGT GGAACCTCAT CAGTAACGCC GTCAAATTCA CCCAGCAAGG CCAGGTTACC GTGCGCGTGC GCTACGATGA AGGCGATATG CTGCATTTTG AAGTGGAAGA TTCCGGCATT GGCATTCCGC AGGATGAGCT GGATAAAATT TTCGCCATGT ATTACCAGGT GAAAGACAGT CATGGCGGTA AACCTGCCAC CGGCACCGGT ATTGGTCTGG CCGTTTCTCG TCGTCTGGCG AAAAATATGG GCGGCGATAT TACGGTTACC AGCGAACAGG GCAAAGGTTC AACCTTTACG TTGACGATCC ACGCACCGTC GGTGGCAGAA GAGGTCGATG ATGCGTTTGA TGAAGACGAT ATGCCTTTAC CGGCGCTGAA TGTACTGCTG GTGGAAGACA TTGAACTGAA CGTGATTGTC GCGCGTTCTG TGCTGGAAAA ATTAGGTAAC AGCGTTGATG TCGCCATGAC CGGCAAGGCG GCGCTGGAGA TGTTTAAACC GGGCGAATAC GACCTGGTAT TGCTGGATAT TCAGTTGCCA GATATGACCG GGCTGGATAT CTCTCGTGAA CTGACGAAGC GTTATCCGCG CGAGGATTTA CCACCGCTGG TGGCCTTAAC CGCTAACGTG CTGAAAGACA AACAAGAGTA CCTCAATGCT GGAATGGATG ATGTGCTGAG TAAGCCGCTT TCTGTTCCGG CGCTAACCGC GATGATCAAG AAATTCTGGG ATACCCAGGA TGATGAGGAG AGTACGGTGA CGACAGAAGA GAACAGTAAA TCAGAAGCAT TGCTCGATAT TCCCATGCTG GAACAGTATC TCGAACTTGT AGGACCGAAG CTGATCACCG ACGGGTTAGC GGTATTTGAG AAGATGATGC CGGGATATGT TAGCGTGCTG GAGTCGAATC TGACGGCGCA GGATAAAAAA GGCATTGTTG AGGAAGGACA TAAAATTAAA GGTGCGGCGG GGTCAGTGGG GTTACGCCAT CTGCAACAGT TGGGTCAGCA AATTCAGTCT CCTGACCTTC CCGCCTGGGA AGATAACGTC GGTGAATGGA TTGAAGAGAT GAAAGAAGAG TGGCGTCACG ACGTAGAAGT ACTGAAAGCG TGGGTGGCAA AAGCTACTAA AAAATGA
|
Protein sequence | MKQIRLLAQY YVDLMMKLGL VRFSMLLALA LVVLAIVVQM AVTMVLHGQV ESIDVIRSIF FGLLITPWAV YFLSVVVEQL EESRQRLSRL VQKLEEMRER DLSLNVQLKD NIAQLNQEIA VREKAEAELQ ETFGQLKIEI KEREETQIQL EQQSSFLRSF LDASPDLVFY RNEDKEFSGC NRAMELLTGK SEKQLVHLKP ADVYSPEAAA KVIETDEKVF RHNVSLTYEQ WLDYPDGRKA CFEIRKVPYY DRVGKRHGLM GFGRDITERK RYQDALERAS RDKTTFISTI SHELRTPLNG IVGLSRILLD TELTAEQEKY LKTIHVSAVT LGNIFNDIID MDKMERRKVQ LDNQPVDFTS FLADLENLSA LQAQQKGLRF NLEPTLPLPH QVITDGTRLR QILWNLISNA VKFTQQGQVT VRVRYDEGDM LHFEVEDSGI GIPQDELDKI FAMYYQVKDS HGGKPATGTG IGLAVSRRLA KNMGGDITVT SEQGKGSTFT LTIHAPSVAE EVDDAFDEDD MPLPALNVLL VEDIELNVIV ARSVLEKLGN SVDVAMTGKA ALEMFKPGEY DLVLLDIQLP DMTGLDISRE LTKRYPREDL PPLVALTANV LKDKQEYLNA GMDDVLSKPL SVPALTAMIK KFWDTQDDEE STVTTEENSK SEALLDIPML EQYLELVGPK LITDGLAVFE KMMPGYVSVL ESNLTAQDKK GIVEEGHKIK GAAGSVGLRH LQQLGQQIQS PDLPAWEDNV GEWIEEMKEE WRHDVEVLKA WVAKATKK
|
| |