Gene EcHS_A3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3403 
SymbolarcB 
ID5593772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3401616 
End bp3403952 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content51% 
IMG OID640922524 
Productaerobic respiration control sensor protein ArcB 
Protein accessionYP_001460012 
Protein GI157162694 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA TTCGTCTGCT GGCGCAGTAT TATGTTGACC TGATGATGAA GTTAGGTCTG 
GTGCGCTTCT CAATGTTGCT GGCGCTGGCC CTCGTCGTTC TTGCCATTGT GGTACAAATG
GCGGTAACCA TGGTGCTGCA TGGTCAGGTC GAAAGCATTG ATGTTATTCG TTCTATCTTC
TTTGGTTTGC TGATTACGCC GTGGGCGGTC TACTTTCTAT CGGTGGTCGT CGAGCAACTG
GAGGAGTCAC GACAACGTCT GTCACGGCTG GTGCAAAAAC TGGAGGAGAT GCGCGAGCGC
GATTTGAGCC TCAACGTTCA GTTAAAAGAT AATATTGCCC AGCTAAATCA GGAAATTGCC
GTTCGTGAAA AAGCGGAAGC AGAACTGCAG GAAACCTTCG GCCAACTGAA AATTGAAATC
AAAGAGCGCG AAGAGACACA AATTCAGCTC GAGCAGCAAT CCTCATTCTT ACGTTCCTTC
CTTGATGCTT CACCCGACCT GGTTTTTTAT CGTAACGAAG ATAAAGAGTT TTCCGGCTGT
AACCGCGCGA TGGAGCTGCT GACCGGAAAA AGCGAAAAAC AACTGGTTCA CCTGAAACCT
GCTGATGTTT ACTCACCGGA AGCCGCCGCA AAAGTCATTG AAACCGATGA AAAAGTGTTC
CGTCATAATG TGTCACTGAC CTATGAACAG TGGCTGGATT ACCCGGACGG GCGCAAAGCC
TGCTTTGAAA TCCGTAAAGT GCCGTACTAC GACCGCGTGG GTAAACGTCA CGGTTTGATG
GGCTTTGGTC GCGACATTAC CGAGCGTAAG CGGTATCAGG ATGCGCTTGA ACGGGCCAGC
CGCGACAAAA CGACGTTTAT CTCCACCATC AGTCACGAAT TGCGTACACC GCTGAACGGT
ATCGTCGGTC TGAGCCGCAT TCTGCTGGAT ACCGAACTCA CCGCCGAGCA GGAAAAATAT
CTCAAGACCA TCCATGTTTC GGCCGTCACG CTGGGGAATA TCTTTAACGA TATTATCGAC
ATGGATAAGA TGGAACGGCG CAAGGTCCAG CTTGATAATC AACCGGTTGA TTTCACCAGC
TTCCTTGCCG ATCTGGAAAA TCTCTCCGCA TTGCAGGCGC AACAAAAAGG ATTGCGCTTT
AACCTGGAGC CGACGCTGCC ATTACCGCAT CAGGTCATTA CCGACGGGAC GCGTTTACGG
CAGATCCTGT GGAACCTCAT CAGTAACGCC GTCAAATTCA CCCAGCAAGG CCAGGTTACC
GTGCGCGTGC GCTACGATGA AGGCGATATG CTGCATTTTG AAGTGGAAGA CTCTGGTATC
GGCATTCCGC AGGATGAGCT GGATAAAATT TTCGCCATGT ATTACCAGGT GAAAGACAGT
CATGGCGGTA AACCTGCCAC CGGCACCGGT ATTGGTCTGG CCGTTTCTCG TCGTCTGGCG
AAAAATATGG GCGGCGATAT TAAGGTTACC AGCGAACAGG GCAAAGGTTC AACCTTTACG
TTGACGATCC ACGCACCGTC GGTAGCAGAA GAGGTCGATG ATGCGTTTGA TGAAGACGAT
ATGCCTTTAC CGGCGCTGAA TGTGCTGCTG GTGGAAGACA TTGAACTGAA CGTGATTGTT
GCGCGTTCTG TGCTGGAAAA ATTAGGTAAC AGCGTTGATG TCGCCATGAC CGGCAAGGCG
GCGCTGGAGA TGTTTAAACC GGGCGAATAC GACCTGGTGT TGCTGGATAT TCAGTTGCCA
GATATGACCG GGCTGGATAT CTCTCGTGAA CTGACGAAAC GTTATCCGCG CGAGGATTTA
CCGCCGCTGG TGGCCTTAAC CGCTAACGTG CTGAAAGACA AACAAGAGTA CCTCAATGCT
GGAATGGATG ATGTGCTGAG TAAGCCGCTT TCTGTTCCGG CGCTAACCGC GATGATCAAG
AAATTCTGGG ATACCCAGGA TGATGAGGAG AGTACGGTGA CGACAGAAGA GAACAGTAAA
TCAGAAGCAT TGCTCGATAT TCCCATGCTG GAACAGTATC TCGAACTTGT AGGACCGAAG
CTGATCACCG ACGGGTTAGC GGTGTTTGAG AAGATGATGC CGGGCTATGT CAGCGTGCTG
GAGTCGAATC TGACGGCGCA GGATAAAAAA GGCATTGTTG AGGAAGGACA TAAAATTAAA
GGTGCGGCGG GGTCAGTGGG GTTACGCCAT CTGCAACAGC TGGGTCAGCA AATTCAGTCT
CCTGACCTTC CGGCCTGGGA AGATAACGTC GGTGAATGGA TTGAAGAGAT GAAAGAAGAG
TGGCGTCACG ACGTAGAAGT GCTGAAAGCG TGGGTGGCAA AAGCCACTAA AAAATGA
 
Protein sequence
MKQIRLLAQY YVDLMMKLGL VRFSMLLALA LVVLAIVVQM AVTMVLHGQV ESIDVIRSIF 
FGLLITPWAV YFLSVVVEQL EESRQRLSRL VQKLEEMRER DLSLNVQLKD NIAQLNQEIA
VREKAEAELQ ETFGQLKIEI KEREETQIQL EQQSSFLRSF LDASPDLVFY RNEDKEFSGC
NRAMELLTGK SEKQLVHLKP ADVYSPEAAA KVIETDEKVF RHNVSLTYEQ WLDYPDGRKA
CFEIRKVPYY DRVGKRHGLM GFGRDITERK RYQDALERAS RDKTTFISTI SHELRTPLNG
IVGLSRILLD TELTAEQEKY LKTIHVSAVT LGNIFNDIID MDKMERRKVQ LDNQPVDFTS
FLADLENLSA LQAQQKGLRF NLEPTLPLPH QVITDGTRLR QILWNLISNA VKFTQQGQVT
VRVRYDEGDM LHFEVEDSGI GIPQDELDKI FAMYYQVKDS HGGKPATGTG IGLAVSRRLA
KNMGGDIKVT SEQGKGSTFT LTIHAPSVAE EVDDAFDEDD MPLPALNVLL VEDIELNVIV
ARSVLEKLGN SVDVAMTGKA ALEMFKPGEY DLVLLDIQLP DMTGLDISRE LTKRYPREDL
PPLVALTANV LKDKQEYLNA GMDDVLSKPL SVPALTAMIK KFWDTQDDEE STVTTEENSK
SEALLDIPML EQYLELVGPK LITDGLAVFE KMMPGYVSVL ESNLTAQDKK GIVEEGHKIK
GAAGSVGLRH LQQLGQQIQS PDLPAWEDNV GEWIEEMKEE WRHDVEVLKA WVAKATKK