Gene EcSMS35_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3506 
SymbolarcB 
ID6143115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3577302 
End bp3579638 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content50% 
IMG OID641618335 
Productaerobic respiration control sensor protein ArcB 
Protein accessionYP_001745482 
Protein GI170683438 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.648174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAA TTCGTCTGCT GGCGCAGTAT TATGTTGACC TGATGATGAA GTTAGGTCTG 
GTGCGCTTCT CAATGTTGCT GGCGCTGGCC CTCGTCGTTC TTGCCATTGT GGTACAAATG
GCGGTAACCA TGGTGCTGCA TGGTCAGGTC GAAAGCATTG ATGTTATTCG TTCTATCTTC
TTTGGTTTGC TGATTACGCC GTGGGCGGTC TACTTTCTAT CGGTGGTCGT CGAGCAACTG
GAGGAGTCAC GGCAACGTCT GTCACGACTG GTGCAAAAAC TGGAGGAGAT GCGCGAGCGC
GATTTGAGCC TCAACGTTCA GTTAAAAGAT AATATTGCCC AGCTAAATCA GGAAATCGCC
GTTCGTGAAA AAGCGGAAGC AGAACTGCAG GAAACCTTCG GCCAACTGAA AATTGAAATC
AAAGAGCGCG AAGAGACACA AATTCAGCTC GAGCAGCAAT CCTCATTCTT ACGTTCCTTC
CTTGATGCTT CACCCGACCT GGTTTTTTAT CGTAACGAAG ATAAAGAGTT TTCCGGCTGT
AACCGCGCGA TGGAGCTGCT GACCGGAAAA AGCGAAAAAC AACTGGTTCA CCTGAAACCT
GCTGATGTTT ACTCACCGGA AGCCGCCGCA AAAGTCATTG AAACCGATGA AAAAGTGTTC
CGTCATAATG TGTCACTGAC CTATGAACAG TGGCTGGATT ACCCGGACGG GCGCAAAGCC
TGCTTTGAAA TCCGTAAAGT GCCGTACTAC GACCGCGTGG GTAAACGTCA CGGTTTGATG
GGCTTTGGTC GCGACATTAC CGAGCGTAAG CGGTATCAGG ATGCGCTTGA ACGGGCCAGC
CGCGACAAAA CGACGTTTAT CTCCACCATC AGTCACGAAT TGCGTACGCC GCTGAATGGT
ATCGTCGGCC TGAGCCGCAT TCTGCTGGAT ACCGAACTCA CCGCCGAGCA GGAAAAATAT
CTCAAAACCA TCCATGTTTC GGCCGTCACG CTGGGGAATA TCTTCAACGA TATTATCGAC
ATGGATAAGA TGGAACGGCG CAAGGTCCAG CTTGATAATC AGCCGGTTGA TTTCACCAGC
TTCCTTGCCG ATCTGGAAAA TCTCTCCGCC TTGCAGGCGC AACAAAAAGG ATTGCGCTTT
AACCTGGAGC CTACGCTGCC ATTACCGCAT CAGGTCATTA CCGACGGGAC GCGTTTACGG
CAGATCCTGT GGAACCTCAT CAGTAACGCC GTCAAATTCA CCCAGCAAGG CCAGGTTACC
GTGCGCGTGC GCTACGATGA AGGCGATATG CTGCATTTTG AAGTGGAAGA TTCCGGCATT
GGCATTCCGC AGGATGAGCT GGATAAAATT TTCGCCATGT ATTACCAGGT GAAAGACAGT
CATGGCGGTA AACCTGCCAC CGGCACCGGT ATTGGTCTGG CCGTTTCTCG TCGTCTGGCG
AAAAATATGG GCGGCGATAT TACGGTTACC AGCGAACAGG GCAAAGGTTC AACCTTTACG
TTGACGATCC ACGCACCGTC GGTGGCAGAA GAGGTCGATG ATGCGTTTGA TGAAGACGAT
ATGCCTTTAC CGGCGCTGAA TGTACTGCTG GTGGAAGACA TTGAACTGAA CGTGATTGTC
GCGCGTTCTG TGCTGGAAAA ATTAGGTAAC AGCGTTGATG TCGCCATGAC CGGCAAGGCG
GCGCTGGAGA TGTTTAAACC GGGCGAATAC GACCTGGTAT TGCTGGATAT TCAGTTGCCA
GATATGACCG GGCTGGATAT CTCTCGTGAA CTGACGAAGC GTTATCCGCG CGAGGATTTA
CCACCGCTGG TGGCCTTAAC CGCTAACGTG CTGAAAGACA AACAAGAGTA CCTCAATGCT
GGAATGGATG ATGTGCTGAG TAAGCCGCTT TCTGTTCCGG CGCTAACCGC GATGATCAAG
AAATTCTGGG ATACCCAGGA TGATGAGGAG AGTACGGTGA CGACAGAAGA GAACAGTAAA
TCAGAAGCAT TGCTCGATAT TCCCATGCTG GAACAGTATC TCGAACTTGT AGGACCGAAG
CTGATCACCG ACGGGTTAGC GGTATTTGAG AAGATGATGC CGGGATATGT TAGCGTGCTG
GAGTCGAATC TGACGGCGCA GGATAAAAAA GGCATTGTTG AGGAAGGACA TAAAATTAAA
GGTGCGGCGG GGTCAGTGGG GTTACGCCAT CTGCAACAGT TGGGTCAGCA AATTCAGTCT
CCTGACCTTC CCGCCTGGGA AGATAACGTC GGTGAATGGA TTGAAGAGAT GAAAGAAGAG
TGGCGTCACG ACGTAGAAGT ACTGAAAGCG TGGGTGGCAA AAGCTACTAA AAAATGA
 
Protein sequence
MKQIRLLAQY YVDLMMKLGL VRFSMLLALA LVVLAIVVQM AVTMVLHGQV ESIDVIRSIF 
FGLLITPWAV YFLSVVVEQL EESRQRLSRL VQKLEEMRER DLSLNVQLKD NIAQLNQEIA
VREKAEAELQ ETFGQLKIEI KEREETQIQL EQQSSFLRSF LDASPDLVFY RNEDKEFSGC
NRAMELLTGK SEKQLVHLKP ADVYSPEAAA KVIETDEKVF RHNVSLTYEQ WLDYPDGRKA
CFEIRKVPYY DRVGKRHGLM GFGRDITERK RYQDALERAS RDKTTFISTI SHELRTPLNG
IVGLSRILLD TELTAEQEKY LKTIHVSAVT LGNIFNDIID MDKMERRKVQ LDNQPVDFTS
FLADLENLSA LQAQQKGLRF NLEPTLPLPH QVITDGTRLR QILWNLISNA VKFTQQGQVT
VRVRYDEGDM LHFEVEDSGI GIPQDELDKI FAMYYQVKDS HGGKPATGTG IGLAVSRRLA
KNMGGDITVT SEQGKGSTFT LTIHAPSVAE EVDDAFDEDD MPLPALNVLL VEDIELNVIV
ARSVLEKLGN SVDVAMTGKA ALEMFKPGEY DLVLLDIQLP DMTGLDISRE LTKRYPREDL
PPLVALTANV LKDKQEYLNA GMDDVLSKPL SVPALTAMIK KFWDTQDDEE STVTTEENSK
SEALLDIPML EQYLELVGPK LITDGLAVFE KMMPGYVSVL ESNLTAQDKK GIVEEGHKIK
GAAGSVGLRH LQQLGQQIQS PDLPAWEDNV GEWIEEMKEE WRHDVEVLKA WVAKATKK