Gene ECH74115_4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4046 
SymbolbarA 
ID6970811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3741853 
End bp3744609 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content50% 
IMG OID643387808 
Producthybrid sensory histidine kinase BarA 
Protein accessionYP_002272251 
Protein GI209398921 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4999] Uncharacterized domain of BarA-like signal transduction histidine kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0239535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.782421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACT ACAGCCTGCG CGCACGCATG ATGATTCTGA TCCTGGCACC GACCGTCCTT 
ATTGGTTTAT TGCTGAGTAT CTTTTTCGTC GTGCATCGCT ATAACGACTT GCAGCGTCAA
CTGGAAGATG CCGGTGCCAG CATTATTGAG CCGCTTGCAG TTTCTACTGA ATATGGCATG
AGCCTGCAAA ATCGCGAATC TATCGGTCAG TTAATAAGCG TACTGCATCG TCGCCATTCC
GATATTGTTC GCGCGATTTC GGTTTATGAT GAAAATAACC GACTCTTTGT CACCTCCAAT
TTTCATCTTG ATCCCTCATC AATGCAGCTC GGCAGCAACG TGCCGTTTCC TCGCCAGCTC
ACTGTCACTC GTGACGGCGA TATTATGATC CTCCGCACGC CGATTATTTC TGAGAGTTAC
TCCCCCGACG AATCGCCCAG TAGCGATGCC AAAAATAGTC AAAATATGTT GGGATATATT
GCGCTTGAGC TGGATCTTAA ATCGGTTCGC TTGCAGCAAT ATAAAGAGAT CTTTATTTCC
AGCGTGATGA TGCTGTTTTG TATCGGTATT GCGCTTATTT TTGGCTGGCG CTTAATGCGC
GATGTAACCG GTCCGATTCG CAACATGGTG AATACTGTCG ACCGCATCCG TCGCGGGCAA
CTCGACAGCC GAGTGGAAGG ATTTATGCTC GGCGAGCTGG ATATGCTGAA AAACGGTATC
AACTCGATGG CAATGTCGCT GGCTGCTTAT CACGAAGAGA TGCAGCACAA TATCGACCAG
GCGACGTCCG ATCTGCGTGA AACGCTGGAG CAGATGGAAA TTCAGAACGT TGAGTTGGAT
CTGGCGAAAA AGCGCGCCCA GGAAGCGGCG CGTATTAAAT CCGAGTTTCT GGCAAATATG
TCACACGAGC TGCGTACACC ACTGAATGGT GTTATTGGTT TTACCCGCCT GACGCTGAAA
ACAGAATTAA CACCAACGCA GCGCGATCAC CTGAATACGA TTGAACGTTC GGCAAATAAT
TTGCTGGCAA TTATTAATGA TGTTCTCGAC TTCTCGAAAC TGGAAGCAGG TAAGCTGATT
CTGGAAAGTA TTCCATTCCC ACTACGCAGC ACGCTGGATG AAGTCGTTAC TCTGCTGGCA
CATTCTTCTC ACGATAAAGG GTTGGAACTG ACGCTCAATA TTAAAAGCGA CGTGCCTGAT
AACGTGATCG GCGACCCACT GCGATTACAG CAAATCATCA CTAACCTGGT GGGGAATGCA
ATTAAATTCA CCGAGAATGG CAACATTGAT ATTCTGGTAG AAAAACGTGC GCTGAGTAAT
ACCAAAGTGC AGATTGAAGT GCAGATTCGG GATACCGGCA TTGGTATTCC TGAACGCGAT
CAATCGCGCT TATTCCAGGC CTTCCGACAG GCTGATGCCA GTATTTCCCG CCGTCATGGT
GGCACCGGGC TGGGGCTGGT GATTACACAA AAACTGGTTA ATGAAATGGG CGGCGATATT
TCGTTCCATA GCCAGCCGAA TCGCGGTTCA ACTTTCTGGT TCCACATTAA TCTCGATCTG
AACCCGAACA TTATTATTGA AGGGCCATCC ACCCAGTGCC TCGCCGGTAA ACGCCTGGCC
TATGTCGAAC CAAACTCCGC AGCAGCGCAA TGCACGCTGG ATATTTTAAG TGAAACGCCG
CTGGAAGTGG TTTATAGCCC AACGTTCTCC GCGCTGCCTC CCGCGCATTA CGACATGATG
TTGTTAGGCA TCGCGGTGAC CTTCCGCGAG CCGCTAACAA TGCAACATGA GCGATTAGCG
AAAGCGGTAT CGATGACCGA TTTCCTGATG CTGGCACTTC CTTGCCATGC ACAAGTCAAT
GCTGAAAAAC TCAAGCAAGA TGGTATCGGC GCGTGTCTGC TGAAACCATT AACACCTACG
CGCCTGTTGC CTGCCCTGAC GGAATTTTGT CATCACAAAC AAAACACGCT TTTGCCTGTA
ACCGATGAAA GTAAGCTGGC AATGACAGTC ATGGCGGTTG ATGACAACCC CGCTAACCTG
AAACTTATCG GCGCATTGCT GGAAGATATG GTGCAACATG TGGAACTTTG CGATAGCGGG
CATCAGGCGG TTGAACGGGC GAAACAGATG CCGTTCGATT TGATCTTAAT GGATATTCAA
ATGCCTGACA TGGATGGCAT TCGGGCCTGC GAGCTCATCC ACCAGCTCCC CCATCAGCAA
CAAACGCCGG TTATCGCGGT AACGGCGCAT GCAATGGCCG GGCAAAAAGA GAAGCTGCTT
GGCGCAGGGA TGAGCGATTA TCTGGCGAAA CCGATTGAAG AAGAGCGATT GCATAATTTG
TTGTTGCGCT ACAAGCCTGG CAGCGGTATT TCGTCTCGCG TCGTGACGCC CGAAGTCAAC
GAAATTGTGG TGAACCCGAA TGCGACCCTC GACTGGCAAC TGGCACTACG CCAGGCAGCA
GGAAAAACCG ATTTAGCGCG CGATATGCTG CAAATGTTAC TCGATTTCCT GCCTGAAGTT
CGCAACAAAG TTGAGGAACA GCTGGTTGGA GAAAACCCGG AAGGCCTGGT TGATTTGATT
CATAAACTGC ATGGCAGTTG CGGCTATAGC GGTGTGCCGC GTATGAAGAA TCTCTGCCAA
CTTATCGAAC AACAGCTACG TAGCGGTACT AAAGAAGAAG ATTTGGAACC GGAGCTGCTG
GAACTGTTGG ACGAGATGGA TAATGTCGCG CGCGAAGCCA GCAAAATTCT CGGGTAA
 
Protein sequence
MTNYSLRARM MILILAPTVL IGLLLSIFFV VHRYNDLQRQ LEDAGASIIE PLAVSTEYGM 
SLQNRESIGQ LISVLHRRHS DIVRAISVYD ENNRLFVTSN FHLDPSSMQL GSNVPFPRQL
TVTRDGDIMI LRTPIISESY SPDESPSSDA KNSQNMLGYI ALELDLKSVR LQQYKEIFIS
SVMMLFCIGI ALIFGWRLMR DVTGPIRNMV NTVDRIRRGQ LDSRVEGFML GELDMLKNGI
NSMAMSLAAY HEEMQHNIDQ ATSDLRETLE QMEIQNVELD LAKKRAQEAA RIKSEFLANM
SHELRTPLNG VIGFTRLTLK TELTPTQRDH LNTIERSANN LLAIINDVLD FSKLEAGKLI
LESIPFPLRS TLDEVVTLLA HSSHDKGLEL TLNIKSDVPD NVIGDPLRLQ QIITNLVGNA
IKFTENGNID ILVEKRALSN TKVQIEVQIR DTGIGIPERD QSRLFQAFRQ ADASISRRHG
GTGLGLVITQ KLVNEMGGDI SFHSQPNRGS TFWFHINLDL NPNIIIEGPS TQCLAGKRLA
YVEPNSAAAQ CTLDILSETP LEVVYSPTFS ALPPAHYDMM LLGIAVTFRE PLTMQHERLA
KAVSMTDFLM LALPCHAQVN AEKLKQDGIG ACLLKPLTPT RLLPALTEFC HHKQNTLLPV
TDESKLAMTV MAVDDNPANL KLIGALLEDM VQHVELCDSG HQAVERAKQM PFDLILMDIQ
MPDMDGIRAC ELIHQLPHQQ QTPVIAVTAH AMAGQKEKLL GAGMSDYLAK PIEEERLHNL
LLRYKPGSGI SSRVVTPEVN EIVVNPNATL DWQLALRQAA GKTDLARDML QMLLDFLPEV
RNKVEEQLVG ENPEGLVDLI HKLHGSCGYS GVPRMKNLCQ LIEQQLRSGT KEEDLEPELL
ELLDEMDNVA REASKILG