Gene ECH74115_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3018 
SymbolbaeS 
ID6971062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2803144 
End bp2804547 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID643386853 
Productsignal transduction histidine-protein kinase BaeS 
Protein accessionYP_002271321 
Protein GI209397988 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0391914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT GGCGACCTGG TATTACCGGC AAACTGTTTC TGGCGATTTT CGCCACCTGC 
ATTGTCTTGC TGATCAGTAT GCACTGGGCG GTGCGTATCA GTTTTGAGCG TGGCTTTATT
GATTACATCA AGCATGGTAA TGAACAGCGA TTACAACTGT TAAGTGATGC GCTTGGCGAG
CAGTATGCGC AGCATGGCAA CTGGCGCTTC CTGCGCAACA ATGATCGCTT TGTCTTTCAG
ATCCTGCGTT CATTTGAACA CGATAATTCG GAAGATAAAC CCGGCCCGGG TATGCCACCA
CACGGCTGGC GTACTCAGTT CTGGGTGGTT GATCAAAACA ACAAGGTGCT GGTTGGTCCG
CGAGCGCCGA TTCCACCTGA CGGTACACGG CGACCCATTC TGGTCAACGG TGCGGAAGTG
GGCGCGGTGA TTGCCTCCCC TGTTGAGCGG CTGACCCGCA ATACTGATAT CAATTTCGAT
AAACAACAGC GGCAAACCAG CTGGCTGATT GTCGCCCTGG CAACGTTACT CGCGGCACTC
GCTACTTTTC TGCTGGCGCG CGGTTTGCTG GCACCGGTAA AACGACTTGT CGATGGCACG
CACAAACTGG CGGCGGGCGA TTTCACTACC CGCGTAACGC CCACCAGTGA AGATGAACTG
GGCAAACTGG CGCAAGACTT CAACCAGCTC GCCAGCACGC TGGAGAAAAA CCAGCAAATG
CGGCGCGATT TTATGGCCGA TATTTCTCAC GAACTGCGTA CGCCATTAGC GGTGCTGCGC
GGTGAACTGG AAGCCATTCA GGATGGCGTG CGCAAATTCA CGCCGGAGAC GGTGGCGTCT
TTACAGGCGG AGGTCGGTAC ACTGACCAAA CTGGTTGACG ATCTTCATCA GTTGTCGATG
TCTGATGAAG GCGCTCTCGC CTATCAAAAA GCACCGGTAG ATTTGATCCC ACTGCTGGAA
GTGGCAGGCG GCGCATTTCG CGAACGATTT GCCAGCCGTG GCCTGAAACT GCAATTTTCC
CTGCCAGACA GTATCACCGT ATTTGGCGAT CGCGACCGTT TAATGCAGTT ATTCAATAAC
TTACTGGAAA ACAGCCTGCG CTACACTGAC AGCGGCGGCA GCCTGAAAAT CTCTGCCGAG
CAGCACGACA AAACGGTGCG CCTGACCTTT GCCGACAGTG CGCCAGGTGT CAGTGACGAT
CAGCTACAAA AATTGTTTGA ACGTTTTTAT CGCACCGAAG GTTCCCGCAA CCGCGCCAGC
GGCGGTTCCG GGCTGGGGCT GGCGATTTGC CTGAACATTG TTGAAGCACA TAATGGCCGC
ATTATTGCCG CCCATTCGCC TTTTGGCGGG GTAAGCATTA CAGTAGAGTT ACCGCTGGAA
CGGGATTTAC AGAGAGAAGT ATGA
 
Protein sequence
MKFWRPGITG KLFLAIFATC IVLLISMHWA VRISFERGFI DYIKHGNEQR LQLLSDALGE 
QYAQHGNWRF LRNNDRFVFQ ILRSFEHDNS EDKPGPGMPP HGWRTQFWVV DQNNKVLVGP
RAPIPPDGTR RPILVNGAEV GAVIASPVER LTRNTDINFD KQQRQTSWLI VALATLLAAL
ATFLLARGLL APVKRLVDGT HKLAAGDFTT RVTPTSEDEL GKLAQDFNQL ASTLEKNQQM
RRDFMADISH ELRTPLAVLR GELEAIQDGV RKFTPETVAS LQAEVGTLTK LVDDLHQLSM
SDEGALAYQK APVDLIPLLE VAGGAFRERF ASRGLKLQFS LPDSITVFGD RDRLMQLFNN
LLENSLRYTD SGGSLKISAE QHDKTVRLTF ADSAPGVSDD QLQKLFERFY RTEGSRNRAS
GGSGLGLAIC LNIVEAHNGR IIAAHSPFGG VSITVELPLE RDLQREV