Gene EcSMS35_0982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0982 
SymbolbaeS 
ID6144504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp990667 
End bp992070 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID641615869 
Productsignal transduction histidine-protein kinase BaeS 
Protein accessionYP_001743061 
Protein GI170680059 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.363235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.539382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT GGCGACCCGG TATTACCGGC AAACTGTTTC TGGCGATTTT CGCCACCTGC 
ATTGTCTTAT TGATCACGAT GCACTGGGCG GTACGTATCA GTTTTGAGCG CGGCTTTATC
GATTACATCA AGCATGGTAA TGAACAACGG CTGCAAATGC TCGGCGATGC GCTTGGTGAG
CAGTACGCCC AGCACGGGAA CTGGCGCTTC CTGCGTAATA ACGATCGCTT TGTATTTCAG
ATCCTACGTT CGCTGGAGCA TGATAACAAC GAAGATAAGC CCGGCCCCGG TATGCCGCCA
CACGGTTGGC GCACGCAATT CTGGGTGGTT GATCAAAACA ACAAAGTGCT GGTTGGCCCG
CGAGCACCGA TTCCACCCGA CGGCACACGG CGGCCCATTA TGGTCAATGG GGCGGAAGTG
GGTGCGGTGA TCGCCTCCCC TGTTGAACGA CTGACCCGCA ATACCGATAT CAATTTTGAC
AGACAACAGA GGCAAACCAG TTGGCTGATT GTCGCTTTAT CTACCTTGTT AGCGGCGCTG
GCGACATTCC CACTGGCGCG CGGTTTGCTG GCTCCGGTCA AACGACTGGT GGACGGTACA
CACAAACTGG CAGCGGGCGA TTTCACTACT CGCGTGACGC CCACCAGTGA AGATGAATTG
GGCAAACTGG CGCAAGACTT CAACCAGCTC GCCAGCACGC TGGAGAAAAA CCAACAGATG
CGCCGCGATT TTATGGCCGA TATCTCCCAC GAGCTGCGCA CGCCTTTAGC GGTACTGCGC
GGCGAACTGG AAGCTATTCA GGATGGCGTG CGTAAATTCA CGCCGGAGAC GGTGGCTTCT
TTACAGGCAG AGGTCGGTAC ACTGACCAAA CTGGTGGATG ATCTTCATCA ATTGTCGATG
TCTGATGAAG GCGCTCTCGC CTACCAGAAA TCGTCGGTGG ATCTGATCCC GCTACTGGAA
GTCGCGGGTG GCGCATTTCG TGAGCGTTTC GCCAGCCGCG GGCTGAAACT GCAATTTTCC
CTGCCAGACA GTATTACCGT ATTTGGCGAT CGCGACCGTT TAATGCAGTT ATTCAATAAC
TTACTGGAAA ACAGCCTGCG CTACACTGAC AGCGGCGGTA GCCTGAAAAT CTCTGCCGAG
CAGCACGACA AAACGGTGCG CCTGACCTTT GCCGACAGCG CGCCGGGCGT CAGTGACGAT
CAGCTACAAA AATTGTTTGA ACGTTTTTAT CGCACCGAAG GCTCCCGCAA CCGAGCCAGC
GGCGGTTCCG GGCTGGGGCT GGCGATTTGC CTGAACATTG TTGAAGCACA TAATGGTCGC
ATTATTGCTG CCCATTCGCC TTTTGGCGGG GTAAGCATTA CAGTAGAGTT ACCGCTGGAA
CGGGATTTAC AGAGAGAAGT ATGA
 
Protein sequence
MKFWRPGITG KLFLAIFATC IVLLITMHWA VRISFERGFI DYIKHGNEQR LQMLGDALGE 
QYAQHGNWRF LRNNDRFVFQ ILRSLEHDNN EDKPGPGMPP HGWRTQFWVV DQNNKVLVGP
RAPIPPDGTR RPIMVNGAEV GAVIASPVER LTRNTDINFD RQQRQTSWLI VALSTLLAAL
ATFPLARGLL APVKRLVDGT HKLAAGDFTT RVTPTSEDEL GKLAQDFNQL ASTLEKNQQM
RRDFMADISH ELRTPLAVLR GELEAIQDGV RKFTPETVAS LQAEVGTLTK LVDDLHQLSM
SDEGALAYQK SSVDLIPLLE VAGGAFRERF ASRGLKLQFS LPDSITVFGD RDRLMQLFNN
LLENSLRYTD SGGSLKISAE QHDKTVRLTF ADSAPGVSDD QLQKLFERFY RTEGSRNRAS
GGSGLGLAIC LNIVEAHNGR IIAAHSPFGG VSITVELPLE RDLQREV