Gene EcHS_A2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2220 
SymbolbaeS 
ID5592989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2207117 
End bp2208520 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID640921350 
Productsignal transduction histidine-protein kinase BaeS 
Protein accessionYP_001458887 
Protein GI157161569 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCT GGCGACCCGG TATTACCGGC AAACTGTTTC TGGCGATTTT CGCCACCTGC 
ATTGTCTTGC TGATCAGTAT GCACTGGGCG GTGCGTATCA GTTTTGAGCG TGGCTTTATT
GATTACATCA AGCATGGTAA TGAACAGCGA TTACAACTGT TAAGTGATGC GCTTGGCGAG
CAGTATGCGC AGCATGGCAA CTGGCGCTTC CTGCGCAACA ATGATCGCTT TGTCTTTCAG
ATCCTGCGTT CATTTGAACA CGATAATTCG GAAGATAAAC CCGGCCCGGG TATGCCACCG
CACGGCTGGC GTACCCAGTT CTGGGTGGTT GATCAAAACA ACAAAGTGCT GGTTGGTCCG
CGAGCGCCGA TTCCACCTGA CGGTACACGG CGACCCATTC TGGTCAACGG TGCGGAAGTT
GGCGCGGTGA TCGCCTCCCC CGTTGAGCGG TTAACGCGCA ATACTGATAT CAATTTCGAT
AAACAACAGC GGCAAACCAG CTGGTTGATT GTCGCCCTGG CAACGTTACT CGCGGCACTT
GCCACTTTTC TGCTGGCGCG CGGTTTACTG GCACCGGTAA AACGACTTGT CGATGGCACG
CACAAACTGG CGGCGGGCGA TTTCACTACC CGCGTAACGC CCACCAGTGA AGATGAACTG
GGCAAACTGG CGCAAGACTT CAACCAGCTT GCCAGCACAC TGGAGAAAAA CCAGCAAATG
CGGCGCGATT TTATGGCCGA TATTTCTCAC GAACTGCGTA CGCCATTAGC GGTGCTGCGC
GGTGAACTGG AAGCCATTCA GGATGGCGTG CGTAAATTCA CGCCGGAGAC GGTGGCGTCT
TTACAGGCGG AGGTCGGTAC ACTGACCAAA CTGGTTGACG ATCTCCATCA GTTGTCGATG
TCTGATGAAG GCGCTCTCGC CTATCAAAAA GCACCGGTAG ATTTGATCCC ACTGCTGGAA
GTGGCGGGCG GCGCATTTCG CGAACGATTC GCCAGTCGTG GCCTGAAACT GCAATTTTCC
CTGCCAGACA GTATTACCGT ATTTGGCGAT CGCGACCGTT TAATGCAGTT ATTCAATAAC
TTACTGGAAA ACAGCCTGCG CTACACTGAC AGCGGCGGCA GCCTGCAAAT CTCTGCCGGG
CAGCGCGACA AAACGGTGCG CCTGACCTTT GCCGACAGTG CGCCAGGTGT CAGTGACGAT
CAGCTACAAA AATTGTTTGA ACGTTTTTAT CGCACCGAAG GTTCCCGCAA CCGTGCCAGC
GGCGGTTCCG GGCTGGGGCT GGCGATTTGC CTGAACATTG TTGAAGCACA TAATGGTCGC
ATTATTGCTG CCCATTCGCC TTTTGGCGGG GTAAGCATTA CAGTAGAGTT ACCGCTGGAA
CGGGATTTAC AGAGAGAAGT ATGA
 
Protein sequence
MKFWRPGITG KLFLAIFATC IVLLISMHWA VRISFERGFI DYIKHGNEQR LQLLSDALGE 
QYAQHGNWRF LRNNDRFVFQ ILRSFEHDNS EDKPGPGMPP HGWRTQFWVV DQNNKVLVGP
RAPIPPDGTR RPILVNGAEV GAVIASPVER LTRNTDINFD KQQRQTSWLI VALATLLAAL
ATFLLARGLL APVKRLVDGT HKLAAGDFTT RVTPTSEDEL GKLAQDFNQL ASTLEKNQQM
RRDFMADISH ELRTPLAVLR GELEAIQDGV RKFTPETVAS LQAEVGTLTK LVDDLHQLSM
SDEGALAYQK APVDLIPLLE VAGGAFRERF ASRGLKLQFS LPDSITVFGD RDRLMQLFNN
LLENSLRYTD SGGSLQISAG QRDKTVRLTF ADSAPGVSDD QLQKLFERFY RTEGSRNRAS
GGSGLGLAIC LNIVEAHNGR IIAAHSPFGG VSITVELPLE RDLQREV