Gene EcolC_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1563 
Symbol 
ID6065479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1728826 
End bp1730229 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content54% 
IMG OID641600979 
Productsignal transduction histidine-protein kinase BaeS 
Protein accessionYP_001724549 
Protein GI170019595 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.32078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.70665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCT GGCGACCCGG TATTACCGGC AAACTGTTTC TGGCGATTTT CGCCACCTGC 
ATTGTCTTGC TGATCAGTAT GCACTGGGCG GTGCGTATCA GTTTTGAGCG TGGCTTTATT
GATTACATCA AGCATGGTAA TGAACAGCGA TTACAACTGT TAAGTGATGC GCTTGGCGAG
CAGTATGCGC AGCATGGCAA CTGGCGCTTC CTGCGCAACA ATGATCGCTT TGTCTTTCAG
ATCCTGCGTT CATTTGAACA CGATAATTCG GAAGATAAAC CCGGCCCGGG TATGCCACCG
CACGGCTGGC GTACCCAGTT CTGGGTGGTT GATCAAAACA ACAAAGTGCT GGTTGGTCCG
CGAGCGCCGA TTCCACCTGA CGGTACACGG CGACCCATTC TGGTCAACGG TGCGGAAGTT
GGCGCGGTGA TCGCCTCCCC CGTTGAGCGG TTAACGCGCA ATACTGATAT CAATTTCGAT
AAACAACAGC GGCAAACCAG CTGGTTGATT GTCGCCCTGG CAACGTTACT CGCGGCACTT
GCCACTTTTC TGCTGGCGCG CGGTTTACTG GCACCGGTAA AACGACTTGT CGATGGCACG
CACAAACTGG CGGCGGGCGA TTTCACTACC CGCGTAACGC CCACCAGTGA AGATGAACTG
GGCAAACTGG CGCAAGACTT CAACCAGCTT GCCAGCACAC TGGAGAAAAA CCAGCAAATG
CGGCGCGATT TTATGGCCGA TATTTCTCAC GAACTGCGTA CGCCATTAGC GGTGCTGCGC
GGTGAACTGG AAGCCATTCA GGATGGCGTG CGTAAATTCA CGCCGGAGAC GGTGGCGTCT
TTACAGGCGG AGGTCGGTAC ACTGACCAAA CTGGTTGACG ATCTCCATCA GTTGTCGATG
TCTGATGAAG GCGCTCTCGC CTATCAAAAA GCACCGGTAG ATTTGATCCC ACTGCTGGAA
GTGGCGGGCG GCGCATTTCG CGAACGATTC GCCAGTCGTG GCCTGAAACT GCAATTTTCC
CTGCCAGACA GTATTACCGT ATTTGGCGAT CGCGACCGTT TAATGCAGTT ATTCAATAAC
TTACTGGAAA ACAGCCTGCG CTACACTGAC AGCGGCGGCA GCCTGCAAAT CTCTGCCGGG
CAGCGCGACA AAACGGTGCG CCTGACCTTT GCCGACAGTG CGCCAGGTGT CAGTGACGAT
CAGCTACAAA AATTGTTTGA ACGTTTTTAT CGCACCGAAG GTTCCCGCAA CCGTGCCAGC
GGCGGTTCCG GGCTGGGGCT GGCGATTTGC CTGAACATTG TTGAAGCACA TAATGGTCGC
ATTATTGCTG CCCATTCGCC TTTTGGCGGG GTAAGCATTA CAGTAGAGTT ACCGCTGGAA
CGGGATTTAC AGAGAGAAGT ATGA
 
Protein sequence
MKFWRPGITG KLFLAIFATC IVLLISMHWA VRISFERGFI DYIKHGNEQR LQLLSDALGE 
QYAQHGNWRF LRNNDRFVFQ ILRSFEHDNS EDKPGPGMPP HGWRTQFWVV DQNNKVLVGP
RAPIPPDGTR RPILVNGAEV GAVIASPVER LTRNTDINFD KQQRQTSWLI VALATLLAAL
ATFLLARGLL APVKRLVDGT HKLAAGDFTT RVTPTSEDEL GKLAQDFNQL ASTLEKNQQM
RRDFMADISH ELRTPLAVLR GELEAIQDGV RKFTPETVAS LQAEVGTLTK LVDDLHQLSM
SDEGALAYQK APVDLIPLLE VAGGAFRERF ASRGLKLQFS LPDSITVFGD RDRLMQLFNN
LLENSLRYTD SGGSLQISAG QRDKTVRLTF ADSAPGVSDD QLQKLFERFY RTEGSRNRAS
GGSGLGLAIC LNIVEAHNGR IIAAHSPFGG VSITVELPLE RDLQREV