Gene ECH74115_2747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2747 
Symbol 
ID6968824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2568944 
End bp2570302 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content40% 
IMG OID643386602 
Productheavy metal sensor histidine kinase 
Protein accessionYP_002271081 
Protein GI209398520 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000765675 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.226821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAT TATCTATAAC CGTCCGTTTA ACCTTGCTTT TTATATTGCT GCTGTCTGTT 
GCTGGCGCCG GAATTGTCTG GACTCTCTAT AATGGCCTCG CAAGTGAGTT GAAATGGCGC
GATGATACAA CACTCATTAA CCGGACAGCG CAGATCAAGC AGTTATTAAT TGATGGGGTA
AATCCAGATA CGTTACCTGT ATACTTTAAC CGGATGATGG ATGTTAGTCA GGATATCTTG
ATCATTCATG GTGATGGCAT CAATAAAATT GTTAACCGGA CAAATGTCAG TGATGACATG
TTAAATAACA TACCTGCTAG TGAGACAATC AGCGCAGCTG GCATTTACAG AAGCATTATT
AATGATACAG AGATAGATGC TTTACGAATT AATATTGATG AAGTTTCGCC ATCATTAACG
GTTACTGTGG CTAAATTGGC TTCAGCCAGA CATAACATGC TTGAACAGTA TAAAATCAAT
AGCATTATAA TTTGCATTGT CGCCATTATA CTGTGCTCAG TATTAAGTCC GCTGTTAATC
CGAACGGGGT TACGAGAGAT CAAAAAGTTG AGTGGTGTAA CGGAAGCGCT GAATTATAAC
GATAGCCGGG AGCCTGTTGA GGTTAGCGCA TTACCGAGAG AACTAAAACC TCTTGGGCAG
GCGTTGAATA AAATGCATCA TGCTTTAGTC AAAGATTTTG AGCGTCTAAG TCAGTTTGCT
GACGATCTCG CTCATGAACT TAGAACGCCA ATTAATGCAT TACTGGGTCA GAATCAGGTT
ACGCTCAGTC AAACCAGAAG TATCGCTGAA TATCAAAAAA CAATTGCCGG TAACATTGAA
GAGCTGGAAA ATATTTCGCG GTTAACAGAG AACATACTGT TTCTTGCCAG GGCAGATAAA
AACAATGTTT TGGTGAAACT GGACTCGCTT TCTCTCAATA AGGAAGTCGA AAATTTGTTG
GACTATCTTG AATATCTTTC AGACGAGAAA GAGATTTGCT TTAAGGTCGA GTGCAATCAG
CAAATCTTTG CGGATAAAAT TTTACTGCAA CGAATGTTAT CGAATCTTAT TGTTAACGCT
ATTCGATATT CGCCAGAAAA ATCGCGTATT CATATAACCA GTTTTCTTGA TACCAACAGC
TATCTTAATA TTGATATCGC CAGCCCTGGA GCGAAAATTA ATGAGCCTGA AAAACTCTTC
CGTAGATTTT GGCGGGGAGA TAATTCGCGT CATTCCGTAG GTCAGGGACT TGGTCTTTCT
TTAGTCAAAG CGATTGCCGA ATTGCATGGG GGAAGTGCTA CGTATCACTA TCTCAATAAG
CATAATGTGT TCCGGATTAC GTTGCCGCAA AGAAATTAA
 
Protein sequence
MKRLSITVRL TLLFILLLSV AGAGIVWTLY NGLASELKWR DDTTLINRTA QIKQLLIDGV 
NPDTLPVYFN RMMDVSQDIL IIHGDGINKI VNRTNVSDDM LNNIPASETI SAAGIYRSII
NDTEIDALRI NIDEVSPSLT VTVAKLASAR HNMLEQYKIN SIIICIVAII LCSVLSPLLI
RTGLREIKKL SGVTEALNYN DSREPVEVSA LPRELKPLGQ ALNKMHHALV KDFERLSQFA
DDLAHELRTP INALLGQNQV TLSQTRSIAE YQKTIAGNIE ELENISRLTE NILFLARADK
NNVLVKLDSL SLNKEVENLL DYLEYLSDEK EICFKVECNQ QIFADKILLQ RMLSNLIVNA
IRYSPEKSRI HITSFLDTNS YLNIDIASPG AKINEPEKLF RRFWRGDNSR HSVGQGLGLS
LVKAIAELHG GSATYHYLNK HNVFRITLPQ RN