Gene ECH74115_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3601 
SymbolevgS 
ID6968179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3316837 
End bp3320220 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content41% 
IMG OID643387398 
Producthybrid sensory histidine kinase in two-component regulatory system with EvgA 
Protein accessionYP_002271857 
Protein GI209399540 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0784] FOG: CheY-like receiver 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.317303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATACCG ATTCGCAGCA ACGGGTTCGT GGTATTAATG CTGATTATTT AAATCTTTTA 
AAAAGAGCGT TAAATATCAA ATTAACACTC CGGGAATACG CAGATCATCA AAAAGCAATG
GACGCGCTGG AAGAAGGTGA AGTCGATATA GTGTTATCAC ATTTAGTTGC TTCGCCGCCT
CTTAATGATG ACATTGCTGC AACCAAACCA CTGATAATTA CCTTTCCGGC GCTGGTCACC
ACCCTTCACG ATTCAATGCG ACCGCTTACC TCATCAAAAC CAGTAAATAT TGCTCGAGTA
GCAAATTATC CCCCTGACGA GGTAATTCAT CAATCATTTC CAAAAGCAAC AATTATCTCT
TTTACAAATT TATATCAGGC ATTAGCATCC GTCTCAGCCG GACAGAATGA TTACTTTATT
GGTAGTAACA TCATTACCAG CAGTATGATT TCCCGCTATT TCACTCACTC CTTAAATGTA
GTGAAATATT ATAACTCACC GCGTCAATAT AACTTCTTCT TAACCAGAAA AGAATCTGTC
ATTCTTAATG AAGTACTCAA TCGATTTGTT GATGCTTTAA CAAATGAAGT TCGCTATGAA
GTATCACAAA ATTGGCTTGA TACAGGAAAC CTGGCCTTTC TGAACAAGCC ATTAGAACTC
ACTGAACATG AAAAACAGTG GATTAAGCAG CATCCCGATT TAAAGGTGCT GGAAAATCCT
TACTCGCCCC CCTATTCTAT GACGGATGAA AATGGCTCGG TTCGGGGCGT TATGGGGGAC
ATTCTTAATA TTATTACCTT GCAAACAGGT TTAAATTTTT CTCCGATCAC CGTTTCACAC
AATATCCATG CTGGAACACA GCTGAATCCC GGCGGATGGG ATATATTACC CGGTGCTATT
TATAGTGAAG ATAGAGAAAA TAATGTTTTA TTTGCTGAAG CCTTCATAAC AACGCCTTAC
GTTTTTGTCA TGCAAAAAGC GCCTGACAGT GAACAAACAT TAAAAAAAGG AATGAAAGTT
GCCATTCCAT ATTATTATGA GCTTCATTCG CAATTAAAAG AGATGTATCC GGAGGTAGAG
TGGATAAAAG TCGATAACGC CAGCGCTGCA TTTCACAAGG TCAAGGAAGG CGAACTTGAT
GCTCTGGTCG CGACACAGTT AAATTCACGT TACATGATCG ACCATTACTA TCCTAATGAA
CTGTATCATT TTCTTATTCC CGGCGTTCCG AATGCATCGC TTTCGTTCGC TTTTCCTCGC
GGAGAACCGG AACTTAAGGA TATTATTAAT AAAGCACTGA ATGCAATCCC CCCAAGCGAA
GTTCTGCGCC TGACCGAAAA ATGGATTAAA ATGCCCAATG TGACCATTGA CACATGGGAC
CTCTATAGCG AGCAATTTTA TATTGTTACG ACATTATCCG TTTTATTAGT TGGCAGTAGC
CTTTTATGGG GATTCTACCT GTTACGCTCA GTTCGTCGTC GTAAAGTCAT TCAGGGTGAT
TTAGAAAACC AAATATCATT CCGGAAAGCG CTCTCGGACT CCTTACCGAA TCCAACTTAT
GTTGTAAACT GGCAAGGTAA TGTCATTAGT CACAATAGTG CATTTGAACA TTATTTCACT
GCTGATTACT ACAAAAATGC AATGTTGCCA TTAGAAAACA GTGAATCACC CTTTAAAGAT
GTTTTTTCTA ATACGCATGA AGTCACAGCA GAGACGAAAG AAAACCGAAC AATATACACA
CAGGTATTTG AAATTGATAA TGGCATCGAG AAAAGATGCA TTAATCACTG GCATACATTA
TGTAATCTGC CAGCCAGCGA ACATGCTGTT TATATTTGTG GTTGGCAAGA TATTACCGAG
ACGCGTGATT TAATTCATGC ACTCGAAGTA GAGAGAAATA AAGCGATCAA TGCAACTGTC
GCAAAAAGTC AGTTTCTGGC AACAATGAGT CACGAAATAA GAACACCAAT AAGCTCCATT
ATGGGCTTCC TGGAACTTCT GTCGGGTTCT GGTCTTAGCA AGGAGCAACG GGTGGAGGCG
ATTTCACTAG CCTACGCCAC CGGACAATCA CTCCTCGGCT TAATTGGTGA AATCCTTGAT
GTCGACAAAA TTGAATCGGG TAACTATCAA CTTCAACCAC AATGGGTCGA TATCCCTACT
TTAGTCCAGA ACACTTGTCA CTCTTTCGGT GCGATTGCTG CAAGCAAATC GATCGCATTA
AGTTGCAGCA GTACATTTCC TGATCATTAC CTGGTTAAGA TCGACCCTCA GGCGTTTAAG
CAGGTCTTAT CAAATTTACT GAGTAATGCT CTCAAATTTA CCACCGAGGG GGCAGTAAAA
ATTACGACCT CCCTGGTTCA CATTGATGAC AACCACGCTG TAATCAAAAT GACGATTATG
GATTCTGGAA GTGGATTATC ACAGGAAGAA CAACAACAAC TGTTTAAACG TTATAGCCAA
ACAAGTGCAG GTCGTCAGCA AACAGGTTCT GGTTTAGGCT TAATGATCTG CAAAGAATTA
ATTAAAAACA TGCAGGGTGA TTTGTCATTA GAAAGTCATC CAGGCATAGG AACAACATTT
ACGATCACAA TCCCGGTAGA AATTATCCAA CAAGTGGCGG CTGTCGAGGC AAAAGCAGAA
CAACCCATCA CACTACCTGA AAAGTTGAGC ATATTAATCG CGGATGATCA TCCGACCAAC
AGGCTATTAC TCAAACGCCA GCTAAATCTA TTAGGATATG ATGTTGATGA AGCCACTGAT
GGTGTGCAAG CGCTACACAA AGTCAGTATG CAACATTATG ATCTGCTTAT TACTGACGTT
AATATGCCGA ATGTGGATGG TTTTGAGTTG ACTCGCAAAC TCCGTGAGCA AAATTCGTCC
TTACCCATCT GGGGGCTTAC AGCCAACGCA CAGGCTAACG AACGTGAAAA AGGGTTAAAT
TGCGGCATGA ACTTATGTTT GTTCAAACCG TTGACCCTGG ATGTACTGAA AACACATTTA
AGTCAGTTAC ACCAGGTTGC GCATATTGTA CCTCAGTATC GCCACCTTGA TATCGAGGCC
CTGAAAAATA ATACGGCGAA CGATCTACAA CTGATGCAGG AGATTCTCAT GACTTTCCAG
CATGAAACAC ATAAAGATTT ACCCGCTGCG TTTCATGCAC TAGAAGCTGG CGATAATAGA
ACTTTCCATC AGTGTATTCA TCGCATCCAC GGTGCGGCTA ACATCCTGAA TTTGCAAAAG
TTGATTAATA TTAGCCATCA GTTAGAAATA ACACCTGTTT CAGATGACAG TAAGCCTGAA
ATTCTTCAGT TGCTAAACTC TGTAAAAGAA CACATTGCAG AGCTAGACCA GGAAATTGCT
GTTTTCTGTC AACAAAATAA CTAA
 
Protein sequence
MHTDSQQRVR GINADYLNLL KRALNIKLTL REYADHQKAM DALEEGEVDI VLSHLVASPP 
LNDDIAATKP LIITFPALVT TLHDSMRPLT SSKPVNIARV ANYPPDEVIH QSFPKATIIS
FTNLYQALAS VSAGQNDYFI GSNIITSSMI SRYFTHSLNV VKYYNSPRQY NFFLTRKESV
ILNEVLNRFV DALTNEVRYE VSQNWLDTGN LAFLNKPLEL TEHEKQWIKQ HPDLKVLENP
YSPPYSMTDE NGSVRGVMGD ILNIITLQTG LNFSPITVSH NIHAGTQLNP GGWDILPGAI
YSEDRENNVL FAEAFITTPY VFVMQKAPDS EQTLKKGMKV AIPYYYELHS QLKEMYPEVE
WIKVDNASAA FHKVKEGELD ALVATQLNSR YMIDHYYPNE LYHFLIPGVP NASLSFAFPR
GEPELKDIIN KALNAIPPSE VLRLTEKWIK MPNVTIDTWD LYSEQFYIVT TLSVLLVGSS
LLWGFYLLRS VRRRKVIQGD LENQISFRKA LSDSLPNPTY VVNWQGNVIS HNSAFEHYFT
ADYYKNAMLP LENSESPFKD VFSNTHEVTA ETKENRTIYT QVFEIDNGIE KRCINHWHTL
CNLPASEHAV YICGWQDITE TRDLIHALEV ERNKAINATV AKSQFLATMS HEIRTPISSI
MGFLELLSGS GLSKEQRVEA ISLAYATGQS LLGLIGEILD VDKIESGNYQ LQPQWVDIPT
LVQNTCHSFG AIAASKSIAL SCSSTFPDHY LVKIDPQAFK QVLSNLLSNA LKFTTEGAVK
ITTSLVHIDD NHAVIKMTIM DSGSGLSQEE QQQLFKRYSQ TSAGRQQTGS GLGLMICKEL
IKNMQGDLSL ESHPGIGTTF TITIPVEIIQ QVAAVEAKAE QPITLPEKLS ILIADDHPTN
RLLLKRQLNL LGYDVDEATD GVQALHKVSM QHYDLLITDV NMPNVDGFEL TRKLREQNSS
LPIWGLTANA QANEREKGLN CGMNLCLFKP LTLDVLKTHL SQLHQVAHIV PQYRHLDIEA
LKNNTANDLQ LMQEILMTFQ HETHKDLPAA FHALEAGDNR TFHQCIHRIH GAANILNLQK
LINISHQLEI TPVSDDSKPE ILQLLNSVKE HIAELDQEIA VFCQQNN