Gene ECH74115_5603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5603 
Symbol 
ID6967248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5241654 
End bp5243924 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content58% 
IMG OID643389239 
Producthypothetical protein 
Protein accessionYP_002273636 
Protein GI209400726 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCACC CCGGACGTCA CTTTTTTGCC AGCGCGCGCG GGCGATTGCT GCTTTTGAAT 
TTGCTGGTGG TGGCGGTGAC GTTGATGGTC AGCGGCGTGG CGGTAATGGG CTTTCGTCAT
GCCAGCCAGA TGCAGGAGCT GGTGCAGCAG CAAACGGTGG ATGATATGAC CGGCAGCCTG
AATCTGGCGC GCGACACGGC AAACGTGGCA ACGGCGGCGG TGCGATTGTC GCAGGTGGTG
GGAGCGCTGG AATATAAAGG CGAAGCCGAG CGGTTGCAGG AGACACAGCG GGCATTAAAA
AGCTCGCTTG CACAACTGGC GAACGCGCCT TTAGCGCAGC AGGAAGCCGG GCTGGTGACG
CGAATCATCA CGCGTAGCAA CGAACTGCAA ACCAGCGTCG GCGGTATGCT GGAGCGCGGG
CAAAGGCGGC ATCTGGAGCG CAACGCGCTG TTAAGTTCGC TGTATCAAAA CTTAAGTTAT
CTGCGGCATC TGCAAAAAGT CACTCACGCG CAAGATGATA TTTTGCTTAA CGAGATGAAC
CGACTGATTG TTGCCGCTAT TGCCACCCCC GCGCCGCAGG CGATTATTCA TCAACTGGTC
GGGGTAATGT CTGCGTTGCC GACCCACAGC GATACGCCAT TGGTTAATAC GTTACTCAAT
GATTTCAATG ATTTCAATGA TGAAATGCGT AAACTCGCGC CGCTTTCTGC CGCTCTGGAG
CAAAGCGATC TGGCAATCAG CTGGTATATG TTTCACATCA AAGCGCTGGT GGCGATCCTC
AATGGCGACA TCAACCGTTA CGCCGAACAG GTGGCGACGC TCTCCGGTCA GCGTGTGGCG
CAAAGTTACC AGGATTTGCG CTCCGGCGAG AGGGTGATCA TGATTTTCGC CCTGTTAGCG
GTGGTGATCA CCGCGTTTGC CGGCTGGTAT ATCTATCGCA ACCTCGGCAC TAACCTGACG
GCGATTTCCC GCGCCATGAG CCGCCTGGCG CACGGCGAAG CGGACGTCAC TTTTCCGGCG
TTACAGCGGC GTGATGAGTT GGGTGAACTG GCGCGCGCCT TTAACGTTTT TGCCCGCAAT
ACCGCGTCAC TCGAACGAAC CTCACGGCTG CTAAAAGAGA AGACCACGCA GATGGAGATT
GCCCAAACTG AGCGGCAGGG GTTGGAAGAG GCGCTATTAC ACAGCCAGAA ACTGAAAGCT
GTCGGGCAAC TGACGGGCGG GCTGGCCCAT GATTTTAATA ATCTGCTGGC GGTGATTATT
GGCAGCCTGG AGCTGGTGAA TCCGGACTCG CCAGACGCGC CGCGAATACA GCGGGCGCTG
CAAGCTGCCG AACGTGGGGC GCTACTGACA CAACGACTGC TGGCGTTTTC CCGCAAGCAG
TCCTTACATC CGCAAGCCGT TGAACTGAAA ACATTGCTGG AAGAGTTAAG CGAATTAATG
CGCCACTCCT TACCGGCAAC GCTGGCGCTG GAGATTGAAG CGCAGTCTCC GGCGTGGCCC
GCGTGGATTG ACATCAGCCA ACTGGAAAAT GCCATCATCA ACCTGGTGAT GAACGCCCGT
GACGCGATGG AAGGGCAGAC GGGAATCATC AAAATTCGTA CCTGGAATCA GCGCGTTACT
CGCAGCAGTG GGCAGCGTCA GGACATGGTG GCACTGGAGG TCATCGATCA TGGCTGCGGT
ATGTCGCAGG CGGTGAAAAG CCAGGTATTT GAGCCATTCT TTACCACCAA ACAGACGGGC
AGCGGCAGCG GGCTGGGTTT GTCGATGGTC TATGGTTTTG TGCGCCAGTC CGGTGGGCGC
GTTGAAATTG AAAGCGCGCC GGGGCAGGGG ACAACGGTCA GATTACAGCT TCCCCGCGCG
CTGACGGCGG CGAAAAACCT ATCGCCAGCG GCAGTAGAAC AGGCTTCTGT CAGCGGCGAT
AAACTGGTGT TAGTGCTGGA AGATGAAGCC GCGGTGCGCC AGACCATCTG CGAACAGTTG
CACCTGCTGG GCTATTTGAC GCTGGAAGCA AGCAGCGGCG AACAGGCGCT GGATCTGCTG
GCGGCATCAG CAGAAATCGA TATTTTTATC AGCGATTTAA TGCTCCCCGG CGGCATGAGC
GGCGCAGAAG TGGTCAATGC GGCCCGCAAA CTTTACCCGC ACCTGACGCT GCTACTCATC
AGCGGTCAGG ATTTACGCCC CAGCCATAAC CCCGCGCTGC CAGACGTGGC GCTGTTGCGC
AAACCCTTTA CCCGTGCGCA GTTGGCGCAG GCGCTGGGGC AAGAGAATTG A
 
Protein sequence
MAHPGRHFFA SARGRLLLLN LLVVAVTLMV SGVAVMGFRH ASQMQELVQQ QTVDDMTGSL 
NLARDTANVA TAAVRLSQVV GALEYKGEAE RLQETQRALK SSLAQLANAP LAQQEAGLVT
RIITRSNELQ TSVGGMLERG QRRHLERNAL LSSLYQNLSY LRHLQKVTHA QDDILLNEMN
RLIVAAIATP APQAIIHQLV GVMSALPTHS DTPLVNTLLN DFNDFNDEMR KLAPLSAALE
QSDLAISWYM FHIKALVAIL NGDINRYAEQ VATLSGQRVA QSYQDLRSGE RVIMIFALLA
VVITAFAGWY IYRNLGTNLT AISRAMSRLA HGEADVTFPA LQRRDELGEL ARAFNVFARN
TASLERTSRL LKEKTTQMEI AQTERQGLEE ALLHSQKLKA VGQLTGGLAH DFNNLLAVII
GSLELVNPDS PDAPRIQRAL QAAERGALLT QRLLAFSRKQ SLHPQAVELK TLLEELSELM
RHSLPATLAL EIEAQSPAWP AWIDISQLEN AIINLVMNAR DAMEGQTGII KIRTWNQRVT
RSSGQRQDMV ALEVIDHGCG MSQAVKSQVF EPFFTTKQTG SGSGLGLSMV YGFVRQSGGR
VEIESAPGQG TTVRLQLPRA LTAAKNLSPA AVEQASVSGD KLVLVLEDEA AVRQTICEQL
HLLGYLTLEA SSGEQALDLL AASAEIDIFI SDLMLPGGMS GAEVVNAARK LYPHLTLLLI
SGQDLRPSHN PALPDVALLR KPFTRAQLAQ ALGQEN