Gene ECH74115_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3915 
Symbol 
ID6970904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3627047 
End bp3628381 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content54% 
IMG OID643387689 
Producttranscriptional regulator, GntR family 
Protein accessionYP_002272137 
Protein GI209400513 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0753496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGTT ATCAGGATAT CGCCCGTCAG TTAAAAACGG TTATCGAGCA AGGAAAACTG 
AAACCTGGCG CAAGGCTACC TTCGAGCCGT ACCTGGTCGC AGGAGCTGGG TGTCTCTCGA
TCCACTGTCG AAAATGCTTA TGCTGAGCTG GTGGCGCAGG GATGGCTGGT CAGGCGGGGA
CAGGCGGGTA CCTTTGTCAG TGAGCAGATA CATCCGCAAC AGTCTGCTGT GGATGTCGTG
GCTTTTGCTG GTGAAAGTCA GCAGCCGCTG CCTTTTCAGA TGGGGCTACC TGCACTTGAT
CTTTTTCCGC GTGAATTGTG GGCGCGGGTG ATGGGCCGCC GTCTGCGTAC CCAGACGCGC
TTTGATTTGG CATTAGGCGA TGTCTGCGGC GAGGCGGCCT TGCGCGAGGC TATTGTTGAT
TATTTACGCG TTTCACGTGG GATTGATTGT CAGCCAGAGC AGGTCTTTAT CACTCACGGT
TATGCGGCCT CAATGGCTTT AATTCTGCAC GCTCTGGCGC AACCGGGAGA CGGGATGTGG
ATGGAAGATC CCGGCTTTCC GCTGATTCGC CCGATTGTCA CTCGCCACGG TGTGGAAATT
TTACCTGTGC CGGTTGATGA CAACGGACTG GATGTCACAA GCGGAATACA AAATTATCCT
GATGCGCGTT TTGCCCTGAT TACTCCGGCA CACCAAAGCC CGCTGGGTGT GGCACTCTCT
TTAGCGCGTA GGTATCAGAT ACTGGAATGG GCAGAGCGTA GTCAGGCATG GATTATTGAA
GATGATTACG ACAGTGAGTT TCGCTATCAC GGTAAGCCGT TACCGGCACT AAAAAGTCTC
GACGCGCCGC AGCGGGTGAT TTATGCCGGA ACATTCAGCA AAGCGCTATT TCCTGCATTG
CGCTGTGCGT GGCTGGTGGT GCCGGTGGAG CAGATTTCGC AATTCCGGCA GCAGGCGTCA
CTGGAACCAT GTGCTGTACC CGTACTGTGG CAGAACACAC TGGCAGATTT CATCCGTGAG
GGGCATTTCT GGCGGCATCT GAAGAAAATG CGCCAGCATT ATGCCCAGCG ACGGCAGTGG
ATTGAGCAGG CACTTACGCA GCAGGGATTT CAGGTTGTGC CGCAAAAAGG CGGTATCCAG
ATGGTGATCA AACTAGCAGG TAATGACGTG ACATTCGTGC ATAAAGCCAA TGCTGCTGGT
CTTGCCGTAC AGGCACTTAG CGACTGGCGT ATCCGCTCAA GCGAGGACGG AGGATTATTG
CTCTCGTTTA CGAATATCGT TAGCGAAAGT ATGGCGCGAC AGGTAGCACA GCAATTGCGC
AGTGCTTTAA ATTAA
 
Protein sequence
MPRYQDIARQ LKTVIEQGKL KPGARLPSSR TWSQELGVSR STVENAYAEL VAQGWLVRRG 
QAGTFVSEQI HPQQSAVDVV AFAGESQQPL PFQMGLPALD LFPRELWARV MGRRLRTQTR
FDLALGDVCG EAALREAIVD YLRVSRGIDC QPEQVFITHG YAASMALILH ALAQPGDGMW
MEDPGFPLIR PIVTRHGVEI LPVPVDDNGL DVTSGIQNYP DARFALITPA HQSPLGVALS
LARRYQILEW AERSQAWIIE DDYDSEFRYH GKPLPALKSL DAPQRVIYAG TFSKALFPAL
RCAWLVVPVE QISQFRQQAS LEPCAVPVLW QNTLADFIRE GHFWRHLKKM RQHYAQRRQW
IEQALTQQGF QVVPQKGGIQ MVIKLAGNDV TFVHKANAAG LAVQALSDWR IRSSEDGGLL
LSFTNIVSES MARQVAQQLR SALN