Gene ECH74115_5854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5854 
Symbol 
ID6971890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5506926 
End bp5508338 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID643389475 
Producttranscriptional regulator, GntR family/aminotransferase, classes I and II 
Protein accessionYP_002273867 
Protein GI209397786 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.491083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTT ATCAACATCT GGCGACTCTA CTTGCCGAGC GGATTGAGCA AGGGCTGTAT 
CGTCACGGGG AGAAATTGCC GTCGGTGCGC AGCTTAAGTC AGGAGCACGG CGTCAGCATC
AGCACCGTGC AGCAGGCGTA CCAGACGCTG GAGACGATGA AGCTCATCAC TCCGCAGCCG
CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTGCCGCC GATGACGCGT
CCGGTGCAGC GCCCAGTGGA AATTACCCAG TGGGATCTGG TGCTGGATAT GCTGGTGGCG
CATAGCGACA GTTCCATTGT TCCGTTAAGC AAAAGCACGC CGGATGTCGA AACGCCCAGC
CTGAAACCGC TGTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTTCTC
GGTTATGACT TGCTAGCCGG TCAGCGGGTA TTGCGCGAGC AGGTTGCCCG CCTGATGCTC
GACAGCGGCT CGGTGGTCAC TGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG
ATGTCGCTAG CGTTAATGGC GGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC
TGTTATTACG GTTCAATGCA GATGCTGCGC GGCATGGGCG TGAAAGTGAT TGAAATCCCA
ACCGATCCAG AAACAGGCAT CAGCGTTGAA GCGCTGGAAC TGGCGCTGGA ACAGTGGCCG
ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC
GCGCGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT
GATGTCTACG GCGAACTGGC AACGGAGTAT CCGCGCCCGC GGACCATTCA TTCCTGGGAT
ATCGACGGGC GAGTGCTGTT GTGCAGCTCG TTCAGTAAAA GTATTGCTCC AGGCCTGCGC
GTGGGTTGGG TCGCACCGGG GCGTTATCAC GATAAACTGA TGCATATGAA ATACGCCATC
AGCAGCTTTA ATGTGCCGTC CACGCAAATG GCGGCGGCAA CGTTTGTGCT GGAAGGTCAC
TATCATCGCC ATATCCGGCG GATGCGGCAG ATCTATCAGC GCAATTTGGC GCTTTATACC
TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT
TTACTGTGGA TAGAATTGCC TGAACAGGTC GATATGGTCT GCGTCACGCG GCAGCTGTGC
CGCATGAAAA TCCAGGTGGC GGCAGGCTCG ATTTTCTCAG CTTCCGGCAA ATACCGTAAC
TGCCTGCGCA TCAACTGTGC TTTGCCACTC AGCGAAACTT ATCGCGAAGC GCTGAAGCAA
ATTGGCGAGG CCGTGTATCG GGCAATGGAA TAA
 
Protein sequence
MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSQEHGVSI STVQQAYQTL ETMKLITPQP 
RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDLVLDMLVA HSDSSIVPLS KSTPDVETPS
LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQVARLML DSGSVVTADD IIITSGCHNS
MSLALMAVCK PGDIVAVESP CYYGSMQMLR GMGVKVIEIP TDPETGISVE ALELALEQWP
IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD
IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLMHMKYAI SSFNVPSTQM AAATFVLEGH
YHRHIRRMRQ IYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCVTRQLC
RMKIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME