Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2044 |
Symbol | |
ID | 6969741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1940357 |
End bp | 1941763 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385956 |
Product | transcriptional regulator, GntR family/aminotransferase, classes I and II |
Protein accession | YP_002270445 |
Protein GI | 209399584 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00402201 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000000000583102 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAT ACCAGCAGCT TGCAGAACAA TTACGCGAGC AGATTGCGTC GGGTATCTGG CAACCCGGCG ATCGTTTACC TTCGTTGCGT GACCAGGTGG CGCTTTCTGG CATGAGCTTT ATGACTGTCA GCCATGCCTA TCAGTTGCTC GAAAGTCAGG GATATATTAT CGCACGACCG CAGTCGGGTT ATTACGTTGC GCCACAGGCA ATAAAAATGC CGAAAGTGCC AGTCATTCCA GTCACTCGAG ATGAAGCAGT CGATATCAAC ACTTATATTT TTGATATGTT GCAGGCCAGT CGCGATCCGT CGGTCGTTCC GTTTGCCTCG GCCTTTCCCG ACCCGCGACT TTTCCCCCTC CAACAACTAA ACCGCTCGCT GGCGCAGGTA AGCAAAACCG CCACGGCGAT GAGCGTGATT GAAAACTTGC CGCCAGGAAA CGCAGAACTG CGTCAGGCTA TTGCCCGTCG CTATGCCTTA CAGGGGATCA CCATTTCTCC TGATGAAATT GTCATCACTG CCGGGGCGTT AGAGGCATTA AACCTCAGTT TGCAAGCGGT AACTGAACCG GGCGATTGGG TGATAGTAGA GAATCCTTGT TTCTACGGTG CGTTGCAGGC GCTGGAGCGG CTACGGCTGA AGGCGTTATC GGTGGCGACG GATGTTAAAG AAGGGATCGA TCTTCAGGCG CTGGAACTGG CGTTGCAGGA GTATCCGGTG AAAGCGTGCT GGCTGATGAC TAATAGCCAG AATCCACTCG GATTTACCTT AACGCCGCAA AAAAAAGCAC AACTGGTGGC GTTGCTCAAT CAGTACAACG TAACGCTGAT TGAAGATGAC GTTTACAGCG AACTTTATTT TGGACGGGAA AAACCGCTGC CTGCGAAAGC GTGGGATCGC CACGATGGCG TTTTGCATTG CTCTTCGTTT TCGAAATGTC TGGTGCCTGG TTTTCGTATT GGTTGGGTCG CCGCCGGAAA ACATGCACGT AAAATTCAAC GCTTGCAGCT GATGAGTACG CTTTCCACCA GCTCACCGAT GCAGCTTGCG CTGGTGGATT ACCTTTCCAC GCGCCGATAC GACGCCCATC TTCGTCGCCT GCGTCGCCAG CTTGCGGAAC GTAAACAACG TGCCTGGCAG GCACTGCTGC GTTATCTGCC TGCGGAAGTG AAAATTCATC ATAATGACAG CGGTTATTTT CTCTGGCTGG AGCTCCCCGA GCCGTTAGAT GCCGGCGAAT TGAGCCTGGC GGCACTGACG CATCATATCA GTATTGCGCC AGGTAAAATG TTTTCTACCG GTGAAAACTG GTCACGTTTT TTTCGTTTTA ATACCGCGTG GCTGTGGGGA GAACGTGAAG AACAGGCGGT AAAACAATTA GGCAAACTTA TTCAAGAACG GCTGTAA
|
Protein sequence | MKKYQQLAEQ LREQIASGIW QPGDRLPSLR DQVALSGMSF MTVSHAYQLL ESQGYIIARP QSGYYVAPQA IKMPKVPVIP VTRDEAVDIN TYIFDMLQAS RDPSVVPFAS AFPDPRLFPL QQLNRSLAQV SKTATAMSVI ENLPPGNAEL RQAIARRYAL QGITISPDEI VITAGALEAL NLSLQAVTEP GDWVIVENPC FYGALQALER LRLKALSVAT DVKEGIDLQA LELALQEYPV KACWLMTNSQ NPLGFTLTPQ KKAQLVALLN QYNVTLIEDD VYSELYFGRE KPLPAKAWDR HDGVLHCSSF SKCLVPGFRI GWVAAGKHAR KIQRLQLMST LSTSSPMQLA LVDYLSTRRY DAHLRRLRRQ LAERKQRAWQ ALLRYLPAEV KIHHNDSGYF LWLELPEPLD AGELSLAALT HHISIAPGKM FSTGENWSRF FRFNTAWLWG EREEQAVKQL GKLIQERL
|
| |