Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4568 |
Symbol | |
ID | 5594540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4575180 |
End bp | 4576592 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923664 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001461104 |
Protein GI | 157163786 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGTT ATCAACATCT GGCGACTCTA CTTGCCGAGC GGATTGAGCA AGGGCTGTAT CGTCACGGGG AGAAATTGCC GTCGGTGCGC AGCTTAAGTC AGGAGCACGG CGTCAGCATC AGCACCGTGC AACAGGCGTA TCAGACGCTG GAAACGATGA AGCTCATCAC TCCGCAGCCG CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTGCCGCC GATGACGCGT CCGGTGCAGC GCCCGGTGGA AATTACCCAG TGGGATCAGG TGCTGGATAT GCTGGTGGCG CATAGCGACA GTTCCATTGT TCCGTTAAGC AAAAGCACGC CGGATGTCGA AACGCCCAGC CTGAAACCGC TCTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTGCTC GGTTATGACT TGTTAGCCGG TCAGCGGGTA TTGCGAGAGC AGATTGCCCG CCTGATGCTC GACAGCGGCT CGGTGGTCAC CGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG ATGTCGCTGG CGTTAATGGC GGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC TGTTATTACG GTTCAATGCA GATGCTGCGC GTCATGGGCG TGAAAGTGAT TGAAATCCCA ACCGATCCAG AAACTGGCAT CAGCGTTGAA GCACTGGAAC TGGCGCTGGA ACAGTGGCCG ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC GCACGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT GATGTCTACG GCGAACTGGC AACGGAGTAT CCGCGCCCGC GGACCATCCA TTCATGGGAT ATCGACGGGC GAGTGTTGTT GTGCAGCTCG TTCAGTAAAA GTATTGCACC TGGCCTGCGC GTGGGCTGGG TCGCACCGGG GCGATATCAC GATAAACTGC TGCATATGAA ATATGCCACC AGCAGCTTTA ATGTACCGTC CACGCAAATG GCGGCGGCAA CGTTTGTGCT GGAAGGTCAC TATCATCGCC ATATCAGGCG GATGCGGCAG AACTATCAGC GCAATTTGGC GCTTTATACC TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT TTACTGTGGA TAGAATTGCC TGAACAGGTC GATATGGTCT GCGTTGCGCG GCAGCTGTGC CGCATGAATA TCCAGGTGGC AGCAGGCTCG ATTTTCTCGG CTTCCGGCAA ATACCGTAAT TGTCTACGCA TCAACTGCGC TTTGCCGCTC AGCGAAACCT ATCGCGAAGC ACTAAAGCAA ATTGGCGAGG CCGTGTATCG GGCAATGGAA TAA
|
Protein sequence | MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSQEHGVSI STVQQAYQTL ETMKLITPQP RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDQVLDMLVA HSDSSIVPLS KSTPDVETPS LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQIARLML DSGSVVTADD IIITSGCHNS MSLALMAVCK PGDIVAVESP CYYGSMQMLR VMGVKVIEIP TDPETGISVE ALELALEQWP IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLLHMKYAT SSFNVPSTQM AAATFVLEGH YHRHIRRMRQ NYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCVARQLC RMNIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME
|
| |