Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3637 |
Symbol | gntR |
ID | 5594325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3624767 |
End bp | 3625762 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922753 |
Product | transcriptional regulator GntR |
Protein accession | YP_001460234 |
Protein GI | 157162916 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.4011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAGACCCGT ACTTCAGGAT GTGGCTGACC GTGTAGGCGT GACCAAAATG ACGGTCAGCC GTTTTTTACG CAACCCGGAG CAGGTTTCCG TCGCTCTACG CGGCAAGATT GCCGCGGCTC TTGATGAACT GGGCTATATT CCCAATCGTG CGCCCGATAT CCTCTCTAAC GCCACCAGCC GGGCGATTGG CGTCCTGTTA CCTTCTCTCA CCAACCAGGT TTTCGCGGAA GTATTACGCG GAATCGAAAG CGTCACCGAC GCGCACGGTT ATCAGACCAT GCTGGCGCAC TACGGTTATA AACCGGAAAT GGAGCAAGAA CGCCTCGAAT CCATGCTCTC CTGGAATATC GACGGCCTGA TCCTCACCGA ACGTACCCAC ACGCCGCGCA CCTTAAAGAT GATTGAAGTG GCGGGTATTC CCGTGGTGGA ACTGATGGAC AGCAAGTCGC CATGCCTTGA TATCGCCGTC GGTTTTGATA ACTTTGAAGC AGCACGCCAG ATGACCACTG CCATTATTGC TCGCGGGCAT CGCCACATTG CCTATCTCGG CGCACGTCTC GACGAACGTA CTATCATCAA ACAGAAGGGA TACGAACAGG CGATGCTGGA TGCAGGCCTG GTGCCATATA GCGTGATGGT TGAGCAATCT TCTTCTTACT CTTCCGGTAT TGAACTGATT CGCCAGGCGC GGCGGGAATA TCCGCAGCTG GATGGCGTGT TCTGTACGAA TGATGACCTG GCGGTCGGCG CGGCGTTTGA ATGTCAGCGT CTGGGGTTAA AAGTTCCTGA CGATATGGCG ATTGCCGGTT TCCACGGTCA TGACATTGGT CAGGTGATGG AGCCACGACT TGCGAGCGTG CTGACGCCGC GTGAGCGGAT GGGCAGTATT GGCGCTGAAC GCCTGCTGGC GCGTATTCGT GGCGAATCTG TGACACCGAA AATGTTAGAT TTAGGTTTCA CCTTGTCACC GGGCGGATCT ATTTAA
|
Protein sequence | MKKKRPVLQD VADRVGVTKM TVSRFLRNPE QVSVALRGKI AAALDELGYI PNRAPDILSN ATSRAIGVLL PSLTNQVFAE VLRGIESVTD AHGYQTMLAH YGYKPEMEQE RLESMLSWNI DGLILTERTH TPRTLKMIEV AGIPVVELMD SKSPCLDIAV GFDNFEAARQ MTTAIIARGH RHIAYLGARL DERTIIKQKG YEQAMLDAGL VPYSVMVEQS SSYSSGIELI RQARREYPQL DGVFCTNDDL AVGAAFECQR LGLKVPDDMA IAGFHGHDIG QVMEPRLASV LTPRERMGSI GAERLLARIR GESVTPKMLD LGFTLSPGGS I
|
| |