Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3721 |
Symbol | gntR |
ID | 6144406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3791543 |
End bp | 3792538 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618547 |
Product | transcriptional regulator GntR |
Protein accession | YP_001745687 |
Protein GI | 170681394 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAGACCCGT ACTTCAGGAT GTGGCTGACC GTGTAGGCGT GACCAAAATG ACGGTCAGCC GTTTTTTACG CAACCCGGAG CAGGTTTCCG TCGCTCTACG CGGCAAGATT GCCGCCGCTC TTGATGAACT GGGTTATATT CCCAATCGCG CACCCGATAT CCTCTCTAAC GCCACCAGCC GGGCGATTGG CGTCCTGTTA CCTTCTCTCA CCAACCAGGT TTTTGCGGAA GTATTACGCG GAATCGAAAG CGTCACCGAC GCGCACGGTT ATCAGACCAT GCTGGCCCAC TACGGTTATA AACCGGAAAT GGAGCAAGAA CGCCTCGAAT CCATGCTCTC CTGGAATATC GACGGCCTGA TCCTCACTGA ACGTACGCAC ACGCCGCGCA CCTTAAAGAT GATTGAAGTG GCGGGGATTC CAGTGGTGGA ACTGATGGAC AGCAAGTCGC CGTGCCTCGA TATTGCCGTC GGTTTTGATA ACTTTGAAGC GGCACGCCAG ATGACCACCG CCATTATTGC TCGCGGGCAT CGCCACATTG CCTATCTCGG CGCACGTCTC GACGAACGTA CTATCATCAA ACAGAAGGGA TACGAACAGG CGATGCTGGA TGCAGGCCTG GTGCCATATA GCGTGATGGT TGAGCAATCT TCTTCTTACT CTTCCGGTAT TGAATTGATT CGCCAGGCGA GGCGGGAATA TCCGCAGCTG GATGGCGTGT TCTGTACTAA TGATGACCTG GCGGTCGGCG CGGCGTTTGA ATGCCAGCGT CTGGGGTTGA AAGTTCCTGA CGATATGGCG ATTGCCGGTT TCCATGGTCA TGACATTGGT CAGGTGATGG AACCTCGGTT GGCGAGCGTA CTGACGCCGC GTGAGCGGAT GGGCAGTATT GGCGCTGAAC GCCTGCTGGC GCGTATTCGT GGCGAATCTG TGACACCGAA AATGTTAGAT TTAGGTTTCA CCTTGTCACC GGGCGGATCT ATTTAA
|
Protein sequence | MKKKRPVLQD VADRVGVTKM TVSRFLRNPE QVSVALRGKI AAALDELGYI PNRAPDILSN ATSRAIGVLL PSLTNQVFAE VLRGIESVTD AHGYQTMLAH YGYKPEMEQE RLESMLSWNI DGLILTERTH TPRTLKMIEV AGIPVVELMD SKSPCLDIAV GFDNFEAARQ MTTAIIARGH RHIAYLGARL DERTIIKQKG YEQAMLDAGL VPYSVMVEQS SSYSSGIELI RQARREYPQL DGVFCTNDDL AVGAAFECQR LGLKVPDDMA IAGFHGHDIG QVMEPRLASV LTPRERMGSI GAERLLARIR GESVTPKMLD LGFTLSPGGS I
|
| |