Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2793 |
Symbol | |
ID | 6143066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2874146 |
End bp | 2875480 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617662 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001744822 |
Protein GI | 170681638 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.102841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCT ATCAGGATAT CGCCCGTCAG TTAAAAACGG CTATTGAGCA AGGGGAACTG AAACCCGGCG CAAGGCTGCC TTCAAGTCGT ACCTGGTCGC AGGAGCTGGG GGTGTCTCGA TCCACTGTCG AAAATGCGTA TGCTGAGCTG GTGGCGCAAG GATGGCTGAT CAGGCGGGGA CAGGCTGGCA CATTTGTCAG TGAGCGGATA TATCCGCAAC AATCTACTGT ACAAGTTGTA GCTTTTGCCG GTGAAAGTCA GCAACCGTTG CCATTTCAAA TGGGATTACC CGCACTCGAT CTCTTTCCGC GAGAGCTGTG GGCGCGGGTG ATGGGGCGTC GCCTTCGTAC CCAGACGCGC TTTGATTTGG CGTTAGGCGA TGTCTGCGGT GAGGCGGCGT TGCGCGAGGC AATAGTTGAT TACTTACGCG TTTCACGTGG GATTGATTGT CAGCCAGAGC AGGTCTTTAT CACTCACGGT TATGCGGCCT CAATAGCTTT AATTCTGCAC GCGCTGGCGA AACCGGGAAA CGGGATGTGG ATAGAAGATC CCGGCTTTCC ACTGATTCGC CCGATTGTCA CTCGCCATGA TGTGGAAATT TTGCCTGTGC CGGTTGATGA CAATGGACTG GATATCACAA GCGGAATACA AAATTATCCT GATGCGCGTT TTGCCCTGAT AACACCAGCA CACCAAAGTC CGCTGGGTGT GGCGCTCTCT TTAGCGCGCA GGCATCAGAT ACTGGAATGG GCAGATCGTA GTCAGGCATG GATTATTGAA GATGATTATG ACAGTGAGTT TCGCTATCAC GGTAAGCCGT TACCGGCGTT GAAAAGTCTC GACGCACCGC AGCGGGTAAT TTATGCCGGA ACATTCAGCA AAGCGCTATT TCCTGCATTG CGCTGTGCGT GGCTGGTGGT GCCGGTGAAG CAAATTGCAC AATTCCGCCA CCAGGCGTCA CTGGCTCCAT GTGCGGTACC TGTTCTATGG CAGAACACAC TGGCAGACTT TCTTCGCGAG GGGCATTTCT GGCGGCATCT GAAGAAAATG CGTCAGCATT ATGCTCAACG TCGGCAATGG ATTGAGCAAG CGTTGACGCA GCAAGGATTT CAGGTTGTGC CGCAGAAAGG TGGTATCCAG ATGGTGATCA GAATGATAGG CGATGATATT GCCCATGCGC GTAAAGCCAA TGCTGCAGGC CTTGCGGTGC AGGCACTTAG CGACTGGCGT ATCCGTTCAA GTGGGGAAGG TGGATTACTG CTTTCGTTTA CTAATATCGT TAACGAAGGT ATGGCGCGAC AGGTAGCACA ACAATTACGT AAAGCCTTAA GCTAA
|
Protein sequence | MPRYQDIARQ LKTAIEQGEL KPGARLPSSR TWSQELGVSR STVENAYAEL VAQGWLIRRG QAGTFVSERI YPQQSTVQVV AFAGESQQPL PFQMGLPALD LFPRELWARV MGRRLRTQTR FDLALGDVCG EAALREAIVD YLRVSRGIDC QPEQVFITHG YAASIALILH ALAKPGNGMW IEDPGFPLIR PIVTRHDVEI LPVPVDDNGL DITSGIQNYP DARFALITPA HQSPLGVALS LARRHQILEW ADRSQAWIIE DDYDSEFRYH GKPLPALKSL DAPQRVIYAG TFSKALFPAL RCAWLVVPVK QIAQFRHQAS LAPCAVPVLW QNTLADFLRE GHFWRHLKKM RQHYAQRRQW IEQALTQQGF QVVPQKGGIQ MVIRMIGDDI AHARKANAAG LAVQALSDWR IRSSGEGGLL LSFTNIVNEG MARQVAQQLR KALS
|
| |