Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4886 |
Symbol | |
ID | 6144120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 5003289 |
End bp | 5004701 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619690 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001746797 |
Protein GI | 170680887 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.344602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.838171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGTT ATCAACATCT GGCGACTTTG CTTGCCGAGC GGATTGAGCA AGGGCTGTAT CGTCACGGGG AGAAATTGCC GTCGGTGCGC AGCTTAAGTC AGGAGCACGG CGTCAGCATC AGTACCGTGC AGCAGGCGTA CCAGACGCTG GAAACGATGA AGCTCATCAC TCCGCAGCCG CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTACCGCC GATGACGCGT CCGGTGCAGC GCCCGGTGGA AATTACCCAG TGGGATCAGG TGCTGGATAT GCTGGAGGCG CATAGCGACA GTTCCATTGT TCCGTTAAGT AAAAGCACGC CGGATGTCGA AACGCCCAGC CTGAAACCGC TGTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTTCTC GGTTATGACT TGTTAGCCGG CCAGCGGGTA TTGCGCGAGC AGATTGCCCG CCTGATGCTC GACAGCGGCT CGGTGGTCAC CGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG ATGTCGCTGG CGTTGATGGC AGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC TGTTATTACG GTTCGATGCA GATGCTGCGC GGCATGGGCG TGAAAGTGAT TGAAATCCCC ACCGATCCAG AAACAGGCAT CAGCGTTGAA GCGCTGGAAC TGGCGCTGGA GCAGTGGCCG ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC GCGCGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT GATGTCTACG GCGAACTGGC GACGGAGTAT CCGCGCCCGC GGACCATTCA TTCCTGGGAT ATCGACGGGC GAGTGCTGTT GTGCAGCTCG TTCAGTAAAA GTATTGCTCC AGGCCTGCGC GTGGGTTGGG TCGCACCGGG GCGCTATCAC GATAAACTGC TGCATATGAA ATATGCCATC AGCAGCTTTA ATGTGCCGTC CACGCAAATG GCGGCGGCAA CGTTTGTGCT GGAAGGTCAC TATCATCGCC ATATCCGGCG GATGCGGCAG ATCTATCAGC GCAATTTGGC GCTTTATACC TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT TTACTGTGGA TCGAATTGCC TGAACAGGTC GATATGGTTT GCGTTGCGCG GCAGCTATAC CGCATGAAAA TCCAGGTGGC GGCAGGCTCG ATTTTCTCAG CTTCCGGCAA ATACCGTAAT TGTCTGCGCA TCAACTGCGC TTTGCCGCTC AGCGAAACCT ATCGCGAAGC ACTGAAGCAA ATTGGCGAAG CCGTGTATCG GGCAATGGAA TGA
|
Protein sequence | MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSQEHGVSI STVQQAYQTL ETMKLITPQP RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDQVLDMLEA HSDSSIVPLS KSTPDVETPS LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQIARLML DSGSVVTADD IIITSGCHNS MSLALMAVCK PGDIVAVESP CYYGSMQMLR GMGVKVIEIP TDPETGISVE ALELALEQWP IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLLHMKYAI SSFNVPSTQM AAATFVLEGH YHRHIRRMRQ IYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCVARQLY RMKIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME
|
| |