Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1735 |
Symbol | |
ID | 6145643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1739827 |
End bp | 1741233 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616611 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001743789 |
Protein GI | 170682466 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.453488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000000000634797 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAT ACCAGCAGCT TGCAGAACAA TTACGCGAGC AGATTGCGTC GGGTATCTGG CAACCCGGAG ATCGTTTGCC TTCGTTGCGT GACCAGGTGG CACTTTCTGG CATGAGCTTT ATGACCGTGA GCCATGCCTA TCAGTTGCTC GAAAGTCAGG GATATATTAT CGCACGACCG CAGTCGGGTT ATTACGTTGC GCCACAGGCA ATAAAAATGC CGAAAGCGCC AGTCATTCCA GTCACTCGAG ATGAAGCAGT CGATATCAAC ACTTATATTT TTGATATGTT GCAGGCCAGT CGCGATCCGT CGGTCGTTCC GTTTGCCTCG GCCTTTCCCG ACCCGCGACT TTTCCCCCTC CAACAACTAA ACCGCTCGCT GGCGCAGGTA AGCAAAACCG CCACAGCGAT GAGCGTGATT GAAAACTTAC CGCCAGGAAA CGCAGAACTG CGTCAGGCTA TTGCTCGTCG CTATGCCTTA CAGGGCATCA CTATTTCTCC AGATGAAATT GCCATTACCG CCGGGGCGTT AGAGGCATTA AACCTCAGTT TGCAAGTGGT AACTGAACCG GGCGATTGGG TGATAGTAGA GAATCCTTGT TTCTACGGCG CTTTGCAGGC GCTGGAGCGG TTACGGCTGA AGGCGTTATC GGTGGCGACG GACGTTAAAG AAGGGATCGA TCTTCAGGCG CTGGAACTGG CGTTGCAGGA GTATCCGGTG AAAGCGTGCT GGCTGATGAC TAATAGCCAG AATCCACTCG GATTTACCTT AACACCGCAA AAAAAAGCAC AACTGGTGGC GTTGCTCAAT CAGTACAACG TAACGCTGAT TGAAGATGAC GTTTACAGCG AACTTTATTT TGGACGGGAA AAACCGCTGC CTGCGAAAGC GTGGGATCAC CACGATGGTG TTTTGCATTG CTCTTCGTTT TCGAAATGTC TGGTGCCTGG TTTTCGTATT GGTTGGGTCG CCGCCGGAAA ACATGCACGT AAAATTCAAC GCTTGCAGTT GATGAGTACG CTTTCCACCA GCTCACCGAT GCAGCTTGCG CTGGTGGATT ACCTTTCTAC TCGCCGATAC GATGCTCATC TTCGTCGTCT GCGTCGCCAG CTTGCGGAAC GTAAACAACG CGCCTGGCAG GCTTTGCTGC GTTATCTGCC TGCGGAAGTC AAAATTCATC ACAATGACAG CGGTTATTTT CTCTGGCTGG AGCTCCCCGA GCCGTTAGAT GCAGGAGAAT TAAGCCTGGT GGCACTGACG CATCATATCA GTATTGCGCC GGGTAAAATG TTTTCTACCG GTGAAAACTG GTCACGTTTT TTTCGTTTTA ATACCGCGTG GCAGTGGGGA GAACGTGAAG AACAGGCGGT AAAACAATTA GGCAAACTTA TTCAAGAACG GCTGTAA
|
Protein sequence | MKKYQQLAEQ LREQIASGIW QPGDRLPSLR DQVALSGMSF MTVSHAYQLL ESQGYIIARP QSGYYVAPQA IKMPKAPVIP VTRDEAVDIN TYIFDMLQAS RDPSVVPFAS AFPDPRLFPL QQLNRSLAQV SKTATAMSVI ENLPPGNAEL RQAIARRYAL QGITISPDEI AITAGALEAL NLSLQVVTEP GDWVIVENPC FYGALQALER LRLKALSVAT DVKEGIDLQA LELALQEYPV KACWLMTNSQ NPLGFTLTPQ KKAQLVALLN QYNVTLIEDD VYSELYFGRE KPLPAKAWDH HDGVLHCSSF SKCLVPGFRI GWVAAGKHAR KIQRLQLMST LSTSSPMQLA LVDYLSTRRY DAHLRRLRRQ LAERKQRAWQ ALLRYLPAEV KIHHNDSGYF LWLELPEPLD AGELSLVALT HHISIAPGKM FSTGENWSRF FRFNTAWQWG EREEQAVKQL GKLIQERL
|
| |