Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0076 |
Symbol | sgrR |
ID | 6969269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 81450 |
End bp | 83108 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384155 |
Product | transcriptional regulator SgrR |
Protein accession | YP_002268678 |
Protein GI | 209397844 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCTG CTCGTCTGCA ACAACAGTTC ATCCGCCTGT GGCAATGCTG CGAGGGTAAA TCGCAGGACA CAACGCTCAA CGAACTGGCA GCGTTATTGA GCTGCTCGCG TCGTCATATG CGTACCCTGC TCAACACCAT GCAGGATCGC GGCTGGCTGA CGTGGGAAGC GGAAGTCGGG CGTGGTAAAC GCTCGCGTCT GACATTCCTC TATACCGGGC TGGCGCTTCA GCAACAGCGG GCGGAAGACC TGCTGGAGCA GGACCGTATC GACCAACTAG TGCAGTTAGT TGGCGACAAA GCGACTGTGA GGCAAATGCT GGTTTCTCAT CTGGGCCGCA GCTTCCGCCA GGGGCGGCAC ATCCTGCGCG TGCTCTACTA TCGTCCGTTG CGTAATCTGC TACCTGGCAG CGCATTGCGC CGTTCCGAAA CCCATATCGC CCGGCAAATC TTCAGTTCGC TAACGCGCAT AAATGAGGAA AATGGGGAAC TGGAAGCAGA CATCGCCCAC CACTGGCAGC AAATATCACC GCTTCACTGG CGTTTCTTTT TGCGTCCAGG AGTCCATTTT CATCATGGTC GTGAACTGGA AATGGACGAC GTGATCGCCT CTTTAAAACG AATCAATACG CTGCCGCTCT ATTCGCATAT TGCTGACATT GTGTCGCCGA CGCCCTGGAC GCTGGATATC CACCTCACGC AGCCGGATCG CTGGTTGCCG TTACTGCTGG GACAAGTTCC GGCGATGATC CTGCCGCGCG AATGGGAAAC CCTCAGTAAC TTTGCCAGCC ATCCCATCGG CACCGGTCCG TATGCGGTGA TTCGCAACAG CACCAATCAA CTGAAAATTC AGGCATTCGA TGACTTCTTC GGTTACCGGG CATTAATCGA TGAAGTTAAC GTCTGGGTTC TGCCGGAAAT TGCCGACGAG CCAGCCGGAG GGCTGATGCT AAAAGGTCCA CAGGGCGAGG AAAAAGAGAT TGAAAGCCGC CTGGAGGAAG GTTGCTACTA TTTACTGTTC GATAGCCGCA CCCATCGCGG GGCGAATCAG CAAGTCAGGG ACTGGGTAAG CTATGTGCTT TCTCCAACTA ATCTGGTCTA TTTCGCTGAG GAACAGTACC AGCAACTGTG GTTCCCGGCT TATGGACTGC TCCCCCGTTG GCATCATGCC CGCACCATAA CGAGCGAAAA ACCGGCTGGT CTGGAAAGCC TCACCCTGAC CTTTTATCAG GATCACAGTG AGCATCGGGT GATTGCCGGG ATCATGCAGC AGATTCTGGC AAGCCACCAG GTCACACTGG AAATCAAAGA GATCAGCTAC GATCAGTGGC ATGAAGGAGA GATCGAGAGC GATATCTGGC TTAACAGCGC CAACTTTACG CTGCCGCTGG ATTTTTCGCT GTTCGCGCAC CTGTGCGAGG TACCGCTGCT CCAACACTGT ATTCCAATCG ACTGGCAAGT CGACGCCGCC CGCTGGCGCA ATGGCGAAAT GAACCTGGCG AACTGGTGCC AACAACTGGT CGCCAGCAAA GCAATGGTGC CACTTATCCA CCACTGGCTG ATCATTCAGG GACAACGCAG TATGCGCGGC CTGCGCATGA ACACCCTCGG CTGGTTTGAT TTTAAATCAG CGTGGTTTGC GCCGCCGGAT CCAGAGTAG
|
Protein sequence | MPSARLQQQF IRLWQCCEGK SQDTTLNELA ALLSCSRRHM RTLLNTMQDR GWLTWEAEVG RGKRSRLTFL YTGLALQQQR AEDLLEQDRI DQLVQLVGDK ATVRQMLVSH LGRSFRQGRH ILRVLYYRPL RNLLPGSALR RSETHIARQI FSSLTRINEE NGELEADIAH HWQQISPLHW RFFLRPGVHF HHGRELEMDD VIASLKRINT LPLYSHIADI VSPTPWTLDI HLTQPDRWLP LLLGQVPAMI LPREWETLSN FASHPIGTGP YAVIRNSTNQ LKIQAFDDFF GYRALIDEVN VWVLPEIADE PAGGLMLKGP QGEEKEIESR LEEGCYYLLF DSRTHRGANQ QVRDWVSYVL SPTNLVYFAE EQYQQLWFPA YGLLPRWHHA RTITSEKPAG LESLTLTFYQ DHSEHRVIAG IMQQILASHQ VTLEIKEISY DQWHEGEIES DIWLNSANFT LPLDFSLFAH LCEVPLLQHC IPIDWQVDAA RWRNGEMNLA NWCQQLVASK AMVPLIHHWL IIQGQRSMRG LRMNTLGWFD FKSAWFAPPD PE
|
| |