Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0074 |
Symbol | sgrR |
ID | 6145976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 83496 |
End bp | 85154 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641614975 |
Product | transcriptional regulator SgrR |
Protein accession | YP_001742191 |
Protein GI | 170681588 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.492998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCTG CTCGTCTGCA ACAACAGTTC ATCCGCCTGT GGCAATGCTG CGAGGGTAAA TCGCAGGACA CAACGCTCAA CGAACTGGCA GCGTTATTGA GCTGCTCGCG TCGTCATATG CGCACCCTGC TCAACACCAT GCAGGATCGC GGCTGGCTGA CGTGGGAAGC GGAAGTCGGG CGCGGTAAAC GCTCGCGTCT GACATTCCTC TATACCGGGC TGGCGCTTCA GCAACAGCGG GCGGAAGACC TGCTGGAGCA GGATCGTATC GATCAACTGG TGCAACTGGT TGGCGACAAA GCGACTGTGC GGCAAATGCT GGTTTCTCAT CTGGGCCGCA GCTTCCGCCA GGGGCGACAC ATCCTGCGTG TGCTCTACTA TCGTCCGTTG CGTAATCTGC TACCTGGCAG CGCATTACGC CGTTCCGAAA CCCATATCGC CCGGCAAATC TTCAGTTCGC TAACGCGCAT AAATGAGGAA AATGGGGAAC TGGAAGCAGA CATCGCCCAC CACTGGCAGC AAATATCACC GCTTCACTGG CGTTTCTTTT TGCGTCCAGG AGTCCATTTT CATCATGGTC GTGAACTGGA AATGGACGAT GTGATCGCCT CTTTAAAACG GATCAATACG CTGCCGCTCT ATTCGCATAT TGCTGAAATT GTATCGCCGA CGCCCTGGAC GCTGGATATC CACCTCACGC AGCCGGATCG CTGGTTGCCG TTACTGCTGG GGCAAGTTCC GGCGATGATC CTGCCGCGCG AATGGGAAAC CCTCAGTAAC TTTGCCAGCC ATCCCATCGG CACCGGTCCG TATGCGGTGA TCCGCAACAG CACCAATCAA CTGAAAATTC AGGCATTCGA TGACTTCTTC GGTTACCGGG CATTAATCGA CGAAGTTAAC GTCTGGGTTC TGCCGGAAAT TGCCGACGAG CCAGCCGGAG GGCTGATGCT AAAAGGGCCA CAGGGCGAGG AAAAAGAGAT TGAAAGCCGC CTGGAGGAAG GTTGCTACTA TTTACTGTTC GACAGCCGCA CCCATCGCGG GGCGAATCAG CAAGTCAGGG ACTGGGTAAG CTATGTGCTT TCTCCAACTA ATCTGGTCTA TTTCGCTGAG GAACAGTACC AGCAACTGTG GTTCCCGGCT TATGGACTGC TCCCCCGTTG GCACCATGCT CGTCCGACAC ATTGCGAAAA ACCCGCCGGG CTGGAAAGCC TCACCCTGAC CTTTTATCAG GATCATATTG AGCATCGAGT GATTGCCGGG ATCATGCAGC AGATTCTGGC AAGTCACCAG GTCACGCTGG AAATCAAAGA GATCAGCTAC GATCAGTGGC ATGAAGGAGA GATCGAGAGC GATATCTGGC TTAACAGCGC CAACTTTACG CTGCCGCTGG ATTTTTCGCT GTTCGCGCAC TTGTGCGAGG TGCCGCTGCT CCAACACTGT CTTCCGATCG ACTGGCAAGC CGACGCCGCC CGCTGGCGCA ATGGCGAAAT GAACCTGGCG AACTGGTGCC AGCAACTGGT CGCCAGCAAA GCAATGGTGC CGCTTATCCA CCACTGGCTG ATCATTCAGG GGCAACGCAG TATGCGCGGT CTACGTATGA ACACCCTCGG CTGGTTCGAT TTTAAATCAG CGTGGTTTGC GCCGCCGGAT CCAGAGTAG
|
Protein sequence | MPSARLQQQF IRLWQCCEGK SQDTTLNELA ALLSCSRRHM RTLLNTMQDR GWLTWEAEVG RGKRSRLTFL YTGLALQQQR AEDLLEQDRI DQLVQLVGDK ATVRQMLVSH LGRSFRQGRH ILRVLYYRPL RNLLPGSALR RSETHIARQI FSSLTRINEE NGELEADIAH HWQQISPLHW RFFLRPGVHF HHGRELEMDD VIASLKRINT LPLYSHIAEI VSPTPWTLDI HLTQPDRWLP LLLGQVPAMI LPREWETLSN FASHPIGTGP YAVIRNSTNQ LKIQAFDDFF GYRALIDEVN VWVLPEIADE PAGGLMLKGP QGEEKEIESR LEEGCYYLLF DSRTHRGANQ QVRDWVSYVL SPTNLVYFAE EQYQQLWFPA YGLLPRWHHA RPTHCEKPAG LESLTLTFYQ DHIEHRVIAG IMQQILASHQ VTLEIKEISY DQWHEGEIES DIWLNSANFT LPLDFSLFAH LCEVPLLQHC LPIDWQADAA RWRNGEMNLA NWCQQLVASK AMVPLIHHWL IIQGQRSMRG LRMNTLGWFD FKSAWFAPPD PE
|
| |