Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0074 |
Symbol | sgrR |
ID | 5595447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 77399 |
End bp | 79054 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640919262 |
Product | transcriptional regulator SgrR |
Protein accession | YP_001456857 |
Protein GI | 157159539 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 68 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCTG CTCGTCTGCA ACAACAGTTC ATCCGCCTGT GGCAATGCTG CGAGGGTAAA TCGCAGGACA CAACGCTCAA CGAACTGGCA GCGTTATTGA GCTGCTCGCG TCGTCATATG CGTACCCTGC TCAACACCAT GCAGGATCGC GGCTGGCTGA CGTGGGAAGC GGAAGTCGGG CGCGGTAAAC GCTCGCGTCT GACATTCCTC TATACCGGGC TGGCGCTTCA GCAACAGCGG GCGGAAGACC TGCTGGAGCA GGATCGTATC GACCAACTAG TACAGTTGGT TGGCGACAAA GCGACTGTGC GGCAAATGCT GGTTTCTCAT CTGGGCCGCA GCTTCCGCCA GGGGCGGCAC ATCCTGCGCG TGCTCTACTA TCGTCCGTTG CGTAATCTGC TACCTGGCAG CGCATTGCGC CGTTCCGAAA CCCATATTGC CCGGCAAATC TTCAGTTCGC TAACGCGCAT AAATGAGGAA AATGGGGAAC TGGAAGCAGA CATCGCCCAC CACTGGCAGC AAATATCACC GCTTCGCTGG CGTTTCTTTT TGCGTCCAGG AGTCCATTTT CATCATGGTC GTGAACTGGA AATGGACGAC GTGATCGCCT CTTTAAAACG AATCAATACG CTGCCGCTCT ATTCGCATAT TGCTGACATT GTGTCGCCGA CGCCCTGGAC GCTGGATATC CATCTCACGC AACCGGACCG CTGGTTACCG TTATTGCTGG GGCAAGTTCC GGCGATGATC CTGCCGCGCG AATGGGAAAC CCTCAGTAAC TTTGCCAGCC ATCCCATCGG CACCGGTCCG TATGCGGTGA TCCGCAACAG CACCAATCAA CTGAAAATTC AGGCATTCGA TGACTTCTTC GGTTACCGGG CATTAATCGA CGAAGTTAAC GTCTGGGTTC TGCCGGAAAT TGCCGACGAG CCAGCCGGAG GTCTGATGCT AAAAGGTCCA CAGGGCGAGG AAAAAGAGAT TGAAAGCCGC CTGGAGGAAG GTTGCTACTA TTTACTATTC GACAGCCGCA CCCATCGCGG GGCGAATCAG CAAGTCAGGG ACTGGGTAAG CTATGTGCTT TCTCCAACTA ATCTGGTCTA TTTCGCTGAG GAACAGTACC AGCAACTGTG GTTCCCGGCT TATGGACTGC TCCCCCGTTG GCACCATGCC CGCACCATAA AGAGCGAAAA ACCGGCTGGT CTGGAAAGCC TCACCCTAAC CTTTTATCAG GATCACAGTG AGCATCGGGT GATTGCCGGG ATCATGCAGC AGATTCTGGC AAGTCACCAG GTCACGCTGG AAATCAAAGA GATCGACTAC GATCAGTGGC ATACAGGAGA GATCGAAAGT GATATCTGGC TAAACAGCGC CAACTTTACC CTGCCGCTGG ACTTCTCTGT TTTCGCACAT TTATGCGAAG TGCCACTGCT ACAACATTGC ATTCCCATTG ACTGGCAAGC CGACGCTGCT CGCTGGCGCA ATGGCGAGAT GAATCTGGCG AACTGGTGCC AGCAACTGGT CGCCAGCAAA GCGATGGTGC CATTATTGCA CCACTGGCTG ATCATTCAGG GGCAACGCAG TATGCGCGGC CTGCGCATGA ACACCCTCGG CTGGTTCGAT TTTAAATCAG CGTGGTTTGC GCCACCGGAT CCATGA
|
Protein sequence | MPSARLQQQF IRLWQCCEGK SQDTTLNELA ALLSCSRRHM RTLLNTMQDR GWLTWEAEVG RGKRSRLTFL YTGLALQQQR AEDLLEQDRI DQLVQLVGDK ATVRQMLVSH LGRSFRQGRH ILRVLYYRPL RNLLPGSALR RSETHIARQI FSSLTRINEE NGELEADIAH HWQQISPLRW RFFLRPGVHF HHGRELEMDD VIASLKRINT LPLYSHIADI VSPTPWTLDI HLTQPDRWLP LLLGQVPAMI LPREWETLSN FASHPIGTGP YAVIRNSTNQ LKIQAFDDFF GYRALIDEVN VWVLPEIADE PAGGLMLKGP QGEEKEIESR LEEGCYYLLF DSRTHRGANQ QVRDWVSYVL SPTNLVYFAE EQYQQLWFPA YGLLPRWHHA RTIKSEKPAG LESLTLTFYQ DHSEHRVIAG IMQQILASHQ VTLEIKEIDY DQWHTGEIES DIWLNSANFT LPLDFSVFAH LCEVPLLQHC IPIDWQADAA RWRNGEMNLA NWCQQLVASK AMVPLLHHWL IIQGQRSMRG LRMNTLGWFD FKSAWFAPPD P
|
| |