Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5499 |
Symbol | sorC |
ID | 6966636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5146805 |
End bp | 5147752 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643389143 |
Product | sorbitol operon regulator SorC |
Protein accession | YP_002273540 |
Protein GI | 209396331 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACA GTGACGATAT CCGTTTGATT GTGAAGATTG CCCAACTCTA TTACGAACAG GATATGACGC AGGCGCAAAT CGCGCGCGAA CTGGGTATTT ACCGCACCAA CATCAGCCGC TTGCTTAAAC GAGGCCGCGA TCAGGGAATT GTCACCATCG CCATCAACTA TGACTACAAC GAAAATCTCT GGCTGGAGCA GCAACTGAAG CAAAAGTTTG GCCTGAAAGA CGTTGTGGTG GTGTCGGGAA ATGATGAGGA TGAAGAGACT CAACTGGCGA TGATGGGGTT ACACGGCGCG CAACTGCTGG ATCGCTTGCT GGAGCCTGGC GATATTGTCG GTTTTTCCTG GGGTCGCGCG GTGAGCGCAC TGGTTGAAAA CTTGCCGCAG GCGGGGCAAT CGCGGCAGTT AATCTGCGTA CCAATTATTG GCGGACCGTC CGGTAAACTC GAAAGCCGCT ATCACGTAAA CACATTAACC TACAGCGCGG CAGCGAAGCT GAAAGGGGAA TCGCATCTCG CGGATTTTCC GGCTCTACTG GATAACCCAT TAATTCGTAA TGGGATCATG CAGTCTCAGC ACTTTAAAAC CATCTCTGCC TACTGGGATA ATCTGGATGT CGCCCTGGTG GGAATTGGCT CACCGGCCAT TCGCGACGGC GCTAACTGGC ATGCGTTTTA TGGTGGTGAA GAGAGTGACG ACCTGAATGC CCGCCAGGTT GCTGGCGATA TTTGCTCGCG CTTTTTTGAT ATTCACGGCG AAATGGTTGA AACGAATATG AGCGAAAAAA CACTCTCTAT CGAAATGAAT AAATTAAAGC AGGCACGATA TTCCATTGGC ATTGCCATGA GTGAAGAAAA ATACAGCGGA ATTATTGGTG CACTGCGTGG AAAATATATT AATTGTCTGG TAACGAATAG CAGCACAGCT GAACTATTAC TGAAATAA
|
Protein sequence | MENSDDIRLI VKIAQLYYEQ DMTQAQIARE LGIYRTNISR LLKRGRDQGI VTIAINYDYN ENLWLEQQLK QKFGLKDVVV VSGNDEDEET QLAMMGLHGA QLLDRLLEPG DIVGFSWGRA VSALVENLPQ AGQSRQLICV PIIGGPSGKL ESRYHVNTLT YSAAAKLKGE SHLADFPALL DNPLIRNGIM QSQHFKTISA YWDNLDVALV GIGSPAIRDG ANWHAFYGGE ESDDLNARQV AGDICSRFFD IHGEMVETNM SEKTLSIEMN KLKQARYSIG IAMSEEKYSG IIGALRGKYI NCLVTNSSTA ELLLK
|
| |