Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4482 |
Symbol | sorC |
ID | 6145562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4578738 |
End bp | 4579685 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619298 |
Product | sorbitol operon regulator SorC |
Protein accession | YP_001746410 |
Protein GI | 170679743 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.543234 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACA GTGACGATAT CCGTTTGATT GTGAAGATTG CCCAACTCTA TTACGAACAG GATATGACGC AGGCGCAAAT CGCGCGCGAA CTGGGTATTT ACCGCACCAC CATCAGCCGC TTGCTTAAAC GAGGCCGCGA TCAGGGAATT GTCACCATCG CCATCAACTA TGACTACAAC GAAAATCTCT GGCTGGAGCA GCAACTGAAG CAAAAGTTTG GCCTGAAAGA CGTTGTGGTG GTGTCGGGAA ATGATGAGGA TGAAGAGACT CAACTGGCGA TGATGGGGTT ACACGGCGCG CAACTGCTGG ATCGCTTGCT GGAACCTGGC GATATTGTCG GTTTTTCCTG GGGCCGCGCG GTGAGCGCAC TGGTTGAAAA CTTGCCGCAG GCGGGGCAAT CGCGGCAGTT AATTTGCGTG CCGATTATTG GCGGCCCGTC CGGTAAACTC GAAAGCCGCT ATCACGTAAA CACATTAACC TACAGCGCGG CAGCGAAGCT GAAAGGGGAA TCGCATCTCG CGGATTTTCC GGCTCTGCTG GATAACCCAT TAATTCGTAA TGGGATCATG CAGTCTCAGC ACTTTAAAAC CATCTCTGCC TACTGGGATA ATCTGGATGT CGCCCTGGTG GGAATTGGCT CACCGGCCAT TCGCGACGGC GCTAACTGGC ATGCGTTTTA TGGTGGTGAA GAGAGTGACG ACCTGAATGC CCGCCAGGTT GCTGGCGATA TTTGCTCGCG CTTTTTTGAT ATTCACGGCG CAATGGTTGA AACGAATATG AGCGAAAAAA CACTCTCTAT CGAAATGAAT AAATTAAAGC AGGCACGGTA TTCCATTGGC ATTGCCATGA GCGAAGAAAA ATACAGCGGA ATTGTTGGTG CACTGCGTGG AAAATATATT AATTGTCTGG TAACGAATAG CAGCACAGCT GAACTATTAC TGAAATAA
|
Protein sequence | MENSDDIRLI VKIAQLYYEQ DMTQAQIARE LGIYRTTISR LLKRGRDQGI VTIAINYDYN ENLWLEQQLK QKFGLKDVVV VSGNDEDEET QLAMMGLHGA QLLDRLLEPG DIVGFSWGRA VSALVENLPQ AGQSRQLICV PIIGGPSGKL ESRYHVNTLT YSAAAKLKGE SHLADFPALL DNPLIRNGIM QSQHFKTISA YWDNLDVALV GIGSPAIRDG ANWHAFYGGE ESDDLNARQV AGDICSRFFD IHGAMVETNM SEKTLSIEMN KLKQARYSIG IAMSEEKYSG IVGALRGKYI NCLVTNSSTA ELLLK
|
| |