Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4524 |
Symbol | sorC |
ID | 6271445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4231035 |
End bp | 4231982 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641728309 |
Product | sorbitol operon regulator SorC |
Protein accession | YP_001882707 |
Protein GI | 187730424 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACA GTGACGATAT CCGTTTGATT GTGAAGATTG CCCAACTCTA TTACGAACAG GATATGACGC AGGCGCAAAT CGCGCGCGAA CTGGGTATTT ACCGCACCAC CATCAGCCGC TTGCTTAAAC GAGGCCGCGA TCAGGGAATT GTCACCATCG CCATCAACTA TGACTACAAC GAAAATCTCT GGCTGGAGCA GCAACTGAAG CAAAAGTTTG GCCTGAAAGA CGTTGTGGTG GTGTCGGGAA ATGATGAGGA TGAAGAGACT CAACTGGCGA TGATGGGGTT ACACGGCGCG CAACTGCTGG ATCGCTTGCT GGAGCCTGGC GATATTGTCG GTTTTTCCTG GGGTCGCGCG GTGAGCGCAC TGGTTGAAAA CTTGCCGCAG GCGGGGCAAT CGCGGCAGTT AATCTGCGTA CCAATTATTG GCGGACCGTC CGGTAAACTC GAAAGCCGCT ATCATGTAAA CACATTAACC TACAGCGCGG CAGCGAAGCT GAAAGGGGAA TCGCATCTCG CGGATTTTCC GGCTCTGCTG GATAACCCAT TAATTCGTAA TGGGATCATG CAGTCTCAGC ACTTTAAAAC CATCTCTGCC TACTGGGATA ATCTGGATGT CGCCCTGGTG GGAATTGGCT CACCGGCCAT TCGCGACGGC GCTAACTGGC ATGCGTTTTA TGGTGGTGAA GAGAGTGACG ACCTGAATGC CCGCCAGGTT GCTGGCGATA TTTGCTCGCG CTTTTTTGAT ATTCACGGCG CAATGGTTGA AACGAATATG AGCGAAAAAA CACTCTCTAT CGAAATGAAT AAATTAAAGC AAGCACGATA TTCCATTGGC ATTGCCATGA GTGAAGAAAA ATACAGCGGA ATTGTTGGTG CACTGCGTGG AAAATATATT AATTGTCTGG TAACGAATAG CAGCACAGCT GAACTATTAC TGAAATAA
|
Protein sequence | MENSDDIRLI VKIAQLYYEQ DMTQAQIARE LGIYRTTISR LLKRGRDQGI VTIAINYDYN ENLWLEQQLK QKFGLKDVVV VSGNDEDEET QLAMMGLHGA QLLDRLLEPG DIVGFSWGRA VSALVENLPQ AGQSRQLICV PIIGGPSGKL ESRYHVNTLT YSAAAKLKGE SHLADFPALL DNPLIRNGIM QSQHFKTISA YWDNLDVALV GIGSPAIRDG ANWHAFYGGE ESDDLNARQV AGDICSRFFD IHGAMVETNM SEKTLSIEMN KLKQARYSIG IAMSEEKYSG IVGALRGKYI NCLVTNSSTA ELLLK
|
| |