Gene EcSMS35_4482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4482 
SymbolsorC 
ID6145562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4578738 
End bp4579685 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content50% 
IMG OID641619298 
Productsorbitol operon regulator SorC 
Protein accessionYP_001746410 
Protein GI170679743 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.543234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA GTGACGATAT CCGTTTGATT GTGAAGATTG CCCAACTCTA TTACGAACAG 
GATATGACGC AGGCGCAAAT CGCGCGCGAA CTGGGTATTT ACCGCACCAC CATCAGCCGC
TTGCTTAAAC GAGGCCGCGA TCAGGGAATT GTCACCATCG CCATCAACTA TGACTACAAC
GAAAATCTCT GGCTGGAGCA GCAACTGAAG CAAAAGTTTG GCCTGAAAGA CGTTGTGGTG
GTGTCGGGAA ATGATGAGGA TGAAGAGACT CAACTGGCGA TGATGGGGTT ACACGGCGCG
CAACTGCTGG ATCGCTTGCT GGAACCTGGC GATATTGTCG GTTTTTCCTG GGGCCGCGCG
GTGAGCGCAC TGGTTGAAAA CTTGCCGCAG GCGGGGCAAT CGCGGCAGTT AATTTGCGTG
CCGATTATTG GCGGCCCGTC CGGTAAACTC GAAAGCCGCT ATCACGTAAA CACATTAACC
TACAGCGCGG CAGCGAAGCT GAAAGGGGAA TCGCATCTCG CGGATTTTCC GGCTCTGCTG
GATAACCCAT TAATTCGTAA TGGGATCATG CAGTCTCAGC ACTTTAAAAC CATCTCTGCC
TACTGGGATA ATCTGGATGT CGCCCTGGTG GGAATTGGCT CACCGGCCAT TCGCGACGGC
GCTAACTGGC ATGCGTTTTA TGGTGGTGAA GAGAGTGACG ACCTGAATGC CCGCCAGGTT
GCTGGCGATA TTTGCTCGCG CTTTTTTGAT ATTCACGGCG CAATGGTTGA AACGAATATG
AGCGAAAAAA CACTCTCTAT CGAAATGAAT AAATTAAAGC AGGCACGGTA TTCCATTGGC
ATTGCCATGA GCGAAGAAAA ATACAGCGGA ATTGTTGGTG CACTGCGTGG AAAATATATT
AATTGTCTGG TAACGAATAG CAGCACAGCT GAACTATTAC TGAAATAA
 
Protein sequence
MENSDDIRLI VKIAQLYYEQ DMTQAQIARE LGIYRTTISR LLKRGRDQGI VTIAINYDYN 
ENLWLEQQLK QKFGLKDVVV VSGNDEDEET QLAMMGLHGA QLLDRLLEPG DIVGFSWGRA
VSALVENLPQ AGQSRQLICV PIIGGPSGKL ESRYHVNTLT YSAAAKLKGE SHLADFPALL
DNPLIRNGIM QSQHFKTISA YWDNLDVALV GIGSPAIRDG ANWHAFYGGE ESDDLNARQV
AGDICSRFFD IHGAMVETNM SEKTLSIEMN KLKQARYSIG IAMSEEKYSG IVGALRGKYI
NCLVTNSSTA ELLLK