Gene EcSMS35_0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0376 
SymbollacI 
ID6145584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp390646 
End bp391728 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content56% 
IMG OID641615272 
Productlac repressor 
Protein accessionYP_001742479 
Protein GI170679780 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000396722 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACCAG TAACGCTATA CGATGTCGCA GAGTATGCCG GTGTCTCTTA TCAGACCGTT 
TCCCGCGTGG TGAACCAGGC CAGCCACGTT TCTGCGAAAA CGCGGGAAAA AGTGGAAGCG
GCGATGGCGG AGCTGAATTA CATTCCCAAC CGCGTGGCAC AACAACTGGC AGGCAAACAG
TCGTTGCTGA TTGGCGTTGC CACCTCCAGT CTGGCCCTGC ACGCGCCGTC GCAAATTGTC
GCCGCGATTA AATCTCGCGC CGATCAACTG GGTGCCAGCG TGGTGGTGTC GATGGTAGAA
CGAAGCGGCG TCGAAGCCTG TAAAGCGGCG GTACACAATC TCCTCGCGCA ACGCGTCAGT
GGGCTGATCA TTAACTATCC GCTGGATGAC CAGGATGCCA TTGCTGTGGA AGCTGCCTGC
GCTAATGTTC CGGCTTTATT TCTTGATGTC TCTGACCAGA CACCCATCAA CAGTATTATT
TTCTCCCATG AAGACGGTAC GCGACTGGGC GTGGAGCATC TGGTCGCATT GGGTCACCAG
CAAATCGCGC TGTTAGCGGG TCCATTAAGT TCTGTCTCGG CACGTCTGCG TCTGGCGGGC
TGGCATAAAT ATCTCACTCG CAATCAAATT CAGCCGATAG CGGAACGGGA AGGCGACTGG
AGTGCCATGT CCGGTTTTCA ACAAACCATG CAAATGCTGA ATGAGGACAT CGTTCCTACT
GCGATGCTGG TTGCCAACGA TCAGATGGCG CTGGGCGCAA TGCGCGCCAT TACCGAGTCC
GGGCTGCGCG TTGGTGCGGA TGTCTCGGTA GTGGGATACG ACGATACCGA AGACAGCTCG
TGTTATATCC CGCCGTTAAC CACCATCAAA CAGGATTTTC GCCTGCTGGG GCAAACCAGC
GTGGACCGCT TGCTGCAACT CTCTCAGGGC CAGGCGGTGA AGGGCAATCA GCTGTTGCCC
GTCTCACTGG TGAAAAGAAA AACCACCCTT CCGCCCAATA CGCAAACCGC CTCTCCCCGC
GCGTTGGCAG ATTCTTTAAT GCAGCTGGCA CGACAAGTTT CCCGACTGGA AAGCGGGCAG
TGA
 
Protein sequence
MKPVTLYDVA EYAGVSYQTV SRVVNQASHV SAKTREKVEA AMAELNYIPN RVAQQLAGKQ 
SLLIGVATSS LALHAPSQIV AAIKSRADQL GASVVVSMVE RSGVEACKAA VHNLLAQRVS
GLIINYPLDD QDAIAVEAAC ANVPALFLDV SDQTPINSII FSHEDGTRLG VEHLVALGHQ
QIALLAGPLS SVSARLRLAG WHKYLTRNQI QPIAEREGDW SAMSGFQQTM QMLNEDIVPT
AMLVANDQMA LGAMRAITES GLRVGADVSV VGYDDTEDSS CYIPPLTTIK QDFRLLGQTS
VDRLLQLSQG QAVKGNQLLP VSLVKRKTTL PPNTQTASPR ALADSLMQLA RQVSRLESGQ