Gene ECH74115_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0418 
SymbollacI 
ID6971684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp426803 
End bp427885 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content56% 
IMG OID643384470 
Productlac repressor 
Protein accessionYP_002268984 
Protein GI209395817 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value8.22114e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACCAG TAACGCTATA CGATGTCGCA GAGTATGCCG GTGTCTCTTA TCAGACCGTT 
TCCCGCGTGG TGAATCAGGC CAGCCACGTT TCTGCGAAAA CGCGGGAAAA AGTGGAAGCG
GCGATGGCGG AGCTGAATTA CATTCCCAAC CGCGTGGCAC AACAACTGGC GGGGAAACAG
TCGTTGCTGA TTGGCGTTGC CACCTCCAGT CTGGCCCTGC ACGCGCCGTC GCAAATTGTC
GCGGCGATTA AATCTCGCGC CGATCAACTG GGTGCCAGCG TGGTGGTGTC GATGGTAGAA
CGAAGCGGCG TCGAAGCCTG TAAAGCGGCA GTGCACAATC TTCTCGCGCA ACGCGTCAGT
GGGCTGATCA TTAACTATCC GCTGGATGAC CAGGATGCCA TTGCTGTGGA AGCTGCCTGC
GCTAATGTTC CGGCGTTATT TCTTGATGTC TCTGACCAGA CTCCCATCAA CAGTATTATT
TTCTCCCATG AAGACGGTAC GCGACTGGGC GTGGAGCATC TGGTCGCATT GGGTCACCAG
CAAATCGCGC TGTTAGCGGG CCCATTAAGT TCTGTCTCGG CGCGTCTGCG TCTGGCGGGC
TGGCATAAAT ATCTCACTCG CAATCAAATT CAGCCGATAG CGGAACGGGA GGGCGACTGG
AGTGCCATGT CCGGTTTTCA ACAAACCATG CAAATGCTAA ATGAGGGCAT CGTTCCCACT
GCGATGCTGG TTGCCAACGA TCAGATGGCG CTGGGCGCAA TGCGCGCCAT TACCGAGTCC
GGGTTGCGCG TTGGTGCGGA TATCTCGGTA GTGGGATACG ACGATACCGA AGACAGCTCG
TGTTATATCC CGCCGTTAAC CACCATCAAA CAGGATTTTC GCCTGCTGGG GCAAACCAGC
GTGGACCGCT TGCTGCAACT CTCTCAGGGC CAGGCGGTGA AGGGCAATCA GCTGTTGCCC
GTCTCACTGG TGAAAAGAAA AACCACCCTG GCGCCCAAGA CGCAAACCGC TTCTCCCCGC
GCGTTGGCCG ATTCATTAAT GCAGCTGGCA CGACAAGTTT CCCGACTGGA AAGCGGGCAG
TGA
 
Protein sequence
MKPVTLYDVA EYAGVSYQTV SRVVNQASHV SAKTREKVEA AMAELNYIPN RVAQQLAGKQ 
SLLIGVATSS LALHAPSQIV AAIKSRADQL GASVVVSMVE RSGVEACKAA VHNLLAQRVS
GLIINYPLDD QDAIAVEAAC ANVPALFLDV SDQTPINSII FSHEDGTRLG VEHLVALGHQ
QIALLAGPLS SVSARLRLAG WHKYLTRNQI QPIAEREGDW SAMSGFQQTM QMLNEGIVPT
AMLVANDQMA LGAMRAITES GLRVGADISV VGYDDTEDSS CYIPPLTTIK QDFRLLGQTS
VDRLLQLSQG QAVKGNQLLP VSLVKRKTTL APKTQTASPR ALADSLMQLA RQVSRLESGQ