Gene ECH74115_3595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3595 
Symbol 
ID6970845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3309981 
End bp3310976 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content52% 
IMG OID643387392 
Productsugar binding transcriptional regulator, LacI family 
Protein accessionYP_002271851 
Protein GI209396088 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00083156 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTCAT TAAAGGATGT CGCACGCCTG GCGGGAGTGT CGATGATGAC AGTCTCCCGG 
GTGATGCATA ATGCAGAATC TGTGCGTCCT GCAACGCGTG ACCGCGTATT GCAGGCAATC
CAGACCCTGA ATTATGTTCC TGATCTTTCC GCCCGTAAGA TGCGCGCTCA AGGACGTAAG
CCGTCGACTC TCGCCGTGCT GGCGCAGGAC ACTGCTACCA CTCCTTTCTC TGTTGATATT
CTGCTTGCCA TTGAGCAAAC CGCCAGCGAG TTCGGCTGGA ATAGTTTTTT AATCAATATT
TTTTCTGAAG ATGACGCTGC CCGCGCGGCA CGTCAGCTGC TTGCCCACCG TCCGGATGGC
ATTATCTATA CTACAATGGG GCTGCGACAT ATCACGCTAC CTGAGTCTCT GTATGGTGAA
AATATTGTAT TGGCGAACTG TGTGGCGGAT GACCCAGCGT TACCCAGTTA TATCCCTGAT
GATTACACTG CACAATATGA ATCAACACAG CATTTGCTCG CGGCGGGCTA TCGTCAACCG
TTATGCTTCT GGCTACCGGA AAGTGCGTTG GCAACAGGGT ATCGTCGGCA GGGATTTGAG
CAGGCCTGGC GTGATGCTGG ACGAGATCTG GCTGAGGTGA AACAATTTCA CATGGCAACA
GGTGATGATC ACTACACCGA TCTCGCAAGT TTACTCAATG ACCACTTCAA ATCGGGCAAA
CCAGATTTTG ATGTTCTGAT ATGTGGTAAC GATCGCGCAG CCTTTGTGGC TTATCAGGTT
CTCCTGGCTA AGGGGGTACG TATCCCGCAG GATGTCGCCG TAATGGGCTT TGATAATCTG
GTTGGCGTCG GGCATCTGTT TTTACCGCCG CTGACCACAA TTCAGCTTCC ACATGACATT
ATCGGGCGGG AAGCTGCATT GCATATTATT GAAGGTCGTG AAGGGGGAAG TGTGACGCGG
ATCCCTTGCC CGCTGTTGAT CCGTTGTTCC ACCTGA
 
Protein sequence
MASLKDVARL AGVSMMTVSR VMHNAESVRP ATRDRVLQAI QTLNYVPDLS ARKMRAQGRK 
PSTLAVLAQD TATTPFSVDI LLAIEQTASE FGWNSFLINI FSEDDAARAA RQLLAHRPDG
IIYTTMGLRH ITLPESLYGE NIVLANCVAD DPALPSYIPD DYTAQYESTQ HLLAAGYRQP
LCFWLPESAL ATGYRRQGFE QAWRDAGRDL AEVKQFHMAT GDDHYTDLAS LLNDHFKSGK
PDFDVLICGN DRAAFVAYQV LLAKGVRIPQ DVAVMGFDNL VGVGHLFLPP LTTIQLPHDI
IGREAALHII EGREGGSVTR IPCPLLIRCS T