Gene ECH74115_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0419 
Symbol 
ID6966674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp427922 
End bp428875 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content52% 
IMG OID643384471 
Productarac-family transcriptional regulator 
Protein accessionYP_002268985 
Protein GI209396742 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000111496 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGC TAACTGTTGC CATTATCGCC GTTGCCGGTT TTAGTCCTTT TCACCTTTCC 
GTACCGTTTA TCGTGTTTAG TGAAAAGATG GCGGAGAAAA AACGCTTTCA CGTGATTATT
TGCGCTGAAA AGCCGGGAAA CGTGGACTCT GCCGATGGGT TTTCCGTAAC CGCCACCCAT
GACTATACCG CAGTCATCCA GGCAGATATT GTGATAATTC CTTACTGGGG AACCATTACA
CAAAAACCGC CACAAAAACT GCTGGAAGCC TTAACGACCG CACGGGATAA CGGCGCACAG
ATTGTCGGGC TTTGCCTGGG CACGTTTGTG CTCGGCTATG CAGGTTTACT GAAAAATAAG
CGTGCCGCCA CGCACTGGGA GTTCGAGCGT GAATTTCAGG CACGTTTTCC ACAAACACAT
CTGGATATTA ACGCGTTGTA CGTAGACGAT GACGGCATTA TTACCTCTGC CGGTACTGCC
GCGGCGCTGG ATTGCTGTTT GTATATTGTT CGGCAACATT TTGGCAGCGA CTATGCTAAC
CATATTGCCC GACGGATGGT CGTACCGCCA TATCGCACCG GCGGTCAGGC GCAGTTTATT
GAGCAACCGG TGCCGAAAAA TACCCATGAT GAACGCATAA ACCTCCTGCT GGATTACCTG
CGGCAAAACA TTGCGCAACA GCATGATCTC GACTCGCTGG CGCAGCGAGT AATGATGAGT
CGCCGCACAT TAACTCGCCA TTTTATGAAA GCGACCGGTT CGAGTATCGC CGAATGGCTC
ATTACTGAAC GCTTACGCCG TAGCCAGGAA CTGTTGGGAT CCAGTCAGTT GCCCGTTGAG
CGGATAGCGG CTGAGGTGGG TTTTCTCTCA CCTGTGACCT GGCGTCAGCA TTTTAAATCT
CACTTCGGCG TCAGCCCCGC CGAATGGCGC AAAACCTTTC GCGGTATGGC ATGA
 
Protein sequence
MRPLTVAIIA VAGFSPFHLS VPFIVFSEKM AEKKRFHVII CAEKPGNVDS ADGFSVTATH 
DYTAVIQADI VIIPYWGTIT QKPPQKLLEA LTTARDNGAQ IVGLCLGTFV LGYAGLLKNK
RAATHWEFER EFQARFPQTH LDINALYVDD DGIITSAGTA AALDCCLYIV RQHFGSDYAN
HIARRMVVPP YRTGGQAQFI EQPVPKNTHD ERINLLLDYL RQNIAQQHDL DSLAQRVMMS
RRTLTRHFMK ATGSSIAEWL ITERLRRSQE LLGSSQLPVE RIAAEVGFLS PVTWRQHFKS
HFGVSPAEWR KTFRGMA