Gene ECH74115_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4159 
Symbol 
ID6971683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3846736 
End bp3848514 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content46% 
IMG OID643387905 
Productsigma-54 dependent transcriptional regulator, Fis family 
Protein accessionYP_002272344 
Protein GI209400999 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.778459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTG CTACTACGCA GTCAGTATTG ATGCAAATTC AACCGACAAT TCAGCGTTTT 
GCCAGAATGC TTGCCAGCGT TTTGCAGCTT GAGGTTGAGA TCGTTGATGA AAACTTGTAT
CGCGTCGCCG GAACGGGCGC GTATGGGAAG TTTCTTGGTC GCCAGTTGAG CGGCAACTCA
CGCCTGCTCT GCCACGTCCT GGAAACGAAA ACTGAAAAAG TTGTGACACA GTCTCGCTTC
GATCCCCTTT GCGAAGGTTG CGATAGTAAA GAAAATTGCC GCGAAAAAGC ATTTCTGGGT
ACGCCTGTCA TTTTACAGGA TCGTTGTGTT GGGGTGATAA GTTTGATTGC CGTTACCCAC
GAGCAACAAG AGCATATCAG TGATAATTTA CGCGAATTTT CTGATTATGT TCGCCATATA
TCCACCATTT TTGTTTCGAA ACTTCTGGAG GATCAGGGGC CAGGAGATAA CATCAGTAAA
ATATTCGCGA CCATGATCGA TAATATGGAT CAGGGCGTAT TAGTTGTTGA TGATGAAAGT
CGGGTTCAGT TTGTTAATCA GACTGCCTTA AAAACACTTG GTGTTGTGCA AAATAATATT
ATTGGGAAAC CTATCCGTTT CAGACCATTA ACATTTGAGA GTAATTTTAC CCATGGACAT
ATGCAGCATA TTGTTTCGTG GGACGATAAA AGTGAATTAA TCATTGGTCA ATTGCATAAC
ATTCAGGGCC GACAATTATT TTTAATGGCG TTTCACCAAT CGCATACCAG TTTTTCTGTA
GCAAATGCAC CTGATGAACC GCATATTGAA CAATTGGTTG GCGAGTGCCG TGTTATGCGG
CAATTAAAAC GACTCATTAG CCGTATTGCA CCCAGCCCAT CCAGCGTTAT GGTGGTTGGT
GAAAGCGGCA CGGGTAAAGA AGTTGTCGCC CGTGCAATCC ATAAGTTGAG CGGAAGACGG
AATAAACCCT TTATTGCTAT CAACTGTGCC GCGATTCCGG AGCAGCTTCT GGAGAGCGAA
CTGTTCGGTT ATGTTAAAGG CGCATTTACT GGCGCTTCTG CCAACGGTAA AACAGGGTTG
ATTCAGGCGG CGAATACGGG CACGCTGTTT CTCGATGAAA TAGGTGATAT GCCATTAATG
TTGCAGGCTA AATTACTGCG CGCTATTGAG GCACGTGAAA TTCTGCCGAT TGGTGCCAGT
AGCCCAATAC AAGTCGACAT TCGCATCATT TCTGCAACTA ACCAGAATTT GGCCCAGTTC
ATTGCCGAAG GTAAATTCCG CGAAGATCTC TTCTACCGAC TTAATGTTAT CCCGATAACT
CTGCCACCGC TGCGTGAACG TCAGGAAGAT ATTGAACTAC TGGTGCATTA CTTTTTACAT
CTGCATACCC GTCGTCTGGG ATCGGTTTAT CCTGGCATTG CTCCCGATGT CGTCGAAATA
TTGCGTAAGC ATCGTTGGCC CGGAAACCTG CGCGAGTTAA GCAATTTGAT GGAATATCTG
GTTAACGTTG TTCCTTCAGG TGAAGTTATC GACAGCACGC TATTGCCGCC AAATCTGCTG
AATAATGGCA CAACGGAGCA AAGTGATGTA ACAGAGGTTA CTGAGGCACA CCTGTCACTC
GATGATGCGG GCGGCACGGC GCTGGAGGAG ATGGAAAAGC AAATGATCCG CGAGGCGCTT
TCACGTCATA ACAGCAAGAA GCAAGTTGCT GATGAACTGG GCATCGGCAT TGCTACGCTC
TATCGCAAGA TTAAGAAATA TGAGTTGTTA AACACATAA
 
Protein sequence
MELATTQSVL MQIQPTIQRF ARMLASVLQL EVEIVDENLY RVAGTGAYGK FLGRQLSGNS 
RLLCHVLETK TEKVVTQSRF DPLCEGCDSK ENCREKAFLG TPVILQDRCV GVISLIAVTH
EQQEHISDNL REFSDYVRHI STIFVSKLLE DQGPGDNISK IFATMIDNMD QGVLVVDDES
RVQFVNQTAL KTLGVVQNNI IGKPIRFRPL TFESNFTHGH MQHIVSWDDK SELIIGQLHN
IQGRQLFLMA FHQSHTSFSV ANAPDEPHIE QLVGECRVMR QLKRLISRIA PSPSSVMVVG
ESGTGKEVVA RAIHKLSGRR NKPFIAINCA AIPEQLLESE LFGYVKGAFT GASANGKTGL
IQAANTGTLF LDEIGDMPLM LQAKLLRAIE AREILPIGAS SPIQVDIRII SATNQNLAQF
IAEGKFREDL FYRLNVIPIT LPPLRERQED IELLVHYFLH LHTRRLGSVY PGIAPDVVEI
LRKHRWPGNL RELSNLMEYL VNVVPSGEVI DSTLLPPNLL NNGTTEQSDV TEVTEAHLSL
DDAGGTALEE MEKQMIREAL SRHNSKKQVA DELGIGIATL YRKIKKYELL NT