Gene ECH74115_3713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3713 
Symbol 
ID6970178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3432160 
End bp3434172 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID643387507 
Productformate hydrogenlyase transcriptional activator 
Protein accessionYP_002271960 
Protein GI209396720 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.574322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGT CAGACGAGGC GATGTTTGCC CCGCCGCAAG GAATAACAAT TGAAGCGGTA 
AACGGAATGC TCGCAGAGCG GTTAGCGCAG AAACACGGCA AGGCGTCTTT ATTACGCGCC
TTCATCCCGC TGCCGCCGCC GTTCAGCCCG GTACAACTTA TTGAACTGCA TGTTCTCAAA
AGCAACTTCT ATTACCGCTA CCATGATGAT GGCAGCGATG TGACGGCAAC AACAGAGTAT
CAGGGCGAGA TGGTCGATTA TTCGCGTCAC GCCGTCCTTC TCGGCAGTAG TGGAATGGCG
GAGCTACGCT TTATTCGCAC CCACGGCAGT CGTTTTACTC CCCAGGATTG CACACTGTTT
AACTGGCTGG CACGCATTAT CACCCCGGTT CTGCAATCAT GGCTCAATGA TGAAGAACAA
CAGGTGGCGC TGCGTTTGCT GGAGAAAGAT CGCGATCATC ATCGGGTACT GGTGGATATC
ACTAATGCAG TGCTGTCACA TCTTGATCTC GACGATCTGA TCGCTGACGT CGCTCGTGAG
ATCCATCATT TTTTCGGTCT GGCTTCAGTC AGTATGGTAC TGGGCGATCA TCGAAAGAAC
GAGAAGTTCA GCCTGTGGTG CAGCGATCTT TCTGCCTCAC ATTGTGCGTG TCTGCCACGC
AATATGCCTG GCGACAGTGT ATTGCTGACA CAAACGCTAC AAACCCGACA ACCGACCTTG
ACGCACCGTG CAGACGATCT GTTTCTCTGG CAACGCGACC CGTTATTACA CTTACTTGCA
TCTAACGGCT GCGAATCTTC GCTCCTGATA CCGCTTACCT TTGGCAACCA TACACCGGGT
GCATTGTTGC TGGCGCATAC CTCTACCACT CTCTTTAGTG AGGAAAACTG CCAGCTACTA
CAACACATAG CCGATCGCAT CGCTATTGCC GTTGGCAATG CCGATGCCTG GCGTAGCATG
ACCGATTTGC AGGAAAGTTT GCAGCAAGAA AACCACCAGC TTAGCGAGCA GCTCCTTTCG
AATCTGGGCG TCGGTGACAT TATCTATCAA AGCCAGGCAA TGGAAGACCT GCTCCAGCAG
GTAGATATTG TGGCGAAGAG CGACAGTACG GTGTTGATTT GTGGTGAAAC CGGAACCGGC
AAAGAGGTGA TCGCCAGAGC GATCCATCAA CTTAGCCCGC GACGCGACAA GCCGCTGGTC
AAAATCAACT GCGCTGCCAT CCCCGCCAGT CTTCTGGAAA GTGAGTTATT CGGTCATGAC
AAAGGGGCCT TTACTGGTGC GATTAATACC CATCGTGGTC GTTTTGAAAT TGCCGATGGC
GGTACGTTGT TTCTCGATGA AATTGGCGAT CTGCCGTTAG AACTTCAGCC TAAATTGCTG
CGCGTATTGC AGGAGCGGGA AATTGAGCGT CTCGGCGGGA GTAGAACGAT CCCGGTGAAT
GTCAGAGTCA TTGCCGCCAC CAACCGTGAT TTGTGGCAAA TGGTTGAAGA TCGCCAGTTT
CGCAGCGATC TCTTTTATCG CCTGAATGTC TTCCCACTGG AATTGCCGCC GCTGCGAGAC
CGTCCGGAAG ATATCCCTCT TTTAGCAAAA CATTTCACGC AAAAAATGGC GCGCCATATG
AATCGCGCAA TTGACGCCAT CCCGACCGAG GCACTACGCC AGTTGATGTC GTGGGATTGG
CCGGGCAACG TGCGCGAGCT GGAAAACGTG ATTGAGCGAG CGGTACTGCT GACTCGCGGT
AACAGTCTGA ATTTACATCT TAATGTCCGA CAAAGCCGTT TACTGCCGAC GCTAAATGAA
GATTCAGCGC TTCGCAGTTC AATGGCGCAG TTGCTGCACC CGACGACGCC AGAGAATGAC
GAAGAAGAAC GTCAGCGCAT TGTTCAGGTA TTGCGAGAAA CCAATGGCAT TGTTGCCGGG
CCCCGTGGCG CGGCGACACG ATTAGGGATG AAGCGCACCA CGCTGCTGTC ACGAATGCAG
CGTCTGGGGA TCTCGGTTCG CGAGGTGTTG TAA
 
Protein sequence
MAMSDEAMFA PPQGITIEAV NGMLAERLAQ KHGKASLLRA FIPLPPPFSP VQLIELHVLK 
SNFYYRYHDD GSDVTATTEY QGEMVDYSRH AVLLGSSGMA ELRFIRTHGS RFTPQDCTLF
NWLARIITPV LQSWLNDEEQ QVALRLLEKD RDHHRVLVDI TNAVLSHLDL DDLIADVARE
IHHFFGLASV SMVLGDHRKN EKFSLWCSDL SASHCACLPR NMPGDSVLLT QTLQTRQPTL
THRADDLFLW QRDPLLHLLA SNGCESSLLI PLTFGNHTPG ALLLAHTSTT LFSEENCQLL
QHIADRIAIA VGNADAWRSM TDLQESLQQE NHQLSEQLLS NLGVGDIIYQ SQAMEDLLQQ
VDIVAKSDST VLICGETGTG KEVIARAIHQ LSPRRDKPLV KINCAAIPAS LLESELFGHD
KGAFTGAINT HRGRFEIADG GTLFLDEIGD LPLELQPKLL RVLQEREIER LGGSRTIPVN
VRVIAATNRD LWQMVEDRQF RSDLFYRLNV FPLELPPLRD RPEDIPLLAK HFTQKMARHM
NRAIDAIPTE ALRQLMSWDW PGNVRELENV IERAVLLTRG NSLNLHLNVR QSRLLPTLNE
DSALRSSMAQ LLHPTTPEND EEERQRIVQV LRETNGIVAG PRGAATRLGM KRTTLLSRMQ
RLGISVREVL