Gene ECH74115_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3983 
SymbolfhlA 
ID6967917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3682467 
End bp3684545 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content52% 
IMG OID643387752 
Productformate hydrogenlyase transcriptional activator 
Protein accessionYP_002272195 
Protein GI209396082 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.933844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATATA CACCGATGAG TGATCTCGGA CAGCAAGGGT TGTTCGACAT CACTCGGACA 
CTATTGCAGC AGCCCGATCT GGCCTCGCTG TGTGAGGCTC TTTCGCAACT GGTAAAGCGT
TCTGCGCTCG CCGACAACGC GGCTATTGTG TTGTGGCAAG CGCAGACTCA ACGTGCGTCT
TATTACGCAT CGCGTGAAAA AGACACCCCC ATTAAATATG AAGACGAAAC TGTTCTGGCA
CACGGTCCGG TACGCAGCAT TTTGTCGCGC CCTGATACGT TGCATTGCAG TTACGAAGAA
TTTTGTGAAA CCTGGCCGCA GCTGGCCGCA GGTGGGCTAT ACCCAAAATT TGGTCACTAT
TGCCTGATGC CACTGGCGGC GGAAGGGCAT ATTTTTGGTG GCTGTGAATT TATTCGTTAT
GACGATCGCC CCTGGAGCGA AAAAGAGTTC AATCGTCTGC AAACATTTAC GCAGATCGTT
TCTGTCGTCA CCGAACAAAT CCAGAGTCGC GTCGTTAACA ATGTCGACTA TGAGTTGTTA
TGCCGGGAAC GCGATAACTT CCGCATCCTG GTCGCCATCA CCAACGCGGT GCTTTCCCGC
CTGGATATGG ACGAACTGGT CAGCGAAGTC GCCAAAGAAA TCCATTACTA TTTCGATATT
GACGATATCA GTATCGTCTT ACGCAGCCAC CGTAAAAACA AACTCAACAT CTACTCCACT
CACTATCTTG ATAAACAGCA TCCCGCCCAC GAACAGAGCG AAGTCGATGA AGCCGGAACC
CTCACCGAAC GCGTGTTCAA AAGTAAAGAG ATGCTGCTGA TTAATCTCCA CGAGCGGGAT
GATTTAGCCC CCTATGAACG CATGTTGTTC GATACCTGGG GCAACCAGAT TCAAACCTTG
TGCCTGTTAC CGCTGATGTC TGGCGACACC ATGCTGGGCG TGCTGAAACT GGCGCAATGT
GAAGAGAAAG TGTTTACCAC TACCAATCTG AATTTACTGC GCCAGATTGC CGAACGTGTG
GCAATCGCTG TCGATAACGC CCTCGCCTAT CAGGAAATCC ATCGTCTGAA AGAACGGCTG
GTTGATGAAA ACCTCGCCCT GACCGAGCAG CTCAACAATG TTGATAGTGA ATTTGGCGAG
ATTATTGGCC GCAGCGAAGC CATGTACAGC GTGCTTAAAC AAGTTGAAAT GGTGGCGCAA
AGTGACAGTA CCGTGCTGAT CCTCGGTGAA ACTGGCACGG GTAAAGAGCT GATTGCCCGT
GCTATCCATA ATCTCAGTGG GCGTAATAAT CGCCGCATGG TCAAAATGAA CTGCGCGGCG
ATGCCTGCCG GATTGCTGGA GAGCGATCTG TTTGGTCATG AGCGTGGGGC TTTTACCGGT
GCCAGCGCCC AGCGTATCGG TCGTTTTGAA CTGGCGGATA AAAGCTCCCT GTTCCTCGAC
GAAGTGGGCG ATATGCCACT GGAGTTACAG CCGAAGTTGC TGCGTGTATT GCAGGAACAG
GAGTTTGAAC GCCTCGGCAG CAACAAAATC ATTCAGACGG ACGTGCGTCT AATCGCCGCG
ACTAACCGCG ATCTGAAAAA AATGGTCGCC GACCGTGAGT TCCGTAGCGA TCTCTATTAC
CGCCTGAACG TATTCCCGAT TCACCTGCCG CCACTACGCG AGCGTCCGGA AGATATTCCG
CTGCTGGCGA AAGCCTTTAC CTTCAAAATT GCCCGTCGTC TGGGGCGCAA TATCGACAGC
ATTCCTGCCG AGACGTTGCG CACCTTGAGC AATATGGAGT GGCCGGGTAA CGTACGCGAA
CTGGAAAACG TCATTGAGCG CGCGGTATTG CTAACACGCG GCAACGTGCT GCAGCTGTCA
TTGCCAGATA TTGCTTTACC GGAGCCTGAA ACGCCGCCTG CCGCAACGGT TGTCGCTCAG
GAGGGCGAAG ATGAATATCA GTTGATTGTT CGCGTGCTGA AAGAAACTAA CGGCGTGGTT
GCCGGGCCTA AAGGCGCTGC GCAACGTCTG GGGCTGAAAC GCACGACCCT GCTGTCACGG
ATGAAGCGAC TGGGAATTGA TAAATCGGCA TTGATTTAA
 
Protein sequence
MSYTPMSDLG QQGLFDITRT LLQQPDLASL CEALSQLVKR SALADNAAIV LWQAQTQRAS 
YYASREKDTP IKYEDETVLA HGPVRSILSR PDTLHCSYEE FCETWPQLAA GGLYPKFGHY
CLMPLAAEGH IFGGCEFIRY DDRPWSEKEF NRLQTFTQIV SVVTEQIQSR VVNNVDYELL
CRERDNFRIL VAITNAVLSR LDMDELVSEV AKEIHYYFDI DDISIVLRSH RKNKLNIYST
HYLDKQHPAH EQSEVDEAGT LTERVFKSKE MLLINLHERD DLAPYERMLF DTWGNQIQTL
CLLPLMSGDT MLGVLKLAQC EEKVFTTTNL NLLRQIAERV AIAVDNALAY QEIHRLKERL
VDENLALTEQ LNNVDSEFGE IIGRSEAMYS VLKQVEMVAQ SDSTVLILGE TGTGKELIAR
AIHNLSGRNN RRMVKMNCAA MPAGLLESDL FGHERGAFTG ASAQRIGRFE LADKSSLFLD
EVGDMPLELQ PKLLRVLQEQ EFERLGSNKI IQTDVRLIAA TNRDLKKMVA DREFRSDLYY
RLNVFPIHLP PLRERPEDIP LLAKAFTFKI ARRLGRNIDS IPAETLRTLS NMEWPGNVRE
LENVIERAVL LTRGNVLQLS LPDIALPEPE TPPAATVVAQ EGEDEYQLIV RVLKETNGVV
AGPKGAAQRL GLKRTTLLSR MKRLGIDKSA LI