Gene EcSMS35_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2638 
SymbolhyfR 
ID6145879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2696228 
End bp2698240 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID641617509 
Producthydrogenase-4 transcriptional regulator 
Protein accessionYP_001744674 
Protein GI170683782 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGT CAGACGAGGC GATGTTTGCC CCGCCGCAAG GAATAACAAT TGAAGCGGTA 
AACGGAATGC TCGCGGAGCG GTTAGCGCAG AAACACGGCA AGGCGTCTTT ATTACGCGCC
TTCATCCCGC TGCCGCCGCC GTTCAGCCCG GTACAACTTA TTGAACTGCA TGTTCTCAAA
AGCAACTTCT ATTACCGCTA CCATGATGAT GGCAGCGATG TGACGGCAAC AACAAAGTAT
CAGGGTGAGA TGGTCGATTA TTCGCGCCAC GCCGTCCTTC TCGGCAGTAG TGGAATGGCG
GAGCTACGCT TTATTCGCAC CCACGGCAGT CGTTTTACTC CCCAGGATTG CACACTGTTT
AACTGGCTGG CACACATAAT CACTCCGGTT CTGCAATCAT GGCTCAATGA TGAAGAACAG
CAGGTGGCGC TGCGTTTGCT GGAGAAAGAT CGCGATCATC ATCGGGTACT GGTGGATATC
ACTAATGCAG TGTTGTCACA TCTTGATCTC GACAATCTGA TCGCTGACGT CGCTCGTGAG
ATCCATCATT TTTTCGGTCT GGCTTCAGTC AGTATGGTAC TGGGCGATCA TCGAAAGAAC
GAGAAGTTCA GCCTGTGGTG CAGCGATCTT TCTGCCTCAC ATTGTGCGTG TCTGCCACGC
AATATGGCTG GCGACAGTGT ATTGCTGACA CAAACGCTAC AAACCCGACA ACCGACCTTG
ACGCACCGTG CAGACGATCT GTTTCTCTGG CAACGCGACC CGTTATTACA CTTACTTGCA
TCTAACGGCT GCGAATCTGC GCTCCTTATA CCGCTCACCT TTGGCAACCA TACACCGGGT
GCATTGTTGC TGGCGCATAC CTCTTCCACT CTCTTTAGTG AGGAAAACTG CCAGCTACTA
CAACATATTG CCGATCGCAT CGCTATTGCC GTTGGCAATG CCGATGCCTG GCGCAGCATG
ACCGATTTGC AGGAAAGCTT GCAGCAAGAA AACCACCAGC TTAGCGAGCA GCTCCTTTCG
AATCTGGGCG TCGGTGACAT TATCTATCAA AGCCAGGCAA TGGAAGACCT GCTCCAGCAG
GTAGATATTG TGGCGAAGAG CGACAGTACG GTGTTGATTT GTGGTGAAAC CGGAACCGGC
AAAGAGGTGA TCGCCAGAGC GATCCATCAA CTTAGCCCAC GACGCGACAA GCCACTGGTC
AAAATCAACT GCGCTGCCAT CCCCGCCAGT CTTCTGGAAA GTGAGTTATT CGGTCATGAC
AAAGGGGCAT TTACTGGTGC GATTAATACC CATCGTGGTC GTTTTGAAAT TGCCGATGGC
GGCACGTTGT TTCTCGATGA AATTGGCGAT CTGCCGTTAG AACTTCAGCC TAAACTGCTG
CGCGTATTGC AGGAACGGGA GATTGAGCGT CTCGGCGGGA GTAGAACGAT CCCGGTAAAT
GTCAGAGTCA TTGCCGCCAC CAACCGTGAT TTGTGGCAAA TGGTTGAAGA TCGCCAGTTT
CGCAGCGATC TCTTTTATCG CCTGAATGTC TTCCCACTGG AACTGCCGCC ACTGCGCGAC
CGTCCGGAAG ATATCCCTCT TTTAGCAAAG CATTTCACGC AAAAAATGGC GCGCCATATG
AATCGCGCAA TTGACGCCAT CCCGACCGAG GCGCTACGCC AGTTGATGTC GTGGGACTGG
CCGGGCAACG TGCGCGAGCT GGAAAACGTG ATTGAGCGAG CGGTACTACT GACTCGCGGT
AACAGTCTGA ATTTACATCT TAATGTCCGA CAAAGCCGTT TACTGCCGAC GCTAAATGAA
GATTCAGCGC TTCGCAGTTC AATGGCGCAG TTGCTGCACC CGACGACGCC AGAGAATGAC
GAAGAAGAAC GTCAGCGCAT TGTTCAGGTA TTGCGAGAAA CCAATGGCAT TGTTGCCGGG
CCCCGTGGCG CGGCGACACG ATTAGGGATG AAGCGCACCA CGCTGCTGTC ACGAATGCAG
CGTCTGGGGA TCTCGGTTCG CGAGGTGTTG TAA
 
Protein sequence
MAMSDEAMFA PPQGITIEAV NGMLAERLAQ KHGKASLLRA FIPLPPPFSP VQLIELHVLK 
SNFYYRYHDD GSDVTATTKY QGEMVDYSRH AVLLGSSGMA ELRFIRTHGS RFTPQDCTLF
NWLAHIITPV LQSWLNDEEQ QVALRLLEKD RDHHRVLVDI TNAVLSHLDL DNLIADVARE
IHHFFGLASV SMVLGDHRKN EKFSLWCSDL SASHCACLPR NMAGDSVLLT QTLQTRQPTL
THRADDLFLW QRDPLLHLLA SNGCESALLI PLTFGNHTPG ALLLAHTSST LFSEENCQLL
QHIADRIAIA VGNADAWRSM TDLQESLQQE NHQLSEQLLS NLGVGDIIYQ SQAMEDLLQQ
VDIVAKSDST VLICGETGTG KEVIARAIHQ LSPRRDKPLV KINCAAIPAS LLESELFGHD
KGAFTGAINT HRGRFEIADG GTLFLDEIGD LPLELQPKLL RVLQEREIER LGGSRTIPVN
VRVIAATNRD LWQMVEDRQF RSDLFYRLNV FPLELPPLRD RPEDIPLLAK HFTQKMARHM
NRAIDAIPTE ALRQLMSWDW PGNVRELENV IERAVLLTRG NSLNLHLNVR QSRLLPTLNE
DSALRSSMAQ LLHPTTPEND EEERQRIVQV LRETNGIVAG PRGAATRLGM KRTTLLSRMQ
RLGISVREVL