Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2638 |
Symbol | hyfR |
ID | 6145879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2696228 |
End bp | 2698240 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617509 |
Product | hydrogenase-4 transcriptional regulator |
Protein accession | YP_001744674 |
Protein GI | 170683782 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATGT CAGACGAGGC GATGTTTGCC CCGCCGCAAG GAATAACAAT TGAAGCGGTA AACGGAATGC TCGCGGAGCG GTTAGCGCAG AAACACGGCA AGGCGTCTTT ATTACGCGCC TTCATCCCGC TGCCGCCGCC GTTCAGCCCG GTACAACTTA TTGAACTGCA TGTTCTCAAA AGCAACTTCT ATTACCGCTA CCATGATGAT GGCAGCGATG TGACGGCAAC AACAAAGTAT CAGGGTGAGA TGGTCGATTA TTCGCGCCAC GCCGTCCTTC TCGGCAGTAG TGGAATGGCG GAGCTACGCT TTATTCGCAC CCACGGCAGT CGTTTTACTC CCCAGGATTG CACACTGTTT AACTGGCTGG CACACATAAT CACTCCGGTT CTGCAATCAT GGCTCAATGA TGAAGAACAG CAGGTGGCGC TGCGTTTGCT GGAGAAAGAT CGCGATCATC ATCGGGTACT GGTGGATATC ACTAATGCAG TGTTGTCACA TCTTGATCTC GACAATCTGA TCGCTGACGT CGCTCGTGAG ATCCATCATT TTTTCGGTCT GGCTTCAGTC AGTATGGTAC TGGGCGATCA TCGAAAGAAC GAGAAGTTCA GCCTGTGGTG CAGCGATCTT TCTGCCTCAC ATTGTGCGTG TCTGCCACGC AATATGGCTG GCGACAGTGT ATTGCTGACA CAAACGCTAC AAACCCGACA ACCGACCTTG ACGCACCGTG CAGACGATCT GTTTCTCTGG CAACGCGACC CGTTATTACA CTTACTTGCA TCTAACGGCT GCGAATCTGC GCTCCTTATA CCGCTCACCT TTGGCAACCA TACACCGGGT GCATTGTTGC TGGCGCATAC CTCTTCCACT CTCTTTAGTG AGGAAAACTG CCAGCTACTA CAACATATTG CCGATCGCAT CGCTATTGCC GTTGGCAATG CCGATGCCTG GCGCAGCATG ACCGATTTGC AGGAAAGCTT GCAGCAAGAA AACCACCAGC TTAGCGAGCA GCTCCTTTCG AATCTGGGCG TCGGTGACAT TATCTATCAA AGCCAGGCAA TGGAAGACCT GCTCCAGCAG GTAGATATTG TGGCGAAGAG CGACAGTACG GTGTTGATTT GTGGTGAAAC CGGAACCGGC AAAGAGGTGA TCGCCAGAGC GATCCATCAA CTTAGCCCAC GACGCGACAA GCCACTGGTC AAAATCAACT GCGCTGCCAT CCCCGCCAGT CTTCTGGAAA GTGAGTTATT CGGTCATGAC AAAGGGGCAT TTACTGGTGC GATTAATACC CATCGTGGTC GTTTTGAAAT TGCCGATGGC GGCACGTTGT TTCTCGATGA AATTGGCGAT CTGCCGTTAG AACTTCAGCC TAAACTGCTG CGCGTATTGC AGGAACGGGA GATTGAGCGT CTCGGCGGGA GTAGAACGAT CCCGGTAAAT GTCAGAGTCA TTGCCGCCAC CAACCGTGAT TTGTGGCAAA TGGTTGAAGA TCGCCAGTTT CGCAGCGATC TCTTTTATCG CCTGAATGTC TTCCCACTGG AACTGCCGCC ACTGCGCGAC CGTCCGGAAG ATATCCCTCT TTTAGCAAAG CATTTCACGC AAAAAATGGC GCGCCATATG AATCGCGCAA TTGACGCCAT CCCGACCGAG GCGCTACGCC AGTTGATGTC GTGGGACTGG CCGGGCAACG TGCGCGAGCT GGAAAACGTG ATTGAGCGAG CGGTACTACT GACTCGCGGT AACAGTCTGA ATTTACATCT TAATGTCCGA CAAAGCCGTT TACTGCCGAC GCTAAATGAA GATTCAGCGC TTCGCAGTTC AATGGCGCAG TTGCTGCACC CGACGACGCC AGAGAATGAC GAAGAAGAAC GTCAGCGCAT TGTTCAGGTA TTGCGAGAAA CCAATGGCAT TGTTGCCGGG CCCCGTGGCG CGGCGACACG ATTAGGGATG AAGCGCACCA CGCTGCTGTC ACGAATGCAG CGTCTGGGGA TCTCGGTTCG CGAGGTGTTG TAA
|
Protein sequence | MAMSDEAMFA PPQGITIEAV NGMLAERLAQ KHGKASLLRA FIPLPPPFSP VQLIELHVLK SNFYYRYHDD GSDVTATTKY QGEMVDYSRH AVLLGSSGMA ELRFIRTHGS RFTPQDCTLF NWLAHIITPV LQSWLNDEEQ QVALRLLEKD RDHHRVLVDI TNAVLSHLDL DNLIADVARE IHHFFGLASV SMVLGDHRKN EKFSLWCSDL SASHCACLPR NMAGDSVLLT QTLQTRQPTL THRADDLFLW QRDPLLHLLA SNGCESALLI PLTFGNHTPG ALLLAHTSST LFSEENCQLL QHIADRIAIA VGNADAWRSM TDLQESLQQE NHQLSEQLLS NLGVGDIIYQ SQAMEDLLQQ VDIVAKSDST VLICGETGTG KEVIARAIHQ LSPRRDKPLV KINCAAIPAS LLESELFGHD KGAFTGAINT HRGRFEIADG GTLFLDEIGD LPLELQPKLL RVLQEREIER LGGSRTIPVN VRVIAATNRD LWQMVEDRQF RSDLFYRLNV FPLELPPLRD RPEDIPLLAK HFTQKMARHM NRAIDAIPTE ALRQLMSWDW PGNVRELENV IERAVLLTRG NSLNLHLNVR QSRLLPTLNE DSALRSSMAQ LLHPTTPEND EEERQRIVQV LRETNGIVAG PRGAATRLGM KRTTLLSRMQ RLGISVREVL
|
| |