Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3708 |
Symbol | |
ID | 6966919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3427144 |
End bp | 3428709 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387502 |
Product | hydrogenase 4 subunit F |
Protein accession | YP_002271955 |
Protein GI | 209400562 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.332962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGCTT TACTCCTGCT CACTCCGCTG CTTTTTTCGC TGCTCTGTTT TGCCTGCCGG AAACGAGGGC ACTCTGCGAC TCGCACGGTG ACAGTATTAC ATAGCTTAGG GATCACACTG CTGCTGATTC TGGCACTCTG GGTGGTCCAA ACTGCCGCTG ATGCAGGAGA AATATTCGCT GCGGGACTGT GGCTTCATAT TGATGGTCTG GGCGGTTTGT TCCTCGCCAT TCTTGGTGTG ATTGGCTTTC TCACCGGTGT TTACTCGATT GGCTACATGC GTCATGAAGT TGAGCACGGC GAGCTTTCAC CCGTTACGCT GTGCGATTAC TACGGTTTCT TCCATCTGTT TTTGTTCACC ATGCTGCTGG TTGTTACCAG CAATAACCTG ATTGTGATGT GGGCGGCGAT CGAAGCCACC ACCTTAAGCT CGGCGTTTCT GGTAGGCATT TACGGTCAGC GTTCATCGCT GGAAGCAGCA TGGAAGTACA TCATTATTTG TACTGTTGGT GTCGCTTTTG GTCTGTTCGG TACCGTGCTA GTATACGCCA ACGCCGCCAG CGTTATGCCG CAGGCAGAAA TGGCGATATT CTGGAGCGAG GTTCTTAAGC AATCGTCCTT GCTTGATCCA ACATTAATGC TGTTGGCCTT TGTGTTTGTG CTAATTGGCT TTGGTACCAA AACCGGGCTA TTTCCCATGC ACGCCTGGCT GCCGGATGCT CACAGTGAAG CGCCGAGTCC GGTCAGCGCC CTACTCTCCG CCGTATTGCT GAACTGCGCG CTGTTGGTGC TGATTCGCTA TTACATCATT ATTTGCCAGG CCATCGGCAG CGATTTCCCC AACCGGTTGT TGCTCATCTT CGGCATGTTG TCGGTTGCCG TGGCGGCATT TTTCATTCTG GTACAGCGGG ACATTAAGCG TCTGCTGGCG TACTCCAGCG TGGAGAACAT GGGACTGGTC GCGGTGGCGC TAGGCATTGG CGGGCCGCTG GGAATTTTTG CCGCGCTGCT GCACACCTTA AACCACAGTC TGGCAAAAAC GCTGCTGTTC TGCGGTTCCG GCAATGTACT GCTCAAGTAC GGCACGCGCG ATCTCAACGT CGTCTGTGGG ATGCTCAAAA TCATGCCATT TACCGCCGTG CTGTTTGGCG GCGGTGCGCT GGCGCTGGCA GGGATGCCGC CCTTCAACAT TTTTCTTAGC GAATTTATGA CCGTTACCGC CGGACTGGCA CGTAATCACC TGCTGATTAT CGTCCTGCTG TTATTGCTGT TAACGCTGGT GCTGGCGGGC CTGGTACGGA TGGCTGCGCG GGTGTTAATG GCGAAACCGC CGCAGGCCGT TAACCGGGGT GATCTCGGCT GGTTGACCAC CTCGCCAATG GTGATTCTGC TGGTCATGAT GCTGGCGATG GGAACGCATA TTCCACAACC TGTCATCAGG ATCCTGGCGG GCGCTTCCAC TATAGTCCTC TCAGGGACGC ACGACCTGCC TGCACAACGT AGCACCTGGC ATGATTTTTT GCCTTCAGGC ACCGCATCTG TTTCGGAGAA ACACAGTGAA CGTTAA
|
Protein sequence | MFALLLLTPL LFSLLCFACR KRGHSATRTV TVLHSLGITL LLILALWVVQ TAADAGEIFA AGLWLHIDGL GGLFLAILGV IGFLTGVYSI GYMRHEVEHG ELSPVTLCDY YGFFHLFLFT MLLVVTSNNL IVMWAAIEAT TLSSAFLVGI YGQRSSLEAA WKYIIICTVG VAFGLFGTVL VYANAASVMP QAEMAIFWSE VLKQSSLLDP TLMLLAFVFV LIGFGTKTGL FPMHAWLPDA HSEAPSPVSA LLSAVLLNCA LLVLIRYYII ICQAIGSDFP NRLLLIFGML SVAVAAFFIL VQRDIKRLLA YSSVENMGLV AVALGIGGPL GIFAALLHTL NHSLAKTLLF CGSGNVLLKY GTRDLNVVCG MLKIMPFTAV LFGGGALALA GMPPFNIFLS EFMTVTAGLA RNHLLIIVLL LLLLTLVLAG LVRMAARVLM AKPPQAVNRG DLGWLTTSPM VILLVMMLAM GTHIPQPVIR ILAGASTIVL SGTHDLPAQR STWHDFLPSG TASVSEKHSE R
|
| |