Gene ECH74115_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3708 
Symbol 
ID6966919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3427144 
End bp3428709 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content54% 
IMG OID643387502 
Producthydrogenase 4 subunit F 
Protein accessionYP_002271955 
Protein GI209400562 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.332962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCTT TACTCCTGCT CACTCCGCTG CTTTTTTCGC TGCTCTGTTT TGCCTGCCGG 
AAACGAGGGC ACTCTGCGAC TCGCACGGTG ACAGTATTAC ATAGCTTAGG GATCACACTG
CTGCTGATTC TGGCACTCTG GGTGGTCCAA ACTGCCGCTG ATGCAGGAGA AATATTCGCT
GCGGGACTGT GGCTTCATAT TGATGGTCTG GGCGGTTTGT TCCTCGCCAT TCTTGGTGTG
ATTGGCTTTC TCACCGGTGT TTACTCGATT GGCTACATGC GTCATGAAGT TGAGCACGGC
GAGCTTTCAC CCGTTACGCT GTGCGATTAC TACGGTTTCT TCCATCTGTT TTTGTTCACC
ATGCTGCTGG TTGTTACCAG CAATAACCTG ATTGTGATGT GGGCGGCGAT CGAAGCCACC
ACCTTAAGCT CGGCGTTTCT GGTAGGCATT TACGGTCAGC GTTCATCGCT GGAAGCAGCA
TGGAAGTACA TCATTATTTG TACTGTTGGT GTCGCTTTTG GTCTGTTCGG TACCGTGCTA
GTATACGCCA ACGCCGCCAG CGTTATGCCG CAGGCAGAAA TGGCGATATT CTGGAGCGAG
GTTCTTAAGC AATCGTCCTT GCTTGATCCA ACATTAATGC TGTTGGCCTT TGTGTTTGTG
CTAATTGGCT TTGGTACCAA AACCGGGCTA TTTCCCATGC ACGCCTGGCT GCCGGATGCT
CACAGTGAAG CGCCGAGTCC GGTCAGCGCC CTACTCTCCG CCGTATTGCT GAACTGCGCG
CTGTTGGTGC TGATTCGCTA TTACATCATT ATTTGCCAGG CCATCGGCAG CGATTTCCCC
AACCGGTTGT TGCTCATCTT CGGCATGTTG TCGGTTGCCG TGGCGGCATT TTTCATTCTG
GTACAGCGGG ACATTAAGCG TCTGCTGGCG TACTCCAGCG TGGAGAACAT GGGACTGGTC
GCGGTGGCGC TAGGCATTGG CGGGCCGCTG GGAATTTTTG CCGCGCTGCT GCACACCTTA
AACCACAGTC TGGCAAAAAC GCTGCTGTTC TGCGGTTCCG GCAATGTACT GCTCAAGTAC
GGCACGCGCG ATCTCAACGT CGTCTGTGGG ATGCTCAAAA TCATGCCATT TACCGCCGTG
CTGTTTGGCG GCGGTGCGCT GGCGCTGGCA GGGATGCCGC CCTTCAACAT TTTTCTTAGC
GAATTTATGA CCGTTACCGC CGGACTGGCA CGTAATCACC TGCTGATTAT CGTCCTGCTG
TTATTGCTGT TAACGCTGGT GCTGGCGGGC CTGGTACGGA TGGCTGCGCG GGTGTTAATG
GCGAAACCGC CGCAGGCCGT TAACCGGGGT GATCTCGGCT GGTTGACCAC CTCGCCAATG
GTGATTCTGC TGGTCATGAT GCTGGCGATG GGAACGCATA TTCCACAACC TGTCATCAGG
ATCCTGGCGG GCGCTTCCAC TATAGTCCTC TCAGGGACGC ACGACCTGCC TGCACAACGT
AGCACCTGGC ATGATTTTTT GCCTTCAGGC ACCGCATCTG TTTCGGAGAA ACACAGTGAA
CGTTAA
 
Protein sequence
MFALLLLTPL LFSLLCFACR KRGHSATRTV TVLHSLGITL LLILALWVVQ TAADAGEIFA 
AGLWLHIDGL GGLFLAILGV IGFLTGVYSI GYMRHEVEHG ELSPVTLCDY YGFFHLFLFT
MLLVVTSNNL IVMWAAIEAT TLSSAFLVGI YGQRSSLEAA WKYIIICTVG VAFGLFGTVL
VYANAASVMP QAEMAIFWSE VLKQSSLLDP TLMLLAFVFV LIGFGTKTGL FPMHAWLPDA
HSEAPSPVSA LLSAVLLNCA LLVLIRYYII ICQAIGSDFP NRLLLIFGML SVAVAAFFIL
VQRDIKRLLA YSSVENMGLV AVALGIGGPL GIFAALLHTL NHSLAKTLLF CGSGNVLLKY
GTRDLNVVCG MLKIMPFTAV LFGGGALALA GMPPFNIFLS EFMTVTAGLA RNHLLIIVLL
LLLLTLVLAG LVRMAARVLM AKPPQAVNRG DLGWLTTSPM VILLVMMLAM GTHIPQPVIR
ILAGASTIVL SGTHDLPAQR STWHDFLPSG TASVSEKHSE R