Gene ECH74115_3709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3709 
Symbol 
ID6971771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3428699 
End bp3430414 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content55% 
IMG OID643387503 
Producthydrogenase-4, G subunit 
Protein accessionYP_002271956 
Protein GI209399980 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGTTA ATTCATCGTC AAATCGTGGC GAAGCGATTC TCGCCGCCCT GAAAACGCAG 
TTCCCCGGCG CGGTGCTGGA TGAAGAGCGA CAAACGCCTG AACAGGTCAC CATTACGGTG
AAAATCAATC TGCTGCCTGA CGTTGTACAG TATCTTTATT ATCAACATGA TGGCTGGCTT
CCGGTCCTGT TTGGCAACGA CGAGCGGACA CTTAACGGTC ATTACGCGGT TTATTATGCC
CTTTCAATGG AAGGGGCCGA AAAATGCTGG ATTGTGGTGA AGGCGCTGGT CGATGCCGAC
AGTCGGGAGT TTCCGTCAGT CACACCGCGC GTCCCTGCCG CGGTCTGGGG CGAGCGAGAA
ATTCGTGATA TGTACGGGCT GATTCCGGTT GGCCTGCCGG ATCAGCGTCG CCTGGTGTTG
CCCGATGACT GGCCGGAAGA TATGCATCCG CTGCGCAAAG ATGCGATGGA TTATCGACTG
CGCCCTGAAC CGACGACTGA TTCCGAAACG TATCCGTTTA TCAATGAGGG CAACAGCGGT
GCGCGGGTGA TCCCTGTCGG CCCGCTGCAT ATCACCTCCG ATGAACCGGG TCACTTCCGC
TTGTTTGTGG ATGGCGAGCA AATTGTCGAT GCTGATTACC GTCTGTTTTA TGTCCATCGC
GGCATGGAGA AACTGGCAGA AACGCGGATG GGCTACAACG AAGTGACCTT CTTATCGGAC
CGCGTGTGTG GGATTTGCGG TTTTGCCCAC AGTGTGGCCT ATACCAACTC GGTTGAAAAT
GCACTGGGGA TTGAGGTGCC GCAACGAGCG CATACCATTC GCTCGATTCT GCTGGAAGTC
GAACGGCTAC ACAGTCATTT GCTCAACCTT GGCCTCTCCT GCCATTTTGT TGGATTTGAT
ACCGGCTTTA TGCAATTTTT CCGCGTGCGG GAAAAGTCGA TGACAATGGC GGAATTGCTG
ACCGGGTCGC GTAAAACCTA CGGTCTGAAT CTGATTGGTG GTGTTCGCCG CGATATTCTC
AAAGAGCAAC GTCTGCAAAC GCTGAAACTG GTGCGCGAGA TGCGCGCCGA CGTGTCGGAG
CTGGTAGAAA TGCTGCTTGC CACGCCGAAT ATGGAACAAC GCACTCAGGG CATTGGCATT
CTCGACCGAC AAATCGCCCG TGATTATAGC CCTGTAGGGC CGCTGATCCG CGGCAGTGGT
TTTGCCCGTG ATTTGCGCTT TGATCACCCC TACGCCGACT ACGGCAATAT TCCAAAAACA
CTGTTTACCT TCACCGGCGG CGATGTTTTC TCCCGCGTGA TGGTCCGTGT CAAAGAGACG
TTTGATTCGC TGGCAATGCT GGAATTTGCT CTCGACAACA TGCCGGATAC CCCACTGCTG
ACCGAAGGCT TTAGCTATAA ACCTCACGCA TTCGCGCTCG GCTTTGTTGA AGCGCCACGC
GGTGAAGACG TGCACTGGAG CATGCTCGGT GATAACCAAA AATTGTTCCG CTGGCGCTGC
CGTGCCGCCA CCTACGCCAA CTGGCCGGTG TTGCGTTACA TGCTGCGCGG CAATACCGTT
TCTGACGCAC CGCTGATTAT CGGTAGCCTT GATCCCTGCT ACTCCTGTAC CGACCGTGTG
ACGCTGGTTG ATGTGCGCAA GCGCCAGTCA AAAACCGTGC CGTATAAAGA GATCGAACGC
TACGGCATTG ATCGTAACCG TTCGCCGCTG AAGTAA
 
Protein sequence
MNVNSSSNRG EAILAALKTQ FPGAVLDEER QTPEQVTITV KINLLPDVVQ YLYYQHDGWL 
PVLFGNDERT LNGHYAVYYA LSMEGAEKCW IVVKALVDAD SREFPSVTPR VPAAVWGERE
IRDMYGLIPV GLPDQRRLVL PDDWPEDMHP LRKDAMDYRL RPEPTTDSET YPFINEGNSG
ARVIPVGPLH ITSDEPGHFR LFVDGEQIVD ADYRLFYVHR GMEKLAETRM GYNEVTFLSD
RVCGICGFAH SVAYTNSVEN ALGIEVPQRA HTIRSILLEV ERLHSHLLNL GLSCHFVGFD
TGFMQFFRVR EKSMTMAELL TGSRKTYGLN LIGGVRRDIL KEQRLQTLKL VREMRADVSE
LVEMLLATPN MEQRTQGIGI LDRQIARDYS PVGPLIRGSG FARDLRFDHP YADYGNIPKT
LFTFTGGDVF SRVMVRVKET FDSLAMLEFA LDNMPDTPLL TEGFSYKPHA FALGFVEAPR
GEDVHWSMLG DNQKLFRWRC RAATYANWPV LRYMLRGNTV SDAPLIIGSL DPCYSCTDRV
TLVDVRKRQS KTVPYKEIER YGIDRNRSPL K