Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3709 |
Symbol | |
ID | 6971771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3428699 |
End bp | 3430414 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387503 |
Product | hydrogenase-4, G subunit |
Protein accession | YP_002271956 |
Protein GI | 209399980 |
COG category | [C] Energy production and conversion |
COG ID | [COG3261] Ni,Fe-hydrogenase III large subunit [COG3262] Ni,Fe-hydrogenase III component G |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGTTA ATTCATCGTC AAATCGTGGC GAAGCGATTC TCGCCGCCCT GAAAACGCAG TTCCCCGGCG CGGTGCTGGA TGAAGAGCGA CAAACGCCTG AACAGGTCAC CATTACGGTG AAAATCAATC TGCTGCCTGA CGTTGTACAG TATCTTTATT ATCAACATGA TGGCTGGCTT CCGGTCCTGT TTGGCAACGA CGAGCGGACA CTTAACGGTC ATTACGCGGT TTATTATGCC CTTTCAATGG AAGGGGCCGA AAAATGCTGG ATTGTGGTGA AGGCGCTGGT CGATGCCGAC AGTCGGGAGT TTCCGTCAGT CACACCGCGC GTCCCTGCCG CGGTCTGGGG CGAGCGAGAA ATTCGTGATA TGTACGGGCT GATTCCGGTT GGCCTGCCGG ATCAGCGTCG CCTGGTGTTG CCCGATGACT GGCCGGAAGA TATGCATCCG CTGCGCAAAG ATGCGATGGA TTATCGACTG CGCCCTGAAC CGACGACTGA TTCCGAAACG TATCCGTTTA TCAATGAGGG CAACAGCGGT GCGCGGGTGA TCCCTGTCGG CCCGCTGCAT ATCACCTCCG ATGAACCGGG TCACTTCCGC TTGTTTGTGG ATGGCGAGCA AATTGTCGAT GCTGATTACC GTCTGTTTTA TGTCCATCGC GGCATGGAGA AACTGGCAGA AACGCGGATG GGCTACAACG AAGTGACCTT CTTATCGGAC CGCGTGTGTG GGATTTGCGG TTTTGCCCAC AGTGTGGCCT ATACCAACTC GGTTGAAAAT GCACTGGGGA TTGAGGTGCC GCAACGAGCG CATACCATTC GCTCGATTCT GCTGGAAGTC GAACGGCTAC ACAGTCATTT GCTCAACCTT GGCCTCTCCT GCCATTTTGT TGGATTTGAT ACCGGCTTTA TGCAATTTTT CCGCGTGCGG GAAAAGTCGA TGACAATGGC GGAATTGCTG ACCGGGTCGC GTAAAACCTA CGGTCTGAAT CTGATTGGTG GTGTTCGCCG CGATATTCTC AAAGAGCAAC GTCTGCAAAC GCTGAAACTG GTGCGCGAGA TGCGCGCCGA CGTGTCGGAG CTGGTAGAAA TGCTGCTTGC CACGCCGAAT ATGGAACAAC GCACTCAGGG CATTGGCATT CTCGACCGAC AAATCGCCCG TGATTATAGC CCTGTAGGGC CGCTGATCCG CGGCAGTGGT TTTGCCCGTG ATTTGCGCTT TGATCACCCC TACGCCGACT ACGGCAATAT TCCAAAAACA CTGTTTACCT TCACCGGCGG CGATGTTTTC TCCCGCGTGA TGGTCCGTGT CAAAGAGACG TTTGATTCGC TGGCAATGCT GGAATTTGCT CTCGACAACA TGCCGGATAC CCCACTGCTG ACCGAAGGCT TTAGCTATAA ACCTCACGCA TTCGCGCTCG GCTTTGTTGA AGCGCCACGC GGTGAAGACG TGCACTGGAG CATGCTCGGT GATAACCAAA AATTGTTCCG CTGGCGCTGC CGTGCCGCCA CCTACGCCAA CTGGCCGGTG TTGCGTTACA TGCTGCGCGG CAATACCGTT TCTGACGCAC CGCTGATTAT CGGTAGCCTT GATCCCTGCT ACTCCTGTAC CGACCGTGTG ACGCTGGTTG ATGTGCGCAA GCGCCAGTCA AAAACCGTGC CGTATAAAGA GATCGAACGC TACGGCATTG ATCGTAACCG TTCGCCGCTG AAGTAA
|
Protein sequence | MNVNSSSNRG EAILAALKTQ FPGAVLDEER QTPEQVTITV KINLLPDVVQ YLYYQHDGWL PVLFGNDERT LNGHYAVYYA LSMEGAEKCW IVVKALVDAD SREFPSVTPR VPAAVWGERE IRDMYGLIPV GLPDQRRLVL PDDWPEDMHP LRKDAMDYRL RPEPTTDSET YPFINEGNSG ARVIPVGPLH ITSDEPGHFR LFVDGEQIVD ADYRLFYVHR GMEKLAETRM GYNEVTFLSD RVCGICGFAH SVAYTNSVEN ALGIEVPQRA HTIRSILLEV ERLHSHLLNL GLSCHFVGFD TGFMQFFRVR EKSMTMAELL TGSRKTYGLN LIGGVRRDIL KEQRLQTLKL VREMRADVSE LVEMLLATPN MEQRTQGIGI LDRQIARDYS PVGPLIRGSG FARDLRFDHP YADYGNIPKT LFTFTGGDVF SRVMVRVKET FDSLAMLEFA LDNMPDTPLL TEGFSYKPHA FALGFVEAPR GEDVHWSMLG DNQKLFRWRC RAATYANWPV LRYMLRGNTV SDAPLIIGSL DPCYSCTDRV TLVDVRKRQS KTVPYKEIER YGIDRNRSPL K
|
| |