Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1209 |
Symbol | hyaB |
ID | 6968164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1217407 |
End bp | 1219200 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643385204 |
Product | hydrogenase 1 large subunit |
Protein accession | YP_002269699 |
Protein GI | 209400020 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTC AGTACGAAAC TCAGGGATAC ACCATCAATA ATGCCGGACG CCGCCTGGTG GTCGACCCGA TTACGCGCAT TGAAGGCCAC ATGCGCTGCG AAGTGAATAT TAACGATCAG AATGTAATCA CCAATGCCGT CTCCTGCGGC ACCATGTTTC GCGGGCTGGA GATCATCCTG CAAGGGCGCG ACCCGCGCGA TGCGTGGGCG TTCGTTGAAC GTATCTGCGG CGTCTGTACT GGCGTACACG CTCTGGCTTC GGTTTACGCC ATCGAAGATG CCATCGGTAT TAAAGTGCCG GACAACGCCA ATATCATCCG CAACATTATG CTGGCAACGC TCTGGTGCCA CGATCATCTG GTGCACTTCT ATCAGCTTGC CGGGATGGAC TGGATCGATG TGTTGGATGC GCTGAAAGCC GACCCGCGGA AAACCTCCGA ACTGGCGCAA AGTCTCTCCT CTTGGCCGAA ATCATCCCCT GGCTATTTCT TCGACGTACA AAACCGCCTG AAGAAATTCG TTGAAGGCGG GCAGTTGGGG ATCTTCCGCA ATGGCTACTG GGGGCACCCG CAGTACAAGT TGCCGCCAGA AGCCAACCTG ATGGGCTTTG CCCACTATCT CGAAGCTCTC GATTTCCAGC GTGAAATTGT CAAAATCCAC GCGGTCTTTG GCGGTAAAAA CCCGCATCCA AACTGGATTG TCGGCGGGAT GCCTTGCGCC ATTAACATTG ACGAAAGCGG CGCGGTCGGG GCGGTCAATA TGGAACGCCT GAACCTGGTG CAGTCGATTA TCACCCGCAC GGCAGACTTC ATTAACAACG TGATGATCCC CGACGCCTTA GCCATCGGTC AGTTCAACAA ACCGTGGAGC GAAATCGGCA CGGGTCTTTC TGATAAATGT GTCCTCAGCT ACGGCGCATT CCCGGATATT GCCAACGACT TTGGTGAGAA AAGTCTGCTG ATGCCTGGCG GCGCGGTGAT TAACGGCGAC TTCAACAATG TGCTGCCAGT GGATTTGGTT GATCCGCAGC AGGTGCAGGA GTTTGTCGAT CACGCCTGGT ATCGTTATCC CAACGATCAG GTCGGGCGTC ATCCGTTCGA TGGCATCACC GACCCGTGGT ACAACCCCGG CGATGTCAAA GGCAGCGATA CCAACATTCA GCAGCTGAAT GAACAGGAAC GCTACTCGTG GATCAAAGCG CCGCGCTGGC GCGGTAACGC GATGGAAGTG GGGCCGCTGG CACGCACGTT AATCGCTTAT CACAAAGGCG ATGCTGCGAC CGTTGAGTCG GTCGATCGCA TGATGTCGGC GCTGAACCTG CCGCTTTCCG GTATCCAGTC AACGTTGGGC CGCATTTTGT GCCGCGCGCA CGAAGCACAG TGGGCCGCAG GTAAGTTGCA GTATTTCTTC AACAAGCTGA TGACTAACCT GAAAAACGGC AATCTTGCCA CCGCCTCCAC GGAAAAATGG GAACCCACAA CCTGGCCGAC AGAGTGCCGT GGTGTCGGTT TTACCGAAGC GCCGCGCGGG GCGTTAGGCC ACTGGGCCGC CATTCGCGAT GGCAAGATTG ATCTCTACCA GTGCGTGGTG CCGACCACCT GGAACGCCAG CCCGCGCGAT CCCAAAGGGC AGATTGGCGC TTATGAAGCG GCGCTGATGA ACACCAAAAT GGCGATCCCC GAGCAACCGC TGGAGATCCT GCGTACTCTG CACAGCTTTG ATCCGTGCCT CGCCTGTTCA ACACACGTGC TGGGCGACGA CGGTAGCGAG CTGATCTCCG TGCAGGTGCG TTAA
|
Protein sequence | MSTQYETQGY TINNAGRRLV VDPITRIEGH MRCEVNINDQ NVITNAVSCG TMFRGLEIIL QGRDPRDAWA FVERICGVCT GVHALASVYA IEDAIGIKVP DNANIIRNIM LATLWCHDHL VHFYQLAGMD WIDVLDALKA DPRKTSELAQ SLSSWPKSSP GYFFDVQNRL KKFVEGGQLG IFRNGYWGHP QYKLPPEANL MGFAHYLEAL DFQREIVKIH AVFGGKNPHP NWIVGGMPCA INIDESGAVG AVNMERLNLV QSIITRTADF INNVMIPDAL AIGQFNKPWS EIGTGLSDKC VLSYGAFPDI ANDFGEKSLL MPGGAVINGD FNNVLPVDLV DPQQVQEFVD HAWYRYPNDQ VGRHPFDGIT DPWYNPGDVK GSDTNIQQLN EQERYSWIKA PRWRGNAMEV GPLARTLIAY HKGDAATVES VDRMMSALNL PLSGIQSTLG RILCRAHEAQ WAAGKLQYFF NKLMTNLKNG NLATASTEKW EPTTWPTECR GVGFTEAPRG ALGHWAAIRD GKIDLYQCVV PTTWNASPRD PKGQIGAYEA ALMNTKMAIP EQPLEILRTL HSFDPCLACS THVLGDDGSE LISVQVR
|
| |