Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2144 |
Symbol | hyaB |
ID | 6146942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2152534 |
End bp | 2154327 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617020 |
Product | hydrogenase 1 large subunit |
Protein accession | YP_001744195 |
Protein GI | 170683907 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.375749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.757738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTC AGTACGAAAC TCAGGGATAC ACCATCAATA ATGCCGGACG CCGCCTGGTG GTCGACCCGA TTACGCGCAT TGAAGGCCAC ATGCGCTGCG AAGTGAATAT TAACGATCAG AATGTGATCA CCAATGCCGT CTCCTGCGGC ACCATGTTTC GCGGGCTGGA GATCATCCTG CAAGGGCGTG ACCCGCGCGA TGCGTGGGCG TTCGTTGAAC GTATCTGCGG CGTCTGTACT GGCGTACACG CCCTGGCTTC GGTTTACGCC ATCGAAGATG CCATCGGTAT TAAAGTGCCG GACAACGCCA ATATCATCCG CAACATTATG CTGGCAACGC TCTGGTGCCA CGATCATCTG GTGCACTTCT ATCAGCTTGC CGGGATGGAT TGGATCGATG TGTTAGATGC GCTGAAAGCC GACCCGCGGA AAACCTCCGA ACTGGCGCAA AGCCTCTCCT CATGGCCGAA ATCATCCCCT GGCTATTTCT TCGACGTACA AAACCGCCTG AAGAAATTCG TTGAAGGCGG GCAGTTGGGG ATCTTCCGCA ATGGCTACTG GGGGCACCCG CAGTACAAAC TGCCGCCAGA AGCCAACCTG ATGGGCTTTG CCCACTATCT CGAAGCTCTC GATTTCCAGC GTGAAATTGT CAAAATCCAC GCGGTCTTTG GCGGTAAAAA CCCACATCCA AACTGGATTG TCGGCGGGAT GCCTTGTGCC ATTAACATTG ACGAAAGCGG CGCGGTCGGG GCGGTCAATA TGGAACGCCT GAACCTGGTG CAGTCAATCA TTACCCGTAC GGCGGATTTC ATTAACAACG TGATGATCCC CGACGCCTTA GCCATCGGTC AGTTCAACAA GCCGTGGAGC GAAATCGGCA CTGGCCTTTC GGATAAATGC GTTCTCAGTT ACGGCGCGTT CCCGGATATC GCCAACGACT TTGGCGAGAA AAGTCTGCTG ATGCCTGGCG GCGCGGTGAT TAACGGCGAC TTCAACAATG TGCTGCCAGT GGATTTGGTT GATCCGCAGC AGGTGCAGGA GTTTGTCGAT CACGCCTGGT ATCGTTATCC CAACGATCAG GTCGGGCGTC ATCCGTTCGA TGGTATTACC GACCCGTGGT ACAACCCCGG CGATGTCAAA GGCAGCGATA CCAACATTCA GCAGCTGAAT GAACAGGAAC GCTACTCGTG GATCAAAGCG CCACGCTGGC GCGGTAACGC GATGGAAGTG GGGCCGCTGG CACGCACGTT AATCGCTTAT CACAAAGGCG ATGCGGCGAC CGTTGAGTCG GTCGATCGCA TGATGTCGGC GTTGAACCTA CCGCTTTCCG GCATCCAGTC AACGTTAGGC CGCATTTTGT GTCGCGCGCA CGAAGCGCAG TGGGCCGCAG GCAAGTTGCA ATATTTCTTC GACAAGCTGA TGACCAATCT CAAAAACGGC AATCTCGCCA CCGCCTCCAC TGAGAAGTGG GAACCAGCAA CCTGGCCGAC AGAGTGCCGT GGTGTCGGCT TTACCGAAGC GCCGCGCGGA GCGTTAGGCC ACTGGGCCGC CATTCGCGAT GGCAAGATTG ATCTCTACCA GTGCGTGGTG CCGACCACCT GGAACGCCAG CCCGCGCGAT CCTAAAGGGC AAATTGGCGC TTATGAAGCG GCGCTGATGA ACACCAAAAT GGCGATCCCC GAGCAACCGC TGGAGATCCT GCGTACTCTG CACAGCTTTG ATCCGTGCCT CGCCTGTTCA ACACACGTGC TGGGCGACGA CGGTAGCGAG CTGATCTCCG TGCAGGTGCG TTAA
|
Protein sequence | MSTQYETQGY TINNAGRRLV VDPITRIEGH MRCEVNINDQ NVITNAVSCG TMFRGLEIIL QGRDPRDAWA FVERICGVCT GVHALASVYA IEDAIGIKVP DNANIIRNIM LATLWCHDHL VHFYQLAGMD WIDVLDALKA DPRKTSELAQ SLSSWPKSSP GYFFDVQNRL KKFVEGGQLG IFRNGYWGHP QYKLPPEANL MGFAHYLEAL DFQREIVKIH AVFGGKNPHP NWIVGGMPCA INIDESGAVG AVNMERLNLV QSIITRTADF INNVMIPDAL AIGQFNKPWS EIGTGLSDKC VLSYGAFPDI ANDFGEKSLL MPGGAVINGD FNNVLPVDLV DPQQVQEFVD HAWYRYPNDQ VGRHPFDGIT DPWYNPGDVK GSDTNIQQLN EQERYSWIKA PRWRGNAMEV GPLARTLIAY HKGDAATVES VDRMMSALNL PLSGIQSTLG RILCRAHEAQ WAAGKLQYFF DKLMTNLKNG NLATASTEKW EPATWPTECR GVGFTEAPRG ALGHWAAIRD GKIDLYQCVV PTTWNASPRD PKGQIGAYEA ALMNTKMAIP EQPLEILRTL HSFDPCLACS THVLGDDGSE LISVQVR
|
| |