Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4994 |
Symbol | |
ID | 8547404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6886953 |
End bp | 6889811 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646389670 |
Product | 2-oxoglutarate dehydrogenase, E1 subunit |
Protein accession | YP_003269376 |
Protein GI | 262198167 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA CCGACCGAGG CGGCAACGAC CCCCTGAACA GTTCGAGCCT TACGTTCGCC GAGGACCTGT ATCAGACCTA CCTCGACGAC CCGCAGGCGG TCCCGGCCGA CTGGCGGGTT TATTTCGACC AGTTGGACGG CAAAGGCACG GGGTCGGGCA GCGGCGCCGG CGCCAACGGG CCCAGCTTCC CGTGGCGCAG CCTGTTCCAC GGCGGCGCGC GCGCGGGCAA TGGCGCCACG CGCGCAGGCG CGGTCGCGGC CGAGATGCCG CCGAGCGGCG ACGCCGACCT GCAGCACCGC GTCGATATGA TGATCCGCAA CTACCGGGTG CGCGGTCACG AGGTCGCGAC CATCAACCCG CTCGGCGGCG ATGTGCCCGA GATCCCCGAG CTGGCCACCG ACTACTACGG CTTCCGCGAG TCGGACTTCG AGCTGCCGCT GGCGCCCAAC ACCCTGCCCG GCTGCGCCCA CCTGCGCGAT GTGTATAACG CGCTGCGCGC CACCTATACC CGATCGATCG GCGCCGAGTA CATGCACATC AGCAACGGCG ATGTGCGCCG CTGGCTGTCC GATCGCATGG AGCGCGGCCG CAACCGCATC GAGCTGTCGC GCGCGACCCA GCTCAGCATC CTCACCAAGC TCACCGACGC CGAGATCTTC GAGGAGTTCA TCCAGAAGAA GTTCGTCGGC GCCAAGCGCT TCTCGCTCGA GGGCGGCGAG AGCCTGATCC CGCTGCTCGA CATGGCCATC GAGAAGGCGG CCAACTCCGG GGTCAAGGAG ATCGTGCTGG GCATGGCCCA CCGCGGCCGG CTCAACGTGC TGGCCAACAT CATGGGCAAG AACCCGCGCA CCATCTTCCG CGAGTTCGAG GACAAGAACC CCGAGCGCCA CTTCGGCTCG GGCGACGTCA AGTATCACCT CGGCTACAGC GCCGAGTGGG TGTCGGCCGA GAACCACGCC CTGCACATGT CGCTGGCCTT CAACCCCTCG CACCTCGAGT TCGTCAACCC GGTGGTGATG GGCCGCGTGC GCGCCAAGCA GGACCGCTTC GGCGACACCG ATCGCACCTG CGGGCTGGCC ATCCTCATCC ACGGCGACGC CGCCTTCATC GGCGAGGGCG TGGTGCAGGA GACGCTGAAC ATGTCGGAGC TCGACGGTTA CGCCGTCGGC GGCACCCTGC ACGTCATCGT CAACAATCAG CTCGGCTTCA CCACCGGCTC CGACCAGAGC CGCAGCACGG TGTACGCCAG CGACATCGCC AAGATGCTGC AGAGTCCGAT CTTCCACGTC AACGGCGAGG ATCCCGAGGC CGTGGCGCAG ACCATCGAGC TGGCCATGGA CTTCCGCGCC GAGTTTGGCC GCGACGTGGT CATCGACATG TACTGCTACC GCCGCCACGG CCACAACGAG GGCGACGAGC CGGCCTTCAC CCAGCCGCTG ATGTACAGCG AGATCCGCCA GCGCCCGACC GTGCGCGAGA GCTACATCGA GCACCTGCTC AAGCTGGGCG AGATCACGGG TGACGAGGCC ACCGAAATCG CCGACGCGCG TCGCGCGCAC CTCGAGGACG AGCTGTCGGT GGCCCGCAGC GAGGACTTCC AGCCCCACTA CTCGGCCGGC GAAGGCATCT GGCAGCCCTA CCACGGCGGC GCCGACGTGC GCACCGACGA TGTCGAGACC GGCATCCACG AGGACGACGC GCGCTCGCTG CTGCAGCGGC TCACCGAGGT GCCCGAGGAG TTCCACCAGC ACCCCAAGAT CACGCGCGGG CTCAAGCAGC GCCGGGCCAT GGCCGAGGGC GAGCATCCGC TCGACTGGTC GGCGGCCGAG GCCCTGGCCC TGGCCAGCCT GCTCACCACC GGCACGCGCG TGCGCATGAC CGGACAGGAC GCCGAGCGCG GAACCTTCAG CCAGCGCCAC GCGGTGCTGC ACGACGTCAA CAGCGACGCG CGCTTCATGC CGCTGGCGCA TCTGGCGCCC GATCAGGCGC CGATCGAGAT CCACAACAGC CCGCTGTCCG AGGCCGGCGT GCTCGGCTTC GAGTACGGCT ACAGCCTCGA CACCCCGGAC GGGCTGGTGC TGTGGGAGGC CCAGTACGGC GACTTCGTCA ACGCCGCCCA GGTCATCATC GACCAGTTCA TCTCCTCGGC CGAGGACAAG TGGAACCGGC TCTCGGGCCT GGTCATGCTG CTGCCGCACG GCTTCGAGGG CAGCGGCCCC GAGCACTCCA GCGCGCGGCT CGAGCGCTTC TTGCAGCTGT GCGCCGAGGA CAACATCCAG GTCGCCAACC CGAGCACGCC GAGCCAGTAC TTCCACCTGC TGCGCCGCCA GGTGCGGCGC CCGGCGCGCA AGCCGCTGGT GGTGATGACG CCCAAGAGCT TGCTGCGCCA TCACAAGGCG CAGTCGCCGC TGTCCGAATT CACCGACGGC CGCTTCGAGC GCGTCCTGGC CGACGAGCTT GAGCCCGCTC GCGTCAAGCA CGTCCTGCTG TGTTCGGGGA AGGTGTACTA CGATCTGCTG GCCGAGCGCG ACGCCGAGGA GCGCAAGGAC GTGGCCATCA TTCGCCTCGA GCAGCTCTAC CCGCTGGCCA TGGAGGAGCT CGAGCGCGTG CTCTCGCCCT ACGCCGCCGG CACGCCCGTG TACTGGGTGC AGGAAGAGCC GGCCAACATG GGCGCGTGGT GGTTTCTGCG GGTGCAGTGG GGCAGCCAGG TCCTCGGCCA TCCCTTCTCC GGCATCAGCC GCCGCGCCTC GGCCAGCCCG GCCACCGGCT CGGGCACCAG CCACAAACTC GAGCAGACCG CTCTGGTCCG CGCGGCCATC CTCGGCGCCG AGAGCTCGCT GGTGACGACC ACCAGCTAA
|
Protein sequence | MSDTDRGGND PLNSSSLTFA EDLYQTYLDD PQAVPADWRV YFDQLDGKGT GSGSGAGANG PSFPWRSLFH GGARAGNGAT RAGAVAAEMP PSGDADLQHR VDMMIRNYRV RGHEVATINP LGGDVPEIPE LATDYYGFRE SDFELPLAPN TLPGCAHLRD VYNALRATYT RSIGAEYMHI SNGDVRRWLS DRMERGRNRI ELSRATQLSI LTKLTDAEIF EEFIQKKFVG AKRFSLEGGE SLIPLLDMAI EKAANSGVKE IVLGMAHRGR LNVLANIMGK NPRTIFREFE DKNPERHFGS GDVKYHLGYS AEWVSAENHA LHMSLAFNPS HLEFVNPVVM GRVRAKQDRF GDTDRTCGLA ILIHGDAAFI GEGVVQETLN MSELDGYAVG GTLHVIVNNQ LGFTTGSDQS RSTVYASDIA KMLQSPIFHV NGEDPEAVAQ TIELAMDFRA EFGRDVVIDM YCYRRHGHNE GDEPAFTQPL MYSEIRQRPT VRESYIEHLL KLGEITGDEA TEIADARRAH LEDELSVARS EDFQPHYSAG EGIWQPYHGG ADVRTDDVET GIHEDDARSL LQRLTEVPEE FHQHPKITRG LKQRRAMAEG EHPLDWSAAE ALALASLLTT GTRVRMTGQD AERGTFSQRH AVLHDVNSDA RFMPLAHLAP DQAPIEIHNS PLSEAGVLGF EYGYSLDTPD GLVLWEAQYG DFVNAAQVII DQFISSAEDK WNRLSGLVML LPHGFEGSGP EHSSARLERF LQLCAEDNIQ VANPSTPSQY FHLLRRQVRR PARKPLVVMT PKSLLRHHKA QSPLSEFTDG RFERVLADEL EPARVKHVLL CSGKVYYDLL AERDAEERKD VAIIRLEQLY PLAMEELERV LSPYAAGTPV YWVQEEPANM GAWWFLRVQW GSQVLGHPFS GISRRASASP ATGSGTSHKL EQTALVRAAI LGAESSLVTT TS
|
| |