Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2118 |
Symbol | |
ID | 6975545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2345059 |
End bp | 2347926 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643391647 |
Product | 2-oxoglutarate dehydrogenase, E1 subunit |
Protein accession | YP_002276492 |
Protein GI | 209544263 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGCG TAGATATCCT GTCGACCGCG TTCAGCGGAG CAAACACGGC CTATCTGGCC GAACTGTACG CCCGCTGGGC TTCGGACCCC GGCAGCGTCG ATCCATCCTT CGCATCCCTG TTCTCGGCGA TGGACGAGGA AGGGGCGGCG ATCCTGCATG ACGCGGAAGG GGCCTCGTGG TCCCCGCGCG AGTCGATGAT CGATGGCGGC GAGGCCCCGC CGGCGGCATC CAAGGGCGGC CCCGTTTCGG TGGCGAGCCT GCATGCGGCG GCGGACGACA GCCTGCGCGC CACGCAACTG ATCCGCGCCT ATCGCGTGCG CGGCCATCTC GAAGCCCGGC TGGACCCGTT GGGCCTGCAG ATCCCCAAAC CCCATGCCGA CCTGGACCCC GCGACGTACG GGTTCGGGCT GCAGGATCTC GATCGCCCGA TCTATCTGGG CCATATCGTG GCCAACCTGA TCGGCACGCA GACCGCGACG ATCAACCAGG TGCTGGACGC GCTGCGCGCC GTCTATTGCG GGCCGATCGG CGCCGAATTC ATGCATGTCC AGGACCCCGA ACATCGCAAC TGGCTGCAGA AGCGGCTGGA GGGCGACAAC TGGCGCGCCG GCGTGTCGGC GGACGAGAAG AAGGTCATCC TGCACCACCT GACCGAGGCC GAGGGGTTCG AGGCCTTCTG CCAGAAGCGC TATGTCGGCA CGAAGCGCTT CGGGCTGGAG GGCGAGGACG TCACCATTCC CGCGCTGCAT GCGATGATCG ACCAGGTGGC CAAGGACGGC GTGCGCACCG TTGCGATCGG CATGCCGCAT CGCGGCCGCC TGAACACGCT GGTAAACGTG GTGCGCAAGC CCTACACCGC CATCTTCAGC GAATTCGCCG GCGCGTCCTT CAAGCCTGAC GACGTGCAGG GCTCGGGCGA CGTGAAATAC CATCTCGGCA CCTCGACCGA CGTGGATATC GACGGCAACC CGGTCCACAT CTCGCTGCAG CCCAACCCGT CGCACCTGGA AGCGGTCGAT CCGGTGGTGA TCGGCAAGGT GCGCGCGACG CAGGACGATG ACGACCCGCA TGCCCGCAGC CGCCACATGG CGCTGCTGCT GCATGGTGAC GCCGCCTTCG CCGGCCAGGG CCTGGTGTAC GAGACCATGG CGATGTCGCA GCTGATCGGC TATCGCACCG GCGGCACCAT CCATGTCGTG GTGAACAACC AGATCGGCTT CACCACCGTG TCGGCCCATG CCTATTCCGG GCTGTACTGC ACGGACATCG CCAAGGCGGT GCAGGCGCCG ATCCTGCACG TCAACGGCGA CGAGCCCGAA GCCGTGGTGT ATTGCGCCCG CCTGGCCGCC GATTTCCGCC AGAAATTCGC GACCGACATC GTGCTGGACA TCGTGGGCTA CCGCCGTCAC GGCCATAACG AGTCGGACGA GCCGTCCTTC ACCCAGCCGA CGATGTACAA GGCCATCGCC GCCCGCCCGA CGGTGCGCAC GCTGTATGCC GACCGCCTGG TGCGCGAAAG CGTGGTGACC GAGGCCGAGG CCACCGCGCA GTGGGATGCG TTCCAGGACC GGCTGGAGGA ATCCTATCAG GCGGCGCAGA CCTACAAGCC CAACAAGGCC GACTGGCTGG AAGGTGCCTG GACCGGGCTG AAGCCCCCGC CGGTGGGCGC GGTGGATGCC GAACCTGCGA CCGGCGTCGC GGTCGAGGCG CTGCGCAAGA TCGGCGAGGC GCTCAGCACC GCGCCGTCCG ATTTCAACAT CAATCCCAAG ATCGCCCGCC AGCTCAAGGC CAAGGCCGCG ATGTTCCAGT CGGGCGAAGG CATCGACTGG GCGACCGGCG AGGCGCTGGG CTTCGGCTCC CTGGTGCTGG AAAAGCATCG CGTCCGCCTG TCGGGCGAGG ACTGCCAGCG CGGCACGTTC AGCCAGCGCC ATGCCGTGCT GACGGACCAG GTCAACCAGA ACACCTATGT GCCGCTGAAC AACATCGACG CCGGCCAGGG CGTGTTCGAG GTCTATAACT CGCTGCTTTC GGAATTCGGC GTGCTGGGCT TCGAATATGG CTATTCGCTG GCCGATCCGA ACGCGCTGGT GCTGTGGGAA GGCCAGTTCG GCGACTTCGC CAACGGCGCG CAGGTCATCA TCGACCAGTT CATCGCCTCG GGCGAGACCA AGTGGCTGCG CATGTCGGGC CTGGTGCTGC TGCTGCCGCA CGGGTACGAG GGCCAGGGTC CGGAACATTC CTCGGCCCGC CTGGAACGGT ACCTGCAGCT CTGCGCCGAA AACAACATGC GGGTCTGCAA CCTGACCACG CCGGCGAACT ATTTCCACGC CCTGCGCCGC CAGCTCAAGC TGGACTACCG CAAGCCGCTG GTCATCATGA CGCCGAAATC GCTTCTGCGG CACAAGCTGG CGGTCTCGAA CCTCGAGGAG TTCGCCTCGG GCACGACGTT CCGTCCGGTG ATCGGCGAAA TCGATCCGAT CGCCAATGGC GACGCGATCG AACGGGTCGT GATCTGCTCG GGCAAGGTCT ATTACGACCT GCTGGCCGAA CGGCGCGAGC GTGCGCTGGA CAAGGTCGCG ATCCTGCGGC TGGAACAGTT CTACCCGTTC CCGGAAAAGC TGCTGGCCGA GCAACTGGCC CTGTATCCGA AGGCGAAAGT CATCTGGTGC CAGGAAGAGC CCGAGAACAT GGGGGGATGG ACGTTCGTCG ATCGCCTGAT CGAAGGCGTG ATGGCCAAGG CGGGTCGCAA GGGCGGCCGT CCGACCTATG TCGGCCGTGT CGCGGCGGCC AGCCCGGCCA CCGGCCTGGC CCGCGTCCAT GCCAGCGAGC AGGCGGCCCT GGTCGCCCAG GCGCTGGGCG TCGGCTGA
|
Protein sequence | MAGVDILSTA FSGANTAYLA ELYARWASDP GSVDPSFASL FSAMDEEGAA ILHDAEGASW SPRESMIDGG EAPPAASKGG PVSVASLHAA ADDSLRATQL IRAYRVRGHL EARLDPLGLQ IPKPHADLDP ATYGFGLQDL DRPIYLGHIV ANLIGTQTAT INQVLDALRA VYCGPIGAEF MHVQDPEHRN WLQKRLEGDN WRAGVSADEK KVILHHLTEA EGFEAFCQKR YVGTKRFGLE GEDVTIPALH AMIDQVAKDG VRTVAIGMPH RGRLNTLVNV VRKPYTAIFS EFAGASFKPD DVQGSGDVKY HLGTSTDVDI DGNPVHISLQ PNPSHLEAVD PVVIGKVRAT QDDDDPHARS RHMALLLHGD AAFAGQGLVY ETMAMSQLIG YRTGGTIHVV VNNQIGFTTV SAHAYSGLYC TDIAKAVQAP ILHVNGDEPE AVVYCARLAA DFRQKFATDI VLDIVGYRRH GHNESDEPSF TQPTMYKAIA ARPTVRTLYA DRLVRESVVT EAEATAQWDA FQDRLEESYQ AAQTYKPNKA DWLEGAWTGL KPPPVGAVDA EPATGVAVEA LRKIGEALST APSDFNINPK IARQLKAKAA MFQSGEGIDW ATGEALGFGS LVLEKHRVRL SGEDCQRGTF SQRHAVLTDQ VNQNTYVPLN NIDAGQGVFE VYNSLLSEFG VLGFEYGYSL ADPNALVLWE GQFGDFANGA QVIIDQFIAS GETKWLRMSG LVLLLPHGYE GQGPEHSSAR LERYLQLCAE NNMRVCNLTT PANYFHALRR QLKLDYRKPL VIMTPKSLLR HKLAVSNLEE FASGTTFRPV IGEIDPIANG DAIERVVICS GKVYYDLLAE RRERALDKVA ILRLEQFYPF PEKLLAEQLA LYPKAKVIWC QEEPENMGGW TFVDRLIEGV MAKAGRKGGR PTYVGRVAAA SPATGLARVH ASEQAALVAQ ALGVG
|
| |