Gene Gdia_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2118 
Symbol 
ID6975545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2345059 
End bp2347926 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content67% 
IMG OID643391647 
Product2-oxoglutarate dehydrogenase, E1 subunit 
Protein accessionYP_002276492 
Protein GI209544263 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCG TAGATATCCT GTCGACCGCG TTCAGCGGAG CAAACACGGC CTATCTGGCC 
GAACTGTACG CCCGCTGGGC TTCGGACCCC GGCAGCGTCG ATCCATCCTT CGCATCCCTG
TTCTCGGCGA TGGACGAGGA AGGGGCGGCG ATCCTGCATG ACGCGGAAGG GGCCTCGTGG
TCCCCGCGCG AGTCGATGAT CGATGGCGGC GAGGCCCCGC CGGCGGCATC CAAGGGCGGC
CCCGTTTCGG TGGCGAGCCT GCATGCGGCG GCGGACGACA GCCTGCGCGC CACGCAACTG
ATCCGCGCCT ATCGCGTGCG CGGCCATCTC GAAGCCCGGC TGGACCCGTT GGGCCTGCAG
ATCCCCAAAC CCCATGCCGA CCTGGACCCC GCGACGTACG GGTTCGGGCT GCAGGATCTC
GATCGCCCGA TCTATCTGGG CCATATCGTG GCCAACCTGA TCGGCACGCA GACCGCGACG
ATCAACCAGG TGCTGGACGC GCTGCGCGCC GTCTATTGCG GGCCGATCGG CGCCGAATTC
ATGCATGTCC AGGACCCCGA ACATCGCAAC TGGCTGCAGA AGCGGCTGGA GGGCGACAAC
TGGCGCGCCG GCGTGTCGGC GGACGAGAAG AAGGTCATCC TGCACCACCT GACCGAGGCC
GAGGGGTTCG AGGCCTTCTG CCAGAAGCGC TATGTCGGCA CGAAGCGCTT CGGGCTGGAG
GGCGAGGACG TCACCATTCC CGCGCTGCAT GCGATGATCG ACCAGGTGGC CAAGGACGGC
GTGCGCACCG TTGCGATCGG CATGCCGCAT CGCGGCCGCC TGAACACGCT GGTAAACGTG
GTGCGCAAGC CCTACACCGC CATCTTCAGC GAATTCGCCG GCGCGTCCTT CAAGCCTGAC
GACGTGCAGG GCTCGGGCGA CGTGAAATAC CATCTCGGCA CCTCGACCGA CGTGGATATC
GACGGCAACC CGGTCCACAT CTCGCTGCAG CCCAACCCGT CGCACCTGGA AGCGGTCGAT
CCGGTGGTGA TCGGCAAGGT GCGCGCGACG CAGGACGATG ACGACCCGCA TGCCCGCAGC
CGCCACATGG CGCTGCTGCT GCATGGTGAC GCCGCCTTCG CCGGCCAGGG CCTGGTGTAC
GAGACCATGG CGATGTCGCA GCTGATCGGC TATCGCACCG GCGGCACCAT CCATGTCGTG
GTGAACAACC AGATCGGCTT CACCACCGTG TCGGCCCATG CCTATTCCGG GCTGTACTGC
ACGGACATCG CCAAGGCGGT GCAGGCGCCG ATCCTGCACG TCAACGGCGA CGAGCCCGAA
GCCGTGGTGT ATTGCGCCCG CCTGGCCGCC GATTTCCGCC AGAAATTCGC GACCGACATC
GTGCTGGACA TCGTGGGCTA CCGCCGTCAC GGCCATAACG AGTCGGACGA GCCGTCCTTC
ACCCAGCCGA CGATGTACAA GGCCATCGCC GCCCGCCCGA CGGTGCGCAC GCTGTATGCC
GACCGCCTGG TGCGCGAAAG CGTGGTGACC GAGGCCGAGG CCACCGCGCA GTGGGATGCG
TTCCAGGACC GGCTGGAGGA ATCCTATCAG GCGGCGCAGA CCTACAAGCC CAACAAGGCC
GACTGGCTGG AAGGTGCCTG GACCGGGCTG AAGCCCCCGC CGGTGGGCGC GGTGGATGCC
GAACCTGCGA CCGGCGTCGC GGTCGAGGCG CTGCGCAAGA TCGGCGAGGC GCTCAGCACC
GCGCCGTCCG ATTTCAACAT CAATCCCAAG ATCGCCCGCC AGCTCAAGGC CAAGGCCGCG
ATGTTCCAGT CGGGCGAAGG CATCGACTGG GCGACCGGCG AGGCGCTGGG CTTCGGCTCC
CTGGTGCTGG AAAAGCATCG CGTCCGCCTG TCGGGCGAGG ACTGCCAGCG CGGCACGTTC
AGCCAGCGCC ATGCCGTGCT GACGGACCAG GTCAACCAGA ACACCTATGT GCCGCTGAAC
AACATCGACG CCGGCCAGGG CGTGTTCGAG GTCTATAACT CGCTGCTTTC GGAATTCGGC
GTGCTGGGCT TCGAATATGG CTATTCGCTG GCCGATCCGA ACGCGCTGGT GCTGTGGGAA
GGCCAGTTCG GCGACTTCGC CAACGGCGCG CAGGTCATCA TCGACCAGTT CATCGCCTCG
GGCGAGACCA AGTGGCTGCG CATGTCGGGC CTGGTGCTGC TGCTGCCGCA CGGGTACGAG
GGCCAGGGTC CGGAACATTC CTCGGCCCGC CTGGAACGGT ACCTGCAGCT CTGCGCCGAA
AACAACATGC GGGTCTGCAA CCTGACCACG CCGGCGAACT ATTTCCACGC CCTGCGCCGC
CAGCTCAAGC TGGACTACCG CAAGCCGCTG GTCATCATGA CGCCGAAATC GCTTCTGCGG
CACAAGCTGG CGGTCTCGAA CCTCGAGGAG TTCGCCTCGG GCACGACGTT CCGTCCGGTG
ATCGGCGAAA TCGATCCGAT CGCCAATGGC GACGCGATCG AACGGGTCGT GATCTGCTCG
GGCAAGGTCT ATTACGACCT GCTGGCCGAA CGGCGCGAGC GTGCGCTGGA CAAGGTCGCG
ATCCTGCGGC TGGAACAGTT CTACCCGTTC CCGGAAAAGC TGCTGGCCGA GCAACTGGCC
CTGTATCCGA AGGCGAAAGT CATCTGGTGC CAGGAAGAGC CCGAGAACAT GGGGGGATGG
ACGTTCGTCG ATCGCCTGAT CGAAGGCGTG ATGGCCAAGG CGGGTCGCAA GGGCGGCCGT
CCGACCTATG TCGGCCGTGT CGCGGCGGCC AGCCCGGCCA CCGGCCTGGC CCGCGTCCAT
GCCAGCGAGC AGGCGGCCCT GGTCGCCCAG GCGCTGGGCG TCGGCTGA
 
Protein sequence
MAGVDILSTA FSGANTAYLA ELYARWASDP GSVDPSFASL FSAMDEEGAA ILHDAEGASW 
SPRESMIDGG EAPPAASKGG PVSVASLHAA ADDSLRATQL IRAYRVRGHL EARLDPLGLQ
IPKPHADLDP ATYGFGLQDL DRPIYLGHIV ANLIGTQTAT INQVLDALRA VYCGPIGAEF
MHVQDPEHRN WLQKRLEGDN WRAGVSADEK KVILHHLTEA EGFEAFCQKR YVGTKRFGLE
GEDVTIPALH AMIDQVAKDG VRTVAIGMPH RGRLNTLVNV VRKPYTAIFS EFAGASFKPD
DVQGSGDVKY HLGTSTDVDI DGNPVHISLQ PNPSHLEAVD PVVIGKVRAT QDDDDPHARS
RHMALLLHGD AAFAGQGLVY ETMAMSQLIG YRTGGTIHVV VNNQIGFTTV SAHAYSGLYC
TDIAKAVQAP ILHVNGDEPE AVVYCARLAA DFRQKFATDI VLDIVGYRRH GHNESDEPSF
TQPTMYKAIA ARPTVRTLYA DRLVRESVVT EAEATAQWDA FQDRLEESYQ AAQTYKPNKA
DWLEGAWTGL KPPPVGAVDA EPATGVAVEA LRKIGEALST APSDFNINPK IARQLKAKAA
MFQSGEGIDW ATGEALGFGS LVLEKHRVRL SGEDCQRGTF SQRHAVLTDQ VNQNTYVPLN
NIDAGQGVFE VYNSLLSEFG VLGFEYGYSL ADPNALVLWE GQFGDFANGA QVIIDQFIAS
GETKWLRMSG LVLLLPHGYE GQGPEHSSAR LERYLQLCAE NNMRVCNLTT PANYFHALRR
QLKLDYRKPL VIMTPKSLLR HKLAVSNLEE FASGTTFRPV IGEIDPIANG DAIERVVICS
GKVYYDLLAE RRERALDKVA ILRLEQFYPF PEKLLAEQLA LYPKAKVIWC QEEPENMGGW
TFVDRLIEGV MAKAGRKGGR PTYVGRVAAA SPATGLARVH ASEQAALVAQ ALGVG