Gene Gdia_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3501 
Symbol 
ID6976953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3833399 
End bp3834523 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content70% 
IMG OID643393021 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_002277840 
Protein GI209545611 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0571139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATT CCATTACGCC CATCACGATG CCCAAGTTCG GCCTGGCGAT GACCGAGGGC 
AAGCTGGCCG GCTGGATGGT CCGCCCCGGT GCCAGCGTGA AGGCCGGCGA CGACCTGGCG
GATATCGAGA CCAGCAAGAT CACCAACGCC TATGAAAGCC CGGCCGGCGG CGTGCTGCGC
CGCCAGGTGG CGGCGGAAGG CGACACGTTG CCGGTGGGCG CGCTGATCGG CGTGCTGGCC
GATGCCAGCG TGCCGGATGC CGAGATCGAC GCCTTCATCA CGCGCTTCAA CGCCGAATTC
TCCTCCGGCG CCGGCGGCGA GGCCGAGGAA GCGGCCCCGG AACCGAAACT GGTGGACGTG
CAGGGCAACA GCCTGCGGGT CCTGGACCTG GGCCATGGGG ACGCCACGCC CATCGTGCTG
GTCCACGGCT TCGGCGGTGA TATCGGGAAC TGGCTGTTCA ACCACGCGGC CCTGGCCGCC
GGCCGGCGCG TCATCGCCTT CGACCTGCCC GGGCATGGCG GATCGACGAA GGATGTGGGC
GCGGGGTCGC TGGATTTCTT CGCCGGCATC GTCGTCGGGC TGCTGGACAC GCTGGGCATC
CCGCAGGCCC ACCTGGTGGG GCATTCGCTG GGCGGCGGCG TTGCCTTGAC GGTGGCCCGG
ACCGCACCCG CGCGCGTCGC CTCCCTGGCG CTGATCGCGC CGGCCGGCAT GGGCCCCGAG
ATCAACATGG ACTTCATCAC CGGGTTCATC ACGGCCGACC GGCAGAAGAC GATCCAGCCG
GTCCTGGCCA TGCTGGTCCA TGACAAGACG CTGGTGGGGC GCAAGATGGC GGACGACGTC
CTGCGCTACA AGCGCCTGGA CGGCGCCGTC GCCGCCCTGA CGCAGATTGC CGCCACCTGC
TTCCCCGACG GCAAGCAGGC CGACGACCTG CGGCCCGTGC TGGAACAGGG CGACGTGAGG
GCGCTGATCC TGTGGGGCGA GGACGACGAG ATCCTGCCCG CGAAGCAGTC CCGGGGCCTG
CCGGGCCGCG TCACGATCGA CCTCCTGCCC GGGGTCGGCC ATATGCCGCA GATGGAGCGG
GCGGCCGACA TCAACAAGGC GATCGCGGCG TTCGTCGCGA AATAG
 
Protein sequence
MTDSITPITM PKFGLAMTEG KLAGWMVRPG ASVKAGDDLA DIETSKITNA YESPAGGVLR 
RQVAAEGDTL PVGALIGVLA DASVPDAEID AFITRFNAEF SSGAGGEAEE AAPEPKLVDV
QGNSLRVLDL GHGDATPIVL VHGFGGDIGN WLFNHAALAA GRRVIAFDLP GHGGSTKDVG
AGSLDFFAGI VVGLLDTLGI PQAHLVGHSL GGGVALTVAR TAPARVASLA LIAPAGMGPE
INMDFITGFI TADRQKTIQP VLAMLVHDKT LVGRKMADDV LRYKRLDGAV AALTQIAATC
FPDGKQADDL RPVLEQGDVR ALILWGEDDE ILPAKQSRGL PGRVTIDLLP GVGHMPQMER
AADINKAIAA FVAK