Gene Cwoe_5079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5079 
Symbol 
ID8735545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5428433 
End bp5429779 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content77% 
IMG OID646505704 
Productcatalytic domain of components of various dehydrogenase complexes 
Protein accessionYP_003396863 
Protein GI284046523 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.490323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.370367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA CGCTGACCGC GGTCGCGATC ACGATGCCGA AGCTGTCGGA CTCGATGGAG 
GAGGGGACGA TCGCCGCGTG GCTGAAGGCC CCGGGCGACC CGGTCGCCGT CGGCGACGCG
CTCGCCGAGA TCGAGACCGA CAAGGCGACG ATGACCTACG AGGCCGAGCA CGCCGGCGTG
ATGGGCGAGC TGCTGGCGGC CGAGGGCGAA GCGGTCGCGC TCGGCGCGCC GATGGCGCAG
CTGCTGGTGG AGGGCGGCGC TGCGGAGGCG GCCGCTGCCC CGGCCGCCGC GCCCGCTGCG
GCCGCGAGCG AGGCCGCCGC GCCCGCCCCC GCGAGCGCCG CTGGCCCGGC CGCCGCGCCC
GCTCCGCCCG CGGGCCCCGC GCCGCTCGCC CACGCCGCGC CGCTCGCCCC CACCGCCGCC
GCCCGCGTGA GCGCGTCGCC GGTCGCGCGG CGGATCGCGC GGGAGCTGGG CGTCGACCTC
GCGACCGTCC GCGGAAGCGG GCCGCGCGGG CGGATCGTGC GGCGCGACGT GGAGCAGATC
GCGGCCGCGT CACCGGCCGC GCTCACGCCG CCGGCCGCCG CCTCGCCCGC GGCGGCCCCG
CCCCTCGCGC GCCCCACCGT CACCACCGCT CCCGCCGACG AGCACGTCGC GCTCAGCAGC
GTGCAGCGGA CGATCGCGAA GCGGATGGTC GCCTCGCGCA CGGAGATCCC GGAGTTCACG
CTCGTCGCCG AGGTCGACAT GACCGCGGCG CTGCGGCTGC GGCGGGAGCT GCGCGAGGCG
CGGCCCGACG CGCCGATCTC CGTCAACGAC CTCGTCGTGA AGGCAGCCGC GCTCGTGCTG
CGCGAGCAGC CGGTGCTCAA CGCCTCGTGG GCGGGCGACC ACGTCGTGCG GCACGCGCGC
GTGAACGTCG GGATCGCGGT CGCGGCCGAG GGTGCGCTGC TGGTGCCGAC GATCTTCGAC
GCCGACGTGC GCGGCGTCGC GGAGATCGCG GCGTCGGCGC GTGCCGCCGG CGAGCGCGCC
CGGAGCGGCA GAGCGACCCC CGCGGAGCTG AGCGGCGGGA CGTTCACCGT CACGAACCTC
GGGATGTTCG GCGTGCAGCA GTTCCATGCC GTGATCAACG CGCCGCAGGT CGCGATCCTC
GCGGTCGGCG GCGTCAGACG CACCCCCGCG TTCGCGCCCG ACGGCGCGGT CGTCGCACAG
GAGCTGATGC ACGTCAGCCT CAGCTGCGAC CATCGCGCCG TCTACGGCGC CGACGCCGCG
CGTTTCCTCG CCCGGCTGCG CGAGGTCCTG GAGCAGCCGT TGTCGTTGTT GTTGCCTGCG
GGCGTGGAAG AGGGAGCATC ACGGTGA
 
Protein sequence
MSATLTAVAI TMPKLSDSME EGTIAAWLKA PGDPVAVGDA LAEIETDKAT MTYEAEHAGV 
MGELLAAEGE AVALGAPMAQ LLVEGGAAEA AAAPAAAPAA AASEAAAPAP ASAAGPAAAP
APPAGPAPLA HAAPLAPTAA ARVSASPVAR RIARELGVDL ATVRGSGPRG RIVRRDVEQI
AAASPAALTP PAAASPAAAP PLARPTVTTA PADEHVALSS VQRTIAKRMV ASRTEIPEFT
LVAEVDMTAA LRLRRELREA RPDAPISVND LVVKAAALVL REQPVLNASW AGDHVVRHAR
VNVGIAVAAE GALLVPTIFD ADVRGVAEIA ASARAAGERA RSGRATPAEL SGGTFTVTNL
GMFGVQQFHA VINAPQVAIL AVGGVRRTPA FAPDGAVVAQ ELMHVSLSCD HRAVYGADAA
RFLARLREVL EQPLSLLLPA GVEEGASR