Gene Cwoe_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5072 
Symbol 
ID8735538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5420187 
End bp5421158 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content73% 
IMG OID646505697 
ProductPyruvate dehydrogenase (acetyl-transferring) 
Protein accessionYP_003396856 
Protein GI284046516 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0496091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.175509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG GGGCGGTGCG CATGACCGAC GCCGAGCTGC TCGGGATGCT GCGGCGGATG 
ATCGAGATCC GCGGCTTCGA GGACGAGATC CAGCGCGCGT TCACGAAGAA CCTCGTACGC
GGCTCGACGC ACCTCTGCAA CGGCCAGGAG GCGTGCGTCG TCGGCGCCTG CGGCGCGCTG
CGCGAGGGCG ACTCGATGGT CTGCACCTAC CGCGGCCACG GCGCCGTGCT GGCGATGGGC
GCGCCGCTGG AGGGGACGTT CGCCGAGATC CTCGGCCGCG AGACGGGCCT CTGTCGCGGC
AAGGGCGGCT CGATGCACCT GACCGACGTC GGCGTCGGCG CCTACGGCTC GTTCGCGATC
GTCGGCGCGC ACCTGCCGAT CGCGACCGGC CTCGCCCTCG CCGCCAAGCT CGACAGAAGC
GAGGCCGTCA GCCTCTGCTT CTTCGGCGAC GGCAGCATGA ACATCGGCGC GGTCCACGAG
GCGATGAACC TCGCCGGGAT CTGGAAGCTG CCGGTGATCT TCTTCTGCGA GAACAACCTC
TACGGCGAGT ACTCGCCGCT CGCCACGACG ACGCCGGTCG AGGAGCTGGC CGCGCGCGCG
GCCGGCTACG GGATGCCGGG CGTGCGCGTC GACGGCAACG ACGTCGTCGC CGTCCACGCG
GTCGTCTCCG AGGCCGTCCG GCGCGCCCGC TCCGGCGAGG GGCCGACGTT CGTCGAGGGC
CTGACCTACC GCCACCGCGG CCACTCGCGC ACCGACCCGG CGAGATACCG GCCGGAGGGC
GAGCTGGAGC GGTGGCTGGA GCTGGACCCG ATCCCGCGGC TGGAGGCGCT GCTGCGCGAG
CGCGGCGTCG CGGACGGCGC CGTCACGCAG GCGCGCGCCG ACGCGGAGGA GGCCGTCGCG
ACGGCGTACG CGGCGGCGCT CGCCGCGCCC GCGCCCGGCC TGGAGCTGAT CTACGAGGAC
GTCTACGCAT GA
 
Protein sequence
MKDGAVRMTD AELLGMLRRM IEIRGFEDEI QRAFTKNLVR GSTHLCNGQE ACVVGACGAL 
REGDSMVCTY RGHGAVLAMG APLEGTFAEI LGRETGLCRG KGGSMHLTDV GVGAYGSFAI
VGAHLPIATG LALAAKLDRS EAVSLCFFGD GSMNIGAVHE AMNLAGIWKL PVIFFCENNL
YGEYSPLATT TPVEELAARA AGYGMPGVRV DGNDVVAVHA VVSEAVRRAR SGEGPTFVEG
LTYRHRGHSR TDPARYRPEG ELERWLELDP IPRLEALLRE RGVADGAVTQ ARADAEEAVA
TAYAAALAAP APGLELIYED VYA