Gene Cwoe_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1034 
Symbol 
ID8731469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1090536 
End bp1091534 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content72% 
IMG OID646501652 
ProductTransketolase central region 
Protein accessionYP_003392842 
Protein GI284042502 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.775121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.29633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA CGATCGACCA GACCACCACG GTCACGTACA AGCAGGCGAT CACGCGGGCG 
CTGGCCGACG CGATGGAGGA GGACGCGCGC GTCTGCCTCT TCGGCGAGGA CGTCGCGGCG
GCGGGCGGCG TCTTCAAGGT CACCGACGGG CTCCACGAGC GCTTTGGCGA ACGGCGCGTG
CGCGACACGC CGATCGCTGA GCAGGCGATC ATCGGCACCG CGATCGGCGC GGGTCTGTCC
GGCCTGCGTC CCGTCGCCGA GATCATGTTC GCCGACTTCG CCGGCGTCTG CTTCGACGGG
ATCGCCAACG AGCTGGCCAA GTACCGCTAC ATGACCGGCG GGCAGGCCGC GATGCCGGTG
ACCGTCCGGC TCGGCAACGG CGCCGGCGGC GGCTTCGGTG CGCAGCACTC GCAGTCGGTC
GAGAACTGGT TCCTCAACGT GCCGGGTCTG AAGATGGTCG CGCCGGCGAC GCCGGCTGAC
GCATACGGGC TGCTGCGGGC GGCGATTCGC GATCCCGACC CGGTCCTCTA CTTCGAGCAC
AAGAACCTCT ACGGCGCGAG AGGCGAGCTG GCGGCGGACC CCGAGATCCC GCCGATCGGC
AAGGCGGCCG TGGTGCGCGC TGGTACCGAC GTGACGCTGG TGGCCACGCA GCTGATGCGG
CTGCGCGCCG AGGAGGCGGC CGAGCTGCTG GCGCGCGAGG GCACCTCGGT CGAGCTGATC
GACCCGCGCA CGATCGCCCC GCTCGACGTC GAGACGATCG CCGCCTCGCT GGCGCGCACG
AACCGCCTCG TCGTCGCGCA GGAGTGCAGC CACGCGGGCA GTTGGGGCGC CTCGCTGGTC
TCGAGCCTGG TCGCCGAGCA CTTCGAGTCG CTCGACGCGC CGCCGCTGGT CGTGAGCGGC
GAGGAGACCC CGATCCCCTA CGCGACTCCG TTGGAGGCGC TGTGGATCCC AAGCGTCGAG
CGGATCGCCG ACGGCGTGCG CCGGGCGCTC GCGTCCTGA
 
Protein sequence
MSATIDQTTT VTYKQAITRA LADAMEEDAR VCLFGEDVAA AGGVFKVTDG LHERFGERRV 
RDTPIAEQAI IGTAIGAGLS GLRPVAEIMF ADFAGVCFDG IANELAKYRY MTGGQAAMPV
TVRLGNGAGG GFGAQHSQSV ENWFLNVPGL KMVAPATPAD AYGLLRAAIR DPDPVLYFEH
KNLYGARGEL AADPEIPPIG KAAVVRAGTD VTLVATQLMR LRAEEAAELL AREGTSVELI
DPRTIAPLDV ETIAASLART NRLVVAQECS HAGSWGASLV SSLVAEHFES LDAPPLVVSG
EETPIPYATP LEALWIPSVE RIADGVRRAL AS