Gene Cwoe_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4098 
Symbol 
ID8734560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4352501 
End bp4354288 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content76% 
IMG OID646504725 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003395888 
Protein GI284045548 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0963685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGC CACGGTACGG CTCCGACGTC GTCGTCGAGT TCCTGCGGGC GGCGGGGATC 
GACCACGTCG TGCTGAACCC GGGCGCGACG TTCCGCGCCC TGCACGACTC GCTCGCGGTC
GCGGACGCGC CGGAGCTGAT CGTGGCGTTG CACGAGGAGA TCGCCGTCGC GCTCGCGCAC
GGCTACGCGA AGTCGGCGGG GAGGCCGATG GCGGTCTTCG TGCACGACCA GGTCGGTCTC
CAGCACGCGA CGATGGCGTT GTTCAACGCC TACGTCGACG CGGTCCCGAT GCTCGTGATC
GGCGGGTCGG GGCCGCGCGA CGTCGCGCGG CGGCGGCCGT GGATCGACTG GATCCACAGC
GGCAGCCCGG AGTCGGCCGT GATCCGCGAC GTCGTCAAGT GGGACGACGA GCCGGCGAGC
GTCGAGGCGA TCCCGCTGTC GCTGGAGCGC GCGTGGCGGA TCGCGACGAC GCCGCCGATG
GGGCCGGTCT ACGTCGCGCT GGACATCCTG CTGCAGGAGG CCGACGCGTC GCACATGCCG
GTGCCGAGCG CGCCCCGCCA CCGGCCGCCG CCCGTGACCG CGCCGCGCGC GGTCGTGGCC
GAGCTGGCCG CCGCACTGGC GGCGGCGTCG GACCCGGTCC TGCTCGTCGA CCGCGGCACG
CCCGGCTGCG CCGCGGCGCT CGTCGCGCTG GCGGAGCGGA CCGGCGCGGC GGTCGTCGAC
CTCGGCGGCC GCTCGTTCCC GGCCGAGCAT CCGGCCGACC GCACGTCCGC CGCGGCGGAC
GTGCTCGGCG CCGCCGACCT CGTGCTCGCG ATCGAGCTGC GCGACGTGAC GTGGGGGCTG
ACGACCGTCG ATCTGAAAGA CCGCTCGACG CGCAGCCTGC TGGCGCCCGG CACGCGCGTC
GTCGCGATCG GCCTGACTGA GCTGCGCCAC CGCGGCTTCG TCCAGTTCGA GGCGCTCGCG
GGCGACGTCG AGCCGATCGT CGCCGAGCCG GCGACGTTGC TGGGGGAGCT GGTGGAGGAG
CTGGAGGCGC GCGCAGCTGA CGTCGCTGCC GCCAACGCCG CTGCCGGCGG CGCCGGTGCG
GACGCGGCGC GCATCGCTCC GCCCGCCGCG GCCCGCGAGC GCGCCGAGCG CCACGCCGCC
GCGCACGCGG CCGAGCGCGC CGGCCACCGA GAGCGCGCCG CTTCGCAGGC GGCCGAGCGG
CCGATCGCGC CGGCTCACCT CGCGGCGGTC CTCGGCGACG TGCTGGGGGA GGACGGCTGG
CAGCTCGCCA ATGGCCTGCT GGGCGGCTGG CCGCGGCGGC TGTGGCCGCT GCGCGACGAG
ACGACCTACC TCGGCCGCTC CGGCGGCGAA GGGCTCGGCT ACGGCCTGCC GGCGTCGATC
GGTGCCGCGC TCGCGCAGCG CGACAACGAC GACCTGCTCG TCCTCGACGT CCAGTCCGAC
GGCGACATGA TGTACACGCC GCAGGCGCTC TGGACCGCCG CGCACCACAG GCTGCCGCTG
CTGATCGTCG TGCACAACAA CCGCACGTAC GGCCGCGATG AGACGCACCA GCTGGAGATC
GCGCACGCGC GCGGGCGTCC GATCGAGGTG CCGCCGGAGG GGATCCGGAT CGAGCAGCCG
CACATCGACT TCGCCTCGCT CGCCCGCGCG CAGGGCGTCG AGGCGATCGG CCCGGTCGAG
GATCCGGCCG CGCTCGACGG CGTGCTCGAA GACGCCGCGC GGCGCGTCCG CGACGAGCGC
CGGCCGCTGC TCGTGGACGT GATCTGCTCG CGCGATCTGT CGCGCTGA
 
Protein sequence
MTTPRYGSDV VVEFLRAAGI DHVVLNPGAT FRALHDSLAV ADAPELIVAL HEEIAVALAH 
GYAKSAGRPM AVFVHDQVGL QHATMALFNA YVDAVPMLVI GGSGPRDVAR RRPWIDWIHS
GSPESAVIRD VVKWDDEPAS VEAIPLSLER AWRIATTPPM GPVYVALDIL LQEADASHMP
VPSAPRHRPP PVTAPRAVVA ELAAALAAAS DPVLLVDRGT PGCAAALVAL AERTGAAVVD
LGGRSFPAEH PADRTSAAAD VLGAADLVLA IELRDVTWGL TTVDLKDRST RSLLAPGTRV
VAIGLTELRH RGFVQFEALA GDVEPIVAEP ATLLGELVEE LEARAADVAA ANAAAGGAGA
DAARIAPPAA ARERAERHAA AHAAERAGHR ERAASQAAER PIAPAHLAAV LGDVLGEDGW
QLANGLLGGW PRRLWPLRDE TTYLGRSGGE GLGYGLPASI GAALAQRDND DLLVLDVQSD
GDMMYTPQAL WTAAHHRLPL LIVVHNNRTY GRDETHQLEI AHARGRPIEV PPEGIRIEQP
HIDFASLARA QGVEAIGPVE DPAALDGVLE DAARRVRDER RPLLVDVICS RDLSR