Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4046 |
Symbol | |
ID | 8734507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 4297240 |
End bp | 4299003 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646504674 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_003395838 |
Protein GI | 284045498 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.848458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCCC GAACCGGCGG CGAGATCGTC GCCGACCACC TGATCGCGGA GGGCGTCCCG TACGTGCTCG GCATCCCCGG CCACGGCGAC ATGGGCGTCT TCGACGCGTT CAAGGACCGC ACCGACAGAA TCGAGACGAT CCAGGTCCGG CACGAGCAGT CGGCCGGCCA CATCGCCGAC GCCTACTACC GCGTCAAGGG CGAGCCGCTC GCGACGATCA CCTCGATCGG CCCCGGCTCC GCCAACATGG CGATGGCGCT CGCGACGGCG TACGTCGACT CGCAGGCGAT GATCTCGATC ACCGGCGCCG TGCAGACGTA CATGGACGGC TGCGGCGTGC TGCAGGAGAT CGAGCGCCAG CGCGACGCCG ACTTCTCCAG CATGCTGCGC CCGGTCGTCA AGCGCGGCTT CTACCCGCAC CGCGTCGACC AGGTCCAGCG CGTGATGCAC CGCGCGTTCA ACTCGGCTGT CACCGGCCGC CCGGGCCCGG TCCACATCGA GCTGCCGATC GACGTCCAGT CGCACGCCGC CGAGATCCCC GAGATCGACC TCGGCAACCA CCGCGCGACG CACACCGTCG TCCACCCCGA CCCGGACGGC ATCGAGCAGA TCGCCGCGAT CCTCGCCGGC GCCAAGCGCC CGGTGATCCT CGCCGGCGGC GGCTGCATCG CGGCCCGCGC GCACGCTGAG CTGCGCTCGG TCGCCGAGCT GCTCGGCGCG CCGGTCATCA CGACGATGAT GGGCAAGGGC GTCTTCCCCG AGGACCACCC CCTCTGCGCC CAGCACACCG GCGCCAACGG CACCGCGTGC GGCAACCAGA TCGCCAGAAC CGCCGACGTG ATCCTCGCGA TCGGCACGCG CTTCGCCGAG CAGAACGCGT CCTCCTACCA GTTCGGCGCG TCGTACTCGA TGCCGGGCAC GACGCTGCTG CACGTCGACG TCGACCCGCG CGAGATCGGC AAGAACTACC CGACCGAGGT CGGCGTCGTC TCCGACGCGC GCACCGGCCT CGCCGCGCTC CACACGGCGC TCCAGGAGCG GTCGCTGCCG GACCTGACCG ACTACCAGGC CGAGGTCACG GCCGAGCGCG AGAAGTGGGA CGCGATGGTG CGCGGCCGCT GGACCGCGAA CGGCCTGTCG CTGACGAAGT CGCTCGCCGC GCTGCGCGAG CTGATGCCGC GCGAGGGGAT CGTGATCGCG TCCGCCGGCC ATCCGCAGAT CCAGGCGTTC CAGGAGTACC CAGCGTACGA GCCGCGCACG TGGCTGACGC CCGGCGGCTA CTCGACGATG GGCTTCACCG TGCCGGCCGC GATCGGCGCC AAGCTCGCTG CGCCGGACGT GCCGGTCGTC GGCGTCGCCG GCGACGGCGA CTTCCTGATG ACGCTGCAGG AGCTGGCGCT CGCCGTCCAG CTGAACCTGA ACGTCGTCTA TCTCGTGATG AACAACGCCG GCTGGACCTC GATCCGTGAC TTCCAGCGCG GGCTGTTCGG CGAGGACCGC GCCTTCTTCA CCGAGTTCCG CGGCCGCAGA GGCGACCTTC AGACGCCCGA CTTCACCGCG ATCGCGCAGG GCTTCGGAGC GACCGGGATC AAGGTCGAGT CGTTCGACCA GCTCAAGCCG GCGCTGGAGC GTGCGCTGCG GACCGAGGGC CCGGTCGTCG TCGAGGCGAT GCAGGACCGC GACCCCGCCA ACACGACAGG CATCAACAGC GGCTACTGGG ATCTGCCGAA GCCCGAATAC CTGACCGCTG AGGGAGCGCG GTAG
|
Protein sequence | MSSRTGGEIV ADHLIAEGVP YVLGIPGHGD MGVFDAFKDR TDRIETIQVR HEQSAGHIAD AYYRVKGEPL ATITSIGPGS ANMAMALATA YVDSQAMISI TGAVQTYMDG CGVLQEIERQ RDADFSSMLR PVVKRGFYPH RVDQVQRVMH RAFNSAVTGR PGPVHIELPI DVQSHAAEIP EIDLGNHRAT HTVVHPDPDG IEQIAAILAG AKRPVILAGG GCIAARAHAE LRSVAELLGA PVITTMMGKG VFPEDHPLCA QHTGANGTAC GNQIARTADV ILAIGTRFAE QNASSYQFGA SYSMPGTTLL HVDVDPREIG KNYPTEVGVV SDARTGLAAL HTALQERSLP DLTDYQAEVT AEREKWDAMV RGRWTANGLS LTKSLAALRE LMPREGIVIA SAGHPQIQAF QEYPAYEPRT WLTPGGYSTM GFTVPAAIGA KLAAPDVPVV GVAGDGDFLM TLQELALAVQ LNLNVVYLVM NNAGWTSIRD FQRGLFGEDR AFFTEFRGRR GDLQTPDFTA IAQGFGATGI KVESFDQLKP ALERALRTEG PVVVEAMQDR DPANTTGINS GYWDLPKPEY LTAEGAR
|
| |