Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2876 |
Symbol | |
ID | 7873778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3114027 |
End bp | 3115094 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699797 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_002889852 |
Protein GI | 237653538 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC TGGTGAGCCC TGCTTTCTGG CGGGGGCGCC GCGTCTTCGT CACGGGTCAT ACCGGCTTCA AGGGCGGTTG GCTCAGCCTG TGGCTGCAGA GCATGGGGGC GGATGTTTGC GGCTTCGCAT TGCCGCCCGC CGCCGCGCCG GCCTTGTTCC ATGTCGCAGA TGTCGAGCGC GGCATGCACT CCGTGCTCGG CGATGTGCGG GACTATGAAA GGTTGAGACA GGCACTCGCG GCAGCAAGGC CGGAGATCGT GCTTCATCTC GCCGCGCAAC CGCTCGTGCC CTACTCGTAC GCCGAACCTG TCGAGACCTT CTCGACCAAC GCCATGGGCA CCGTGAATCT GCTCGAGGCC TGTCGGCACC AGCCCGACCT GAAAGCCGTC GTCGTGGTCA GCAGCGACAA GTGCTACGAA AACCGGGAGC AACTCTGGGG CTATCGTGAG ACGGATCCGA TGGGCGGATA CGACCCGTAT TCGGCCAGCA AGGGCTGCAC TGAACTCGTC GTCGCGTCGT ACCGTCGCTC CTTCCTCGCC GCTCGCGGCG TTGCCCTCGC CAGCGCTCGG GCCGGCAACG TGATCGGAGG CGGAGACTGG ACCCCCAGTC GTCTGGTGCC CGACGTCCTG GCGGCCTTTG CACGCAACGA AGCCGTTGTT CTGCGCAATC CTGACGCGAT CCGGCCCTGG CAACACGTGC TGGAGCCCTT GGCGGGTTAT CTGCTCTTGG CCCAGCACCT CGTCGAGCAT GGAGAGGCCT TCGCCGAGGG CTGGAACTTT GGGCCCGACG AAACCGACGC GCGTACAGTC GCGTGGATCG TGGAGATGCT TGCCGCCGGC TGGGGCGCCG GTGCCGACTG GCAGCCATCC GGCGAGCCGC GGATCCATGA GGCGCACACC CTCAAGCTGG ACTGCACCAA GGCCCGCGTC CGGCTGGGCT GGCGGCCTCG CTGGCAAGCG GAAGACGCAG TCACCCGCAG CCTCGCCTGG TATCAAGCCT GGCGTGCCGG TGCCGATATG CATCGATACA CTCTCGACGA AATCTCCGCA TTCACAGCCA GCTCATGA
|
Protein sequence | MENLVSPAFW RGRRVFVTGH TGFKGGWLSL WLQSMGADVC GFALPPAAAP ALFHVADVER GMHSVLGDVR DYERLRQALA AARPEIVLHL AAQPLVPYSY AEPVETFSTN AMGTVNLLEA CRHQPDLKAV VVVSSDKCYE NREQLWGYRE TDPMGGYDPY SASKGCTELV VASYRRSFLA ARGVALASAR AGNVIGGGDW TPSRLVPDVL AAFARNEAVV LRNPDAIRPW QHVLEPLAGY LLLAQHLVEH GEAFAEGWNF GPDETDARTV AWIVEMLAAG WGAGADWQPS GEPRIHEAHT LKLDCTKARV RLGWRPRWQA EDAVTRSLAW YQAWRAGADM HRYTLDEISA FTASS
|
| |