Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3802 |
Symbol | |
ID | 7874044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4194955 |
End bp | 4196082 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700744 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002890768 |
Protein GI | 237654454 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTCC TGATCACCGG CGCCGGCGGC TTCGTCGGCA AGAACCTGCA GCAGCACCTG GCCGAGCGCA AGGACGTGGA GGTGGTGTGC TTCACGCGTG CCAACACCGT GGCCGAGCTG CCGCGCCTGC TCGACGGGGT GGAGTTCGTC TTCCACCTGG CCGGGGTGAA TCGACCGCAG GATCCGCAGG AGTTCGTCAC CGGCAACGCC GACCTGACCG CGGCGCTGGT GGCGGCGGTG GAAGGTGAGA TGCAGGCAAG CGGGCGCAGG ATCGCCATCG TCTGCAGCTC GTCCACCCAG GCCGCCCGCG ACAACCCCTA CGGCGCCAGC AAGCGCGCGG CCGAAGCGGC GCTGCAGGCC TTTGCGGCGC GCAGCGGCGC GGCGGCGCAC GTCTTCCGCC TGCCCAACGT GTTCGGCAAG TGGTGCCGGC CGAACTACAA CTCGGCGGTG GCGACCTTCT GCCACAACAT CGCCCGTGGG CTGCCGATCC AGATCAACGA CCCGGCCGCA CCGGTGACGC TGGTGTATGT GGACGACGTG GTCGAGCGCT TCATCGAGCT GATGGACGGC GCCGATGCGG CGGTGGACGC CGAGGGCTTC GCCACGGTGA CGCCGCAATA CACCACCACG GTGGGCGAGC TGGCGCGGTT GATCGAGACC TTCCGCGCCA GCCGCGACAC GCTGGTGACC GAACGCGTGG GCACCGGCCT GGTGCGCGCG CTGTATTCCA CCTACGTGAG CTACCTCCCG CCGGAACTCT TCGCCTACTC CGTGCCGATG CACGGCGACG CGCGCGGCGT GTTCGTGGAG ATGCTGAAGA CGCCCGACTG CGGGCAGTTC TCCTTCTTCA CCGCGCACCC GGGCATCACC CGCGGCGGCC ACTACCACCA CACCAAGACC GAGAAGTTCC TGGTGATCAA GGGCGAGGCC CGCTTCAAGT TCCGCCACAT GCAGACGGGC GAGACGCACG AGCGGGTGAC CAGCGGCAGC AAGGCCGAGA TCGTGGAGAC GGTGCCGGGG TGGACGCACG ACATCACCAA CATCGGCAGC GACGAGATGG TGGTGATGCT GTGGGCGAAC GAGGTGTTCG ACCGGGCGAA GCCGGATACG TATGCGTGTC CGTTGTAA
|
Protein sequence | MKVLITGAGG FVGKNLQQHL AERKDVEVVC FTRANTVAEL PRLLDGVEFV FHLAGVNRPQ DPQEFVTGNA DLTAALVAAV EGEMQASGRR IAIVCSSSTQ AARDNPYGAS KRAAEAALQA FAARSGAAAH VFRLPNVFGK WCRPNYNSAV ATFCHNIARG LPIQINDPAA PVTLVYVDDV VERFIELMDG ADAAVDAEGF ATVTPQYTTT VGELARLIET FRASRDTLVT ERVGTGLVRA LYSTYVSYLP PELFAYSVPM HGDARGVFVE MLKTPDCGQF SFFTAHPGIT RGGHYHHTKT EKFLVIKGEA RFKFRHMQTG ETHERVTSGS KAEIVETVPG WTHDITNIGS DEMVVMLWAN EVFDRAKPDT YACPL
|
| |