Gene Tmz1t_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1116 
Symbol 
ID7084645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1220586 
End bp1221863 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID643698131 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002354771 
Protein GI217969537 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.416504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACCC TCGACACCCT CAAGCTCGCC ATCATCGGCC TCGGTTACGT CGGCCTGCCG 
CTCGCGGTCG AATTCGCCAA GAAGCGCTCC GTCGTCGGCT TCGACATCAA CCAGGCCCGC
ATCGACGCCC TCAAGACCGG CCACGACGCC ACCCTCGAGG TCTCCGACGA AGAACTGCGC
GAAGCCACCG GCCTGCAATA CAGCGCCAAC CTGCAGGACC TCGCCGCCTG CAACACCTTC
ATCGTCACCG TCCCCACCCC CATCGACGAG CACAAGCAGC CCGACCTCAC CCCGCTGGTC
AAGGCCAGCG AGACCATCGG CAAGGTGCTC AAGAAGGGCG ACATCGTCAT CTACGAATCC
ACGGTCTACC CCGGCGCCAC TGAGGAAGAC TGCGTCCCGG TGCTGGAGAA GTTTTCCGGC
CTCAAATTCA ACGTCGACTT CTACGCCGGC TACAGCCCCG AGCGCATCAA CCCGGGCGAC
AAGGAACACC GCGTCTCCAC CATCAAGAAG GTCACCTCCG GCTCCACCCC CGAAGTGGCC
GAGCTGGTCG ACCAGCTCTA CCGCCAGATC ATCGTCGTCG GCACCCACAA GGCCGAAAGC
ATCAAGGTGG CCGAAGCCGC CAAGGTCATC GAGAACACCC AGCGCGACGT CAACATCGCC
CTCATCAACG AGCTGGCCAT CATCTTCAAC AAGATGGGCA TCGACACCGA GGCCGTGCTG
CAGGCCGCCG GCAGCAAGTG GAACTTCCTG CCCTTCCGTC CGGGCCTGGT CGGCGGCCAC
TGCATCGGCG TGGACCCCTA CTACCTCACC CACAAGGCGC AGTCCATCGG CTACCACCCC
GAGATCATCC TCGCCGGCCG CCGCCTCAAC GACGGCATGG GCGCCTACGT GGTGTCGCAG
CTCGTCAAGG CCATGCTCAA GCGCCGCATC ACCGTCGAAG GCGCGCGCGT GCTGGTCATG
GGCCTCACCT TCAAGGAAAA CTGCCCGGAC CTGCGCAACA CCCGCATCGT CGACATCGTC
AAGGAACTCG GCGAGTACAA CATCCAGGCC GACGTGTACG ACCCGTGGGT GGACGTGGCC
GAGGCCCAGC ACGAATACGG GCTCACTCCG ATCGACAAGC CGGAGCCCGG CGCCTACGAC
GCGATCATCG TCGGCGTGGC GCATCAGCAG TTCAAGGACA TGGGAGCCGA GGCCATCCGC
GCGCTCGGCA AGCCGGAACA TGTGGTGTAT GACCTCAAGT ATGTGATGCC GAGGAATGCG
GCGGATCTGC GACTTTAA
 
Protein sequence
MHTLDTLKLA IIGLGYVGLP LAVEFAKKRS VVGFDINQAR IDALKTGHDA TLEVSDEELR 
EATGLQYSAN LQDLAACNTF IVTVPTPIDE HKQPDLTPLV KASETIGKVL KKGDIVIYES
TVYPGATEED CVPVLEKFSG LKFNVDFYAG YSPERINPGD KEHRVSTIKK VTSGSTPEVA
ELVDQLYRQI IVVGTHKAES IKVAEAAKVI ENTQRDVNIA LINELAIIFN KMGIDTEAVL
QAAGSKWNFL PFRPGLVGGH CIGVDPYYLT HKAQSIGYHP EIILAGRRLN DGMGAYVVSQ
LVKAMLKRRI TVEGARVLVM GLTFKENCPD LRNTRIVDIV KELGEYNIQA DVYDPWVDVA
EAQHEYGLTP IDKPEPGAYD AIIVGVAHQQ FKDMGAEAIR ALGKPEHVVY DLKYVMPRNA
ADLRL