Gene Tmz1t_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3802 
Symbol 
ID7874044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4194955 
End bp4196082 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID643700744 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002890768 
Protein GI237654454 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCC TGATCACCGG CGCCGGCGGC TTCGTCGGCA AGAACCTGCA GCAGCACCTG 
GCCGAGCGCA AGGACGTGGA GGTGGTGTGC TTCACGCGTG CCAACACCGT GGCCGAGCTG
CCGCGCCTGC TCGACGGGGT GGAGTTCGTC TTCCACCTGG CCGGGGTGAA TCGACCGCAG
GATCCGCAGG AGTTCGTCAC CGGCAACGCC GACCTGACCG CGGCGCTGGT GGCGGCGGTG
GAAGGTGAGA TGCAGGCAAG CGGGCGCAGG ATCGCCATCG TCTGCAGCTC GTCCACCCAG
GCCGCCCGCG ACAACCCCTA CGGCGCCAGC AAGCGCGCGG CCGAAGCGGC GCTGCAGGCC
TTTGCGGCGC GCAGCGGCGC GGCGGCGCAC GTCTTCCGCC TGCCCAACGT GTTCGGCAAG
TGGTGCCGGC CGAACTACAA CTCGGCGGTG GCGACCTTCT GCCACAACAT CGCCCGTGGG
CTGCCGATCC AGATCAACGA CCCGGCCGCA CCGGTGACGC TGGTGTATGT GGACGACGTG
GTCGAGCGCT TCATCGAGCT GATGGACGGC GCCGATGCGG CGGTGGACGC CGAGGGCTTC
GCCACGGTGA CGCCGCAATA CACCACCACG GTGGGCGAGC TGGCGCGGTT GATCGAGACC
TTCCGCGCCA GCCGCGACAC GCTGGTGACC GAACGCGTGG GCACCGGCCT GGTGCGCGCG
CTGTATTCCA CCTACGTGAG CTACCTCCCG CCGGAACTCT TCGCCTACTC CGTGCCGATG
CACGGCGACG CGCGCGGCGT GTTCGTGGAG ATGCTGAAGA CGCCCGACTG CGGGCAGTTC
TCCTTCTTCA CCGCGCACCC GGGCATCACC CGCGGCGGCC ACTACCACCA CACCAAGACC
GAGAAGTTCC TGGTGATCAA GGGCGAGGCC CGCTTCAAGT TCCGCCACAT GCAGACGGGC
GAGACGCACG AGCGGGTGAC CAGCGGCAGC AAGGCCGAGA TCGTGGAGAC GGTGCCGGGG
TGGACGCACG ACATCACCAA CATCGGCAGC GACGAGATGG TGGTGATGCT GTGGGCGAAC
GAGGTGTTCG ACCGGGCGAA GCCGGATACG TATGCGTGTC CGTTGTAA
 
Protein sequence
MKVLITGAGG FVGKNLQQHL AERKDVEVVC FTRANTVAEL PRLLDGVEFV FHLAGVNRPQ 
DPQEFVTGNA DLTAALVAAV EGEMQASGRR IAIVCSSSTQ AARDNPYGAS KRAAEAALQA
FAARSGAAAH VFRLPNVFGK WCRPNYNSAV ATFCHNIARG LPIQINDPAA PVTLVYVDDV
VERFIELMDG ADAAVDAEGF ATVTPQYTTT VGELARLIET FRASRDTLVT ERVGTGLVRA
LYSTYVSYLP PELFAYSVPM HGDARGVFVE MLKTPDCGQF SFFTAHPGIT RGGHYHHTKT
EKFLVIKGEA RFKFRHMQTG ETHERVTSGS KAEIVETVPG WTHDITNIGS DEMVVMLWAN
EVFDRAKPDT YACPL