Gene Tmz1t_3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3393 
Symbol 
ID7873884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3712121 
End bp3713269 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID643700332 
Productradical SAM enzyme, Cfr family 
Protein accessionYP_002890364 
Protein GI237654050 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTC CCAATCCGGT CAATCTGCTC GATTTCGACG TAGACGGCCT CGTCGCCTGG 
TTCGCCGGGC TGGGCGAGAA GCCGTTCCGC GCCCGCCAGG TGATGCGCTG GATGCATCAC
GAGGGCTGCG ACGACTTCGA CGCCATGACC GACGTCGCCA AGTCCCTGCG CGCGAAGCTG
AAGGACCTCG CCGTGATCCG CCCGCCGGTG CCGGTGCGCG ACTCGATCTC GGCCGACGGC
ACGCGCAAGT GGCTGCTCGA CGTGGGCAAC GCCAACGCGG TCGAGACCGT GTTCATTCCC
GAGACCAGCC GCGGCACGCT GTGCGTGTCC TCGCAGGCGG GCTGCGCGCT CGACTGCGCG
TTCTGCTCCA CCGGCAAGCA GGGCTTCAAC CGCAACCTGT CGGCCGCGGA AATCATCGGC
CAGCTCTGGC TGGCCAACAA GCTGCTCGGC GCGGCGCGGG CCGACGCCGA GGAGCACGCC
ACGGACCTCG AGGCCGGCGA GAAGGACAAC GGCCGCATCA TCAGCAACGT GGTGATGATG
GGCATGGGCG AGCCGCTCGC CAACTTCGAC AACGTCGTCA CCGCGCTGCG CCTCATGCTG
GACGACCACG CCTACGGCCT GTCGCGCCGC CGGGTCACGG TGTCGACCTC GGGCATCGTG
CCGGCGATGG ACCGCCTGCG CGACGAATGT CCGGTGGCGC TGGCGGTGTC GCTGCATGCG
TCCAACGACG CGCTGCGCGA CCGCCTCGTG CCGATCAATC AGAAGTACCC GCTGCGCGAG
CTCATGGCCG CCTGCCAGCG CTACCTCGAG CGCGCGCCGC GCGACTTCGT CACCTTCGAG
TACGTGATGC TCGAAGGCGT CAACGACAGC GACGCGCACG CTCGCGAACT CGTCGCGCTG
GTGCGCGACA CGCCGTGCAA GTTCAACCTG ATCCCGTTCA ACCCCTTCCC GGACTCCGGC
TTCCAGCGCT CGCCCGCGGA GCGTATCCGG CGTTTTGCCG GTATCCTGAT CGACGCCGGT
ATCGTGACCA CCACGCGCAA GACGCGTGGC GACGATGTCG ACGCTGCGTG CGGCCAGCTC
GCCGGCCAGG TCCAGGACAG GACGCGGCGT ACCGTGCGCC TGCACCAGGC GAGGGAGAAC
ATGCGATGA
 
Protein sequence
MSTPNPVNLL DFDVDGLVAW FAGLGEKPFR ARQVMRWMHH EGCDDFDAMT DVAKSLRAKL 
KDLAVIRPPV PVRDSISADG TRKWLLDVGN ANAVETVFIP ETSRGTLCVS SQAGCALDCA
FCSTGKQGFN RNLSAAEIIG QLWLANKLLG AARADAEEHA TDLEAGEKDN GRIISNVVMM
GMGEPLANFD NVVTALRLML DDHAYGLSRR RVTVSTSGIV PAMDRLRDEC PVALAVSLHA
SNDALRDRLV PINQKYPLRE LMAACQRYLE RAPRDFVTFE YVMLEGVNDS DAHARELVAL
VRDTPCKFNL IPFNPFPDSG FQRSPAERIR RFAGILIDAG IVTTTRKTRG DDVDAACGQL
AGQVQDRTRR TVRLHQAREN MR