Gene Tmz1t_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3096 
Symbol 
ID7874566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3352633 
End bp3353967 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content63% 
IMG OID643700019 
Productbenzoate 1,2-dioxygenase, large subunit 
Protein accessionYP_002890071 
Protein GI237653757 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACCA CGGAATATCT GGACTCACTC ATCGAAAAGG ACAAGGAAAA GGGCCTTTTC 
CGCTGCAAGC GTGAGATGTT CACCAACAGG GAGCTGTTCG ACCTCGAGAT GGAACACATC
TTCGAGGGCA ACTGGATCTA CCTCGCCCAC GAGAGCCAGC TGCCGAAGAT CAACGACTAC
TACACGACCA AGATCGGTCG TCAGCCGGTG TTCATCACCC GCAACAAGGA CAACCAGCTC
AACTGTTTCA TCAACGCCTG CGCCCACCGC GGCGCCACGC TGGCACGCTT CAAGCACGGC
AACAAGGCCA CCTACACCTG CCCGTTCCAC GGCTGGACCT TCAACAACTC GGGCAAGCTG
CTGAAGATCA AGGACCCGTC CGAGACCGGC TACCCCGAGA GCTTCAACTG CGAGGGCTCG
CACGATCTCA AGAAGATCGC GCGCTTCGAG TCCTACCGCG GCTTCCTGTT CGGCAGCCTG
AAGGCCGACG TGCAGCCGCT GGAAGACTTC CTCGGCGAGT CGAAGAAGAT CATCGACATG
GTCGTCGACC AGTCGCCCGA GGGCCTGGAA GTGCTGCGCG GCTCCTCGAC CTACGTCTAT
GAAGGCAACT GGAAGCTGCA GGCCGAGAAC GGCGCCGACG GCTACCACGT CACCGCCACG
CACTGGAACT ACGCCGCCAC CCAGGCGCAG CGCAAGAGCC GCGACGCCGG CGACGACATC
AAGGCCATGA GCGCCGGCGG CTGGGCCAAG AAGGGCGGCG GCTCGTACTC GTTCGAGAAC
GGCCACCTGC TGCTGTGGAC GCGCTGGGAC AACCCCGAAG ACCGTCCGCT GATGGAACAG
CGCGACCGTC TGGTCAAGGA ATTCGGCGAA GCCAAGGCCG ATTGGATGAT CGGCAATTCG
CGCAACCTGT GCCTGTACCC GAACGTGTAC CTGATGGACC AGTTCAGCTC GCAGATCCGC
GTGTTCCGCC CGCTCGACGT CAACCGCACC GAAGTCACCA TCTACTGCAT CGCGCCCAAG
GGCGAGTCGG CCGAGGCGCG TGCCCGCCGC ATCCGCCAGT ACGAGGACTT CTTCAACGCC
TCGGGCATGG CCACGCCGGA CGACCTCGAA GAATTCCGCG CCTGCCAGGA AGGCTTCATG
GGCCGTGCGC TGGAGTGGAA CGACATGTCG CGCGGCGCCA CGCACTGGGT CGAAGGCCCG
GACGAAGAAG CGGACAAGAT CGACCTCAAG CCCATCCTGT CGGGCGTGAA GACCGAGGAC
GAGGGCCTCT ACGTCGCCCA GCACACCTAC TGGCTTGAAG AGATTCGCAA GGCTGCCGCG
GCCAAGAACG CATAA
 
Protein sequence
MITTEYLDSL IEKDKEKGLF RCKREMFTNR ELFDLEMEHI FEGNWIYLAH ESQLPKINDY 
YTTKIGRQPV FITRNKDNQL NCFINACAHR GATLARFKHG NKATYTCPFH GWTFNNSGKL
LKIKDPSETG YPESFNCEGS HDLKKIARFE SYRGFLFGSL KADVQPLEDF LGESKKIIDM
VVDQSPEGLE VLRGSSTYVY EGNWKLQAEN GADGYHVTAT HWNYAATQAQ RKSRDAGDDI
KAMSAGGWAK KGGGSYSFEN GHLLLWTRWD NPEDRPLMEQ RDRLVKEFGE AKADWMIGNS
RNLCLYPNVY LMDQFSSQIR VFRPLDVNRT EVTIYCIAPK GESAEARARR IRQYEDFFNA
SGMATPDDLE EFRACQEGFM GRALEWNDMS RGATHWVEGP DEEADKIDLK PILSGVKTED
EGLYVAQHTY WLEEIRKAAA AKNA