Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3096 |
Symbol | |
ID | 7874566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3352633 |
End bp | 3353967 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643700019 |
Product | benzoate 1,2-dioxygenase, large subunit |
Protein accession | YP_002890071 |
Protein GI | 237653757 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | [TIGR03229] benzoate 1,2-dioxygenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCACCA CGGAATATCT GGACTCACTC ATCGAAAAGG ACAAGGAAAA GGGCCTTTTC CGCTGCAAGC GTGAGATGTT CACCAACAGG GAGCTGTTCG ACCTCGAGAT GGAACACATC TTCGAGGGCA ACTGGATCTA CCTCGCCCAC GAGAGCCAGC TGCCGAAGAT CAACGACTAC TACACGACCA AGATCGGTCG TCAGCCGGTG TTCATCACCC GCAACAAGGA CAACCAGCTC AACTGTTTCA TCAACGCCTG CGCCCACCGC GGCGCCACGC TGGCACGCTT CAAGCACGGC AACAAGGCCA CCTACACCTG CCCGTTCCAC GGCTGGACCT TCAACAACTC GGGCAAGCTG CTGAAGATCA AGGACCCGTC CGAGACCGGC TACCCCGAGA GCTTCAACTG CGAGGGCTCG CACGATCTCA AGAAGATCGC GCGCTTCGAG TCCTACCGCG GCTTCCTGTT CGGCAGCCTG AAGGCCGACG TGCAGCCGCT GGAAGACTTC CTCGGCGAGT CGAAGAAGAT CATCGACATG GTCGTCGACC AGTCGCCCGA GGGCCTGGAA GTGCTGCGCG GCTCCTCGAC CTACGTCTAT GAAGGCAACT GGAAGCTGCA GGCCGAGAAC GGCGCCGACG GCTACCACGT CACCGCCACG CACTGGAACT ACGCCGCCAC CCAGGCGCAG CGCAAGAGCC GCGACGCCGG CGACGACATC AAGGCCATGA GCGCCGGCGG CTGGGCCAAG AAGGGCGGCG GCTCGTACTC GTTCGAGAAC GGCCACCTGC TGCTGTGGAC GCGCTGGGAC AACCCCGAAG ACCGTCCGCT GATGGAACAG CGCGACCGTC TGGTCAAGGA ATTCGGCGAA GCCAAGGCCG ATTGGATGAT CGGCAATTCG CGCAACCTGT GCCTGTACCC GAACGTGTAC CTGATGGACC AGTTCAGCTC GCAGATCCGC GTGTTCCGCC CGCTCGACGT CAACCGCACC GAAGTCACCA TCTACTGCAT CGCGCCCAAG GGCGAGTCGG CCGAGGCGCG TGCCCGCCGC ATCCGCCAGT ACGAGGACTT CTTCAACGCC TCGGGCATGG CCACGCCGGA CGACCTCGAA GAATTCCGCG CCTGCCAGGA AGGCTTCATG GGCCGTGCGC TGGAGTGGAA CGACATGTCG CGCGGCGCCA CGCACTGGGT CGAAGGCCCG GACGAAGAAG CGGACAAGAT CGACCTCAAG CCCATCCTGT CGGGCGTGAA GACCGAGGAC GAGGGCCTCT ACGTCGCCCA GCACACCTAC TGGCTTGAAG AGATTCGCAA GGCTGCCGCG GCCAAGAACG CATAA
|
Protein sequence | MITTEYLDSL IEKDKEKGLF RCKREMFTNR ELFDLEMEHI FEGNWIYLAH ESQLPKINDY YTTKIGRQPV FITRNKDNQL NCFINACAHR GATLARFKHG NKATYTCPFH GWTFNNSGKL LKIKDPSETG YPESFNCEGS HDLKKIARFE SYRGFLFGSL KADVQPLEDF LGESKKIIDM VVDQSPEGLE VLRGSSTYVY EGNWKLQAEN GADGYHVTAT HWNYAATQAQ RKSRDAGDDI KAMSAGGWAK KGGGSYSFEN GHLLLWTRWD NPEDRPLMEQ RDRLVKEFGE AKADWMIGNS RNLCLYPNVY LMDQFSSQIR VFRPLDVNRT EVTIYCIAPK GESAEARARR IRQYEDFFNA SGMATPDDLE EFRACQEGFM GRALEWNDMS RGATHWVEGP DEEADKIDLK PILSGVKTED EGLYVAQHTY WLEEIRKAAA AKNA
|
| |