Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2817 |
Symbol | |
ID | 7873225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3045174 |
End bp | 3046982 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699738 |
Product | PQQ-dependent dehydrogenase, methanol/ethanol family |
Protein accession | YP_002889793 |
Protein GI | 237653479 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAGA ACGACAGACG CGCGCGCACC CTGCCGGCGA TGCTGACCGC CCTGACGGTC GCGCTGGCCG CGGCGGGCAT GGGCGCCACG GCGCAGGCCG CGAGCGTGAG CGACGAGGAC ATCCTGAAGG ATGCCGAGAC GACGCATCAG GTCGTCACCA ACGGCCTCGG CACCAAGGGT CAGCGCTACA GCCCGCTCGC CAAGGTCAAT GTCGACACCG TGAAGGACCT CGTCCCGGTG TGGGCCTTCT CCTTCGGCGG CGAGAAGCAG CGCGGCCAGC AGTCGCAGCC GTTGATCCAC GACGGCAAGA TGTTCGTGAC CGCGTCCTAC AGCCGCATCT ACGCGCTCGA CGCCAAGACC GGCCAGAAGC TGTGGAAGTA CGAGCACCGC CTGCCCGAGG GCATCATGCC GTGCTGCGAC GTGATCAACC GCGGCGCCGC GCTCTACGGC GACCTGGTGA TCTTCGGCAC CCTGGACGCG CAGCTCGTCG CGCTCAACCG CGACACCGGC AAGGTGGTGT GGCGCGAGAA GATCGACGAC TACAAGGCCG GCTACTCGAT GACCGCCGCG CCGCAGATCG TCAAGGGCAT GGTGATCACC GGGGTGTCGG GCGGCGAGTT CGGCGTCGTC GGCCGCGTCG AGGCGCGCGA CGCCAAGACC GGCAAGAAGG TCTGGGTGCG CCCGGTGGTC GAAGGCCACA TGGGCTACAA GTACGACAAG GACGGCAAGG AGATCGAGAA CGGCATCTCG GGCACGACCA ACGCCACCTG GCCGGGCGAC CTGTGGAAGA CCGGCGGCGC CGCGACCTGG CAGACCGCCT ACTACGATCC GGACGTGAAC CTGATCTTCA TGGGCACCGG CAACCCGGCG CCGTGGAACT CCTGGCTGCG TCCGGGCGAC AACCTGTACT CGTCGTCCAC GGTCGCGGTC GACCCGGACA CCGGCAAGAT CGTCTGGCAC TACCAGAACA CGCCGCACGA CGGCTGGGAC TTCGACGGGG TGAACGAATT CATCGCCTTC GAGTACAAGG ACCCGAAGAG CGGCAAGCTC GTCAAGGCGG GCGCCAAGGC CGACCGCAAC GGCTTCTTCT TCGTCAATGA CCGGACCAAC GGCAAGCTGC TCGCCGCCTA CCCCTTCCTG ACCCGCATCG ACTGGGCCAA GGGCTACGAC CTCGAGACCG GTCGTCCGAT CACCAACGAC GACAAGCGTC CGCCCAATCC GTTTGCCGAG GGCGCGGCCG GCGAGGGCGG CAAGGGCAAG CCGGTGTTCG CCGCGCCGTC CTTCCTCGGC GGCAAGAACC AGATGCCGAT GGGCTACAGC CCGGACACCG GCCTGTTCTA CGTGCCCGCC AACGAGTGGG GCATGGACAT CTGGAACGAG CCGGTGAGCT ACAAGCGCGG CGCGGCCTAC CTGGGCGCGG GCTTCACCAT CAAGCCGCTG TACGAGGACT ACATCGGCGC GATGCGCGCG ATCGACCCGG TGAGCGGCAA GATCGTCTGG GAGGTCAAGA ACAACGCGCC GCTGTGGGGC GGGGTGCTCA GCACCGCCGG CAACCTGGTG TTCTATGGCA CGCCCGAGGG CTACCTGAAG GCGGTCAACG CGAAGACCGG CGAGGAGGCG TGGAAATTCC AGACCGGCTC GGGCGTGATC GCGCCGCCGG TGACCTGGGA AGCCGACGGC GAGCAGTACG TCGCGGTGGT GTCGGGCTGG GGCGGGGCGG TACCGCTGTG GGGCGGCGAC GTCGCCAAGC GGGTGAACTT CCTCGAGCAG GGCGGCACCG TGTGGGTGTT CAAGCTGCAC AAGAGCTGA
|
Protein sequence | MMKNDRRART LPAMLTALTV ALAAAGMGAT AQAASVSDED ILKDAETTHQ VVTNGLGTKG QRYSPLAKVN VDTVKDLVPV WAFSFGGEKQ RGQQSQPLIH DGKMFVTASY SRIYALDAKT GQKLWKYEHR LPEGIMPCCD VINRGAALYG DLVIFGTLDA QLVALNRDTG KVVWREKIDD YKAGYSMTAA PQIVKGMVIT GVSGGEFGVV GRVEARDAKT GKKVWVRPVV EGHMGYKYDK DGKEIENGIS GTTNATWPGD LWKTGGAATW QTAYYDPDVN LIFMGTGNPA PWNSWLRPGD NLYSSSTVAV DPDTGKIVWH YQNTPHDGWD FDGVNEFIAF EYKDPKSGKL VKAGAKADRN GFFFVNDRTN GKLLAAYPFL TRIDWAKGYD LETGRPITND DKRPPNPFAE GAAGEGGKGK PVFAAPSFLG GKNQMPMGYS PDTGLFYVPA NEWGMDIWNE PVSYKRGAAY LGAGFTIKPL YEDYIGAMRA IDPVSGKIVW EVKNNAPLWG GVLSTAGNLV FYGTPEGYLK AVNAKTGEEA WKFQTGSGVI APPVTWEADG EQYVAVVSGW GGAVPLWGGD VAKRVNFLEQ GGTVWVFKLH KS
|
| |