Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3093 |
Symbol | |
ID | 7874563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3347429 |
End bp | 3348358 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700016 |
Product | catechol 2,3 dioxygenase |
Protein accession | YP_002890068 |
Protein GI | 237653754 |
COG category | [R] General function prediction only |
COG ID | [COG2514] Predicted ring-cleavage extradiol dioxygenase |
TIGRFAM ID | [TIGR03211] catechol 2,3 dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.396841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA CCGGAGTGAT GCGTCCTGGC CACATCCAGC TGCGCGTGCT GGATCTGGAT GAAAGCGTGG CCTTTTACAA GGACGTGCTC GGGCTCGTCG AGACCGGCCG CGATGCAGCC GGCCGTGTGT ATTTCAAGGC CTGGGACGAG CGCGACCACA ATAGCGTGAT CCTGCGCCAG GCCGACCGTG CCGGCGTGGA CCTGATCGGC TTCAAGGTCA AGGACAAGGC GACGCTCGAG CAACTCGACA AGGACCTGCA GGCCTACGGC GTCGCCACCG AGCGCATCCC GGCCGGCGAA CTGCTCGAGA CCGGCGAGCG CGTGCGCTTC ACCATCCCGA CCGGCCACGT CATGGAGCTG TATGCCGAAA AGACCGACGT CGGCAACGGC CAGCCCTACA CCAACCCGGA TCCGTGGATC GCCGCCTCCG AGCACGGCAT CGCTCCGCAC CGCTTCGACC ATTGCCTGCT CTACGGCCCG AACCTCGAGG AGAACCTCAA GCTGTTCACC GAGGTGCTCG GCTTCCACCT CGTCGAGCGC GTGCTGCTCG AAGACGGCAA GTCCCTGCTC GCCACCTTCA TTTCCTGCTC GCACAAGGCG CACGACCTGG CCTTCGTCGC CCACCCCGAG CCGGGCAAGC TGCACCACCT GTCCTTCCTG CTCGACAGCT GGGAGAAGGT GCTGCGCGCG GCCGACATCA TGTCGATGAA CCGGGTGTCG ATCGACATCG GCCCGACCCG CCACGGCATC ACCCGCGGCT CGACGATCTA CGCCTTCGAC CCCTCGGGCA ACCGCTTCGA GACCTTCTGC GGCGGCTACG AGACCTATCC CGACCACGCG CCGATCACCT GGACCTTCGA CGAGGTCGGC GCCGGCATCT TCTACCACGA CCGCAAGCTC AACGAGCGCT TCCTGACCGT CGTGACCTGA
|
Protein sequence | MATTGVMRPG HIQLRVLDLD ESVAFYKDVL GLVETGRDAA GRVYFKAWDE RDHNSVILRQ ADRAGVDLIG FKVKDKATLE QLDKDLQAYG VATERIPAGE LLETGERVRF TIPTGHVMEL YAEKTDVGNG QPYTNPDPWI AASEHGIAPH RFDHCLLYGP NLEENLKLFT EVLGFHLVER VLLEDGKSLL ATFISCSHKA HDLAFVAHPE PGKLHHLSFL LDSWEKVLRA ADIMSMNRVS IDIGPTRHGI TRGSTIYAFD PSGNRFETFC GGYETYPDHA PITWTFDEVG AGIFYHDRKL NERFLTVVT
|
| |