Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3109 |
Symbol | |
ID | 7874578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3363846 |
End bp | 3364775 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700031 |
Product | catechol 2,3 dioxygenase |
Protein accession | YP_002890083 |
Protein GI | 237653769 |
COG category | [R] General function prediction only |
COG ID | [COG2514] Predicted ring-cleavage extradiol dioxygenase |
TIGRFAM ID | [TIGR03211] catechol 2,3 dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0110217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGA CAGGCGTACT TCGTCCGGGA CACATCTCCC TGCGTGTCCT CGACCTCGAG GAAGGCATCA ACTTCTACAA GAACACCCTC GGCCTGGTCG AAACCGGCCG CGACGGCCAG GGCCGCGTCT ACTTCAAGGC CTGGGACGAG CGCGAGCACA ACAGCGTACT GATCCGCGAG GCCGAGAGCG CCGGCGTGGA CTTCTTCGCC TTCAAGGTCG CCGACAAGGC GACGCTCGAC AAGCTCGACG CCGACCTGCA GGCCTTCGGC CTGAAGACCG AGCGCATCCC CGCCGGCGAG ATGCTCGAGA CCGGCGAGCG CGTGCGCTTC GAGGTGCCGT CGGGCCACCT GATCGAGCTC TATGCCGAGA AGACCGACAT CGGCAACGGC CAGTCCTATG TGAACCCGGA CCCGTGGATC CCGGATGCCG AGCGTGGCAT CGCGCCCTCG CGCATGGACC ACTGCCTGCT GTACGGGCCG GACATCGAGA AGGTGCAGGA GATCTTCGAG AAGGTGCTCG GCTTCTACCT CGTCGAGCAC GTGGTGATGG AAGACGGCAA GACCGACCTG GCGATCTGGC TGTCGTGCTC GACCAAGGCC CACGGCATCG CCTTCGTGCG CCACCCCGAG CCGGGCAAGC TGCACCACAT CTCGTTCAAG CTCGACAGCT GGGAGAAGGT GCTGCGCGCG GCCGACATCA TGTCGATGAA CCGCATCTCG ATCGACATCG GCCCGACCCG CCACGGCGTC ACCCGCGGCA CCACGATCTA TGCCTTCGAC CCCTCGGGCA ACCGCTTCGA GACCTTCTGC GGCGGCTACG ACACCTACCC CGACTACAAG ACCATCACGT GGACCTGGGA CGAGGTGGGC GCCGGGATCT TCTATCACGA CCGCAAGCTC AACGAGCGCT TCCTGAGCGT GGTGTCCTGA
|
Protein sequence | MAMTGVLRPG HISLRVLDLE EGINFYKNTL GLVETGRDGQ GRVYFKAWDE REHNSVLIRE AESAGVDFFA FKVADKATLD KLDADLQAFG LKTERIPAGE MLETGERVRF EVPSGHLIEL YAEKTDIGNG QSYVNPDPWI PDAERGIAPS RMDHCLLYGP DIEKVQEIFE KVLGFYLVEH VVMEDGKTDL AIWLSCSTKA HGIAFVRHPE PGKLHHISFK LDSWEKVLRA ADIMSMNRIS IDIGPTRHGV TRGTTIYAFD PSGNRFETFC GGYDTYPDYK TITWTWDEVG AGIFYHDRKL NERFLSVVS
|
| |