Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2593 |
Symbol | |
ID | 7873334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2799646 |
End bp | 2800653 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699516 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_002889572 |
Protein GI | 237653258 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.144879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTACG AACTCGCCCG CCCCCTCCTG TTCGCGCTCG ACCCCGAGAC CGCCCACAAC CTCACCCTGC ATGGCCTGCA CTATGCCGGC AAGCTGCTGC CCGCCGCCGA GCCGGAGCCG GCGAGCGCGG TCGAGGTCAT GGGCCTGCGC TTTCCCAACC GCGTCGGCCT CGCCGCCGGG CTCGACAAGA ACGGCGAGGC GATCGACGGC CTCGCCCGCC TCGGTTTCGG CTTCCTCGAG ATCGGCACGA TCACGCCGCG CCCGCAGCCG GGCAACCCCC GCCCGCGCAT GTTCCGCCTG CCCGAGGTGC GGGGCATCAT CAATCGCATG GGCTTCAACA ACCACGGCGT CGACGTCCTG CTCGCGCACG TGCGTGCGGC GAAGTACCGC GGCATCCTCG GCATAAACAT CGGCAAGAAC TTCGACACCC CGATCGAGCG TGCCGCCGAC GACTACCTCG CCTGCCTGGA AAAAGTGTAC GCCCTGGCGA GCTACGTCAC GGTGAACATC TCCTCGCCCA ACACCAAGAA CCTGCGCCAG CTGCAGGGCG AATCCGAACT CGACGACCTC CTCGGCCGCC TCAAGGCGGC GCAGACCCGC CTCACCGAGC AACATGGGCG CTACGTCCCG CTCACCCTCA AGATCGCCCC CGACCTCGAC GACGCGCAGG TGAGCAACAT CGCCGACGCG CTGCGCCGCC ACCGCATCGA CGGCGTGATC GCCACCAATA CCACGATCGC GCGCGACAAG GTGCAGGGCA TCGCCCACGG CAACGAACAG GGCGGCCTGT CGGGCGCGCC GGTGTTCGAG GCTTCCACCG CCGTGGTGCG CAAGCTCGCG CACGCGCTCG CCGGCGAGTT GCCGATCATC GCCGCCGGAG GCGTGCTCGA GGGCGCACAG GCGCGCGCCA AGCTCGAGGC CGGCGCCGCT TTGGTTCAGC TCTACAGCGG ACTCATCTAC CGCGGCCCGG CACTGGTGCG CGAATGCGTG CGCGCCACCG CGGGGTGA
|
Protein sequence | MLYELARPLL FALDPETAHN LTLHGLHYAG KLLPAAEPEP ASAVEVMGLR FPNRVGLAAG LDKNGEAIDG LARLGFGFLE IGTITPRPQP GNPRPRMFRL PEVRGIINRM GFNNHGVDVL LAHVRAAKYR GILGINIGKN FDTPIERAAD DYLACLEKVY ALASYVTVNI SSPNTKNLRQ LQGESELDDL LGRLKAAQTR LTEQHGRYVP LTLKIAPDLD DAQVSNIADA LRRHRIDGVI ATNTTIARDK VQGIAHGNEQ GGLSGAPVFE ASTAVVRKLA HALAGELPII AAGGVLEGAQ ARAKLEAGAA LVQLYSGLIY RGPALVRECV RATAG
|
| |