Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3438 |
Symbol | |
ID | 7873929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3762676 |
End bp | 3763716 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700378 |
Product | dihydroorotase |
Protein accession | YP_002890409 |
Protein GI | 237654095 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0418] Dihydroorotase |
TIGRFAM ID | [TIGR00856] dihydroorotase, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.577495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCGA TTACCCTGAT CCGCCCCGAC GACTGGCACC TGCATGTGCG CGACGACGCC GCGCTCGACG CGGTGGTGCC GCACACCGCC GCCCGCTTCG GCCGGGCGCT GATCATGCCC AACCTGAAGC CGCCGGTCAC CACCACCGCG CAGGCGCTCG CCTACCGCGA GCGCATCCTC GCCGCGGCGC GCGGGTCGAG GTTCGAGCCG CTGATGAGCC TCTACCTCAC CGACAACCTG TCGCCGGACG AGATCGACCG CGCCCGCGCG AGCGGCCACG TGGTCGCCTG CAAGCTCTAT CCGGCGGGCG CGACCACGAA CTCCGACGCC GGCGTCACCG CGATCGACAA GATCTATGCG GTGCTCGAAC GCATGGAGAA GCTCGGCATG GTGCTGTGCG TGCATGGCGA GGCCACCGGG GCGGAGGTTG ACGTGTTCGA CCGCGAGCGC GTCTTCGTCG AGCGCACGCT GTCGCCGCTG GTGCGGCGTT TCCCGGGGCT CAAGGTGGTG TTCGAGCACA TCACCACCGC GGAGGCGGCG CAGTTCGTGC GCGCAGCGGG CTCGCACGTG GCCGCCACGG TCACCGCCCA CCACTTGCTG CTCAACCGCA ACGCGATCTT CGCCGGCGGC ATCCGCCCGC ACCACTACTG TCTGCCGGTC CTCAAGCGCG AGAGCCACCG CGAGGCGCTG ATCGCCGCGG TCACCTCGGG CAACGCGCGC TTTTTCCTTG GCACCGACTC CGCGCCGCAT GCGCGCAGCA CCAAGGAAAA CGCCTGCGGC TGCGCAGGCT GCTACACCGC GCACGCGGGC ATCGAGCTCT ACGCCGAGGT CTTCGACGCC GCCGGCGCGC TCGATCGCCT CGAGGCCTTC GCCAGCCTCA ACGGCCCGGC CTTCTACGGC CTGGCGCCGA GCAGCGACAC CATCACGCTG CGGCGCGAGC CCTGGAGCGT GCCGGCGAGC TACCCCTACC TGGGCGAAGA TCCGCTGGTG CCCTTGCGCG CCGGCGAGCA GGTGGGCTGG AAGCTGGTGG AAGGGGCCTG A
|
Protein sequence | MQSITLIRPD DWHLHVRDDA ALDAVVPHTA ARFGRALIMP NLKPPVTTTA QALAYRERIL AAARGSRFEP LMSLYLTDNL SPDEIDRARA SGHVVACKLY PAGATTNSDA GVTAIDKIYA VLERMEKLGM VLCVHGEATG AEVDVFDRER VFVERTLSPL VRRFPGLKVV FEHITTAEAA QFVRAAGSHV AATVTAHHLL LNRNAIFAGG IRPHHYCLPV LKRESHREAL IAAVTSGNAR FFLGTDSAPH ARSTKENACG CAGCYTAHAG IELYAEVFDA AGALDRLEAF ASLNGPAFYG LAPSSDTITL RREPWSVPAS YPYLGEDPLV PLRAGEQVGW KLVEGA
|
| |