Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3699 |
Symbol | |
ID | 7873698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4065246 |
End bp | 4066523 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700645 |
Product | dihydroorotase |
Protein accession | YP_002890669 |
Protein GI | 237654355 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCC TGATTTCCAA CGGCCGCGTC GTCGATCCGG CCAATCGCAC CGATGCGGTG CAGAACGTCT ACGTCGCAGG AGGCAAGATC GTCGCCCTCG GCCAGGCGCC GGACGGCTTC GTCGCCGAGC GCACGATCGA CGCCGGCGGT CTCGTGGTCG CCCCCGGCTT CATCGACCTC GCCGCGCGCC TGCGCGAACC CGGCTACGAG TACCGCGCCA CGCTCGAATC CGAAATGGAG GCGGCGATGG CCGGCGGCGT CACCAGCCTG GCGATCCCGC CCGACACCGA CCCCGTGCTC GACGAGCCCG GCCTGGTCGA GATGCTGACC TATCGCGCCA AGAAGCTGAA CCGCGCCCAC ATCTATCCGG TCGGCGCACT CACCATCGGC CTTCAGGGCG AGCGCCTGTC CGAGATGGCC GAACTGGTCG AGGCCGGCTG CGTCGCCTTC TCGCAGGCCA ACGTGCCGCT GGTCGACAAC ACCGTGCTGA TGCGCGCGCT GCAATACGCC GCCACTTTCG GCTTCCGCGT CTGGCTGCAG CCGCTGGCGC CCTTCCTGTC GCAGGTGGGC CACGCCCACG ACGGCGAGGT GGCGACGCGG CTGGGCCTGT CGGGCATCCC GGTCGCCGCC GAAACCGTGG CGCTCTACAC CTACCTCGAG CTCGCGCGCA TCACCGGCGC CCGCCTGCAC ATCACCCGCC TGTCCTCGGC CGCCGGCCTC GCGCTCATCG ACCAGGCGCG CGCAGAAGGC ATGGACGTGA CCTGCGACGT GTCGATCAAC CATGTGCACC TGTGCGACAT GGACATCGGC TACTTCAACC CCAACTGCCA CCTCGTCCCG CCGCTGCGCA GCCAGCGCGA CCGCGAGGCA CTCGCCCGGG GCCTGGCCGA GGGCCGCATC GACGCGCTGT GCTCGGACCA CACCCCGGTG GACGACGACG CCAAGCAGAC GCCGTTCTCC GAATCCGAAC CCGGCGCCAC CGGCCTCGAG CTGCTGCTGC CGCTGACGCT GAAGTGGGCC GACCGCGCCG GGCTGGCGCT GCTGGACGGG CTGGCCCGCA TCACCTCGGA CGCGGCGAAG ATCGTCGGCA TCACCAAGGC CGGCCACCTC TCGGTGGGCG CGCGCGCCGA CGTGTGCGTG TTCGACCCCG CGACCCACGT CACCATCACC CGCGAGGGCC TCCGGAGCCA GGGCAAGAAC ACGCCCTTCC TCGGCATGGA GCTGCCGGGC AAGGTGCGCT ACACGCTGGT CGAGGGGCAG GTGATGTTCG AGGGCTGA
|
Protein sequence | MNILISNGRV VDPANRTDAV QNVYVAGGKI VALGQAPDGF VAERTIDAGG LVVAPGFIDL AARLREPGYE YRATLESEME AAMAGGVTSL AIPPDTDPVL DEPGLVEMLT YRAKKLNRAH IYPVGALTIG LQGERLSEMA ELVEAGCVAF SQANVPLVDN TVLMRALQYA ATFGFRVWLQ PLAPFLSQVG HAHDGEVATR LGLSGIPVAA ETVALYTYLE LARITGARLH ITRLSSAAGL ALIDQARAEG MDVTCDVSIN HVHLCDMDIG YFNPNCHLVP PLRSQRDREA LARGLAEGRI DALCSDHTPV DDDAKQTPFS ESEPGATGLE LLLPLTLKWA DRAGLALLDG LARITSDAAK IVGITKAGHL SVGARADVCV FDPATHVTIT REGLRSQGKN TPFLGMELPG KVRYTLVEGQ VMFEG
|
| |