Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1302 |
Symbol | |
ID | 3831788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1344863 |
End bp | 1346464 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829238 |
Product | dihydroorotase |
Protein accession | YP_430158 |
Protein GI | 83590149 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3653] N-acyl-D-aspartate/D-glutamate deacylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0050942 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.521882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCCC TGCTGATTAA GGGCGGCACG GTTGTCGACG GTACCGGTCG CCCTCCTTTT CGGGCCGATG TAGGTTTGCA AGGAGCAAAG ATCGCCGCCC TGGGCCTGTT TGATGGCGTA CCCGCGGGGA AGGTCGTGGA TGCCACCGGC CTGGTAGTTG CACCCGGGTT TATTGACTTT CACAGCCATG CTGACGCCGA ACTCCTGCGG GACCCGGAGG ATCGGGCCAA GCTGGCCCAG GGAGTCACTA CCGAAGTCAT CGGTAACTGT GGTATGTCCC TGGTCCCGGG GTCCCAGGAT ACCCGTCCCC TCCTGGCCGC GTATACCAGC CCGGTGCTGG GGGAGATCCC CCCGGATTTT CAGGCCACCG GCCTGGCTGA ATACCATTCC CTCTTAAGGC GCCAGGGAAT AGCTGTCAAT GTTGCCACTC TGGCCGGTCA CGGTTCAATC CGCCTGGCGG TCATGGGTAT GGCCGATCGC CGTGCCCCAA GGGCTGAGCT GGAGGAGATG TGCTCCCTCC TCAGGCAGGC GATGGCGGAA GGCGCCTGGG GCCTGTCCAG CGGTCTCCTC TACCCGCCGG GATGTTATGC CCCCACTGGG GAGTTAATCA CCCTCTGCCG GGTAGTCCGC CAGTACGGTG GTTTTTATGT CAGCCATATT CGTAATGAAT CCGACGGTGT CCTGGAGGCG GTAGAGGAAG CCCTGGAAAT CGGGCGGGAG GCCGGTGTTC CCGTCCACAT CTCCCACCTT AAAGCCTGCG GCTCCCGCAA CTGGCCCAAA ATACCCCGGG CTTTGGCCCT GCTGGACGCT GCCCGGGCTA AAGGCCAGGA CGTAAGCTGG GACGTCTACC CTTATACTGC CGGTTCCACC ACTGCCGCCT CCCTGCTGCC CCCGTGGGCT GTTGCCGGAG GCACTGCCGC CCTTCAGGAA CGCCTGCATT CACCGGAGGT TCGCCAGGAA ATTAAGAAAG CCTGGCAGGA AGGCTTGCCG GGCTGGGACA ACATGGTCAG TTCCCTTGGT TACGACCATT TAATAATTAA TGCTGTGAGC CACCGGGAAA ATAAGGACTG CGTGGGCCTG AGCCTGGCGC AAATCGGCCA ACAAAGGAGT CTGGACCCAG GTGACGCCCT GCTGGACCTG TTACAGAGCG AAGGTGGTAA CCTGGCCATT GAAACCTACC ACGCCTGCGA GGAGACCCTG GGAATGATCT TGCAGCACCC GGTAACGATC ATCGGCAGCG ATGGGATTTA TTCCGGCGAA CATGCTCATC CCCGCCTATA CGGCACCTTT GCCAGGGTTT TGGGCCGTTA TGTCCGGGAG CGAAAACTCC TCTCCCTGGA GGAGGCCATA GCCAAAATGA CTTCCAAACC GGCGGCCAGG CTCGGCCTCC GGTACCGGGG GCGGGTAACG CCAGGCTATT ATGCCGACCT GATCCTCTTT GACCAGGAAA CTATTGCCGA CCGGGCTACC TTCCAGGAAC CGGCCCGGAC ACCTTCTGGC ATCAAGGCTG TTATCGTCAA CGGGCGGGTG GCCTACCAGG AAGGGCGGTT TACCGGCGAA CGGGCCGGAA TTATTCTCAC CAGTCATACT ACCGGTGTTT AG
|
Protein sequence | MFSLLIKGGT VVDGTGRPPF RADVGLQGAK IAALGLFDGV PAGKVVDATG LVVAPGFIDF HSHADAELLR DPEDRAKLAQ GVTTEVIGNC GMSLVPGSQD TRPLLAAYTS PVLGEIPPDF QATGLAEYHS LLRRQGIAVN VATLAGHGSI RLAVMGMADR RAPRAELEEM CSLLRQAMAE GAWGLSSGLL YPPGCYAPTG ELITLCRVVR QYGGFYVSHI RNESDGVLEA VEEALEIGRE AGVPVHISHL KACGSRNWPK IPRALALLDA ARAKGQDVSW DVYPYTAGST TAASLLPPWA VAGGTAALQE RLHSPEVRQE IKKAWQEGLP GWDNMVSSLG YDHLIINAVS HRENKDCVGL SLAQIGQQRS LDPGDALLDL LQSEGGNLAI ETYHACEETL GMILQHPVTI IGSDGIYSGE HAHPRLYGTF ARVLGRYVRE RKLLSLEEAI AKMTSKPAAR LGLRYRGRVT PGYYADLILF DQETIADRAT FQEPARTPSG IKAVIVNGRV AYQEGRFTGE RAGIILTSHT TGV
|
| |