Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2127 |
Symbol | |
ID | 3833278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2225266 |
End bp | 2226324 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637830052 |
Product | dihydroorotate oxidase B, catalytic subunit |
Protein accession | YP_430962 |
Protein GI | 83590953 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.95796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAGG CTGAGAAGAG GCCCGGAGTG AATCCTAAAG AAGAAGACGA GAAGTTGGGG GCCAGAGGTA AGGGACTTGC AGGAACAGGC TGCGCCGGTT CCGGGCTAAA GTCACGATCT ACTCCTCAAG AGCCGGAAAC CGGTGCCGGG GTCAACCTGG AGGTACAGCT TGGGGACCTG ACCCTGCCGA ACCCCGTCAT GCCGGCCTCC GGTACCTTCG GCTTTGGCGA GGAATACGCA CCTTTTCTGG ACCTCAACCG CCTGGGGGCC CTGGTTGTGA AGACCATTAC CCTGGAGCCG ACCCCCGGCA ACCCGCCGCC GCGTTTAATG GAGACTCCCT CCGGTTTACT CAACTCCATC GGGCTCCAGA ACCCGGGCCT GGAGGTTTTT CTCCGGGAGA AACTGCCTTA TTTGCGCCGG TTTACTCCCC CGCTGATTGT AAATATTGCC GGCAGGACGG TAGAGGAATA CGGCGAGCTG GCTGCCAGGC TAAGCGCGGC GGAGGGTATT GCCGCCCTGG AGGTCAACAT CTCCTGCCCC AATGTCCGGG AGGGAGGTAT TGTCTTCGGC ACGGTGCCGG AAATGGCCGC CCGGGTGACG GCAACGGTAA AAGGGCAAAC CCACCTGCCG GTGATAGTTA AACTGACGCC CAATGTCACC GATATTACGG TCCTGGCCAG GGCGGTGGAG GACGCCGGGG CCGATGCCCT TTCCCTGATC AATACCCTTC AGGGTATGGC CATCGACCTG GAGACCCGAC GGCCGGCCCT GGCCAATATC GTCGGGGGCC TGAGCGGCCC GGCCATTAAA CCGGTGGCCC TCTGTGCCGT CTGGCGGGTG GCCCGGGCGG TAAAGATCCC GGTAATCGGC ATGGGCGGTA TCGTGACCGC CCGGGATGCC CTGGAGTTCC TCCTGGCCGG GGCCAGGGCC GTGGCGGTAG GCACAGCCGG CCTGGTGAAT CCCCGGGCTA TAATCGAAGT GATTGACGGG ATTAAAGCCT ACCTGCAGGA ACAGGGTTTG CAGGATGTCA ACGAACTGGT AGGCGCCTTG CGAATTTAA
|
Protein sequence | MEQAEKRPGV NPKEEDEKLG ARGKGLAGTG CAGSGLKSRS TPQEPETGAG VNLEVQLGDL TLPNPVMPAS GTFGFGEEYA PFLDLNRLGA LVVKTITLEP TPGNPPPRLM ETPSGLLNSI GLQNPGLEVF LREKLPYLRR FTPPLIVNIA GRTVEEYGEL AARLSAAEGI AALEVNISCP NVREGGIVFG TVPEMAARVT ATVKGQTHLP VIVKLTPNVT DITVLARAVE DAGADALSLI NTLQGMAIDL ETRRPALANI VGGLSGPAIK PVALCAVWRV ARAVKIPVIG MGGIVTARDA LEFLLAGARA VAVGTAGLVN PRAIIEVIDG IKAYLQEQGL QDVNELVGAL RI
|
| |