Gene Moth_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2127 
Symbol 
ID3833278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2225266 
End bp2226324 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content62% 
IMG OID637830052 
Productdihydroorotate oxidase B, catalytic subunit 
Protein accessionYP_430962 
Protein GI83590953 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.95796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGG CTGAGAAGAG GCCCGGAGTG AATCCTAAAG AAGAAGACGA GAAGTTGGGG 
GCCAGAGGTA AGGGACTTGC AGGAACAGGC TGCGCCGGTT CCGGGCTAAA GTCACGATCT
ACTCCTCAAG AGCCGGAAAC CGGTGCCGGG GTCAACCTGG AGGTACAGCT TGGGGACCTG
ACCCTGCCGA ACCCCGTCAT GCCGGCCTCC GGTACCTTCG GCTTTGGCGA GGAATACGCA
CCTTTTCTGG ACCTCAACCG CCTGGGGGCC CTGGTTGTGA AGACCATTAC CCTGGAGCCG
ACCCCCGGCA ACCCGCCGCC GCGTTTAATG GAGACTCCCT CCGGTTTACT CAACTCCATC
GGGCTCCAGA ACCCGGGCCT GGAGGTTTTT CTCCGGGAGA AACTGCCTTA TTTGCGCCGG
TTTACTCCCC CGCTGATTGT AAATATTGCC GGCAGGACGG TAGAGGAATA CGGCGAGCTG
GCTGCCAGGC TAAGCGCGGC GGAGGGTATT GCCGCCCTGG AGGTCAACAT CTCCTGCCCC
AATGTCCGGG AGGGAGGTAT TGTCTTCGGC ACGGTGCCGG AAATGGCCGC CCGGGTGACG
GCAACGGTAA AAGGGCAAAC CCACCTGCCG GTGATAGTTA AACTGACGCC CAATGTCACC
GATATTACGG TCCTGGCCAG GGCGGTGGAG GACGCCGGGG CCGATGCCCT TTCCCTGATC
AATACCCTTC AGGGTATGGC CATCGACCTG GAGACCCGAC GGCCGGCCCT GGCCAATATC
GTCGGGGGCC TGAGCGGCCC GGCCATTAAA CCGGTGGCCC TCTGTGCCGT CTGGCGGGTG
GCCCGGGCGG TAAAGATCCC GGTAATCGGC ATGGGCGGTA TCGTGACCGC CCGGGATGCC
CTGGAGTTCC TCCTGGCCGG GGCCAGGGCC GTGGCGGTAG GCACAGCCGG CCTGGTGAAT
CCCCGGGCTA TAATCGAAGT GATTGACGGG ATTAAAGCCT ACCTGCAGGA ACAGGGTTTG
CAGGATGTCA ACGAACTGGT AGGCGCCTTG CGAATTTAA
 
Protein sequence
MEQAEKRPGV NPKEEDEKLG ARGKGLAGTG CAGSGLKSRS TPQEPETGAG VNLEVQLGDL 
TLPNPVMPAS GTFGFGEEYA PFLDLNRLGA LVVKTITLEP TPGNPPPRLM ETPSGLLNSI
GLQNPGLEVF LREKLPYLRR FTPPLIVNIA GRTVEEYGEL AARLSAAEGI AALEVNISCP
NVREGGIVFG TVPEMAARVT ATVKGQTHLP VIVKLTPNVT DITVLARAVE DAGADALSLI
NTLQGMAIDL ETRRPALANI VGGLSGPAIK PVALCAVWRV ARAVKIPVIG MGGIVTARDA
LEFLLAGARA VAVGTAGLVN PRAIIEVIDG IKAYLQEQGL QDVNELVGAL RI