Gene Mext_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2106 
Symbol 
ID5831462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2365089 
End bp2366165 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content73% 
IMG OID641367903 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001639572 
Protein GI163851529 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0121785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CGCTCTTCCC CCTCGCCCGG CCGCTGCTGC ACGGGCTCGA CGCCGAGACG 
GCGCACGACG TGACGATTCG CGGCCTGTCG CTGCTGCCGC CGCGGCGTCC GCCGGCCGAC
GACGCGTCCC TCGCCGTCGA GTTGTTCGGG CAGAGCTTTC CCAATCCGGT CGGCCTCGCC
GCCGGATTCG ACAAGGGCGC GCGGGTGGCC GACGCGCTGC TCGGCCTCGG TTTCGGCTTC
GTCGAGGTCG GCGGAGTCGT GCCGCAGCCC CAGCCCGGCA ATCCACGCCC GCGGGTGTTC
CGCCTCCCCC GCGACCGGGC GGTGATCAAC CGCTTCGGCC TCAACAGCGA GGGGCTCGAC
GCGGTGGCCG ACCGGCTCAA GGCCCGCGCC GGCCGCGAGG GGATCGTCGG CGTCAACATC
GGCGCCAACA AGGAATCGGC GGACCGCCTC GCCGACTACG TCGCCTGCAC CGCGCGGCTC
GCCCCGCATG TCGCCTTCAT CACCGTCAAC GTCTCCTCGC CCAACACGCC GGGCCTGCGC
GACCTTCAGG GCGAAGCCTT CCTCGACGAC CTGCTCGCCC GCGTCGTCGC CGCCCGCGAC
GCCAGCGGAT CGAGCGCCGC CGTGCTCCTC AAGATCGCGC CCGACATCGC GCTCGAAGGG
CTCGACGCCA TGACGGCGAC GGCGCTTCGG CGCGGCATCC AGGGCCTCGT CGTTTCGAAC
ACGACGATCG CCCGGCCGAC GTCCCTCGTG GAATCCTCCG TCGCAAAGGA AACCGGCGGC
CTGTCCGGAC GGCCGCTGTT CGGCCCGTCG ACGCGGCTGC TGGCCGAGAC CTATCTGCGC
GTCGGCGACC GGATCCCGCT GATCGGCGTC GGCGGCATCG ATTCGGCGGA GGCCGCCTGG
ACCAAGATCC GGGCCGGTGC GCGCCTCGTC CAGCTCTACT CCGCCCTCGT CTACGAGGGA
CCGGGGCTGG TCGGCACGAT TAAGCGCGGC CTGAGCCAGC GGCTGCGGGC GGAGGGCCTG
ACGAGCCTCG CCCCGGTCGT CGGGCGGGAC GCGGCCGCCC TCGCGCGGGA CGCCTAA
 
Protein sequence
MIDALFPLAR PLLHGLDAET AHDVTIRGLS LLPPRRPPAD DASLAVELFG QSFPNPVGLA 
AGFDKGARVA DALLGLGFGF VEVGGVVPQP QPGNPRPRVF RLPRDRAVIN RFGLNSEGLD
AVADRLKARA GREGIVGVNI GANKESADRL ADYVACTARL APHVAFITVN VSSPNTPGLR
DLQGEAFLDD LLARVVAARD ASGSSAAVLL KIAPDIALEG LDAMTATALR RGIQGLVVSN
TTIARPTSLV ESSVAKETGG LSGRPLFGPS TRLLAETYLR VGDRIPLIGV GGIDSAEAAW
TKIRAGARLV QLYSALVYEG PGLVGTIKRG LSQRLRAEGL TSLAPVVGRD AAALARDA