Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2106 |
Symbol | |
ID | 5831462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2365089 |
End bp | 2366165 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641367903 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_001639572 |
Protein GI | 163851529 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0121785 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACG CGCTCTTCCC CCTCGCCCGG CCGCTGCTGC ACGGGCTCGA CGCCGAGACG GCGCACGACG TGACGATTCG CGGCCTGTCG CTGCTGCCGC CGCGGCGTCC GCCGGCCGAC GACGCGTCCC TCGCCGTCGA GTTGTTCGGG CAGAGCTTTC CCAATCCGGT CGGCCTCGCC GCCGGATTCG ACAAGGGCGC GCGGGTGGCC GACGCGCTGC TCGGCCTCGG TTTCGGCTTC GTCGAGGTCG GCGGAGTCGT GCCGCAGCCC CAGCCCGGCA ATCCACGCCC GCGGGTGTTC CGCCTCCCCC GCGACCGGGC GGTGATCAAC CGCTTCGGCC TCAACAGCGA GGGGCTCGAC GCGGTGGCCG ACCGGCTCAA GGCCCGCGCC GGCCGCGAGG GGATCGTCGG CGTCAACATC GGCGCCAACA AGGAATCGGC GGACCGCCTC GCCGACTACG TCGCCTGCAC CGCGCGGCTC GCCCCGCATG TCGCCTTCAT CACCGTCAAC GTCTCCTCGC CCAACACGCC GGGCCTGCGC GACCTTCAGG GCGAAGCCTT CCTCGACGAC CTGCTCGCCC GCGTCGTCGC CGCCCGCGAC GCCAGCGGAT CGAGCGCCGC CGTGCTCCTC AAGATCGCGC CCGACATCGC GCTCGAAGGG CTCGACGCCA TGACGGCGAC GGCGCTTCGG CGCGGCATCC AGGGCCTCGT CGTTTCGAAC ACGACGATCG CCCGGCCGAC GTCCCTCGTG GAATCCTCCG TCGCAAAGGA AACCGGCGGC CTGTCCGGAC GGCCGCTGTT CGGCCCGTCG ACGCGGCTGC TGGCCGAGAC CTATCTGCGC GTCGGCGACC GGATCCCGCT GATCGGCGTC GGCGGCATCG ATTCGGCGGA GGCCGCCTGG ACCAAGATCC GGGCCGGTGC GCGCCTCGTC CAGCTCTACT CCGCCCTCGT CTACGAGGGA CCGGGGCTGG TCGGCACGAT TAAGCGCGGC CTGAGCCAGC GGCTGCGGGC GGAGGGCCTG ACGAGCCTCG CCCCGGTCGT CGGGCGGGAC GCGGCCGCCC TCGCGCGGGA CGCCTAA
|
Protein sequence | MIDALFPLAR PLLHGLDAET AHDVTIRGLS LLPPRRPPAD DASLAVELFG QSFPNPVGLA AGFDKGARVA DALLGLGFGF VEVGGVVPQP QPGNPRPRVF RLPRDRAVIN RFGLNSEGLD AVADRLKARA GREGIVGVNI GANKESADRL ADYVACTARL APHVAFITVN VSSPNTPGLR DLQGEAFLDD LLARVVAARD ASGSSAAVLL KIAPDIALEG LDAMTATALR RGIQGLVVSN TTIARPTSLV ESSVAKETGG LSGRPLFGPS TRLLAETYLR VGDRIPLIGV GGIDSAEAAW TKIRAGARLV QLYSALVYEG PGLVGTIKRG LSQRLRAEGL TSLAPVVGRD AAALARDA
|
| |