Gene Mext_3313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3313 
Symbol 
ID5833072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3674533 
End bp3675837 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID641369113 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_001640771 
Protein GI163852728 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.582859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC TCGTCCTCTC CAACGCCCAT CTCCTCGATC CGGCCAGCGG CCGCGAGGGT 
CCCGGCGCCG TGCTCGTGCG CGACGGCCGC ATCGCCGACA TCGCCTGGAG TGCCACCCCC
GACGCGCCGG AGGGCGCCCA AAAGATCGAT TGCGGGGGTC TCACCCTCGC GCCAGGCCTG
ATCGACCTGC GCGCCTTCGT CGGCGAGCCG GGCGCAGAAC ACCGCGAGAC CCTAGCCTCG
GCGAGCGCGG CGGCGGCCGC GGGCGGCGTG ACGACCCTGG TCTGCATGCC CGACACCAAC
CCCGTCATCG ACGGGCCGGC CATCGTCGAT TTCGTGCTGC GCCGCGCCCG CGACACCGCG
AGCGTCAACG TGCTGCCGGC CGCCGCCATC ACCAAGGGTC TGGCCGGCCG CGAGATGACC
GAGTTCGGCC TGCTGGCGGA AGCCGGCGCG GTCGCCTTCA CCGATGGGCT GAAAGCCGTC
ACCAACGCGC AGGTGATGCG CCGCGCGCTG ACCTACGCCC GTGATTTCGG CGCGCTGCTG
ATGCAGCATG TCGAGGAGCC GGATCTCGTC GGCGAAGGCG TGATGAACGA GGGCGAGATG
GCCTCCCGCC TCGGCCTGAT CGGCATCCCC CGCGAGGCCG AGACGGTGAT GCTGGAGCGC
GACATCCGCC TCGTGCGCCT CACCGGCGCG CGCTACCACG CGGCGATGAT CTCTTGCGCC
GATTCCGTCG AGATCGTGCG GCGGGCCAAG GAGGCGGGGC TGCCCGTCAC CTGCGGCGTC
TCGGTCAACA ACCTCGTGCT CAACGAGGGT GACATCGGCC ACTACCGCAC CTTCTGCAAG
CTCTCGCCGC CCTTGCGCCG CGAGGACGAC CGGCAGGCGG TGATCGCGGC ACTGAATGAA
GGCGTGATCG ACGTCATCGT CTCCGACCAC AACCCGCAGG ACGTCGAGAC CAAGCGCCTG
CCCTTCGCCG AAGCCGCGGA CGGCGCGCTC GGCATCGAGA CCTTGCTCGG CGCGAGCCTT
CGCCTGCTCC ATACCGGCGA CGTGACGCTG GGAAGGCTGC TGAAGGTGCT CTCGGCCAAC
CCCGCGGCGC TGCTCGGCCG CGAGGCGGGG CGCCTGGAGA AGGGCGCCCC CGCCGACCTC
GTGCTGATCG ACCCCGATCT GCCCTACCTG CTCGACAAGC GGCAGCTGAA GTCGCGCTCC
AAGAACTCGC CCTTCGACGA GGCCCGGCTC CAGGGCGCCG CGGTGCTGAC GCTGGTCGGC
GGCCGCATCG TCCACCGCTC CGACCTCTCG GCCCTGGCCG CATGA
 
Protein sequence
MSALVLSNAH LLDPASGREG PGAVLVRDGR IADIAWSATP DAPEGAQKID CGGLTLAPGL 
IDLRAFVGEP GAEHRETLAS ASAAAAAGGV TTLVCMPDTN PVIDGPAIVD FVLRRARDTA
SVNVLPAAAI TKGLAGREMT EFGLLAEAGA VAFTDGLKAV TNAQVMRRAL TYARDFGALL
MQHVEEPDLV GEGVMNEGEM ASRLGLIGIP REAETVMLER DIRLVRLTGA RYHAAMISCA
DSVEIVRRAK EAGLPVTCGV SVNNLVLNEG DIGHYRTFCK LSPPLRREDD RQAVIAALNE
GVIDVIVSDH NPQDVETKRL PFAEAADGAL GIETLLGASL RLLHTGDVTL GRLLKVLSAN
PAALLGREAG RLEKGAPADL VLIDPDLPYL LDKRQLKSRS KNSPFDEARL QGAAVLTLVG
GRIVHRSDLS ALAA