Gene Mpop_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_4820 
Symbol 
ID6310221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp5146347 
End bp5147687 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content72% 
IMG OID642653499 
Productdihydroorotase 
Protein accessionYP_001927451 
Protein GI188584006 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.475472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGC CTTTCGACCT CGTCCTGTCC GGCGGCACGG TGGTGAACCA CGACGGCGCC 
GCCCTCGCCG ACCTCGGCAT CCGCGGCGGA CGTGTGGCGG CCATCGGCGA CCTCGCGGGC
GCGGCGGCGC AGGAGCGGCG GGACTGCCGC GGCCTGCACC TCCTGCCCGG CGTGATCGAC
AGCCAAGTGC ATTTCCGCGA GCCCGGCCTC GATCACAAGG AGGATCTGGA GACGGGCTCG
CGCGCGGCGG TGATGGGCGG CGTCACGGCG GTGTTCGAGA TGCCCAACAC CAACCCGCAG
ACGACCAGCG CCGCGGCGCT CGCCGACAAG GTCGCCCGCG CCCATCACCG GATGCATTGC
GACTTCGCCT TCTGGGTCGG GGGCACCCAC GAGAACGCGG CGGAGGTGGC CGAGCTGGAG
CGGCTGCCGG GAGCGGCCGG GATCAAGGTG TTCATCGGCT CCTCCACCGG CTCGCTGCTC
GTCGAGGACG ATGCCGGCGT GCGCGCGATC CTGAGCCGCA TCCGCCGCCG GGCCGCCTTC
CATTCGGAGG ACGAGCCGAT GCTGCGCGAG CGCAAGGAGT TGCGCGTGCC GGGTGACCCG
TCTTCGCACC CGGTCTGGCG CTCGCCGGAG GTGGCGGTGA AGGCCACCCG CCGCCTGATC
GCGATCGCGC GGGAGACGGG CACGCGCATC CACATCCTGC ACATCTCGAC CGCGGACGAG
ATGCCGATCC TGGCGGACGC CAAGGACGTG GCGAGCGTCG AGGTGACGCC GCACCACCTG
ACGATCGACG GCGGCGAGGC CTACGCCCGG CTCGGGACGC TGGTGCAGAT GAACCCGCCG
GTGCGCGACG CCGGGCATCG CGACGGGATC TGGCGCGGTC TTGGCGACGG CGTCGTCGAC
GTTCTCGGCT CCGACCACGC GCCGCACACG CTCGAGGAGA AGGCCAAGCC CTATCCGGAT
TCCCCCTCGG GGATGACCGG CGTGCAGACG CTGGTGCCGA TCATGCTCGA CCACGTGAAT
GCCGGACGGC TCTCGCTCGC CCGGCTCGTC GACCTGACGA GCGCCGGCCC CAAGCGCCTG
TTCGGCATCG CCCGGAAAGG GCGGCTCGCC GTGGGCTACG ACGCCGACGT GACGGTGGTG
GACCTCAAGC GCCGAGAGAC CATCCGCAAC GCGTGGATCG CCTCGAAATG CGGCTGGACG
CCCTATGACG GCGTCACCGT CACCGGCTGG CCGGTGGGCA CCGTGGTGCG CGGCACCCCG
GTGATGTGGC AGGGCGAATT AACCAACCCG TCTCGGGGCG AAGCGGTCGT GTTCGAGGAG
GCGCTGCCGG CCGCGGGTTA A
 
Protein sequence
MEAPFDLVLS GGTVVNHDGA ALADLGIRGG RVAAIGDLAG AAAQERRDCR GLHLLPGVID 
SQVHFREPGL DHKEDLETGS RAAVMGGVTA VFEMPNTNPQ TTSAAALADK VARAHHRMHC
DFAFWVGGTH ENAAEVAELE RLPGAAGIKV FIGSSTGSLL VEDDAGVRAI LSRIRRRAAF
HSEDEPMLRE RKELRVPGDP SSHPVWRSPE VAVKATRRLI AIARETGTRI HILHISTADE
MPILADAKDV ASVEVTPHHL TIDGGEAYAR LGTLVQMNPP VRDAGHRDGI WRGLGDGVVD
VLGSDHAPHT LEEKAKPYPD SPSGMTGVQT LVPIMLDHVN AGRLSLARLV DLTSAGPKRL
FGIARKGRLA VGYDADVTVV DLKRRETIRN AWIASKCGWT PYDGVTVTGW PVGTVVRGTP
VMWQGELTNP SRGEAVVFEE ALPAAG