Gene M446_5637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5637 
Symbol 
ID6131328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6185569 
End bp6187056 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content68% 
IMG OID641645751 
Productdihydropyrimidinase 
Protein accessionYP_001772365 
Protein GI170743710 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGT TCGATCTCTT CGTCACCAAT GGCACCGTCG TGACCGCGCA CGGTGAGTAC 
CGAGGCTCCA TCGCGGTCGT GGACGGGCGG ATCGCCGCGA TCGTGGCGCC AGGCGTCGAC
CTGCCGGCCG TCCGGACCAT CGACGCGCGC GGCCGGCACG TCCTGCCGGG CCTGTGGCAC
GTGCACTGCC ATTTCCGGGA GCCGGGCCAC ACCTACAAGG AGGATTTCGA GAGCGGCAGC
AGGGCGGCCG CGGCGGGCGG CATCGCCTTC TGCATCGACA TGACCAACAA CACCCCGCAT
CCCACCACGC TCGAAACCTT CGAGATGAAG AAAGCGGCGA TCGCTCCCAA AGCCCACGTC
GATTACGCGA TCTACGGCGG CGGCCTCTAC CCGAAGACGT GCCTCGACCT CGGCAAGGCG
GGGGCGATCG GTATCAAGAT CTTCAACACG CGGCACGTCA AGGAGGTCTA CCCCTACATC
ACCGAACTCG GGGTGGTCGA TCACGGCATC CTCTACGAAC TCTACGAGGC GATCGCCGAG
ACCGGCCTCG TCGGCTCGGT GCACCACGAC GATACGGAAT GGTGCAAGCG CCTGACGTTT
CGGGATTACA TCAATCCTGG AAAGGTTGAG AACAAATACT ACATGGAGTG CTACGAGCGC
GGCTACATGT ACGGACACGG CATGGTCGCC GGTCTCGCCA GCTCGCTCTA CTACGCGCGC
CTGACCAAGC TGCGCCTGCA CGTGCTTCAC CTGGGGGTTA TGCCGGTCGG CGCGTACGAG
ATGCTCCGGC ATGCCAAGTT CGACCTGAAG CAGGACGTCA CCGCGGAGCT TGAGGCCGCC
TCCTTGTTCA TGTCGCGCGA GCAGGCGGAG CGGGTCGGCC CCTTCGCGTA TCTCTGGGCG
CACAGCCCGG AGGCGGGCTG GACCAGTCTC AGGGACGGCG TCGCCGACAT GCTGGTCGGC
GAGCACGCGC CCCACTCCGT CGAGGACGTC GAGCCCGGCT GGGAGGACAA TTTCTCCGTC
CCGCTCGGGA TCACGGGTGC CCAGGAATTC GTGCCGCTGA TGCTCAACGC GGTGAACGAG
GGGCGGATGA CGCTCCAGGA CATCGCGCGG TTCTGCGCCC TCCAGCCCGC CCAGCGCTTC
GGCCTCTACC CGCGCAAGGG CGCGCTCGAA CTCGGTGCCG ACGCCGACAT CACGATCGTC
GACCTCGCCC GGGAGACGGT CTTGCGCAAG GAGGACATGC ACAGCCGCGC GGGCCACACC
TCCTGGGAGG GGATGCGGGT CCGCGGCATG CCGGTCGCCA CGATCGTCCG TGGGCAAGTG
GTGATGGAGG AGGCCCGCAT CGTCGGCGAG CCCGGGCTCG GCCGGTTCAC GCCCGGCATC
CTCGGCCGGG ACGGCGGACC GGCGCCGGAC GCCGCGCGGC TCCATCGGGC CGAGGCCGTC
GCCCAAGCCG CCTCGCGGCA GGTAGCGGCC GGCCCGGTCC TCGAATAG
 
Protein sequence
MKPFDLFVTN GTVVTAHGEY RGSIAVVDGR IAAIVAPGVD LPAVRTIDAR GRHVLPGLWH 
VHCHFREPGH TYKEDFESGS RAAAAGGIAF CIDMTNNTPH PTTLETFEMK KAAIAPKAHV
DYAIYGGGLY PKTCLDLGKA GAIGIKIFNT RHVKEVYPYI TELGVVDHGI LYELYEAIAE
TGLVGSVHHD DTEWCKRLTF RDYINPGKVE NKYYMECYER GYMYGHGMVA GLASSLYYAR
LTKLRLHVLH LGVMPVGAYE MLRHAKFDLK QDVTAELEAA SLFMSREQAE RVGPFAYLWA
HSPEAGWTSL RDGVADMLVG EHAPHSVEDV EPGWEDNFSV PLGITGAQEF VPLMLNAVNE
GRMTLQDIAR FCALQPAQRF GLYPRKGALE LGADADITIV DLARETVLRK EDMHSRAGHT
SWEGMRVRGM PVATIVRGQV VMEEARIVGE PGLGRFTPGI LGRDGGPAPD AARLHRAEAV
AQAASRQVAA GPVLE