Gene M446_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1753 
Symbol 
ID6133518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1967845 
End bp1969185 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID641642008 
Productdihydroorotase 
Protein accessionYP_001768677 
Protein GI170740022 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA GCTTCGACCT GATCCTGCGC GGCGGCACGC TCGTCAACCA CGACGGCGTC 
GGGCCGGGCG ATCTCGGCAT CCGGGCCGGG CGGGTCGCGG CCATCGGCGA CCTCGCCACC
GCGGCGGCCG GCGAGGTCCG GGACTGCACC GGCCTGCATC TCCTGCCGGG GGTGATCGAC
AGCCAGGTGC ATTTCCGCGA GCCCGGGCTC GACCACAAGG AGGATCTGGA GACCGGCTCG
CGCGCCGCCG TGATGGGAGG CGTCACGACG GTGTTCGAGA TGCCCAACAC CAATCCCCAG
ACGACCGGCC CGGAGGCGCT CGCCGACAAG CTCCGCCGCG CCCATCACCG CATGCACTGC
GACTTCGCCT TCTGGGTCGG GGGCACGCAC GAGAACGCCG CCGAGGTGGC GGAGCTGGAG
CGGCTGCCGG GAGCGGCGGG GATCAAGGTC TTCATCGGCT CCTCGACGGG CTCGCTCCTG
GTCGAGGACG ATGCGGGGAT CACCGAGATC CTGAAGCGCA TCCGCCGCCG CGCCGCCTTC
CACGCCGAGG ACGAGGCGAT GCTGCGCGCC CGCAAGGGCC TGCGCGTGCC GGGCGACCCG
TCCTCGCACC CGGTCTGGCG CTCGCCCGAG GCGGCGCTCA CGGCGACGCA GCGCCTGGTG
CGGATCGCCC GGGAGACCGG CGCGCGGATC CACATCCTGC ACATCTCCAC CGCCGAGGAG
ATGCGCTTCC TCGCCGCGCA CAAGGACGTG GCGACCGTGG AGGTGACGCC CCACCACCTC
ACCCTGGACG GCGCGGAGGC CTATCCGCGC CTCGGCACGC TGGTGCAGAT GAACCCGCCG
GTGCGCGACG CCGCCCATCG CGACGGGATC TGGTGGGGTC TCTCCCAGGG CGTCGCGGAC
GTGCTCGGCT CCGACCACGC CCCCCACACC CTGGAGGAGA AGGCCAAGCC CTACCCGGAT
TCCCCCTCCG GCATGACCGG GGTGCAGACG CTGGTGCCGA TCATGCTCGA CCACGTGGCG
GCGGGGCGCC TGAGCCTCGC GCGCTTCGTC GACCTGACGA GCGCGGGGCC GCAGCGCGCC
TTCGGGCTCG CCCGCAAGGG CCGCCTCGCG GTCGGCTACG ACGCGGACGT CACGGTGGTG
GACCTGAAGC GCCGCGAGAC GATCCGCAAC GCCTGGATCG CCAGCCGCTG CGGCTGGACG
CCCTACGACG GCACGACGGT CACGGGCTGG CCGGTCGGCA CGCTGGTGCG CGGCGCGACC
GTGATGTGGG AGGGCAGCCT GGTGACGCCC GCCTCCGGCG AGGCGGCCCG GTTCGAGGAG
GCTTTTCCGG CGCGCGCCTG A
 
Protein sequence
MTQSFDLILR GGTLVNHDGV GPGDLGIRAG RVAAIGDLAT AAAGEVRDCT GLHLLPGVID 
SQVHFREPGL DHKEDLETGS RAAVMGGVTT VFEMPNTNPQ TTGPEALADK LRRAHHRMHC
DFAFWVGGTH ENAAEVAELE RLPGAAGIKV FIGSSTGSLL VEDDAGITEI LKRIRRRAAF
HAEDEAMLRA RKGLRVPGDP SSHPVWRSPE AALTATQRLV RIARETGARI HILHISTAEE
MRFLAAHKDV ATVEVTPHHL TLDGAEAYPR LGTLVQMNPP VRDAAHRDGI WWGLSQGVAD
VLGSDHAPHT LEEKAKPYPD SPSGMTGVQT LVPIMLDHVA AGRLSLARFV DLTSAGPQRA
FGLARKGRLA VGYDADVTVV DLKRRETIRN AWIASRCGWT PYDGTTVTGW PVGTLVRGAT
VMWEGSLVTP ASGEAARFEE AFPARA