Gene Msil_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3849 
Symbol 
ID7092545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4216121 
End bp4217440 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content67% 
IMG OID643467134 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_002364093 
Protein GI217979946 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.058395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC CCGTCTCTCC GTCGGTCGCG CATCAGCCGC TGGCGCTGGT CAACGGCCGT 
CTTGTCGATG GGCAGACCTA TGATTGCGTG CGCGGCGGAA TTCTCATCCT CGACGGGAAA
ATTCTCGATC TCGGCCCCGA AGTCGCGCCG AAAAACCTGC CCGTTCATTC GCGCGTGATC
GACTGCGGCG GCGATTTCAT CGCGCCGGGC CTCATCGACA TGCGCGCCTT TGTCGGCGAG
CCGGGCGGCG AGCATCGTGA AACGATCGCC ACCGCGACGG CCGCGGCGGC AGCGGGCGGC
GTCACAACGA TTCTGGCGCG GCCCGACACC AATCCGCCGG TCGATGAGCC CGCCGTCGTC
GATTTTCTGC TGCGCCGCGC CCGCGACACC GGCCGCGTGC GGCTCATTCC CTGCGCGGCG
ATGACGCAAG GGCTGCGCGG CGAGGAGATC GCCGAGATCG GGCTGTTGCA GCAGGCGGGC
GCGCTCGCTT TTTCGGACGG CGCCCATTCC ATCGCAAACT CCCGCGTGCT GCGCCGCGTG
CTCTCCTATG CGCGCGATTT CGACGCGCTT ATCATTCATT ATGCCGAGGA TCGCGACCTC
GCCGCCGAGG GCGTCATGAA TGAGGGCGAA TTCGCCACAA GGCTCGGCCT CTCTGGCATC
CCGCGCGAGG CGGAGGCGAT CGCGCTCGAC CGCGACATCC GCCTCGTCAA CCTCACCGGC
GCGCGCTATC ACGCCGCGCT GGTGACGACG ACGCTGTCGC TCGACATTAT CGAACGCGCC
AAGGCGGCCG GACTGCCGGT CACCGCTGGA ACCTCGATCA ATCATCTGAC GCTGAACGAA
AGCGATATCG GCGATTACCG CACCTTTCTA AAGCTTGCGC CGCCGCTGCG GCGCGAGGAC
GAGCGGCGCG CGCTCGTGGA GGCGCTGTCG TCCGGCCTGA TCGACGTCAT CGTGTCCGAC
CACAATCCGC AGGACGTCGA GACCAAGCGC CTGCCTTTCG CCGAAGCCGA GAATGGCGCG
ATCGGGCTCG AGACCATGCT GGCGGCGGGG TTGCGGCTTG TCGCCTCCGG CGAAGTCTCG
CTGCAACGGC TGATCGGCGC CATGACGCTG CGTCCCGCCG AAATTTTGGG CCTGCCGCAG
GGCCGGCTGC GGGTTGGCGC CCCGGCCGAC GTCATCCGCT TCGATGCGGA GGCCGCCTAT
GTGGTCGATC CCTCAAAACT GCGCTCGCGC TCCAAGAACA CGCCCTTCGA CGAGGCGACC
ATGGAAGGCC GCGTGAAGCT GACGCTGGTC GAGGGGCGGA TTGTGTTCGA GGAGGAGTGA
 
Protein sequence
MNIPVSPSVA HQPLALVNGR LVDGQTYDCV RGGILILDGK ILDLGPEVAP KNLPVHSRVI 
DCGGDFIAPG LIDMRAFVGE PGGEHRETIA TATAAAAAGG VTTILARPDT NPPVDEPAVV
DFLLRRARDT GRVRLIPCAA MTQGLRGEEI AEIGLLQQAG ALAFSDGAHS IANSRVLRRV
LSYARDFDAL IIHYAEDRDL AAEGVMNEGE FATRLGLSGI PREAEAIALD RDIRLVNLTG
ARYHAALVTT TLSLDIIERA KAAGLPVTAG TSINHLTLNE SDIGDYRTFL KLAPPLRRED
ERRALVEALS SGLIDVIVSD HNPQDVETKR LPFAEAENGA IGLETMLAAG LRLVASGEVS
LQRLIGAMTL RPAEILGLPQ GRLRVGAPAD VIRFDAEAAY VVDPSKLRSR SKNTPFDEAT
MEGRVKLTLV EGRIVFEEE