Gene M446_3258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3258 
Symbol 
ID6130653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3607711 
End bp3608790 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content77% 
IMG OID641643445 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001770097 
Protein GI170741442 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.107503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCGCCA GCCTGTTCCC CCTCGCCCGG CCCGTTCTCC ACGCCCTCGA CGCCGAGACG 
GCCCATCGCC TGACCCTGCG GGCGCTCGCG CTCCTGCCCC CCGGACCACC GCCCGCCGAC
GACCCGGCCC TGGCGGTCGC GGCCTTCGGC AGGCGCTTCC CGAACCCGGT CGGGCTCGCG
GCGGGATTCG ACAAGGGGGC CGAGGTGCCG GACGCGCTCC TGCGCCTCGG CTTCGGCTTC
GTGGAGGTCG GCGGCGTGGT GCCGCTGCCC CAGCCCGGCA ACCCGCGCCC GCGGGTCTTC
CGCCTGCCCC GCGACGGCGC CGTCATCAAC CGGTTCGGGC TCAACAGCGA GGGGCTCGCC
ACCGTGGCGG CGCGACTCGC CGCGCGGGCC GGCCGGCCCG GCCTGATCGG CGCGAATATC
GGGGCCAACA AGGAGGCGGC CGACCGGCTG GCCGATTACG TGACCTGCAC GCGGGCGCTC
GCGGGCCTCG TCGACTTCAT CACCGTGAAC GTCTCCTCGC CCAACACGCC GGGCCTGCGC
GACCTGCAGG GCGAGGCCTT CCTGGACGAG CTCCTGGCCC GGGTCGTGGA GGCCCGCGAC
GCGGCAGGGG GCGGGCGCCG CGCCGCCATC CTGCTCAAGA TCGCGCCCGA CATCACCCTC
GGCGCCCTCG ACGCCATCGC GGCCACGGCC CTGCGGCGCG GGGTGGAGGG GCTCGTCGTC
TCGAACACCA CGGTGGCGCG GCCCGCCGGC CTCGCGGAGG CGGCGCGCGC CCGCGAGGCC
GGCGGGCTCT CCGGCCGGCC GCTCTTCTCG CCCTCGACGC GGCTCCTCGC CGAGACCTTC
CTGCGGGTCG GCACCCGCCT GCCCCTGGTC GGGGTCGGCG GGATCGATTC GGCCGAGGCC
GCCTGGACCA AGATCCGGGC CGGCGCCAGC CTCCTGCAGC TCTACTCGGC CCTGGTCTAT
GCGGGCCCCG GGCTCGTCGG GACGATCAAG CGCGGGCTCG CCGCGCGCCT CGCCGACCAG
GGCCGGCCGC TCGCCGCGTG GGTCGGCCGC GACGCGGCCG AACTCGCGCG CAGCGCCTGA
 
Protein sequence
MLASLFPLAR PVLHALDAET AHRLTLRALA LLPPGPPPAD DPALAVAAFG RRFPNPVGLA 
AGFDKGAEVP DALLRLGFGF VEVGGVVPLP QPGNPRPRVF RLPRDGAVIN RFGLNSEGLA
TVAARLAARA GRPGLIGANI GANKEAADRL ADYVTCTRAL AGLVDFITVN VSSPNTPGLR
DLQGEAFLDE LLARVVEARD AAGGGRRAAI LLKIAPDITL GALDAIAATA LRRGVEGLVV
SNTTVARPAG LAEAARAREA GGLSGRPLFS PSTRLLAETF LRVGTRLPLV GVGGIDSAEA
AWTKIRAGAS LLQLYSALVY AGPGLVGTIK RGLAARLADQ GRPLAAWVGR DAAELARSA