Gene Mmar10_0518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0518 
Symbol 
ID4285834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp607909 
End bp608940 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID638139983 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_755749 
Protein GI114569069 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.565795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCACG ACCTGGCGAC CCGGATGCTG CATGGCCTCG ACCCGGAAAC CGCGCACCGT 
GTCGGCATTC TCGGTCTGAA GGCCGGGCTG GGGCCACGCC AGTTCCGACC GGACCCCGCC
ATCCTGCGGA CCCGACTCGT CGGCCTAGAT CTGCCGAATC CGGTCGGCCT TGCGGCCGGT
TTCGACAAGA ATGCCGAGGC GCCGGATGCC CTCCTTGCAG CGGGTTTCGG CTTCGTCGAA
TGCGGCGCCG TGACCCCGCT TGCCCAGGAT GGCAAGCCGC GACCGCGGAT ATTCCGGCTC
GACGCGGACC GGGCGGTCAT CAATCGCATG GGCTTTCCCA ATCAGGGATT GGCGCTGTTT
CATCAGCGAC TGGTGCGTCG CTCGGCGCGG CTCGGCGTGG TCGGCGTCAA TCTGGGCGCC
AATCTCGAGA GTGAGGACCG GATCGCTGAC TATGTCGCCT GTCTCGACGC GCTCAAGGAC
CTGGCCCAGT TCTTCACGGT CAATGTGTCT TCTCCGAACA CGCCCGGCCT GCGCACGCTG
CAATCATCAG GCGCGCTCGA TGATCTGCTG GCCGCCGTTG CCGCGGTCGG TGCCAAGGCG
CCGGTCTTCC TGAAGATTGC GCCGGATATC GAAGATGCCG AGGCCGATGT CATGGTCGCC
GCGATCACGC GTCACAAGCT CGACGGCATC ATCATTTCCA ACACCACCAT CACCCGCCCG
GAAACCCTCG TCAGTGCGAA TATGGGTGAG GGGGGCGGCC TGTCCGGTCC GCCAGTCTTT
GCCCGCTCGA CCGAACTCGT GCGCGCTTTC CGCAAGGCCG CGGGACCGGA CATGGCAATC
ATCGGTGTCG GCGGCGTGTC CTGTGCCGAA ACCGCCTATG CCAAGATCCG GGCCGGTGCC
AATGCGATCC AGCTCTATAC CGCGATGATT TATGAGGGGC CGGGCCTGAT CCAGCGGATC
AAGCGCGGAC TGGTGGAACG GCTTCAGGTC GACGGGTTCG CATCGGTTGC CGACGCTGTC
GGCGCCGAGT GA
 
Protein sequence
MIHDLATRML HGLDPETAHR VGILGLKAGL GPRQFRPDPA ILRTRLVGLD LPNPVGLAAG 
FDKNAEAPDA LLAAGFGFVE CGAVTPLAQD GKPRPRIFRL DADRAVINRM GFPNQGLALF
HQRLVRRSAR LGVVGVNLGA NLESEDRIAD YVACLDALKD LAQFFTVNVS SPNTPGLRTL
QSSGALDDLL AAVAAVGAKA PVFLKIAPDI EDAEADVMVA AITRHKLDGI IISNTTITRP
ETLVSANMGE GGGLSGPPVF ARSTELVRAF RKAAGPDMAI IGVGGVSCAE TAYAKIRAGA
NAIQLYTAMI YEGPGLIQRI KRGLVERLQV DGFASVADAV GAE