Gene Mmar10_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0814 
Symbol 
ID4284790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp908611 
End bp909948 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content65% 
IMG OID638140280 
Productdihydroorotase 
Protein accessionYP_756045 
Protein GI114569365 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.000170879 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCCAGA CATTCGATGT GATCCTCAAG GGCGGCGAGA TCGTCAATCA TGACGGCCGC 
GGCCATGCCG ATATCGGCAT CATTGACGGC AAGACCGCGG CGATCGGCGA CCTGTCGCAG
GCTTCGGCCG GCGAGGTGGT CGACTGTTCC GGCCTGATGG TCCTGCCGGG CGTGATCGAC
ACCCAGGTCC ATTTCCGCGA GCCGGGCATG GAGTGGAAGG AAGACCTGCA ATCGGGGTCA
TTGTGCGCGG TGATGGGCGG CGTCACGGCG GTGTTCGAAA TGCCGAACAC CAATCCCACC
ACCACCGTGC CGGACATGCT GACCGACAAG CTGGCCCGGG CGAAGAACCG CATGCATTGC
GACCACGCCT TTTATGCCGG CGCGACCAAT GAGAATGCCG ACATATTGCC CGAGATGGAG
CGCATGCTGG GCTGTTGCGG TGTGAAAGTC TTCATGGGCG CTTCGACCGG CTCGCTGCTG
GTGGCGGATG ATGAGGGCGT GGAACGGGTC CTGCGCGCGA TCAAGCGCCG CGCCGCCTTC
CATTCCGAAG ACGAGTACCG CCTCGCCGAG CGCCGCGAAC TGGCCGTCGA GGGCGACTGG
ACCAGCCACC CGCATGTGCG CGATGCCGAG GCCGCCATCA TGTCGACCAA GCGCCTGGTG
CGGCTGGCCC GCAAGACCGG CAAGCGGATC CACGTGCTGC ATATTTCGAC CGCCGAGGAA
ATGGACTTCC TCGCCGAGCA TCGCGACATC GCCTCGGTCG AGGCGACACC GCAGCACCTC
ACCTTGGAAG GCCCCGAAAT CTATGAGCGC ATCAAGGGCC GGGCCCAGAT GAACCCGCCC
ATGCGCGATG CCCGCCACCG GGCCGGGCTG TGGCGCGGCA TTCAGCGCGG CATTGTCGAT
GTGATCGGCT CCGACCACGC GCCGCACACG CTGGAAGAGA AAGCCAAGCC CTATCCGCAA
AGCCCGTCCG GCATGCCCGG CGTGCAGACG CTGGTCCCGG TCATGCTCGA TCATGTAAAC
GCCGGCCGCC TGAGCATTGA ACGCTTTGTC GACCTGACCT CGGCCGGCGC CCAGCGCATT
TTCGGCATTG CCGGCAAGGG CCGCATGGCG GTCGGCTGGG ATGCCGATTT CACCCTGGTC
GACATGAAGC GCAAAGAGAC CATCACCGAT GCCTGGTCGG CGTCGAAGTC CGGCTGGACC
CCGTTTGACG GCATGTCGGT CACCGGCTGG CCGGTCGGCA CGATCATTCG CGGCCGTTCG
GTGATGCGTG ATGGCGAGCT TGTTGCATCG GGCAAGGGCG AACCGGTGCG CTTCATGGAA
GCCCTGCCGC ACGACTGA
 
Protein sequence
MSQTFDVILK GGEIVNHDGR GHADIGIIDG KTAAIGDLSQ ASAGEVVDCS GLMVLPGVID 
TQVHFREPGM EWKEDLQSGS LCAVMGGVTA VFEMPNTNPT TTVPDMLTDK LARAKNRMHC
DHAFYAGATN ENADILPEME RMLGCCGVKV FMGASTGSLL VADDEGVERV LRAIKRRAAF
HSEDEYRLAE RRELAVEGDW TSHPHVRDAE AAIMSTKRLV RLARKTGKRI HVLHISTAEE
MDFLAEHRDI ASVEATPQHL TLEGPEIYER IKGRAQMNPP MRDARHRAGL WRGIQRGIVD
VIGSDHAPHT LEEKAKPYPQ SPSGMPGVQT LVPVMLDHVN AGRLSIERFV DLTSAGAQRI
FGIAGKGRMA VGWDADFTLV DMKRKETITD AWSASKSGWT PFDGMSVTGW PVGTIIRGRS
VMRDGELVAS GKGEPVRFME ALPHD