Gene Mlg_0345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0345 
Symbol 
ID4268333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp388568 
End bp389848 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content73% 
IMG OID638125076 
Productdihydroorotase 
Protein accessionYP_741190 
Protein GI114319507 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAT GGAGCATCAC GGGGGCCCGG GTGGTGGACC CGGCCAGCGA CCGGGATGAG 
GTCGTGGATC TGCACATCGC CGATGGGCGC ATCGCCGCCC TGGGGGCACC GCCCAGCGGC
TGGCAACCGG AGCATATCCT CCAGGCCACC GGCCTGGTGG CCTGCCCGGG GCTGATCGAC
CTGGCGGCGC GGACCCGAGA GCCGGGGCAG GCGCGCAAGG CCAATATCGC CAGTGAGGCC
CGGGCCGCGG CGGCCGGGGG AATCACCACG CTGATCTGCC CGCCGGACAC CCGCCCCGTC
ACCGATACGC CTTCGGTGGT GGAGCTGATC CGCAACCGCT CCGCAGCGGC GGGCGGGGCC
CGGGTGCTGC CGCTGGGGGC GCTGACCCGG AACCTGGACG GGGGGCAGTT GAGCGAGATG
GTGGCCCTGA GCGAGGCGGG CTGCCCCGGG CTGTCCGACG GCGGCCGGCC GATCGCCGAC
AGCCTGGTGC TGCGCCGGGC GCTGGAGTAC GCCGCCACCT TTGATCTGCC CGTGCACCTG
ACCCCGGAGG AGCCCATCCT GGCCCAGGGC CTGGCCCACG AGGGGCAGCT GGCCACCCGG
ATGGGCCTGC CCGGGATCCC GGTGGCCGCC GAGACGGCCG GGCTCGGCCG CATGCTCGCC
CTGGCCGAGG AGATCGGGGC CCGGGTCCAC TTCGGCCGGT TGTCCAGCCG CCGCGGTCTC
GAGCTGATCC TGGCCGCCCA GCGCAACGGT CAGCCGGTGA CCGCCGACGC CGCCATTCAT
CAGCTGTTTC TCACCGAGAT GGACATCTAC GGCTACCAGA GCCAGGCCCA CGTGCGCCCG
CCGCTGCGTT CCACCGGCGA CCGCGACGCC CTGCGCCGGG CGCTGGCGGC CGGCGAGCTT
CCGGTCCTCT GCTCCGACCA CCAGCCCCAC GATCCGGACG CCAAGCGTTG CCCCTTCGCC
GAGAGCGAAC CAGGCATCTC CGGGCTGGAC AGCCTGCTGG CGCTCGTCTT GCGCCTGGCC
GACGAGCTCA ACCTGCCCCT GACCCGCGCC CTGGCACCGG TCACCAGCGG CCCGGCACGG
GTCCTGGACC TGCCGGGTGG GCGCCTGACC GAGGGCGCCC CGGCGGACAT CTGCCTGTTC
GATCCGGACG AGGTCTGGTG GTTCAAGGCC AGCGACATGC ACAGCCGGGG CGAGAACAGC
CCGTTTACGG GCTGGGAATT CACCGGCCGG GCCCGCTACA CCATCGTCGA CGGACTCCGG
GTCTATGACG CCCACAACTG A
 
Protein sequence
MTAWSITGAR VVDPASDRDE VVDLHIADGR IAALGAPPSG WQPEHILQAT GLVACPGLID 
LAARTREPGQ ARKANIASEA RAAAAGGITT LICPPDTRPV TDTPSVVELI RNRSAAAGGA
RVLPLGALTR NLDGGQLSEM VALSEAGCPG LSDGGRPIAD SLVLRRALEY AATFDLPVHL
TPEEPILAQG LAHEGQLATR MGLPGIPVAA ETAGLGRMLA LAEEIGARVH FGRLSSRRGL
ELILAAQRNG QPVTADAAIH QLFLTEMDIY GYQSQAHVRP PLRSTGDRDA LRRALAAGEL
PVLCSDHQPH DPDAKRCPFA ESEPGISGLD SLLALVLRLA DELNLPLTRA LAPVTSGPAR
VLDLPGGRLT EGAPADICLF DPDEVWWFKA SDMHSRGENS PFTGWEFTGR ARYTIVDGLR
VYDAHN