Gene Mlg_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1463 
Symbol 
ID4270244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1669378 
End bp1670400 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content68% 
IMG OID638126219 
Productdihydroorotate oxidase 
Protein accessionYP_742302 
Protein GI114320619 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0318461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTATTCAC TGATTCGACC GCTGTTGATG CGCATGGATG CCGAGCGCAG CCATGAGTTT 
TCCCTGGCCT GGATGGACCG GCTGGCCCGG CTGGGGTTGG GGCGTCTGCT GTGCGGCCAC
CGCCTGCCGG ACATGCCGCG CCGGGTCATG GGCCTGACGT TCGCCAATCC GGTGGGTCTG
GCCGCGGGGC TGGACAAAAA CGGTGAGCAC CTGGAGGCCC TGGGGCACGT GGGCTTTGGG
TTTATTGAGG TGGGCACGGT GACCCCCAGG CCGCAGCCCG GCAACCCGGA GCCCCGGCTC
TTTCGCCTGC CCGCCCACGA GGCCATCATC AACCGCATGG GCTTCAACAA CCAGGGCGTG
GACGCCCTGG TCCAGCGCCT GCGGGTGACC CGCTACCAGG GGGTCTTGGG CGTCAATATC
GGTAAGAACA AGGACACGCC CACCGAACGG GCCACCGACG ACTACCTGAG CTGTTTACAG
AAGGTCTACC CCTACGCCGA TTACGTGGCG GTGAACGTCT CCTCGCCAAA CACCCCCGGG
CTGCGCGACC TGCAGGGGGG CGAGTTGCTG GAAGCGTTGC TGGGCCGACT CACTCACCTG
CGGGGTGTGC TGGCCCGGGA GTACGGCCGT TACGTGCCCC TGGTGGTCAA GATCGCGCCG
GATATGGATG AGGCCCAGCG GGCCCACTTC TGCCAACAGG TGCTGCGTTA CGGCATCGAC
GGCGTCGCGG CCACCAATAC CACCCTGTCC CGCGACGGGG TGGAGGATGA CCCGCTGGCC
CGGGAGCAGG GCGGGCTCTC CGGCGCCCCC TTGCGGCCGC GCGCCCAGGC GGTGCTCGAG
GAGCTGGGAC AGCGGCTCGG TCACCGGGTG CCATTGATCG GTGTCGGCGG CATCATGAGC
GGTGCCGATG CCCAGGCCCG CATGGCGGCA GGCGCCGACC TGCTTCAGAT CTACTCGGGG
TTCATCTACC GCGGGCCGCT CCTGCTGGAG GAGCTGCTCA AGGCGGTGGC GCCCGAGCAC
TGA
 
Protein sequence
MYSLIRPLLM RMDAERSHEF SLAWMDRLAR LGLGRLLCGH RLPDMPRRVM GLTFANPVGL 
AAGLDKNGEH LEALGHVGFG FIEVGTVTPR PQPGNPEPRL FRLPAHEAII NRMGFNNQGV
DALVQRLRVT RYQGVLGVNI GKNKDTPTER ATDDYLSCLQ KVYPYADYVA VNVSSPNTPG
LRDLQGGELL EALLGRLTHL RGVLAREYGR YVPLVVKIAP DMDEAQRAHF CQQVLRYGID
GVAATNTTLS RDGVEDDPLA REQGGLSGAP LRPRAQAVLE ELGQRLGHRV PLIGVGGIMS
GADAQARMAA GADLLQIYSG FIYRGPLLLE ELLKAVAPEH