Gene Mlg_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1014 
Symbol 
ID4270043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1153884 
End bp1155350 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content66% 
IMG OID638125765 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_741857 
Protein GI114320174 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CTCAAGAAGC CCTGACCTTT GACGACGTTC TCCTGCTCCC CGCGCACTCC 
GCCGTCCTGC CGCGGGATGT TGACCTCAGC ACCCAGCTGA CCCGTGGTAT CCGTCTGCGC
GCGCCTATCG TCTCCGCCGC GATGGATACC GTCACCGAGG CCCGGTTGGC CATCGCGTTG
GCCGAGCAGG GTGGTATCGG CATTGTCCAC AAGAACATGA CCGTGGCGCA GCAGGCCAAC
GAGGTGCGCC GGGTCAAGAA GTTCGAGAGC GGGGTGATCA AGGAGCCCAT TACCGTATCC
CCTCGCACCA CCATCCGCGA GGTGTTGGAG CTGACGCGGG CCAATGGTAT CTCCGGCGTG
CCCGTGGTGG ACGGTGAGGA CCTGGTGGGT ATCGTCACCA GCCGCGATCT GCGCTTCGAG
ACCCGCCTGG ACGAACCGGT CTCGGTGGCG ATGACCCCGC GCGAGCGACT GGTCACGGTG
ACCGAGGGCG CCGATCGCGA GGAGATCCTC AGCAAGCTGC ACGGAAACCG CATCGAGAAG
GTGTTGGTGG TGGACGATGC CTTCCACCTG CGGGGCATGG TCACGGTCAA GGATATCCAG
AAGGCCAAGG ATTACCCGAA CGCCAGTAAG GACGAGCACG GCCGCCTGCG GGTGGGCGCC
GCAGTGGGTA CTGGCGGCGA TACCGAGGAA CGGCTGGCCG CGCTGGTGGA GGCCGGGGTG
GACGTGGTGG TGGTGGATAC CGCGCACGGC CACTCCCAGG GCGTGCTCAA CCGGGTGCGG
TGGATCAAAC AGCACTATCC CGACCTCCAG GTGATTGGGG GGAACATCGC CACTGCCCAG
GCGGCGCTCG ATCTCAAGGA GGCGGGTGTG GACGCGGTCA AGGTCGGTAT CGGCCCGGGC
TCCATCTGTA CCACCCGGGT GGTGGCCGGG GTGGGCGTCC CGCAGATCAC CGCGATCTCC
AACGTGGCCG AGGCCCTGGC GGGTACCGAC ATCCCCCTGA TTGCCGACGG CGGCGTGCGC
TTTTCCGGCG ATATGGCCAA GGCCCTGGCG GCCGGTGCCT ATTGCGTGAT GGTGGGGAGC
CTGCTGGCCG GCACCGAAGA GGCACCGGGT GAGGTGGAGC TCTATCAGGG ACGCTCCTAC
AAATCCTACC GCGGGATGGG GTCGCTGGGC GCCATGTCCC AGAGCCAGGG CTCGGCCGAC
CGCTATTTCC AGGACCCCAC CGCCAACGTG GACAAGCTGG TGCCCGAGGG CATCGAGGGC
CGGGTGCCCT ATAAGGGTTC CATGGTCACC ATCGTCCACC AGTTGCTGGG GGGGATCCGC
GCCAGCATGG GTTATGTGGG CTGCGCCAGC ATTGAGGAGA TGCGGACCCG GCCGGAGTTC
GTGCGCATCA CCAACGCCGG TATGCGCGAG AGCCATGTGC ACGACGTGAG TATCACCAAA
GAGGCGCCCA ACTACCGCGT GGGCTGA
 
Protein sequence
MRIAQEALTF DDVLLLPAHS AVLPRDVDLS TQLTRGIRLR APIVSAAMDT VTEARLAIAL 
AEQGGIGIVH KNMTVAQQAN EVRRVKKFES GVIKEPITVS PRTTIREVLE LTRANGISGV
PVVDGEDLVG IVTSRDLRFE TRLDEPVSVA MTPRERLVTV TEGADREEIL SKLHGNRIEK
VLVVDDAFHL RGMVTVKDIQ KAKDYPNASK DEHGRLRVGA AVGTGGDTEE RLAALVEAGV
DVVVVDTAHG HSQGVLNRVR WIKQHYPDLQ VIGGNIATAQ AALDLKEAGV DAVKVGIGPG
SICTTRVVAG VGVPQITAIS NVAEALAGTD IPLIADGGVR FSGDMAKALA AGAYCVMVGS
LLAGTEEAPG EVELYQGRSY KSYRGMGSLG AMSQSQGSAD RYFQDPTANV DKLVPEGIEG
RVPYKGSMVT IVHQLLGGIR ASMGYVGCAS IEEMRTRPEF VRITNAGMRE SHVHDVSITK
EAPNYRVG