Gene Mlg_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0375 
Symbol 
ID4269000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp420126 
End bp421235 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content77% 
IMG OID638125106 
Productdiaminohydroxyphosphoribosylaminopyrimidine deaminase / 5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_741220 
Protein GI114319537 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.989809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.14287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGTT TCTCCCCCGC CGACAACCTC TTCATGGGCC GGGCCCTGCG CCTGGCCCGC 
CGCCCGCAGC AGCCGCCCCA TCCCAACCCC GCCGTCGGCT GCGTGCTGGT GCGCGACGGG
CTTATCGTCG GCGAGGGCTG GCATGAGCGG GCGGGCGAGC CCCACGCGGA GGCCATGGCC
CTGCACCGGG CCGGCGAGCA GGCCAGCGGC GCCACCGCCT ACGTCACCCT GGAGCCGTGC
AGCCACCACG GCCGCACCCC GCCCTGCAGC GAGGCCCTGC TGGCGGCCGG CGTGGTCCGG
GTGGTGGCGG CCATGACCGA CCCCAACCCG CAGGTGGCCG GCCGCGGGCT GCGCCGCTTG
CGCGCCGCCG GGCTGGAGGT GGCCACCGGC TTGATGGCCG AGCAGGCGGC GGCGCTGAAC
CCCGGCTTCA CCCAGCGGAT GCGCACCGGG CGGCCCTGGC TGCGGCTGAA ATCGGCGGCC
AGTCTCGACG GGCGGACCGC CATGGCCTCC GGCGAGAGCC GCTGGATCAC CTCGCCCCAG
GCCCGGGCCG ACGTCCACCG CTGGCGGGCG CGCAGCGACG CCATGCTCAC CGGCATCGGC
ACCGTGCTGG CGGACGACCC GCGCCTGGAT GTCCGCGATG CCGGCATCGA GGCGCCGCGC
CAGCCGCGCC GCTGCGTGCT CGACCGCGAC CTGCGGACCC CCGCGGACGC GGTCCTGCTC
CGCGGCGAGG GCGCGACCCT GTTCCACGGC CCGGACGTGG CCGCCGGACA GATCCGGCGG
CTGACCGACG CCGGCGCCCA CTGCGTGGCG CTGCCGCTGG CGGACGGGCG CCTGGACCTG
GGCGCGGCCC TGGACTGGCT GGGCGGTCAG GGGTGCAATG AGGTGCTGGT GGAGGCCGGG
CCGACCCTGG GCGGGGCCTT GAGCCGCGCC GGTCTGGTGG ATGAATGGCT GCTCTACCTG
GCGCCCCACC TGATGGGCGA CGCGGCGCGG CCGCTGCTGC ACTGGCCGGG GCTGGAGACG
ATGAGCCAGC GCCAGCCCCT CCGGGTGCAG GACTGCCGCT TGGTGGGGCC GGATCTGCGG
TTGACGCTGC GGCTGGGGAG CGGGACCTGA
 
Protein sequence
MSGFSPADNL FMGRALRLAR RPQQPPHPNP AVGCVLVRDG LIVGEGWHER AGEPHAEAMA 
LHRAGEQASG ATAYVTLEPC SHHGRTPPCS EALLAAGVVR VVAAMTDPNP QVAGRGLRRL
RAAGLEVATG LMAEQAAALN PGFTQRMRTG RPWLRLKSAA SLDGRTAMAS GESRWITSPQ
ARADVHRWRA RSDAMLTGIG TVLADDPRLD VRDAGIEAPR QPRRCVLDRD LRTPADAVLL
RGEGATLFHG PDVAAGQIRR LTDAGAHCVA LPLADGRLDL GAALDWLGGQ GCNEVLVEAG
PTLGGALSRA GLVDEWLLYL APHLMGDAAR PLLHWPGLET MSQRQPLRVQ DCRLVGPDLR
LTLRLGSGT